Research Presentation
Research Presentation
Research Presentation
3. What is descriptive
statistics?
Descriptive statistics are used to describe the characteristics or features of a
dataset. The term ‘descriptive statistics can be used to describe both
individual quantitative observations (also known as ‘summary statistics’) as
well as the overall process of obtaining insights from these data. We can use
descriptive statistics to describe both an entire population and an individual
sample. Because they are merely explanatory, descriptive statistics are not
heavily concerned with the differences between the two types of data.
So what measures do descriptive statistics look at? While there are many,
important ones include:
Distribution
Central tendency
Variability
What is distribution?
DistrThe distributions use the frequency of different outcomes (or data points)
in a population or sample. We can show it as numbers in a list or table, or we
can represent it graphically. As a basic example, the following list shows the
number of those with different hair colors in a dataset of 286 people.
We can also represent this information visually, for instance in a pie chart.
Generally, using visualizations is common practice in descriptive statistics. It
helps us more readily spot patterns or trends in a dataset.
Once again, using our hair color example, we can determine that the mean
measurement is 57.2 (the total value of all the measurements, divided by the
number of values), the median is 39 (the central value) and the mode is 13
(because it appears twice, which is more than any of the other data points).
Although this is a heavily simplified example, for many areas of data analysis
these core measures underpin how we summarize the features of a data
sample or population. Summarizing these kinds of statistics is the first step in
determining other key characteristics of a dataset, for example, its variability.
This leads us to our next point…
What is variability?
The variability, or dispersion, of a dataset, describes how values are
distributed or spread out. Identifying variability relies on understanding the
central tendency measurements of a dataset. However, like central tendency,
variability is not just one measure. It is a term used to describe a range of
measurements. Common measures of variability include:
4. What is inferential
statistics?
So, we’ve established that descriptive statistics focus on summarizing the key
features of a dataset. Meanwhile, inferential statistics focusfocusesaking
generalizations about a larger population based on a representative sample of
that population. Because inferential statistics focuses on making predictions
(rather than stating facts) its results are usually in the form of a probability.
Hypothesis testing
Confidence intervals
Regression and correlation analysis
The hypothesis test, in this case, might ask something like: ‘Does the vaccine
reduce severe illness caused by covid-19?’ By collecting data from different
sample groups, we can infer if the vaccine will be effective. If all samples
show similar results and we know that they are representative and random,
we can generalize that the vaccine will have the same effect on the population
at large. On the flip side, if one sample shows higher or lower efficacy than the
others, we must investigate why this might be. For instance, maybe there was
a mistake in the sampling process, or perhaps the vaccine was delivered
differently to that group. It’s due to a dosing error that one of the Covid
vaccines act Which shows how important hypothesis testing can be. If the
outlier group had simply been written off, the vaccine would have been less
effective!
For example, let’s say you’ve measured the tails of 40 randomly selected cats.
You get a mean length of 17.5cm. You also know the standard deviation of tail
lengths is 2cm. Using a special formula, we can say the mean length of tails in
the full population of cats is 17.5cm, with a 95% confidence interval.
Essentially, this tells us that we are 95% certain that the population mean
(which we cannot know without measuring the full population) falls within the
given range. This technique is very helpful for measuring the degree of
accuracy within a sampling method.
What we’ve described here is just a small selection of a great many inferential
techniques that you can use within data analytics. However, they provide a
tantalizing taste of the sort of predictive power that inferential statistics can
offer.
Descriptive statistics:
Inferential statistics:
One last thing to note: while we’ve presented descriptive and inferential
statistics in a binary way, they are often used in conjunction. Together, these
powerful statistical techniques are the foundational bedrock on which data
analytics is built.
To learn more about the role that descriptive and inferential statistics play in
data analytics, check out our free, five-day short course. If that’s your
interest in pursuing data analytics as a career, why not then check out
the best online data analytics courses on the market? For more
introductory data analytics topics, see the following:
Data Analytics
All articles
Will Hillier
Contributor to the CareerFoundry blog
A British-born writer based in Berlin, Will has spent the last 10 years writing about
education and technology, and the intersection between the two. He has a borderline
fanatical interest in STEM and has been published in TES, the Daily Telegraph, SecEd
magazine, and more,e. His fiction has been short- and longlisted for over a dozen
awards.
1. Descriptive Statistics :
Descriptive statistics uses data that providesdescribesulation either through
numerical calculatcalculationsh or tables. Itablesvides a graphical summary of
data. It is simply used for summarizing objects, etc. There are two categories in
this as follows
a). The measure of central tendency –
Measure The measurer tendency is also known as summary statistics that
are to represenrepresentter a point or a particular value of a data set or
sample set.
In statistics, there are three common measures of central tendency as
shown below:
(i) Mean :
It is measured as an average of all value invaluesmple set.
For example,
(ii) Median :
It measures of central the value of a sample set. In these, the data
set is ordered from lowest to highest value and then finds the exact
middle.
For example,
1. Descriptive Statistics :
Descriptive statistics uses data that providesdescribesulation either through
numerical calculatcalculationsh or tables. Itablesvides a graphical summary of
data. It is simply used for summarizing objects, etc. There are two categories in
this as follows
a). The measure of central tendency –
Measure The measurer tendency is also known as summary statistics that is
to represent the center point or a particular value of a data set or sample set.
In statistics, there are three common measures of central tendency as
shown below:
(i) Mean :
It is measured a by average the of all value invaluesmple set.
For example,
(ii) Median :
It measures a of central the value of a sample set. In these, data
setthe is ordered from lowest to highest value and then finds exact
mithe ddle.
For example,
(ii) Median :
It is measure a of central the value of a sample set. In these, data
setthe is ordered from lowest to highest value and then finds exact
mithe ddle.
For example,
(iii) Mode :
It is value mothe st frequently arrived in sample sthe et. The value
repeated most of time in the central the set is actuallyr example,
Descriptive Statistics
Descriptive statistics give information that describes the data in some manner. For example,
suppose a pet shop sells cats, dogs, birds, and fish. If 100 pets are sold, and 40 out of the 100
were dogs, then one description of the data on the pets sold would be that 40% were dogs.
This same pet shop may conduct a study on the number of fish sold each day for one month and
determine that an average of 10 fish were sold each day. The average is an example of
descriptive statistics.
Some other measurements in descriptive statistics answer questions such as 'How widely
dispersed is this data?', 'Are there a lot of different values?' or 'Are many of the values the
same?', 'What value is in the middle of this data?', 'Where does a particular data value stand
concerning the other values in the data set?'
A graphical representation of data is another method of descriptive statistics. Examples of this
visual representation are histograms, bar graphs, and pie graphs, to name a few. Using these
methods, the data is described by compiling it into a graph, table, or other visual representation.
This provides a quick method to make comparisons between different data sets and to spot the
smallest and largest values and trends or changes over some time. If the pet shop owner wanted
to know what type of pet was purchased most in the summer, a graph might be a good medium to
compare the number of each type of pet sold and the months of the year.
Inferential Statistics
Now, suppose you need to collect data on a very large population. For example, suppose you
want to know the average height of all the men in a city with a population of so many million
residents. It isn't very practical to try and get the height of each man.
This is where inferential statistics come into play. Inferential statistics make inferences about
populations using data drawn from the population. Instead of using the entire population to
gather the data, the statistician will collect a sample or samples from millions of residents and
make inferences about the entire population using the sample.
The sample is a set of data taken from the population to represent the population. Probability
distributions, hypothesis testing, correlation testing, and regression analysis all fall under the
category of inferential statistics.
To unlock this lesson you must be a Study.com Member. Create your account
Bryce S.
Student
United States
A textbook can only get you so far. The video aid provided by Study.com helps connect the dots
for a much deeper understanding.
Related Lessons
Related Courses
Next Lesson
Ch 2. Summary
Descriptive Statistics
Descriptive statistics give information that describes the data in some manner. For example,
suppose a pet shop sells cats, dogs, birds and fish. If 100 pets are sold, and 40 out of the 100
were dogs, then one description of the data on the pets sold would be that 40% were dogs.
This same pet shop may conduct a study on the number of fish sold each day for one month and
determine that an average of 10 fish were sold each day. The average is an example of
descriptive statistics.
Some other measurements in descriptive statistics answer questions such as 'How widely
dispersed is this data?', 'Are there a lot of different values?' or 'Are many of the values the
same?', 'What value is in the middle of this data?', 'Where does a particular data value stand with
respect with the other values in the data set?'
A graphical representation of data is another method of descriptive statistics. Examples of this
visual representation are histograms, bar graphs and pie graphs, to name a few. Using these
methods, the data is described by compiling it into a graph, table or other visual representation.
This provides a quick method to make comparisons between different data sets and to spot the
smallest and largest values and trends or changes over a period of time. If the pet shop owner
wanted to know what type of pet was purchased most in the summer, a graph might be a good
medium to compare the number of each type of pet sold and the months of the year.
Inferential Statistics
Now, suppose you need to collect data on a very large population. For example, suppose you
want to know the average height of all the men in a city with a population of so many million
residents. It isn't very practical to try and get the height of each man.
This is where inferential statistics comes into play. Inferential statistics makes inferences about
populations using data drawn from the population. Instead of using the entire population to
gather the data, the statistician will collect a sample or samples from the millions of residents and
make inferences about the entire population using the sample.
The sample is a set of data taken from the population to represent the population. Probability
distributions, hypothesis testing, correlation testing and regression analysis all fall under the
category of inferential statistics.
To unlock this lesson you must be a Study.com Member. Create your account
The field of statistics is divided into two major divisions: descriptive and
inferential. Each of these segments is important, offering different techniques
that accomplish different objectives. Descriptive statistics describe what is going
on in a population or data set. Inferential statistics, by contrast, allow scientists
to take findings from a sample group and generalize them to a larger population.
The two types of statistics have some important differences.
Descriptive Statistics
Descriptive statistics is the type of statistics that probably springs to most
people’s minds when they hear the word “statistics.” In this branch of statistics,
the goal is to describe. Numerical measures are used to tell about features of a set
of data. There are a number of items that belong in this portion of statistics, such
as:
Measures of central tendency capture general trends within the data and are
calculated and expressed as the mean, median, and mode. A mean tells scientists
the mathematical average of all of a data set, such as the average age at first
marriage; the median represents the middle of the data distribution, like the age
that sits in the middle of the range of ages at which people first marry; and, the
mode might be the most common age at which people first marry.
Measures of spread describe how the data are distributed and relate to each
other, including:
Measures of spread are often visually represented in tables, pie and bar charts,
and histograms to aid in the understanding of the trends within the data.
Inferential Statistics
Inferential statistics are produced through complex mathematical calculations
that allow scientists to infer trends about a larger population based on a study of
a sample taken from it. Scientists use inferential statistics to examine the
relationships between variables within a sample and then make generalizations
or predictions about how those variables will relate to a larger population.
Even though inferential statistics uses some similar calculations — such as the
mean and standard deviation — the focus is different for inferential statistics.
Inferential statistics start with a sample and then generalizes to a population.
This information about a population is not stated as a number. Instead, scientists
express these parameters as a range of potential numbers, along with a degree of
confidence.