Measures of Central Tendency
Measures of central tendency describe a set of data by identifying the central position in the data set as a single representative value.
We come across new data every day. We find them in newspapers, articles, in our bank statements, mobile and electricity bills. Now the question arises whether we can figure out some important features of the data by considering only certain representatives of the data.
This is possible by using measures of central tendency. In the following sections, we will look at the different measures of central tendency. We will also learn how to calculate them and under what situations they are most appropriate to be used. We can think of it as a tendency of data to cluster around a middle value. In statistics, the three most common measures of central tendencies are mean, median, and mode. Let’s begin by understanding the meaning of mean, median, and mode along with an example to support our understanding.
What Are Measures of Central Tendency?
Measures of central tendency are the values that describe a data set by identifying the central position of the data. There are 3 main measures of central tendency  Mean, Median and Mode.
Let us study about the measures of central tendency, their formulas, usage and types in detail below:
Mean
The mean (often called the average) is most likely the measure of central tendency that you are most familiar with. It is also known as average. Mean is simply the sum of all the components in a group or collection, divided by the number of components.
It is denoted by x̄, pronounced “x bar”.
Mean = Sum of the terms/ Number of terms
Example:
To understand the definition, let us look at the weights of 8 boys in kilograms: 45, 39, 53, 45, 43, 48, 50, 45. So, in the above example, there are 8 boys.
Therefore, the average of the group:
Average = Sum of the weights/Number of boys
= (45 + 39 + 53 + 45 + 43 + 48 + 50 + 45)/8
= 368/8
= 46
Thus, the average weight of the group is 46 kilograms.
Let us now see how to calculate the mean for different types of data along with an example.
Case 1:
So, if we have n values in a data set and they have values x_{1},x_{2}, …,x_{n}, the sample mean, usually denoted by x― (pronounced "x bar"), is:
x̅ = x1 + x2 + ...... +xn/n
This formula is usually written in a slightly different manner using the Greek capital letter, ∑, pronounced "sigma", which means "sum of...":
x̅ = ∑x/n
You may have noticed that the above formula refers to the sample mean. So, why have we called it a sample mean? This is because, in statistics, samples and populations have very different meanings and these differences are very important, even if, in the case of the mean, they are calculated in the same way. To acknowledge that we are calculating the population mean and not the sample mean, we use the Greek lower case letter "mu", denoted as μ:
μ = ∑x/n
Example: If the heights of 5 people are 142 cm, 150 cm, 149 cm, 156 cm, and 153 cm.
Find the mean height.
Mean height
x̅ = 142+150+149+156+153/5
= 750/5
=150
Case 2:
Let there be n number of items in a list x1, x2, x3, … , xn. Let the frequency of each item be f1, f2, f3, … , fn respectively. The mean can be calculated using the formula given below.
x̅ = f1x1 + f2x2 + f3x3 +.....+ fnxn / f1 + f2 + f3 + .....+ fn
OR
x̅ = ∑fixi/n
Consider the following example.
Example:
Find the mean of the following distribution:
x  4  6  9  10  15 
f  5  10  10  7  8 
Solution:
Calculation table for arithmetic mean:
x_{i} 
f_{i} 
x_{i}f_{i} 
4  5  20 
6  10  60 
9  10  90 
10  7  70 
15  8  120 
∑fi=40  ∑xifi=360 
∴ Mean=¯x= ∑xifi/∑fi = 360/40 = 9
∴ Mean = 9
Case 3:
When the items in a list are written in the form of a range, for example, 1020, we need to first calculate the class mark.
Class Mark = Upper Limit + Lower Limit / 2
Then, the mean can be calculated using the formula given below, where xi will be the classmark for each item.
Example:
Here is an example where the data is in the form of class intervals. The following table indicates the data on the number of patients visiting a hospital in a month. Find the average number of patients visiting the hospital in a day.
Number of patients 
Number of days visiting hospital 
010  2 
1020  6 
2030  9 
3040  7 
4050  4 
5060  2 
Solution
In this case, we find the classmark (also called as midpoint of a class) for each class.
Classmark = lower limit + upper limit/2
Let x_{1}, x_{2}, x_{3} ……x_{n} be the class marks of the respective classes.
Hence, we get the following table
Classmark (x_{i})  frequency (f_{i})  x_{i}f_{i} 
5  2  10 
15  6  90 
25  9  225 
35  7  245 
45  4  180 
55  2  110 
Total  ∑fi = 30  ∑fixi = 860 
Mean = x̅ = ∑fixi/∑fi
= 860/30
= 28.67
When not to use the mean
The mean has one main disadvantage: it is particularly sensitive to outliers. These are values that are unusually larger or smaller compared to the rest of the data. For example, consider the salary of staff at a factory below:
Staff  1  2  3  4  5  6  7  8  9  10 
Salary  15k  18k  16k  14k  15k  15k  12k  17k  90k  95k 
The mean salary for these ten staff is $30.7k. However, the raw data suggests that this mean value does not accurately reflect the typical salary of a worker, because most workers have salaries in the $12k to 18k range. Thus the mean is being skewed by the two large salaries. Therefore, in this situation, we would like to have a better measure of central tendency. As we will find out later, taking the median would be a better measure of central tendency in this situation.
Median
The value of the middlemost observation that is obtained after arranging the data in ascending order is called the median of the data. The advantage of using the median as a central tendency is that it is less affected by outliers and skewed data. To calculate the median, let us suppose we have the data below:
65  55  89  56  35  14  56  55  87  45  92 
Firstly, we need to rearrange that data into ascending order:
14  35  45  55  55  56  56  65  87  89  92 
The median mark will be the middle mark  here, 56 (highlighted in bold). It is the middle mark because it lies in the exact center as there are 5 scores before it and 5 scores after it. This works very well when we have an odd number of scores, but what when we have an even number of scores? What if you had 10 scores? Well, then we simply take the middle two scores and find their average. Let us look at the example below:
65  55  89  56  35  14  56  55  87  45 
Rearranging that data into ascending order:
14  35  45  55  55  56  56  65  87  89 
We now take the 5^{th} and 6^{th} score in our data set and average them. We get a median of 55.5.
Let us now learn how to calculate the median for different types of data along with a supporting example.
Case 1: Ungrouped Data
Step 1: Arrange the data in ascending or descending order.
Step 2: Let the total number of observations be \(n\).
To find the median, we need to consider if \(n\) is even or odd.
If \(n\) is odd, then use the formula:
Median = [(n+1)/2]^{th} observation
Example:
Let's consider the data: 56, 67, 54, 34, 78, 43, 23. What is the median?
For finding the mean, arrange the data in ascending order: 23, 34, 43, 54, 56, 67, 78.
Here, n (no.of observations) = 7
So,
Median = (7 + 1)/2 = 4^{th} observation
Median = 54
Case 2: Grouped Data
If \(n\) is even, then use the formula:
Median = \(\dfrac {\dfrac{n}{2}^{th}\text {obs.} + (\dfrac{n}{2}+1)^{th}\text {obs.}}{2}\)
Step 1: Find the median class.
When the data is continuous and in the form of a frequency distribution, the median is found as shown below:
Let n = total number of observations i.e. ∑fi
Note: Median Class is the class where n/2 lies.
Step 2: Use the following formula to find the median.
Median = l + [(n/2c)/f] × h
c = cumulative frequency of the class preceding the median class where,
l = lower limit of the median class
f = frequency of the median class
h = class size
Example:
Find the mode of the given data:
Marks Obtained  020  2040  4060  6080  80100 
Number of students  5  10  12  6  3 
Solution
The highest frequency \(=\) 12, so the modal class is 4060.
And,
 l = lower limit of modal class = 40
 fm = frequency of modal class =12
 f1= frequency of class preceding modal class = 10
 f2 = frequency of class succeeding modal class = 6
 h = class width \(=\) 20
Using the mode formula,
Mode = l + [(fmf1)(2fmf1f2)] × h
= 40+[(1210)(2 × 12  106)] × 20
= 40+[(2/8) ] × 20
= 45
Mode = 45
Mode
The value which appears most often in the given data i.e. the observation with the highest frequency is called the mode of data.
Case 1: Ungrouped Data
For ungrouped data, we just need to identify the observation which occurs maximum times.
Mode = Observation with maximum frequency
For example in the data: 6, 8, 9, 3, 4, 6, 7, 6, 3 the value 6 appears the most number of times. Thus, mode = 6. An easy way to remember mode is: Most Often Data Entered. Depending upon the number of modes the data has, it can be called unimodal, bimodal, trimodal, or multimodal. The example discussed above has only 1 mode, so it is unimodal. Note: A data may have no mode, 1 mode, or more than 1 mode.
Case 2: Grouped Data
When the data is continuous, the mode can be found using the following steps:
Step 1: Find modal class i.e. the class with maximum frequency.
Step 2: Find mode using the following formula:
Mode = l + [(fmf1)/ (2fmf1f2)] × h
where,
 l = lower limit of modal class,
 fm = frequency of modal class,
 f1= frequency of class preceding modal class,
 f2 = frequency of class succeeding modal class,
 h = class width
Consider the following example to understand the formula.
Example 1
Find the mode of the given data:
Marks Obtained  020  2040  4060  6080  80100 
Number of students  5  10  12  6  3 
Solution
The highest frequency \(=\) 12, so the modal class is 4060.
And,
 l = lower limit of modal class = 40
 fm = frequency of modal class =12
 f1 = frequency of class preceding modal class = 10
 f2 = frequency of class succeeding modal class = 6
 h= class width = 20
Using the mode formula,
Mode = l + [(fmf1)/(2fmf1f2)] × h
= 40+[(1210)/(2 × 12  106) ] × 20
= 40+[2/8] × 20
= 45
Mode = 45
Empirical Relation Between Measures of Central Tendency
The three measures of central tendency i.e. mean, median, and mode are closely connected by the following relations (called an empirical relationship).
2Mean + Mode = 3Median
For instance, if we are asked to calculate the mean, median, and mode of continuous grouped data, then we can calculate mean and median using the formulae as discussed in the previous sections and then find mode using the empirical relation.
Example: We have data with mode 65 and a median of 61.6, then, we can find the mean using the above relation.
2Mean + Mode = 3Median
2Mean = 3Median  Mode
2Mean = 3 × 61.6  65
2Mean = 119.8
Mean = 119.8/2 = 59.9
Difference between Mean and Average
The term average is frequently used in everyday life to denote a value that is typical for a group of quantities. Average rainfall in a month or the average age of employees of an organization are typical examples.
We might read an article stating "People spend an average of 2 hours every day on social media." We understand from the use of the term average that not everyone is spending 2 hours a day on social media but some spend more time and some less. However, we can understand from the term average that 2 hours is a good indicator of the amount of time spent on social media per day.
Most people use average and mean interchangeably even though they are not the same.
 Average is the value that indicates what is most likely to be expected.
 They help to summarise large data into a single value.
An average tends to lie centrally with the values of the observations arranged in ascending order of magnitude. So, we call an average measure of the central tendency of the data. Averages are of different types. What we refer to as mean i.e. the arithmetic mean is one of the averages. Mean is called the mathematical average whereas median and mode are positional averages.
Difference between Mean and Median
The mean is known as the mathematical average whereas the median is known as the positional average. To understand the difference between the two, consider the following example.
A department of an organization has 5 employees which include a supervisor and four executives. The executives draw a salary of ₹10,000 per month while the supervisor gets ₹40,000.
Mean=(10000+10000+10000+10000+40000)/5 =80000/5 = 16000
Thus, the mean salary is $16,000.
To find the median, we consider the ascending order: 10000, 10000, 10000, 10000, 40000.
n=5, so (n+1)(2) = 3.
Thus, the median is the 3rd observation. Median = 10000. Thus, the median is $10,000 per month.
Now let us compare the two measures of central tendencies.
We can observe that the mean salary of $16,000 does not give even an estimated salary of any of the employees whereas the median salary represents the data more effectively. One of the weaknesses of mean is that it gets affected by extreme values. Look at the following graph to understand how extreme values affect mean and median:
So, the mean is to be used when we don't have extremes in the data. If we have extreme points, then the median gives a better estimation.
Here's a quick summary of the differences between the two.
Mean Vs Median  

Mean  Median  
Definition  Average of given data (Mathematical Average)  The central value of data (Positional Average) 
Calculation  Add all values and divide by the total number of observations  Arrange data in ascending / descending order and find the middle value 
Values of data  Every value is considered for calculation  Every value is not considered 
Effect of extreme points  Greatly affected by extreme points  Doesn't get affected by extreme points 
Related Articles:
 Summary statistics
 Arithmetic Mean
 Data Handling and its Types
 Frequency Distribution
 Data Collection Methods
 Graphs in Statistics
 How to Find Median
 Mean and Variance
Important Notes
1. The three most common measures of central tendency are mean, median, and mode.
2. Mean is simply the sum of all the components in a group or collection, divided by the number of components.
3. The value of the middlemost observation obtained after arranging the data in ascending order is called the median of the data.
4. The value which appears most often in the given data i.e. the observation with the highest frequency is called the mode of data.
5. The three measures of central values i.e. mean, median and mode are closely connected by the following relations (called an empirical relationship): 2Mean + Mode = 3Median
Solved Examples:

Example 1: The mean monthly salary of 10 workers of a group is $1445. One more worker whose monthly salary is $1500 has joined the group. Find the mean monthly salary of 11 workers of the group using the measures of central tendency formula.
Solution:
Here, n=10, x̅ =1445
Using the formula,
x̅ = ∑fixi/n
Therefore ∑xi = x̅ × n
∑xi =1445 ×10
=14450
10 workers salary = $14450
11 workers salary = $14450 + 1500 = $15950
Average salary = 15950/11
=1450
Answer: Average salary of 11 workers = $1450 
Example 2:
Here is an example where the data is in the form of class intervals. The following table indicates the data on the number of patients visiting a hospital in a month. Find the average number of patients visiting the hospital in a day using the measures of central tendency formula.
Number of patients Number of days visiting hospital
010 2 1020 6 2030 9 3040 7 4050 4 5060 2 Solution:
In this case, we find the classmark (also called as midpoint of a class) for each class.
Note: Classmark = lower limit + upper limit / 2
Let x_{1}, x_{2}, x_{3} ……x_{n} be the class marks of the respective classes.
Hence, we get the following table
Classmark (x_{i}) frequency (f_{i}) x_{i}f_{i} 5 2 10 15 6 90 25 9 225 35 7 245 45 4 180 55 2 110 Total ∑fi=30 ∑fixi=860 ∴ Mean = x = ∑xifi / ∑fi = 860/30 = 28.67
Answer: Mean of patients visiting the hospital in a day = 28.67

Example 3:
A survey on the heights (in cm) of 50 girls of class X was conducted at a school and the following data was obtained:
Height (in cm) 120130 130140 140150 150160 160170 Total Number of girls 2 8 12 20 8 50 Find the mode and median of the above data using the measures of central tendency formula.
Solution:
Modal class= 150160
[as it has maximum frequency]
l =150, h &=10, fm =20, f1 =12, f2=8
Mode = l + [(fmf1)/(2fmf1f2)] × h
= 150 + [(20120/(2 × 20128)] × 10= 150 + 4
=154
To find the median, we need cumulative frequencies.
Consider the table:
Class Intervals No. of girls (f_{i}) Cumulative frequency (c) 120130 2 2 130140 8 2+8=10 140150 12 = f_{1} 10+12=22 (c) 150160 20 = f_{m} 22+20=42 160170 8 = f_{2} 42+8=50 (n) n = 50
n/2 = 25
Median class = 150160
l =150, c= 22, f=20, h= 10Median = l +[(n/2 c)/f] × h
= 150+[(50/2 22)/20] × 10= 150 + 1.5
= 151.5
Answer: Mode = 154, Median = 151.5
FAQs on Measures of Central Tendency
What Are the Measures of Central Tendency?
The most common measures of central tendency are the arithmetic mean, the median, and the mode.
What Are Measures of Central Tendency Examples?
Central tendency is a statistic that represents the single value of the entire population or a dataset. Some of the important examples of measures of central tendency include mode, median, arithmetic mean and geometric mean, etc.
What Is the Definition of Measures of Central Tendency?
A measure of central tendency is a single value that attempts to describe a set of data by identifying the central position within that set of data. As such, measures of central tendency are sometimes called measures of central location.
What Are Good Measures of Central Tendency?
The mean is the most frequently used measure of central tendency because it uses all values in the data set to give you an average. For data from skewed distributions, the median is better than the mean because it isn't influenced by extremely large values.
Where Can We Use Measures of Central Tendency in Our Daily Affairs?
Central tendency is very useful in psychology. It lets us know what is normal or 'average' for a set of data. It also condenses the data set down to one representative value, which is useful when you are working with large amounts of data.
What Is the Difference Between Mean and Median as measures of Central Tendency?
The mean is the average (or arithmetic mean) of the values of a data set, whereas the median is the middlemost value of the data.
How Do you Find the Measures of Central Tendency?
The measures of central tendency can be found using the formulas of mean, median or mode in most of the cases. As we know, the mean is the average of a given data set, the median is the middlemost data value and the mode represents the most frequently occurring data value in the set.