Chapter 16 – My Economics

Chapter 16 :-

Plus One Economics Notes on Chapter 16 Measures of Dispersion

Introduction

If we analyse two or more observations the central value may be the same but still there can be wide disparities in the formation of the distribution. For example, the AM of 2, 5 and 8 is 5; AM of 4, 5 and 6 is 5; AM of 1, 2 and 12 is 5; AM of 0, 1 and 14 is 5. Measures of dispersion will help us in understanding the important characteristics of a distribution.

This is explained with the help of another example.

Runs scored by three batsmen in a series of 5 one day matches are as given below:

Table 6.1 Cricket Scores
Days	Batsman 1	Batsman 2	Batsman 3
1	100	70	0
2	100	80	0
3	100	100	300
4	100	120	180
5	100	130	20
Total	500	500	500
Mean	100	100	100

Since the average is the same in all the three cases, one Is likely to conclude that these three batsmen are alike, But a close examination shall reveal that the distributions differ widely from one another. In case of the first batsman, each and every item is perfectly represented by the AM and there is no dispersion. In case of the second batsman, only one item is perfectly represented by the AM and the other items vary, but the variation is not too much. In case of the third batsman, not a single item is represented by the AM. All the items vary, and the variation is too large. Here we can see that the first batsman is consistent, while the third is inconsistent.

Now it is quite obvious that averages try to tell only the representative size of a distribution. To understand it better, we need to know the spread of various items also. So in order to express the data correctly, it becomes necessary to describe the deviation of the observations from the central value. This deviation of items-from the central value is called dispersion.

” The degree to which numerical data tend to spread about an average value is called the variation or dispersion of the data.” – Spiegel

The word dispersion means deviation or difference. In statistics, dispersion refers to deviation of various items of the series from its central value. Dispersion is the degree to which a numerical data tend to spread about an average value. Measure of dispersion is the method of measuring the dispersion or deviation of the different values from a designated value of the series. These measure, are also called averages of second order as they are averages of deviation taken from an average.

Objects of measuring variation

Measures of dispersion are useful in following respects:

To test the reliability of an average: Measures of dispersion enable us to know whether an average is really representative of the series. If the variability in the values of various items in a series is large the average is not so typical. On the other hand, if the variability is small, the average would be a representative value.
To serve as a basis for the control of the variability: A study of dispersion helps in identifying the causes of variability and in taking remedial measures.
To compare the variability of two or more series: We can compare the variability of two or more series by calculating relative measures of dispersion. The higher the degree of variability the lesser is the consistency or uniformity and vice versa.
To serve as a basis for further statistical analysis: Many powerful analytical tools in statistics such as correlation, regression, testing of hypothesis, analysis of fluctuations in time series, techniques of production control, cost control, etc., are based on measures of dispersion.

Methods of studying Dispersion

The following are the important methods:

Range
Quartile Deviation
Mean Deviation
Standard Deviation
Lorenz Curve

Range and quartile deviation measure the dispersion by calculating the spread within which the values lie. Mean deviation and standard deviation calculate the extent to which the values differ from the average. Lorenz curve is a graphical method of finding dispersion.

Absolute and Relative Measures of Dispersion

Absolute measures of dispersion are expressed in the same statistical unit in which the original data are given. In case two sets of data are expressed in different units, absolute measures of dispersion are not comparable. In such cases, relative measures are used.

A measure of relative dispersion is the ratio of measure of absolute dispersion to an appropriate average. It is also called coefficient of dispersion, as it is independent of the unit.

Range

Range is the simplest method of studying dispersion. It is the difference between the highest and the lowest values in a series.

$$ Range = L – S $$

where L= largest item; S = smallest item.

The relative measure corresponding to range, called the coefficient of range is obtained by applying the following formula:

$$ Coefficient \,of \,Range \,= \,{{\frac{L – S }{L + S}}} $$

Individual Series

Let us find Range and Coefficient of Range. The profits of a company for the last 8 years are given below.

Table 6.2
Year	Profit (in 000 Rs)
1985	40
1986	30
1987	80
1988	100
1989	115
1990	85
1991	210
1992	230

$$ Range = L – S $$

Here L = 230; S = 30.

Range = 230 – 30 = 200

$$ Coefficient \,of \,Range \,= \,{{\frac{L – S }{L + S}}} $$ $$ = \,{{\frac{230 – 30 }{230 + 30}}} $$ $$ = \,{{\frac{200 }{260}}} $$ $$ = \,{{0.77}} $$

Discrete Series

Let us find Range and Coefficient of Range for a discrete series.

Table 6.3
Size	Frequency
5	7
10	8
15	12
20	16
25	21
30	17
35	12
40	4

In order to find Range and Coefficient of Range, we should take the highest and the lowest values of size of items.

$$ Range = L – S $$

Here L = 40; S = 5.

Range = 40 – 5 = 35

$$ Coefficient \,of \,Range \,= \,{{\frac{L – S }{L + S}}} $$ $$ = \,{{\frac{40 – 5 }{40 + 5}}} $$ $$ = \,{{\frac{35}{45}}} $$ $$ = \,{{0.78}} $$

Continuous Series

For continuous series, range is calculated either by subtracting the lower limit of the lowest class from the upper limit of the highest class or by subtracting the mid-value of the lowest class from the midvalue of the highest class.

Let us find the range and coefficient of range of the following series:

Table 6.4
Daily Wage	Number of Workers
80 – 100	12
100 – 120	18
120 – 140	24
140 – 160	27
160 – 180	32
180 – 200	20

$$ Range = L – S $$

Here L = 200; S = 80.

Range = 200 – 80 = 120

$$ Coefficient \,of \,Range \,= \,{{\frac{L – S }{L + S}}} $$ $$ = \,{{\frac{200 – 80 }{200 + 80}}} $$ $$ = \,{{\frac{120}{280}}} $$ $$ = \,{{0.43}} $$

Let us find the range and coefficient of range of the following series where only midpoints are given:

Table 6.5
Class midpoints	Frequency
2	3
5	5
8	6
11	8
14	6
17	4
20	1

$$ Range = L – S $$

Here L = 20; S = 2.

Range = 20 – 2 = 18

$$ Coefficient \,of \,Range \,= \,{{\frac{L – S }{L + S}}} $$ $$ = \,{{\frac{20 – 2 }{20 + 2}}} $$ $$ = \,{{\frac{18}{22}}} $$ $$ = \,{{0.82}} $$

MERITS OF RANGE

Easy to compute.
It gives the maximum spread of data.
Easy to understand.

DEMERITS OF RANGE

It is affected greatly by sampling fluctuations.
It is not based on all the observations.
It cannot be used in case of open-end distribution.

Quartile Deviation

We have seen that range is the simplest to understand and easiest to compute. But range as a measure of dispersion has certain limitations. The presence of even one extreme item (high or low) in a distribution can reduce the utility of range as a measure of dispersion. Since it is based on two extreme items (highest and lowest) it fails to take into account the scatter within the range. Hence we need a measure of dispersion to overcome these limitations of range. Such a measure of dispersion is called quartile deviation. In the previous chapter we studied quartiles. Quartiles are those values which divide the series into four equal parts. Hence we have three quartiles-Q₁, Q₂, and Q₃. Q₁ is the lower quartile wherein $ { \frac{{1}}{{4}}} $^th of the total observations lie below it and $ { \frac{{3}}{{4}}} $^th above it. Q₂ is same as median which divides the series into two equal parts. Q₃ is the upper quartile, $ { \frac{{3}}{{4}}} $^th of the value falls below it and $ { \frac{{1}}{{4}}} $^th above.

We have already studied the value of Q₁ and Q₃ for individual, discrete and continuous series, hence not repeated.

Upper and lower quartile ( Q₁ and Q₃ ) are used to calculate inter-quartile range.

$$ \mathbf {Inter-quartile\, range \,= Q_3\,-\,Q_1} $$ Half of inter-quartile range is called quartile deviation.

Quartile deviation (semi inter-quartile range) is defined as half the distance between the third and first quartiles.

Quartile Deviation and inter quartile range are absolute measures of dispersion. The relative measure is coefficient of Quartile Deviation (Q.D)