Statistics is the area of study dealing with the presentation, analysis and interpretation of data. Facts or figures, collected with a definite purpose, are called data. Data can be presented graphically in the form of bar graphs, histograms and frequency polygons.
Central tendency means you have to find value at the "center". There are three ways of measuring central value.
Add up the numbers and divide by how many numbers.
Example: What is the mean of numbers: 3, 8, 5, 13, 20, 23?
The sum of these numbers is 72. There are 6 numbers. Mean is equal to 72/6 = 12.
Mean of Grouped Data
List all numbers in ascending order and select the middle one. If there are two middle numbers, take average of both.
Example: What is the median of 13, 7, 21, 23, 40, 23, 12, 29?
Arrange these numbers in order, you'll get: 7, 12, 13, 21, 23, 23, 29, 40. There are eight numbers, 21 and 23 are two middle numbers. Median will be average of both i.e. 22.
Median of Grouped Data
The Mode is the value that occurs most often. If more than one value occur most often, then there can be more than one Mode. When there are two modes it is called bimodal, when there are three or more modes is is called multimodal.
Example: Find the mode of 13, 7, 21, 23, 23, 40, 23, 12, 29?
Arrange these numbers in order and you will find 23 appears most often. Hence, mode is 23.
Mode of Grouped Data
When there is large amount of data, it is grouped together and frequency distribution table is created. A frequency distribution is defined when the following two information is specified:
The cumulative frequency of a class is the frequency obtained by adding the frequencies of all the classes preceding the given class.
Class Limit: The starting and end values of each class are called “lower limit” and “upper limit” of that class respectively.
Class Interval: The difference between the upper and lower boundary of a class is called the class interval or size of the class. It can also be defined as the difference between the lower or upper limits or boundaries of two consecutive classes.
Histogram: Pertaining to a frequency distribution, if the true limits of the classes are taken on the x-axis and the corresponding frequencies on the y-axis and adjacent rectangles are drawn, the diagram is called histogram.
Frequency Polygon and Frequency Curve: If the points pertaining to the mid values of the classes of a frequency distribution and the corresponding frequencies are plotted on a graph sheet and these points are joined by straight lines, the figure formed is called frequency polygon. If these points are joined by a smooth curve, the figure formed is called frequency curve.
Cumulative Frequency Curves: If the points pertaining to the boundaries of the classes of a frequency distribution and the corresponding cumulative frequencies are plotted on a graph sheet and they are joined by a smooth curve, the figure formed is called cumulative frequency curve.
In case of symmetrical distribution, Mean = Median = Mode
In case of a moderately asymmetrical distribution, Mode = 3 Median - 2 Mean
The dispersion of data is the measure of spreading (scatter) of the data about some central tendency. It is measured in the following types:
The Range is the difference between the lowest and highest values. It is the difference between two extreme observations of a distribution.
Range = Maximum value - Minimum value
Example: What is the range of 4, 6, 9, 3, 4, 11?
The lowest value is 3 and the highest value is 11. Range = 11-3 = 8.
After arranging the data in the ascending order of magnitude you find
QD = (Q3 - Q1)/2
The Standard Deviation is a measure of how spread out numbers are. It is represented by the greek letter sigma, σ. The standard deviation is the square root of the Variance.
Variance is defined as the average of the squared differences from the Mean. To calculate variance
To find the standard deviation, take square root of the variance.