S7-SA3-0203
What is the Interpretation of Variance?
Grade Level:
Class 12
AI/ML, Physics, Biotechnology, FinTech, EVs, Space Technology, Climate Science, Blockchain, Medicine, Engineering, Law, Economics
Definition
What is it?
Variance tells us how much the numbers in a dataset are spread out from their average (mean). A high variance means the numbers are very spread out, while a low variance means they are clustered close to the average.
Simple Example
Quick Example
Imagine two cricket teams. Team A scores 10, 100, 20 runs in three matches. Team B scores 40, 45, 35 runs. Both teams have an average score of 43.3 runs. Team A's scores are very different from each other (high variance), while Team B's scores are quite similar (low variance).
Worked Example
Step-by-Step
Let's find the variance for the daily mobile data usage (in GB) of a student: 1 GB, 2 GB, 1 GB, 4 GB, 2 GB.
Step 1: Find the Mean (Average). Sum of data = 1+2+1+4+2 = 10. Number of data points = 5. Mean = 10 / 5 = 2 GB.
---Step 2: Subtract the Mean from each data point and square the result.
(1-2)^2 = (-1)^2 = 1
(2-2)^2 = (0)^2 = 0
(1-2)^2 = (-1)^2 = 1
(4-2)^2 = (2)^2 = 4
(2-2)^2 = (0)^2 = 0
---Step 3: Sum these squared differences. Sum = 1 + 0 + 1 + 4 + 0 = 6.
---Step 4: Divide the sum by the number of data points (for population variance) or (number of data points - 1) for sample variance. For simplicity, let's use the number of data points here (N). Variance = 6 / 5 = 1.2.
---Answer: The variance of the mobile data usage is 1.2 GB^2.
Why It Matters
Understanding variance is crucial in many fields. In AI/ML, it helps models understand data spread. In FinTech, it's used to measure investment risk. Engineers use it to ensure product quality, like making sure car parts have consistent sizes. Data scientists and researchers use it daily to make sense of information.
Common Mistakes
MISTAKE: Confusing variance with standard deviation. | CORRECTION: Variance is the average of the squared differences from the mean. Standard deviation is the square root of the variance, making it easier to interpret in the original units of the data.
MISTAKE: Forgetting to square the differences from the mean. | CORRECTION: Squaring ensures all differences are positive and gives more weight to larger differences, which is key to how variance measures spread.
MISTAKE: Using N-1 in the denominator for population variance. | CORRECTION: Use N (total number of data points) in the denominator when calculating population variance. Use N-1 when calculating sample variance (when your data is only a part of a larger group).
Practice Questions
Try It Yourself
QUESTION: If the scores of students in a math test are 80, 85, 90, 95, 100, what is the mean score? | ANSWER: Mean = (80+85+90+95+100)/5 = 450/5 = 90.
QUESTION: Two auto-rickshaw drivers record their daily earnings (in Rupees) for 3 days. Driver A: 500, 510, 520. Driver B: 400, 500, 600. Which driver has more consistent earnings (lower variance)? | ANSWER: Driver A's earnings are more consistent (lower variance). (Mean for both is 510. For A: (500-510)^2 + (510-510)^2 + (520-510)^2 = 100 + 0 + 100 = 200. Variance = 200/3 = 66.67. For B: (400-510)^2 + (500-510)^2 + (600-510)^2 = 12100 + 100 + 8100 = 20300. Variance = 20300/3 = 6766.67).
QUESTION: The number of samosas sold at a stall each hour for 4 hours was 10, 12, 8, 14. Calculate the variance of samosas sold. | ANSWER: Mean = (10+12+8+14)/4 = 44/4 = 11. Squared differences: (10-11)^2=1, (12-11)^2=1, (8-11)^2=9, (14-11)^2=9. Sum = 1+1+9+9 = 20. Variance = 20/4 = 5.
MCQ
Quick Quiz
What does a high variance value in a dataset indicate?
The data points are very close to each other.
The data points are widely spread out from the mean.
The mean of the data is very high.
There are no outliers in the data.
The Correct Answer Is:
B
A high variance value means that the individual data points are far from the average (mean), indicating a large spread. Low variance means they are close to the mean.
Real World Connection
In the Real World
Imagine you're buying vegetables from a local mandi. If one vendor's potato prices vary wildly day-to-day (e.g., 20 Rs/kg, then 50 Rs/kg, then 30 Rs/kg), their price variance is high. Another vendor might have very consistent prices (e.g., 25 Rs/kg, 26 Rs/kg, 24 Rs/kg), showing low variance. Consumers often prefer low variance in prices for predictability.
Key Vocabulary
Key Terms
MEAN: The average of a set of numbers | SPREAD: How far apart the numbers in a dataset are | DATASET: A collection of related data | STANDARD DEVIATION: The square root of variance, giving spread in original units | OUTLIER: A data point far away from other data points
What's Next
What to Learn Next
Next, you should explore 'Standard Deviation.' It's directly related to variance and is often preferred because it's in the same units as your original data, making it easier to understand the actual spread. Mastering these will boost your data analysis skills!


