S7-SA3-0358
What is a Chi-Square Test (Introductory)?
Grade Level:
Class 12
AI/ML, Physics, Biotechnology, FinTech, EVs, Space Technology, Climate Science, Blockchain, Medicine, Engineering, Law, Economics
Definition
What is it?
The Chi-Square Test (pronounced 'kai-square') is a statistical tool used to see if there's a significant difference between what we expect to happen and what actually happens. It helps us decide if observed results are just by chance or if there's a real pattern.
Simple Example
Quick Example
Imagine you expect your favourite cricket team to win 8 out of 10 matches. But after 10 matches, they only win 5. A Chi-Square Test can help you figure out if this difference (expected 8 vs. actual 5 wins) is just bad luck or if something has genuinely changed with the team's performance.
Worked Example
Step-by-Step
Let's say a school principal expects 50% of students to choose Science, 30% Commerce, and 20% Arts. Out of 200 students, 120 chose Science, 60 Commerce, and 20 Arts. Is this difference significant?
---Step 1: Calculate Expected Frequencies.
Science: 50% of 200 = 100 students
Commerce: 30% of 200 = 60 students
Arts: 20% of 200 = 40 students
---Step 2: Note Observed Frequencies.
Science: 120 students
Commerce: 60 students
Arts: 20 students
---Step 3: Calculate (Observed - Expected)^2 / Expected for each category.
Science: (120 - 100)^2 / 100 = (20)^2 / 100 = 400 / 100 = 4
Commerce: (60 - 60)^2 / 60 = (0)^2 / 60 = 0 / 60 = 0
Arts: (20 - 40)^2 / 40 = (-20)^2 / 40 = 400 / 40 = 10
---Step 4: Sum these values to get the Chi-Square value.
Chi-Square = 4 + 0 + 10 = 14
---Answer: The calculated Chi-Square value is 14.
Why It Matters
This test is super useful in many fields! In medicine, doctors use it to see if a new drug works better than an old one. In marketing, companies check if a new ad campaign changes customer choices. Data scientists and researchers use it to make important decisions based on data, helping improve everything from climate models to financial predictions.
Common Mistakes
MISTAKE: Using percentages directly in the calculation instead of actual counts. | CORRECTION: Always convert percentages or proportions into the actual number of observations (frequencies) before starting the calculation.
MISTAKE: Assuming a high Chi-Square value always means the difference is 'good'. | CORRECTION: A high Chi-Square value means there's a significant difference between observed and expected. Whether that difference is 'good' or 'bad' depends on the specific problem you're studying.
MISTAKE: Applying the Chi-Square test when expected frequencies are very low (e.g., less than 5). | CORRECTION: The Chi-Square test works best when expected frequencies in each category are reasonably large. If they are too small, the results might not be reliable.
Practice Questions
Try It Yourself
QUESTION: A coin is tossed 100 times. You expect 50 heads and 50 tails. You get 60 heads and 40 tails. Calculate the Chi-Square value. | ANSWER: Heads: (60-50)^2/50 = 100/50 = 2. Tails: (40-50)^2/50 = 100/50 = 2. Total Chi-Square = 2 + 2 = 4.
QUESTION: A survey expects 70% of people to prefer tea and 30% coffee. Out of 200 people, 130 prefer tea and 70 prefer coffee. Calculate the Chi-Square value. | ANSWER: Expected Tea: 0.70 * 200 = 140. Expected Coffee: 0.30 * 200 = 60. Tea: (130-140)^2/140 = (-10)^2/140 = 100/140 = 0.714. Coffee: (70-60)^2/60 = (10)^2/60 = 100/60 = 1.667. Total Chi-Square = 0.714 + 1.667 = 2.381 (approx).
QUESTION: A mobile company claims 40% of its users are in Mumbai, 30% in Delhi, and 30% in Bengaluru. A random check of 500 users finds 180 in Mumbai, 170 in Delhi, and 150 in Bengaluru. Calculate the Chi-Square value. What does a higher value generally indicate? | ANSWER: Expected Mumbai: 0.40 * 500 = 200. Expected Delhi: 0.30 * 500 = 150. Expected Bengaluru: 0.30 * 500 = 150. Mumbai: (180-200)^2/200 = (-20)^2/200 = 400/200 = 2. Delhi: (170-150)^2/150 = (20)^2/150 = 400/150 = 2.67. Bengaluru: (150-150)^2/150 = 0. Total Chi-Square = 2 + 2.67 + 0 = 4.67. A higher Chi-Square value generally indicates a larger difference between the observed and expected results, meaning the observed results are less likely due to random chance.
MCQ
Quick Quiz
What does a Chi-Square test primarily help us to compare?
Mean values of two groups
Observed frequencies with expected frequencies
Standard deviations of a dataset
The sum of all data points
The Correct Answer Is:
B
The Chi-Square test is specifically designed to compare observed frequencies (what actually happened) with expected frequencies (what we thought would happen). It assesses if the differences are statistically significant.
Real World Connection
In the Real World
Imagine a food delivery app like Swiggy or Zomato wants to know if their new discount offer actually makes more people order biryani than before. They can use a Chi-Square test to compare the number of biryani orders before the offer (expected) with the number of orders after the offer (observed) to see if the discount made a real difference or if it was just a random fluctuation.
Key Vocabulary
Key Terms
OBSERVED FREQUENCY: The actual count of how many times something happened. | EXPECTED FREQUENCY: The predicted count of how many times something should happen based on a theory or hypothesis. | STATISTICAL SIGNIFICANCE: A result is statistically significant if it is unlikely to have occurred by chance. | HYPOTHESIS: An educated guess or assumption that can be tested.
What's Next
What to Learn Next
Great job learning about the Chi-Square test! Next, you can explore 'Degrees of Freedom' and 'P-value'. These concepts will help you understand how to interpret your Chi-Square value and truly decide if your observed results are statistically significant or just random luck.


