top of page
Inaugurated by IN-SPACe
ISRO Registered Space Tutor

S7-SA3-0358

What is a Chi-Square Test (Introductory)?

Grade Level:

Class 12

AI/ML, Physics, Biotechnology, FinTech, EVs, Space Technology, Climate Science, Blockchain, Medicine, Engineering, Law, Economics

Definition
What is it?

The Chi-Square Test (pronounced 'kai-square') is a statistical tool used to see if there's a significant difference between what we expect to happen and what actually happens. It helps us decide if observed results are just by chance or if there's a real pattern.

Simple Example
Quick Example

Imagine you expect your favourite cricket team to win 8 out of 10 matches. But after 10 matches, they only win 5. A Chi-Square Test can help you figure out if this difference (expected 8 vs. actual 5 wins) is just bad luck or if something has genuinely changed with the team's performance.

Worked Example
Step-by-Step

Let's say a school principal expects 50% of students to choose Science, 30% Commerce, and 20% Arts. Out of 200 students, 120 chose Science, 60 Commerce, and 20 Arts. Is this difference significant?
---Step 1: Calculate Expected Frequencies.
Science: 50% of 200 = 100 students
Commerce: 30% of 200 = 60 students
Arts: 20% of 200 = 40 students
---Step 2: Note Observed Frequencies.
Science: 120 students
Commerce: 60 students
Arts: 20 students
---Step 3: Calculate (Observed - Expected)^2 / Expected for each category.
Science: (120 - 100)^2 / 100 = (20)^2 / 100 = 400 / 100 = 4
Commerce: (60 - 60)^2 / 60 = (0)^2 / 60 = 0 / 60 = 0
Arts: (20 - 40)^2 / 40 = (-20)^2 / 40 = 400 / 40 = 10
---Step 4: Sum these values to get the Chi-Square value.
Chi-Square = 4 + 0 + 10 = 14
---Answer: The calculated Chi-Square value is 14.

Why It Matters

This test is super useful in many fields! In medicine, doctors use it to see if a new drug works better than an old one. In marketing, companies check if a new ad campaign changes customer choices. Data scientists and researchers use it to make important decisions based on data, helping improve everything from climate models to financial predictions.

Common Mistakes

MISTAKE: Using percentages directly in the calculation instead of actual counts. | CORRECTION: Always convert percentages or proportions into the actual number of observations (frequencies) before starting the calculation.

MISTAKE: Assuming a high Chi-Square value always means the difference is 'good'. | CORRECTION: A high Chi-Square value means there's a significant difference between observed and expected. Whether that difference is 'good' or 'bad' depends on the specific problem you're studying.

MISTAKE: Applying the Chi-Square test when expected frequencies are very low (e.g., less than 5). | CORRECTION: The Chi-Square test works best when expected frequencies in each category are reasonably large. If they are too small, the results might not be reliable.

Practice Questions
Try It Yourself

QUESTION: A coin is tossed 100 times. You expect 50 heads and 50 tails. You get 60 heads and 40 tails. Calculate the Chi-Square value. | ANSWER: Heads: (60-50)^2/50 = 100/50 = 2. Tails: (40-50)^2/50 = 100/50 = 2. Total Chi-Square = 2 + 2 = 4.

QUESTION: A survey expects 70% of people to prefer tea and 30% coffee. Out of 200 people, 130 prefer tea and 70 prefer coffee. Calculate the Chi-Square value. | ANSWER: Expected Tea: 0.70 * 200 = 140. Expected Coffee: 0.30 * 200 = 60. Tea: (130-140)^2/140 = (-10)^2/140 = 100/140 = 0.714. Coffee: (70-60)^2/60 = (10)^2/60 = 100/60 = 1.667. Total Chi-Square = 0.714 + 1.667 = 2.381 (approx).

QUESTION: A mobile company claims 40% of its users are in Mumbai, 30% in Delhi, and 30% in Bengaluru. A random check of 500 users finds 180 in Mumbai, 170 in Delhi, and 150 in Bengaluru. Calculate the Chi-Square value. What does a higher value generally indicate? | ANSWER: Expected Mumbai: 0.40 * 500 = 200. Expected Delhi: 0.30 * 500 = 150. Expected Bengaluru: 0.30 * 500 = 150. Mumbai: (180-200)^2/200 = (-20)^2/200 = 400/200 = 2. Delhi: (170-150)^2/150 = (20)^2/150 = 400/150 = 2.67. Bengaluru: (150-150)^2/150 = 0. Total Chi-Square = 2 + 2.67 + 0 = 4.67. A higher Chi-Square value generally indicates a larger difference between the observed and expected results, meaning the observed results are less likely due to random chance.

MCQ
Quick Quiz

What does a Chi-Square test primarily help us to compare?

Mean values of two groups

Observed frequencies with expected frequencies

Standard deviations of a dataset

The sum of all data points

The Correct Answer Is:

B

The Chi-Square test is specifically designed to compare observed frequencies (what actually happened) with expected frequencies (what we thought would happen). It assesses if the differences are statistically significant.

Real World Connection
In the Real World

Imagine a food delivery app like Swiggy or Zomato wants to know if their new discount offer actually makes more people order biryani than before. They can use a Chi-Square test to compare the number of biryani orders before the offer (expected) with the number of orders after the offer (observed) to see if the discount made a real difference or if it was just a random fluctuation.

Key Vocabulary
Key Terms

OBSERVED FREQUENCY: The actual count of how many times something happened. | EXPECTED FREQUENCY: The predicted count of how many times something should happen based on a theory or hypothesis. | STATISTICAL SIGNIFICANCE: A result is statistically significant if it is unlikely to have occurred by chance. | HYPOTHESIS: An educated guess or assumption that can be tested.

What's Next
What to Learn Next

Great job learning about the Chi-Square test! Next, you can explore 'Degrees of Freedom' and 'P-value'. These concepts will help you understand how to interpret your Chi-Square value and truly decide if your observed results are statistically significant or just random luck.

bottom of page