top of page
Inaugurated by IN-SPACe
ISRO Registered Space Tutor

S7-SA3-0235

What is the Total Sum of Squares?

Grade Level:

Class 12

AI/ML, Physics, Biotechnology, FinTech, EVs, Space Technology, Climate Science, Blockchain, Medicine, Engineering, Law, Economics

Definition
What is it?

The Total Sum of Squares (TSS) measures the total variation present in a set of data points around their average value. It tells us how much the individual data points differ from the overall mean of the entire dataset. Think of it as a way to quantify how 'spread out' your numbers are.

Simple Example
Quick Example

Imagine you are tracking the daily mobile data usage (in GB) of your family for 5 days: 1, 2, 1, 3, 3. The average usage is 2 GB. The Total Sum of Squares would tell you how much each day's usage varies from this average of 2 GB, added up for all days.

Worked Example
Step-by-Step

Let's find the Total Sum of Squares for the daily mobile data usage: 1, 2, 1, 3, 3 GB.

1. First, find the average (mean) of the data.
Mean = (1 + 2 + 1 + 3 + 3) / 5 = 10 / 5 = 2.
So, the average data usage is 2 GB.

---2. Now, subtract the mean from each data point.
(1 - 2) = -1
(2 - 2) = 0
(1 - 2) = -1
(3 - 2) = 1
(3 - 2) = 1

---3. Square each of these differences.
(-1)^2 = 1
(0)^2 = 0
(-1)^2 = 1
(1)^2 = 1
(1)^2 = 1

---4. Finally, add all these squared differences together.
Total Sum of Squares = 1 + 0 + 1 + 1 + 1 = 4.

---The Total Sum of Squares for this data is 4.

Why It Matters

Understanding Total Sum of Squares is crucial in fields like AI/ML, where it helps evaluate how well a model predicts outcomes. In Climate Science, it can measure variation in temperature data. Engineers use it to analyze data spread in experiments, helping them build better products or systems, making it a foundation for many exciting careers.

Common Mistakes

MISTAKE: Forgetting to square the differences before adding them up. | CORRECTION: Always square each difference (data point - mean) before summing them. This ensures positive values and gives more weight to larger differences.

MISTAKE: Calculating the mean incorrectly, which then makes all subsequent steps wrong. | CORRECTION: Double-check your mean calculation by summing all data points and dividing by the count of data points.

MISTAKE: Not subtracting the mean from each individual data point. | CORRECTION: The core idea is to find the variation FROM the average, so subtracting the mean from each point is a critical first step after finding the mean.

Practice Questions
Try It Yourself

QUESTION: Calculate the Total Sum of Squares for the following marks obtained in a test: 5, 7, 9. | ANSWER: Mean = (5+7+9)/3 = 7. Differences: (5-7)=-2, (7-7)=0, (9-7)=2. Squared differences: (-2)^2=4, (0)^2=0, (2)^2=4. TSS = 4+0+4 = 8.

QUESTION: A chai shop sold these number of cups over 4 hours: 20, 25, 15, 30. What is the Total Sum of Squares? | ANSWER: Mean = (20+25+15+30)/4 = 90/4 = 22.5. Differences: (20-22.5)=-2.5, (25-22.5)=2.5, (15-22.5)=-7.5, (30-22.5)=7.5. Squared differences: (-2.5)^2=6.25, (2.5)^2=6.25, (-7.5)^2=56.25, (7.5)^2=56.25. TSS = 6.25+6.25+56.25+56.25 = 125.

QUESTION: Five auto-rickshaws travelled these distances (in km) in an hour: 10, 12, 8, 10, 15. If a new auto-rickshaw travels 20 km, how does the Total Sum of Squares change? | ANSWER: Original Mean = (10+12+8+10+15)/5 = 55/5 = 11. Original Squared differences: (10-11)^2=1, (12-11)^2=1, (8-11)^2=9, (10-11)^2=1, (15-11)^2=16. Original TSS = 1+1+9+1+16 = 28. New Mean = (10+12+8+10+15+20)/6 = 75/6 = 12.5. New Squared differences: (10-12.5)^2=6.25, (12-12.5)^2=0.25, (8-12.5)^2=20.25, (10-12.5)^2=6.25, (15-12.5)^2=6.25, (20-12.5)^2=56.25. New TSS = 6.25+0.25+20.25+6.25+6.25+56.25 = 95.5. The TSS increases from 28 to 95.5.

MCQ
Quick Quiz

What is the first step in calculating the Total Sum of Squares?

Square each data point

Find the average (mean) of the data

Subtract 1 from each data point

Add all data points together

The Correct Answer Is:

B

The first step is always to find the average (mean) of the data. This mean is the central point from which we measure the variation of all other data points. Options A, C, and D are either later steps or incorrect operations.

Real World Connection
In the Real World

Total Sum of Squares is a foundational concept in data analysis. For example, in cricket analytics, experts use it to understand how much a player's score varies from their average, helping coaches identify consistent performers or areas for improvement. Data scientists at companies like Swiggy or Zomato might use similar concepts to understand variation in delivery times or customer ratings.

Key Vocabulary
Key Terms

VARIATION: How much data points differ from each other or from an average. | MEAN: The average of a set of numbers, calculated by summing all values and dividing by the count of values. | SQUARED DIFFERENCE: The result of subtracting the mean from a data point and then multiplying that result by itself. | DATASET: A collection of related data points.

What's Next
What to Learn Next

Once you understand Total Sum of Squares, you can explore 'Variance' and 'Standard Deviation'. These concepts build directly on TSS to give you even more detailed insights into the spread and consistency of data, which is super useful for understanding statistics and data science.

bottom of page