top of page
Inaugurated by IN-SPACe
ISRO Registered Space Tutor

S3-SA3-0245

What is a Positively Skewed Distribution?

Grade Level:

Class 8

AI/ML, Data Science, Physics, Economics, Cryptography, Computer Science, Engineering

Definition
What is it?

A positively skewed distribution is a type of data distribution where most of the data points are clustered towards the left side (lower values), and the 'tail' of the distribution extends towards the right side (higher values). This means the average (mean) is usually greater than the middle value (median) because the higher values on the right pull the mean in that direction.

Simple Example
Quick Example

Imagine a small village where most people earn a moderate income, but a few rich landowners earn a very high income. If you plot everyone's income, most data points will be on the left (lower income), and a long 'tail' will stretch to the right because of those few high incomes. This is a positively skewed distribution.

Worked Example
Step-by-Step

Let's look at the scores (out of 10) of 10 students in a surprise quiz:
2, 3, 4, 4, 5, 6, 7, 8, 9, 100 (one student got an extra credit!)
---1. Arrange the data in ascending order: 2, 3, 4, 4, 5, 6, 7, 8, 9, 100
---2. Calculate the Mean (Average): Sum of all scores / Number of scores
(2+3+4+4+5+6+7+8+9+100) / 10 = 148 / 10 = 14.8
---3. Find the Median (Middle value): Since there are 10 values (an even number), the median is the average of the 5th and 6th values.
(5 + 6) / 2 = 11 / 2 = 5.5
---4. Compare Mean and Median: Mean (14.8) > Median (5.5)
---5. Observe the data: Most scores are low (2 to 9), but one very high score (100) pulls the average up, creating a 'tail' on the right.
This shows it's a positively skewed distribution.

Why It Matters

Understanding skewed data is crucial in fields like AI/ML, where it helps in making fair predictions, and in Economics, to analyze income disparities. Data scientists and financial analysts use this to interpret market trends and customer behavior, helping businesses make better decisions.

Common Mistakes

MISTAKE: Thinking 'positive' means good or high values are more frequent. | CORRECTION: 'Positive' skew refers to the direction of the tail, which is towards the higher (positive) numbers, not the frequency of high values.

MISTAKE: Confusing the mean and median values in a skewed distribution. | CORRECTION: In a positively skewed distribution, the mean is generally greater than the median because the higher values in the tail pull the mean up.

MISTAKE: Believing that a skewed distribution is always 'bad' data. | CORRECTION: Skewed distributions are very common in real-world data and simply tell us about the pattern of the data, like income or wait times.

Practice Questions
Try It Yourself

QUESTION: If the average height of people in a room is 165 cm, and the median height is 160 cm, is the distribution of heights likely positively or negatively skewed? | ANSWER: Positively skewed (Mean > Median)

QUESTION: A small shop owner records daily sales in rupees: 100, 150, 120, 110, 130, 200, 500. Calculate the mean and median. Is this data positively or negatively skewed? | ANSWER: Mean = (100+150+120+110+130+200+500)/7 = 1310/7 approx 187.14. Median (ordered: 100, 110, 120, 130, 150, 200, 500) = 130. Since Mean (187.14) > Median (130), it's positively skewed.

QUESTION: The waiting times (in minutes) for customers at an ATM are: 1, 2, 2, 3, 3, 3, 4, 4, 5, 20. Sketch a rough shape of this distribution. Where would the 'tail' be? Is it positively or negatively skewed? | ANSWER: The tail would be on the right side due to the '20' minute wait time. It is positively skewed because most waiting times are low, and a few high values pull the distribution to the right.

MCQ
Quick Quiz

Which statement is true for a positively skewed distribution?

The mean is usually less than the median.

The tail of the distribution points towards the left.

Most data points are clustered on the right side.

The mean is usually greater than the median.

The Correct Answer Is:

D

In a positively skewed distribution, higher values in the tail pull the mean towards the right, making it greater than the median. Options A, B, and C describe negatively skewed or symmetrical distributions.

Real World Connection
In the Real World

Think about the prices of houses in a growing Indian city. Most houses might be in a moderate price range, but a few luxury bungalows or penthouses will have very high prices. If you plot these prices, you'll see a positively skewed distribution, with a long tail stretching to the right due to those expensive properties. Real estate analysts use this to understand market dynamics.

Key Vocabulary
Key Terms

MEAN: The average of all values in a dataset. | MEDIAN: The middle value when data is arranged in order. | TAIL: The part of the distribution that extends thinly towards one side. | DISTRIBUTION: How data points are spread out or clustered.

What's Next
What to Learn Next

Great job understanding positively skewed distributions! Next, you should explore 'Negatively Skewed Distributions'. This will help you compare and contrast different data shapes, which is a fundamental skill in data analysis and statistics.

bottom of page