top of page
Inaugurated by IN-SPACe
ISRO Registered Space Tutor

S3-SA3-0221

What is Categorical Data?

Grade Level:

Class 8

AI/ML, Data Science, Physics, Economics, Cryptography, Computer Science, Engineering

Definition
What is it?

Categorical data is a type of data that can be divided into groups or categories. Instead of numbers, it represents qualities, characteristics, or labels. Think of it like putting things into different 'boxes' based on what they are.

Simple Example
Quick Example

Imagine you are asking your friends about their favourite ice cream flavour. Their answers could be 'Vanilla', 'Chocolate', 'Strawberry', or 'Butterscotch'. These are categories, not numbers, so 'favourite ice cream flavour' is categorical data.

Worked Example
Step-by-Step

Let's say a teacher wants to record the favourite sport of students in Class 8.

Step 1: The teacher asks 5 students: Rohan, Priya, Amit, Sanjana, and Vivek.
---
Step 2: Rohan says 'Cricket', Priya says 'Badminton', Amit says 'Football', Sanjana says 'Cricket', and Vivek says 'Badminton'.
---
Step 3: We can list these responses as: Cricket, Badminton, Football, Cricket, Badminton.
---
Step 4: Now, we group them into categories: 'Cricket', 'Badminton', and 'Football'.
---
Step 5: We can count how many students chose each category.
---
Result: Cricket: 2 students, Badminton: 2 students, Football: 1 student. Here, 'Cricket', 'Badminton', and 'Football' are the categories, making this categorical data.

Why It Matters

Understanding categorical data is super important in fields like Data Science and AI/ML, where you analyse information to make smart predictions. Doctors use it to group patient symptoms, and economists use it to classify consumer behaviour. It helps build smarter apps and solve real-world problems!

Common Mistakes

MISTAKE: Thinking that any data with words is categorical. | CORRECTION: Categorical data must be able to be grouped into distinct, non-overlapping categories. A student's essay, though words, isn't categorical data.

MISTAKE: Confusing categorical data with numerical data that has been assigned labels (like '1' for 'male' and '2' for 'female'). | CORRECTION: Even if numbers are used as codes, if those numbers don't represent a quantity or measurement, the original data type is still categorical. The number '1' here isn't a quantity.

MISTAKE: Assuming categorical data always has a natural order. | CORRECTION: Some categorical data, like 'small, medium, large' (ordinal data), has an order, but most, like 'red, green, blue' (nominal data), does not. Don't assume an order unless it's clear.

Practice Questions
Try It Yourself

QUESTION: Is the 'brand of mobile phone' (e.g., Samsung, Apple, Xiaomi) used by your friends an example of categorical data? | ANSWER: Yes

QUESTION: A survey asks students to rate a new school canteen dish as 'Very Good', 'Good', 'Average', or 'Poor'. Is this categorical data? If yes, does it have an order? | ANSWER: Yes, it is categorical data. Yes, it has an order (from 'Very Good' to 'Poor').

QUESTION: Your school records the number of hours each student spends studying for an exam. Is 'number of hours' categorical data? Explain why or why not. | ANSWER: No, 'number of hours' is not categorical data. It is numerical data because it represents a measurable quantity (hours) and can be counted or measured on a scale.

MCQ
Quick Quiz

Which of the following is an example of categorical data?

Height of students in cm

Number of siblings a student has

Favourite colour of students

Weight of school bags in kg

The Correct Answer Is:

C

Favourite colour of students (like Red, Blue, Green) represents categories or labels, not quantities, making it categorical data. The other options (height, number of siblings, weight) are all numerical measurements.

Real World Connection
In the Real World

When you use a shopping app like Flipkart or Amazon, the 'category' filters (like 'Electronics', 'Fashion', 'Home & Kitchen') are based on categorical data. E-commerce companies use this to organise products and help you find what you need quickly. Even your school's attendance register, marking 'Present' or 'Absent', uses categorical data!

Key Vocabulary
Key Terms

CATEGORY: A group or class into which data can be divided | NOMINAL DATA: Categorical data without any natural order (e.g., colours) | ORDINAL DATA: Categorical data with a natural order or ranking (e.g., small, medium, large) | QUALITATIVE DATA: Another name for categorical data, focusing on qualities rather than numbers

What's Next
What to Learn Next

Great job understanding categorical data! Next, you should explore 'Numerical Data' to see how it differs from what you've learned. Then, you can dive into 'Data Representation' to learn how to show both types of data using graphs and charts.

bottom of page