S3-SA3-0221
What is Categorical Data?
Grade Level:
Class 8
AI/ML, Data Science, Physics, Economics, Cryptography, Computer Science, Engineering
Definition
What is it?
Categorical data is a type of data that can be divided into groups or categories. Instead of numbers, it represents qualities, characteristics, or labels. Think of it like putting things into different 'boxes' based on what they are.
Simple Example
Quick Example
Imagine you are asking your friends about their favourite ice cream flavour. Their answers could be 'Vanilla', 'Chocolate', 'Strawberry', or 'Butterscotch'. These are categories, not numbers, so 'favourite ice cream flavour' is categorical data.
Worked Example
Step-by-Step
Let's say a teacher wants to record the favourite sport of students in Class 8.
Step 1: The teacher asks 5 students: Rohan, Priya, Amit, Sanjana, and Vivek.
---
Step 2: Rohan says 'Cricket', Priya says 'Badminton', Amit says 'Football', Sanjana says 'Cricket', and Vivek says 'Badminton'.
---
Step 3: We can list these responses as: Cricket, Badminton, Football, Cricket, Badminton.
---
Step 4: Now, we group them into categories: 'Cricket', 'Badminton', and 'Football'.
---
Step 5: We can count how many students chose each category.
---
Result: Cricket: 2 students, Badminton: 2 students, Football: 1 student. Here, 'Cricket', 'Badminton', and 'Football' are the categories, making this categorical data.
Why It Matters
Understanding categorical data is super important in fields like Data Science and AI/ML, where you analyse information to make smart predictions. Doctors use it to group patient symptoms, and economists use it to classify consumer behaviour. It helps build smarter apps and solve real-world problems!
Common Mistakes
MISTAKE: Thinking that any data with words is categorical. | CORRECTION: Categorical data must be able to be grouped into distinct, non-overlapping categories. A student's essay, though words, isn't categorical data.
MISTAKE: Confusing categorical data with numerical data that has been assigned labels (like '1' for 'male' and '2' for 'female'). | CORRECTION: Even if numbers are used as codes, if those numbers don't represent a quantity or measurement, the original data type is still categorical. The number '1' here isn't a quantity.
MISTAKE: Assuming categorical data always has a natural order. | CORRECTION: Some categorical data, like 'small, medium, large' (ordinal data), has an order, but most, like 'red, green, blue' (nominal data), does not. Don't assume an order unless it's clear.
Practice Questions
Try It Yourself
QUESTION: Is the 'brand of mobile phone' (e.g., Samsung, Apple, Xiaomi) used by your friends an example of categorical data? | ANSWER: Yes
QUESTION: A survey asks students to rate a new school canteen dish as 'Very Good', 'Good', 'Average', or 'Poor'. Is this categorical data? If yes, does it have an order? | ANSWER: Yes, it is categorical data. Yes, it has an order (from 'Very Good' to 'Poor').
QUESTION: Your school records the number of hours each student spends studying for an exam. Is 'number of hours' categorical data? Explain why or why not. | ANSWER: No, 'number of hours' is not categorical data. It is numerical data because it represents a measurable quantity (hours) and can be counted or measured on a scale.
MCQ
Quick Quiz
Which of the following is an example of categorical data?
Height of students in cm
Number of siblings a student has
Favourite colour of students
Weight of school bags in kg
The Correct Answer Is:
C
Favourite colour of students (like Red, Blue, Green) represents categories or labels, not quantities, making it categorical data. The other options (height, number of siblings, weight) are all numerical measurements.
Real World Connection
In the Real World
When you use a shopping app like Flipkart or Amazon, the 'category' filters (like 'Electronics', 'Fashion', 'Home & Kitchen') are based on categorical data. E-commerce companies use this to organise products and help you find what you need quickly. Even your school's attendance register, marking 'Present' or 'Absent', uses categorical data!
Key Vocabulary
Key Terms
CATEGORY: A group or class into which data can be divided | NOMINAL DATA: Categorical data without any natural order (e.g., colours) | ORDINAL DATA: Categorical data with a natural order or ranking (e.g., small, medium, large) | QUALITATIVE DATA: Another name for categorical data, focusing on qualities rather than numbers
What's Next
What to Learn Next
Great job understanding categorical data! Next, you should explore 'Numerical Data' to see how it differs from what you've learned. Then, you can dive into 'Data Representation' to learn how to show both types of data using graphs and charts.


