Grouped data is a frequency distribution of the variable (whose values are given in the raw dataset). Such a frequency table is often referred to as grouped data.^{[1]}
Contents

Example 1

Mean of grouped data 2

See also 3

Notes 4

References 5
Example
The idea of grouped data can be illustrated by considering the following raw dataset:
20

25

24

33

13

26

8

19

31

11

16

21

17

11

34

14

15

21

18

17

Table 1: Time taken (in seconds) by a group of students to
answer a simple math question
The above data can be organised into a frequency distribution (or a grouped data) in several ways. One method is to use intervals as a basis.
The smallest value in the above data is 8 and the largest is 34. The interval from 8 to 34 is broken up into smaller subintervals (called class intervals). For each class interval, the amount of data items falling in this interval is counted. This number is called the frequency of that class interval. The results are tabulated as a frequency table as follows:
Time taken (in seconds)

Frequency

5 ≤ t < 10

1

10 ≤ t < 15

4

15 ≤ t < 20

6

20 ≤ t < 25

4

25 ≤ t < 30

2

30 ≤ t < 35

3

Table 2: Frequency distribution of the time taken (in seconds) by the group of students to
answer a simple math question
Another method of grouping the data is to use some qualitative characteristics instead of numerical intervals. For example, suppose in the above example, there are three types of students: 1) Below normal, if the response time is 5 to 14 seconds, 2) normal if it is between 15 and 24 seconds, and 3) above normal if it is 25 seconds or more, then the grouped data looks like:

Frequency

Below normal

5

Normal

10

Above normal

5

Table 3: Frequency distribution of the three types of students
Mean of grouped data
An estimate, \bar{x}, of the mean of the population from which the data are drawn can be calculated from the grouped data as:

\bar{x}=\frac{\sum{f\,x}}{\sum{f}} .
In this formula, x refers to the midpoint of the class intervals, and f is the class frequency. Note that the result of this will be different from the sample mean of the ungrouped data. The mean for the grouped data in the above example, can be calculated as follows:
Class Intervals

Frequency ( f )

Midpoint ( x )

f x

5 and above, below 10

1

7.5

7.5

10 ≤ t < 15

4

12.5

50

15 ≤ t < 20

6

17.5

105

20 ≤ t < 25

4

22.5

90

25 ≤ t < 30

2

27.5

55

30 ≤ t < 35

3

32.5

97.5

TOTAL

20


405

Thus, the mean of the grouped data is

\bar{x}=\frac{\sum{f\,x}}{\sum{f}} = \frac{405}{20} = 20.25
See also
Notes

^ Newbold et al., 2009, pages 14 to 17
References

Newbold, P.; Carlson, W.; Thorne, B. (2009). Statistics for Business and Economics (Seventh ed.). Pearson Education.
This article was sourced from Creative Commons AttributionShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and USA.gov, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for USA.gov and content contributors is made possible from the U.S. Congress, EGovernment Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a nonprofit organization.