Popular American Beer in Statistical Data Analysis

by

in

Need help with assignments?

Our qualified writers can create original, plagiarism-free papers in any format you choose (APA, MLA, Harvard, Chicago, etc.)

Order from us for quality, customized work in due time of your choice.

Click Here To Order Now

Executive Summary

This paper supplies the results of a descriptive statistical analysis of data related to 71 best-selling brands of beer in the U.S. in 2007. The data is comprised of information about alcohol percentage, the number of calories, and the number of carbohydrates (in grams) per 12 ounces of beer. The main descriptives (means, including 95% confidence intervals; medians; variances; standard deviations; ranges; values of skewness; and values of kurtosis), as well as the results of normality tests for these variables, are reported. Also, histograms (with normal curves) and box-and-whiskers plots are included. All the tables and charts can be found in appendices.

Quantitative Methods Midterm

This paper provides the results of a statistical analysis of data about 71 most popular domestic beer in the U.S. This data was originally taken from “Beer Brewpub” (n.d.) on May 4, 2007. The data includes information about alcohol percentage in beer, as well as the number of calories for every 12 ounces of beer, and the number of carbohydrates (in grams) per 12 ounces of beer. Relevant descriptive statistics, as well as normality tests, histograms, and box-and-whisker plots, are supplied.

Methods Used

To analyze the data, IBM SPSS Statistics v. 23 software was utilized. To calculate descriptive statistics and normality tests, and to create box-and-whisker plots, the Analyze → Descriptive statistics → Explore procedure was used; consequently, the means, variances, and standard deviations are calculated according to the formulae that estimate the respective population parameters from the given sample (Warner, 2013). For histograms, the procedure Graphs → Legacy dialogs → Histogram was employed to obtain histograms with normal curves (George & Mallery, 2016).

Analysis

The results for Alcohol Percentage are as follows. It is easy to see from Appendix 1 that for Alcohol Percentage, Mean=4.776 (95% CI: 4.587-4.965); Median=4.7; Variance=.635; SD=.797; Range=6.1. Also, Skewness=-2.01 (skewed to the right), Kurtosis=12.079 (most values are close to the mean). From the histogram (Appendix 2), it is unclear whether the distribution is normal, but, as Appendix 3 shows, both Kolmogorov-Smirnov and Shapiro-Wilk tests are statistically significant at p<.001, suggesting that the data is not normally distributed. The box-and-whiskers chart (Appendix 4) displays that there is one significant outlier: case 66, “Red Hook IPA,” alcohol=6.5%; and one extreme outlier: case 58, “O’Doul’s,” alcohol=.4% (Field, 2013).

The results for Calories are described next. From Appendix 1, it can be seen that for Calories, Mean=139.25 (95% CI: 132.84-145.66); Median=145.0; Variance=733.135; SD=27.076; Range=125. Also, Skewness=-.364 (slightly skewed to the right); Kurtosis=-.664 (many values are not very close to the mean). From the histogram (Appendix 2), it is unclear whether the distribution is normal, but, as Appendix 3 shows, both Kolmogorov-Smirnov and Shapiro-Wilk tests are statistically significant (p<.001 and p=.010, respectively), indicating that the data is not normally distributed. The box-and-whiskers chart (Appendix 4) suggests that there are no significant outliers.

Finally, the results for Carbohydrates are as follows. From Appendix 1, it is evident that for Carbohydrates, Mean=10.896 (95% CI: 9.835-11.957); Median=11.4; Variance=20.09; SD=4.4822; Range=21.3. Also, Skewness=.122 (slightly skewed to the left); Kurtosis=-.101 (several values are not very close to the mean). From the histogram (Appendix 2), it might be assumed that the data is approximately normally distributed. Indeed, as Appendix 3 displays, both Kolmogorov-Smirnov and Shapiro-Wilk tests are non-significant (p=.200 and p=.266, respectively), which confirms that the data is approximately normally distributed. The box-and-whiskers chart (Appendix 4) allows for concluding that there is one significant outlier (Field, 2013): case 71, “Sam Adams Cream Stout,” carbohydrates=23.9.

Conclusion

On the whole, it should be stressed that IBM SPSS Statistics v. 23 software was used to calculate the relevant descriptive statistics and create the main charts for the data about 71 most popular types of beer in the U.S. The paper reports the results of the analysis which was conducted for this purpose.

References

Beer brewpub and microbrewery guide by beer 100. (n.d.). Web.

Field, A. (2013). Discovering statistics using IBM SPSS Statistics (4th ed.). Thousand Oaks, CA: SAGE Publications.

George, D., & Mallery, P. (2016). IBM SPSS Statistics 23 step by step: A simple guide and reference (14th ed.). New York, NY: Routledge.

Warner, R. M. (2013). Applied statistics: From bivariate through multivariate techniques (2nd ed.). Thousand Oaks, CA: SAGE Publications.

Appendix 1

The descriptives for the data are as follows:

Descriptives
Statistic Std. Error
alc_percent Mean 4.776 .0946
95% Confidence Interval for Mean Lower Bound 4.587
Upper Bound 4.965
5% Trimmed Mean 4.801
Median 4.700
Variance .635
Std. Deviation .7970
Minimum .4
Maximum 6.5
Range 6.1
Interquartile Range .8
Skewness -2.010 .285
Kurtosis 12.079 .563
calories Mean 139.25 3.213
95% Confidence Interval for Mean Lower Bound 132.84
Upper Bound 145.66
5% Trimmed Mean 139.55
Median 145.00
Variance 733.135
Std. Deviation 27.076
Minimum 70
Maximum 195
Range 125
Interquartile Range 50
Skewness -.364 .285
Kurtosis -.664 .563
carbohydr Mean 10.896 .5319
95% Confidence Interval for Mean Lower Bound 9.835
Upper Bound 11.957
5% Trimmed Mean 10.828
Median 11.400
Variance 20.090
Std. Deviation 4.4822
Minimum 2.6
Maximum 23.9
Range 21.3
Interquartile Range 6.8
Skewness .122 .285
Kurtosis -.101 .563

Appendix 2

The histograms for the given variables are as follows:

Alcohol
Calories
Carbohydrates

Appendix 3

The tests of normality for the three given variables are as follows:

Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
alc_percent .170 71 .000 .814 71 .000
calories .161 71 .000 .953 71 .010
carbohydr .082 71 .200* .979 71 .266
*. This is a lower bound of the true significance.
a. Lilliefors Significance Correction

Appendix 4

The box-and-whiskers charts for the three variables are as follows:

Alcohol
Calories
Carbohydrates

Need help with assignments?

Our qualified writers can create original, plagiarism-free papers in any format you choose (APA, MLA, Harvard, Chicago, etc.)

Order from us for quality, customized work in due time of your choice.

Click Here To Order Now