Summary of ‘How to Lie with Statistics’ by Darrell Huff
Introduction
‘How to Lie with Statistics’ was written and published by Darrell Huff. In the book, the author focuses on outlining various errors that arise as a result of statistical interpretation. He further states how often the interpretation errors lead tacticians to make an incorrect conclusion. The author has tried to assert the aspects under different topics within the book. Therefore, the current paper will provide a summary of the book based on the different topics within each chapter.
Chapter One
In this chapter, Huff mostly focuses on justifying his claim that sampling is the origin of any statistical problem. The author states that every statistic is based on a given sample since the whole population cannot be subjected to a statistical test, and every sample that is derived from the given population contains some aspect of bias (Huff). To assert his claim, he gave an example of the Statistics of the Yale graduate annual earnings of $25111 that had followed all the statistic standards but had concealed the whole truth either intentionally or unintentionally. Darrell’s assertion in this chapter is that the aspect of the built-in bias comes as a result of the failure of the respondents in giving honest replies, Market Research selecting samples that provide better numbers and personal biases that originate from the perception of the researcher. The author gives an example of a survey that required the respondents to state which book they read the most between Harper and True love story. The feedback from the survey indicated that most respondents preferred Harper. However, the figures from the publisher showed that there was more circulation of a True love story than Harper, which further refuted the sampling results (Huff). The main reason for the discrepancy according to Huff was that most of the respondents had not told the truth when responding to the survey questions. The author further gives an example of a cancer patient to indicate that a wrong sample or sample selection process often leads the surveyor towards making wrong decisions and wrong direction. Finally Darrell stresses on the aspect of selecting an interviewer and other aspects that should be kept under consideration in the study environment to ensure data collection becomes flawless and smooth.
Chapter Two
The Well Chosen Average
In this chapter, Huff talks about the various tricks that a researcher can use for manipulation when using the average to describe any statistical fact. The author's main idea in this chapter is that any individual who uses an average must clearly understand all three types of averages since a similar set of data can produce three sets of values when the three different types of averages are used for calculations (Huff). Darrell states that if a neighborhood has people on pension and retirees, then the income of the two or three millionaires within the same neighborhood is likely to boost the average when we only calculate the arithmetic mean of the neighborhoods income. On the other hand, the median will give the exact value that lies in the middle. Darrell, therefore, claims in this chapter that the median provides a more precise reflection of the sample than the mean since the mean tends to conceal information. To further demonstrate how a published fact can be manipulated from the real facts when the average is not qualified, the author used an example of the average pay of an employee in a corporation that can be interpreted to mean different things to various people. In other words, every scenario within any given context requires an individual to quantify the type of average used in its description.
Chapter Three
The Little Figures That Are Not There
The chapter mainly discusses the process through which sample data is often picked in a way likely to prove a given result especially in the advertising world of consumer products. According to Huff, even though the statistics given favors a particular product, it also reveals some underneath tricks. In the first instance, the sample sizes are small and undergo particular treatments to change the expected treatment outcomes to make the test results fascinating (Huff). According to the author, the result of any study is likely to be diverted to the researcher’s desire by hiding the prevailing condition of the environment. Huff gave an example of tossing a coin whereby he states that if a coin is tossed ten times then there is an 80% probability of getting a head but if it is done severally then one can get a probability of 50% for both head and tail. Therefore to determine if the results have been collected in a valid way, the author has suggested the use of a significance test that is ideal for indicating if the result is based on real change and not on some probability. The author further discusses how incorrectly labeled axes lead to misleading charts.
Chapter Four
Much Ado about Practically Nothing
In this chapter, Huff discusses the need of expressing a sample result in range and error in measurement. Darrell illustrates that at times the sample result may be close, and the difference between the results may not make any meaning since the probable error range may be far much greater than the difference that exists between the sample results. For example, the ranking of the over 600 American colleges by Forbes magazine was achieved by a complex combination of different factors that were weighted for more or less influence (Vedder & Ewalt). The chapter also discusses the process of data collection and states that when the collected data are all combined, then there is a likelihood of increasing the probable error, an aspect he has illustrated with an example of measuring a corn field.
Chapter Five
The Gee-Whiz Graph
In the chapter, the author discusses the aspect of survey presentation and findings. Darrell tries to explain that the use of numbers is not suitable for making report worthy to comprehend or even read, and the figures may not make meaning to readers (Huff). The author, therefore, discusses a way through which the statistician devises a graphical method to deceive people and exaggerate facts.
Chapter Six
The One-Dimensional Picture
In this chapter, Huff focuses on the aspect of the one-dimensional picture of a type or kind of graph known as the pictorial graph that is established by a symbol trick such as using a money bag or factory symbols. The graphs have eye-catching characteristics and tend to be explored by more readers. The author further illustrates how the money bag tricks are used to manipulate reports to give them the desired direction
Chapter Seven
The Semiattached Figure
Chapter Eight
Post Hoc Rides Again
The chapter discusses the type of correlation in which a relationship exists but there is no clarity between the cause and the effect. The author gives an example of how smoking is related to achieving bad grades, but it is not clear whether smoking causes bad grades or whether the individual who decided to smoke is getting bad grades. Huff’s main idea is that there may be a correlation between factors however other underlying factors might also be having an influence. The trick behind the statistical manipulation is to relate issues that are not exclusive with other concerned issues and make a claim towards what has influenced the result (Huff).
Chapter Nine
How to Statisticulate
In this chapter, the author has coined a term called statistics late to refer to the aspect of statistical manipulation. Huff further lists various tricks that are used in the manipulation process such as profit on cost price measurements and using a graph that has a fine scale of the y-axis to demonstrate the steepness of a given growth. According to the author, using a map to describe federal government spending demonstrates statistical manipulations that are well illustrated by the example of the family income calculation.
Chapter Ten
How to Talk Back to Statistic
The chapter discusses ways to avoid the deceptions that the author has discussed in the other chapters. Huff admits even though statistical analysis cannot be subjected to the same test like chemistry analysis to determine its authenticity, there are other means that can be used to achieve the test through asking the five main questions whereby the main aim will be to determine whether any statistical information presented is authentic or not.
Conclusion
In conclusion, the author brings out the aspect of how statisticians and researchers use different manipulation tricks and methods to deceive and conceive the readers into believing the statistical facts presented.
Works Cited
Huff, Darrell. How to Lie with Statistics. New York: WW Norton & Company, 1993.
Vedder, Richard, and Ewalt, David M. “America's Best Colleges 2009.” Forbes Magazine, 5 Aug. 2009. Web. 13 Dec. 2015.