This exercise is designed to help you explore how outliers affect standard deviation (a measure of spread).

Here is a data set (n=47\displaystyle {n}={47}) that is nearly normal (as can be seen in the histogram provided after the data set).
28.6 29.1 30 31 31.9 32.6 32.6 36
38.8 38.8 38.8 39.2 39.3 40.3 40.6 41.3
41.9 42.3 42.3 43 43 43.2 43.8 44.6
46.2 47 47 47.2 47.7 48.1 48.7 49
49.2 49.4 49.9 50.7 51.5 51.7 53.6 55
56.2 56.2 59 60 60.9 61.4 61.4  

246810data253035404550556065Frequency[Graphs generated by this script: setBorder(54,40,20,15); initPicture(20.5,65,0,11);axes(122.8,2,1,null,2); fill="blue"; stroke="black"; textabs([165,0],"data","above");line([25,-0.22],[25,0.22]); text([25,0],"25","below");line([30,-0.22],[30,0.22]); text([30,0],"30","below");line([35,-0.22],[35,0.22]); text([35,0],"35","below");line([40,-0.22],[40,0.22]); text([40,0],"40","below");line([45,-0.22],[45,0.22]); text([45,0],"45","below");line([50,-0.22],[50,0.22]); text([50,0],"50","below");line([55,-0.22],[55,0.22]); text([55,0],"55","below");line([60,-0.22],[60,0.22]); text([60,0],"60","below");line([65,-0.22],[65,0.22]); text([65,0],"65","below");textabs([0,115],"Frequency","right",90);rect([25,0],[30,2]);rect([30,0],[35,5]);rect([35,0],[40,6]);rect([40,0],[45,11]);rect([45,0],[50,11]);rect([50,0],[55,4]);rect([55,0],[60,4]);rect([60,0],[65,4]);]

Part 1
Use Excel or StatDisk to find the sample standard deviation for this data set. You should be able to select the table and copy it directly to Excel or other such programs. Report the standard deviation using the rounding rules suggested in class (two more decimal places than the original data).

SD =

Part 2
When you collected this data, you knew there was an extreme value that seemed a little out of the ordinary (as demonstrated in the histogram). This value is 94.2.
246810data253035404550556065707580859095Frequency[Graphs generated by this script: setBorder(54,40,20,15); initPicture(20.5,95,0,11);axes(188.4,2,1,null,2); fill="blue"; stroke="black"; textabs([165,0],"data","above");line([25,-0.22],[25,0.22]); text([25,0],"25","below");line([30,-0.22],[30,0.22]); text([30,0],"30","below");line([35,-0.22],[35,0.22]); text([35,0],"35","below");line([40,-0.22],[40,0.22]); text([40,0],"40","below");line([45,-0.22],[45,0.22]); text([45,0],"45","below");line([50,-0.22],[50,0.22]); text([50,0],"50","below");line([55,-0.22],[55,0.22]); text([55,0],"55","below");line([60,-0.22],[60,0.22]); text([60,0],"60","below");line([65,-0.22],[65,0.22]); text([65,0],"65","below");line([70,-0.22],[70,0.22]); text([70,0],"70","below");line([75,-0.22],[75,0.22]); text([75,0],"75","below");line([80,-0.22],[80,0.22]); text([80,0],"80","below");line([85,-0.22],[85,0.22]); text([85,0],"85","below");line([90,-0.22],[90,0.22]); text([90,0],"90","below");line([95,-0.22],[95,0.22]); text([95,0],"95","below");textabs([0,115],"Frequency","right",90);rect([25,0],[30,2]);rect([30,0],[35,5]);rect([35,0],[40,6]);rect([40,0],[45,11]);rect([45,0],[50,11]);rect([50,0],[55,4]);rect([55,0],[60,4]);rect([60,0],[65,4]);rect([65,0],[70,0]);rect([70,0],[75,0]);rect([75,0],[80,0]);rect([80,0],[85,0]);rect([85,0],[90,0]);rect([90,0],[95,1]);]
Upon careful consideration, you decide to include the outlier 94.2.

Calculate the standard deviation of the original data set with this outlier included (n=48\displaystyle {n}={48}).

SD =

Part 3
Because the value was so extreme, you decided to return to the original data collected to make sure this extreme value was not incorrectly entered or calculated wrong. When you do this, you actually find that the value was incorrect, and the correct value appears to be even more extreme; the value should be 127 (not 94.2). (Histogram with the correct outlier shown below.)
246810data253035404550556065707580859095100105110115120125130Frequency[Graphs generated by this script: setBorder(54,40,20,15); initPicture(20.5,130,0,11);axes(254,2,1,null,2); fill="blue"; stroke="black"; textabs([165,0],"data","above");line([25,-0.22],[25,0.22]); text([25,0],"25","below");line([30,-0.22],[30,0.22]); text([30,0],"30","below");line([35,-0.22],[35,0.22]); text([35,0],"35","below");line([40,-0.22],[40,0.22]); text([40,0],"40","below");line([45,-0.22],[45,0.22]); text([45,0],"45","below");line([50,-0.22],[50,0.22]); text([50,0],"50","below");line([55,-0.22],[55,0.22]); text([55,0],"55","below");line([60,-0.22],[60,0.22]); text([60,0],"60","below");line([65,-0.22],[65,0.22]); text([65,0],"65","below");line([70,-0.22],[70,0.22]); text([70,0],"70","below");line([75,-0.22],[75,0.22]); text([75,0],"75","below");line([80,-0.22],[80,0.22]); text([80,0],"80","below");line([85,-0.22],[85,0.22]); text([85,0],"85","below");line([90,-0.22],[90,0.22]); text([90,0],"90","below");line([95,-0.22],[95,0.22]); text([95,0],"95","below");line([100,-0.22],[100,0.22]); text([100,0],"100","below");line([105,-0.22],[105,0.22]); text([105,0],"105","below");line([110,-0.22],[110,0.22]); text([110,0],"110","below");line([115,-0.22],[115,0.22]); text([115,0],"115","below");line([120,-0.22],[120,0.22]); text([120,0],"120","below");line([125,-0.22],[125,0.22]); text([125,0],"125","below");line([130,-0.22],[130,0.22]); text([130,0],"130","below");textabs([0,115],"Frequency","right",90);rect([25,0],[30,2]);rect([30,0],[35,5]);rect([35,0],[40,6]);rect([40,0],[45,11]);rect([45,0],[50,11]);rect([50,0],[55,4]);rect([55,0],[60,4]);rect([60,0],[65,4]);rect([65,0],[70,0]);rect([70,0],[75,0]);rect([75,0],[80,0]);rect([80,0],[85,0]);rect([85,0],[90,0]);rect([90,0],[95,0]);rect([95,0],[100,0]);rect([100,0],[105,0]);rect([105,0],[110,0]);rect([110,0],[115,0]);rect([115,0],[120,0]);rect([120,0],[125,0]);rect([125,0],[130,1]);]
Calculate the sample standard deviation of the original data set with the correct outlier included (n=48\displaystyle {n}={48}).

SD =

Part 4
Briefly describe how an outlier affects the standard deviation of a data set. (This part of the exercise is not automatically graded by the computer.)