MS 150 Statistics fx spring 2006 • Name:


Part I: Basic Statistics

Systolic blood pressures were measured for twenty-two caffeine consuming students in exercise sport science courses this spring. The systolic blood pressure data is in the table and in an on line data sheet. Systolic blood pressure is the blood pressure generated by the contraction of the heart. Normal human systolic blood pressure is usually 120.

Data sheet

For the systolic blood pressure data given in the table:

  1. _________ What level of measurement is the systolic blood pressure data?
  2. _________ Determine the sample size n.
  3. _________ Calculate the sample mean x.
  4. _________ Determine the median.
  5. _________ Determine the mode.
  6. _________ Determine the minimum.
  7. _________ Determine the maximum.
  8. _________ Calculate the range.
  9. _________ Calculate the sample standard deviation sx.
  10. _________ Calculate the sample Coefficient of Variation.
  11. _________ Determine the class width. Use five bins (classes or intervals)
  12. Fill in the following table with the class upper limits in the first column, the frequencies in the second column, and the relative frequencies in the third column
    BinsFrequencyRelative Frequency f/n
  13. Sketch a histogram of the relative frequency data on the bottom of page three.
  14. __________________ What is the shape of the distribution?
  15. __________________ What is the probability of rolling a four on an eight-sided die that has faces numbered from one to eight?

Part II: Estimated mean from a distribution

Systolic blood pressure measurements were made on 48 drug free students. The results are recorded in the table below.

Bin upper limitFreq fRel Freq_______
  1. __________________ What the the estimated mean systolic blood pressure for these 48 drug-free students?

Part III: Confidence Intervals

A sample size n of eight students who consume caffeinated beverages but who do not use any other drugs had a sample mean x systolic blood pressure of 128.63 with a sample standard deviation sx of 19.21. Construct a 95% confidence interval for the population mean systolic blood pressure for these eight caffeine users. Note that n is less than 30.

  1. ____________ What would be the point estimate for the population mean for all students who drink caffeinated beverages on campus?
  2. df = __________ Find the number of degrees of freedom.
  3. tc = __________ Find tcritical.
  4. E = _______________ Find the error tolerance E.
  5. The 95% confidence interval for the population mean caffeine users systolic blood pressure is:
    ____________ ≤ ≤ ____________
  6. __________ The drug free population mean systolic blood pressure for students at the College of Micronesia-FSM is 119.41. Is a population mean systolic blood pressure of 119.41 a possible mean systolic blood pressure for the caffeine using students based on the 95% confidence interval?

Part IV: Hypothesis Testing

Betelnut contains the simulant arecoline, coffee and cola drinks contain the stimulant caffeine, tobacco contains the stimulant nicotine. A group of students who do not chew betelnut, consume caffeinated beverages, smoke, nor chew tobacco had a population mean systolic blood pressure of 119.41. There are a number of students who consume all three stimulants, often at the same time. A sample size n of seven students who chew betelnut with tobacco and consume caffeinated beverages had a sample mean x systolic blood pressure of 129.29 with a sample standard deviation sx of 7.95. Could this sample of seven students include a population mean systolic blood pressure of 119.41? Test the hypothesis that the population mean systolic blood pressure for the sample could or could not be 119.41. Use an alpha of α = 0.05 for the hypothesis test.

  1. ________________________________________ Write the null hypothesis in formal statistical format.
  2. ________________________________________ Write the alternate hypothesis in formal statistical format.
  3. α = __________ Write down the level of significance alpha.
  4. tc = __________ Determine tcritical.
  5. t = __________ Calculate the t-statistic.
  6. p = __________ Determine the p-value using the t-distribution.
  7. __________ What is the largest confidence interval c for which this difference is statistically significant?
  8. ________________________________________ Would we reject the null hypothesis or fail to reject the null hypothesis that the sample mean is statistically significantly different from the population mean at a 5% level of significance?
  9. __________ If we reject the null hypothesis, what is the risk of a type I error based on the p-value?
  10. __________ Do the students using all three stimulants have a statistically significantly higher systolic blood pressure than the drug-free students?
  11. __________ Is high blood pressure good for your body?

Part V: Linear Regression

Age vs systolic

The data in this section examines whether there is a trend in the systolic blood pressure for students with age.

  1. _________ Calculate the slope of the best fit (least squares) line for the data.
  2. _________ Calculate the y-intercept of the best fit (least squares) line.
  3. _________ Is the correlation positive, negative, or neutral?
  4. _________ Use the equation of the best fit line to calculate the predicted systolic blood pressure for a 24 year old student.
  5. _________ Use the inverse of the best fit line to calculate the predicted age for a systolic blood pressure of 140.
  6. _________ Calculate the linear correlation coefficient r for the data.
  7. _________ Is the correlation none, low, moderate, high, or perfect?
  8. _________ Calculate the coefficient of determination.
  9. _________ What percent of the variation in the age data explains the variation in the blood pressure data?
  10. _________ Can we use a linear regression to predict blood pressure from age?
  11.      Why or why not?

Tables of Formulas and Excel functions

Basic Statistics
Statistic or ParameterSymbolEquationsExcel
Square root=SQRT(number)
Sample standard deviationsx or s=STDEV(data)
Sample Coefficient of VariationCV sx/x =STDEV(data)/AVERAGE(data)
Confidence interval statistics
Statistic or ParameterSymbolEquationsExcel
Degrees of freedomdf= n-1=COUNT(data)-1
Find a zc value from a confidence level c zc=ABS(NORMSINV((1-c)/2))
Find a tc value from a confidence level c tc=TINV(1-c,df)
Calculate an error tolerance E of a mean for n < 30 using sx. Should also be used for n ≥ 30. E error_tolerance_tc.gif (989 bytes) =tc*sx/SQRT(n)
Calculate a confidence interval for a population mean µ from a sample mean x and an error tolerance E x - E ≤ ≤ x + E
Hypothesis Testing
Calculate t-critical for a two-tailed test tc=TINV(α,df)
Calculate a t-statistic t xbartot.gif (1028 bytes) =(x - )/(sx/SQRT(n))
Calculate a two-tailed p-value from a t-statisticp = TDIST(ABS(t),df,2)
Linear Regression Statistics
Statistic or ParameterSymbolEquationsExcel
Slopeb=SLOPE(y data, x data)
Intercepta=INTERCEPT(y data, x data)
Correlationr=CORREL(y data, x data)
Coefficient of Determinationr2 =(CORREL(y data, x data))^2