MS 150 Statistics 4.4 linear regression review
Malware 2005 - 2006
|Month||Month num (x)||Threats in 100,000s (y)|
2006 showed a dramatic return to malware threats according to Trend Micro. The number of new malware threats (computer viruses, worms, exploits, and rootkits) appearing per month climbed steadily throughout 2006. Use the third column, the y data, to answer the following questions.
- __________ Find the mode of the threats column.
- __________ Find the median of the threats column.
- __________ Find the mean of the threats column.
- __________ Find the standard deviation of the threats column.
Use the second and third columns in the table on the right to find the linear regression (best fit) line through the data and to answer the questions below.
- ______________ Use the computer to plot the data. Does the relationship appear to be linear (roughly a straight line) or non-linear (curved)?
- ______________ Determine the slope of the linear regression for the data.
- ______________ Determine the y-intercept of the linear regression for the data.
- ______________ Determine the correlation coefficient r.
- ______________ Is the correlation positive or negative?
- ______________ Is the correlation none, weak, moderate, strong, or perfect?
- ______________ Determine the coefficient of determination.
- ______________ What percent in the variation in month number "explains" the variation in the grayware threats?
- ______________ Given that the trend has held to date, use the slope and intercept above to calculate the predicted grayware threat in September 2007 (for the month number use 22).
- ______________ Presume that the trend will continue. Use the slope and intercept to calculate the the month number in which the threats will be 20.
- ______________ Toughie: What month name and year does the above month number correspond?
Data based on the white paper The trend of threats today: 2006 Annual Threat Roundup and
2007 Forecast ©2007 by Trend Micro Incorporated.