Perry________________________________________ Section:
________1_____
STATISTICS 101 - Module 1c Written Homework
Scatterplots and Correlation
1.) Many of us may have a sweet tooth. Suppose you enjoy candy every now and then, but you
want to be careful to not overdo it on the calories. You go into a store and pick out a candy
bar. You notice that the calories are not listed on the package. But, there are other
variables listed for the candy bar like fat, carbs, sugar etc. Posted with this homework is a
data set called CandyBars.jmp. The data set contains 75 different candies along with many
variables on each type. Your goal is to find the variable that is most strongly related to
calories.
a.) We will begin by looking at scatterplots of Calories versus a few variables. Use the data set
and JMP to produce a scatterplot of Calories versus Total fat, Carbohydrate, Sugars, and
Protein. Copy and paste your graphic to turn in with this homework. See the JMP guide for
this section if you need any help with the commands. Note: Put Calories in first, then the
other variables of interest. This way you only have to look at the top row.
b.) Find the scatterplot for Calories and Sugars. Interpret the scatterplot.
There is a very weak correlation between calories and sugar. It is positive and
there are several notable outliers
c.) Find the scatterplot for Calories and Total fat. Interpret the scatterplot.
There is a strong, positive linear relationship between calories and total fat
d.) Based on all the scatterplots, which variable appears to have the strongest relationship with
Calories? Why?
Calories and total fat appear to have the strongest linear relationship, there
are no outliers and they have a very strong correlation coefficient
1. Using the correlation values, what variable has the strongest correlation with Calories?
What variable has the weakest correlation with Calories? What are the values of each
correlation?
Strongest variable: _______Total fat_______________________________
Correlation value: ___0.8071________
Weakest variable: ______________Carbohydrate_________________________
Correlation value: _______0.3992_____