Small bits of data can yield interesting patterns, such as the
spirals in nature that give rise to the golden mean.
We will use our creative inquiry
techniques
to analyze larger data sets with many dimensions of data
for patterns. Work with someone else for the class data
portions of the lab, but
your stock market answers will differ.
Class Data
Go to this lab (under the class highlights page)
and then click on this Class Data Excel File.
The computer should then download the file.
Open the file classdata.xls from Excel.
You will see an Excel table filled in - Dr. Sarah took the data
from the sheets you filled in, and put it into Excel.
Average or Mean
To calculate the average of a bunch of numbers, we
add them all up, and then divide by how many numbers we have.
Mental Rotations Test (MRT) I graded these on a scale of 0, 1, or 2.
A 2 means that you scored both choices correctly (choices a and d),
while 1 means that one of your
choices was correct, while the other was incorrect.
- Click on Q1, which is empty. Type in
=average(n2:n26)
and hit Enter. This will calculate the average of the female
scores on the Timed MRT. What is this average?
- Click on Q2. Type in
=average(n27:n31)
and hit Enter. This will calculate the average of the male scores on the
Timed MRT. What is this average?
- Click on Q3, which is empty. Calculate the average of the female
scores on the Untimed MRT. What is this average?
- Click on Q4. Calculate the average of the male scores on the
Untimed MRT. What is this average?
- Many studies have shown that
there are gender differences on the Timed MRT in that men perform better,
but that women perform just as well on the untimed MRT.
It makes sense that people (men or women)
would perform better under less time pressure
(although it just depends on the person).
Does our class data follow these patterns?
- Search the web for
Lawrence Summers. Then find at least two different perspectives about the
performance of women in mathematics (for example, one from Summers and the
other from someone else). Summarize the perspectives in your own words.
Real Words Remembered versus Nonsense Words Remembered
- Click on Q5. Type in
=average(f2:f31)
and hit Enter. This will calculate the class average of the real words
remembered. What is this average?
- Click on Q6. Calculate the class average of the
nonsense words
remembered. What is this average?
- Notice that the average for the real words remembered is larger than
for the nonsense words remembered. Explain
why we would have expected to obtain these results.
Height and Armspan
- Click on Q7. Calculate the class average of armspan.
What is this average?
- Click on Q8. Calculate the class average of height.
What is this average?
-
Search the web and/or texts
to find information about Leonardo da Vinci's speculation
about the relationship between armspan and height. Summarize what you found
in your own words.
- Search the web to find information about the ape index and its
relationship to rock climbing and summarize what you found.
Then identify yourself on the class data list (but don't tell
anyone else which row you are in!) and calculate your own index:
Mean, Median and Mode as Measures of Center
Mean/AverageTo calculate the average of a bunch of numbers, we
add them all up, and then divide by how many numbers we have.
MedianTo calculate the median,
we first put our data in increasing order, and then find the middle number
and place. If there is no such middle number, we take the 2 numbers
next to the middle place, and take the average of them.
For example,
0,0,1,3 has an average of 1, since we would take (0+0+1+3)/4 = 1.
It has a median of .5, since the middle place is between the 2nd 0 and the 1.
Since there is no actual number in this place, we take the average of 0 and 1,
which is .5.
The median is both a place and a number.
ModeTo calculate the mode, we take the most frequently occurring
value. For example, 0 would be the mode of 0,0,1,3 since it occurs
more frequently than any other value.
Mean, median and mode are all different measures of the data's
"center", and
certain circumstances make each better to use at times
(as we'll see in the Heart of Mathematics readings and in class).
Number of Siblings
- Click on Q9. Type in
=median(c2:c31)
and hit Enter. Excel can quickly calculate the class median of the
number of siblings even when the data is not in increasing order.
What is this median?
- Notice that
the mode of the number of siblings is 1 because it occurs the most times.
- Click on Q10. Calculate the class average of the
number of siblings. What is this average?
Notice that the average is higher
since the data is not weighted evenly about the middle place
If you think of the median place as the middle of a scale, then the lower
numbers (0s and 1s) pull the average down
while the higher numbers pull it up. While there are an equal number
of items on
either side of this scale, the average equals
the median only when these pulls cancel each other out, if the
data is evenly weighted about the median.
This does not happen in this case.
The 0s and 1s pull the average down
less than the higher numbers pull it up. Here I have reordered the sibling
data.
0
0
0
1
1
1
1
1
1
1
1
1
1
1
1
(the median place and number lies here between two numbers)
1
1
(the average lies here between two numbers)
2
2
2
2
2
2
2
2
3
3
3
4
5
What is a reasonable measure of center in this case? Well, the median
is the actual center of the data and so it
is a good measure of center. In this case it tells us that half the class
has that many or less siblings.
Notice that while the average is not a number in our data set,
and it is not the center of the data, it does tell us about the
distribution of the data.
Finally, the mode just tells us the most frequently occurring
number in the data set.
Distance from Home
Take out your paper copy of the class data and turn to the side where
it is ordered by distance from home.
- We know the median will separate the ordered data in half.
Click on Q11. Calculate the class median of the
distance from home. Verify that this matches what you thought the median
should be.
What is this median?
- Click on Q12. Type in
=mode(d2:d31)
and hit Enter. This will calculate the class mode of the
distance from home. What is this mode?
- Calculate the class average of the distance
from home in Q13. What is this average?
-
Write out the distance from home data in increasing order and mark where the
average and median are,
(just like Dr. Sarah did for the median and average with the
number of siblings).
- Explain why the average is higher than the median by using the idea of
a scale balancing about the median.
Make sure we have checked over your numerical answers.
Then, under File, release on Close, and click on Don't Save.
Stock Market Data
go into ASULearn and to the personal file storage space.
Click on View Single, and then the various numbers until you find your
stock file.
For example, my stock is titled DrSarahMSFT.xls
Download the file by using ctrl and then click on the filename, and then
save the link.
Excel should still be open. Open the file using File/Open.
The Mean and Median of Volume
Volume is the number of shares
bought or sold in a given day.
-
What Excel box contains the first data entry for Volume?
- What Excel box contains the last data entry for Volume?
- Click on g2, which is empty, and calculate the
average of Volume. What is this value?
- Calculate the median of Volume
in G3. What is this value?
- On the stock update packet
turn to the page with the graph on it. The volume is represented as
a bar graph on the bottom. To read the volume on a given day, you go to the
top of the corresponding
rectangle, go over to the left of the graph, and then read the
number from there.
Draw a horizontal line
at the point on the y-axis which matches the
median of Volume on this graph. This may or may not actually
correspond to a specific day that attained this volume.
Show us so that we
can mark off that you have done this correctly.
- DISCUSSION POINTS (leave until later)
Recall that the median is the middle place after we put the data in
increasing
order and that if we think of this as the center of a scale, then
we can see whether the mean is higher or lower by seeing whether the data
tips the scale to one side or another (you may wish to review my
explanation in the Sibling section above).
Even if the data is not in increasing order (on your graph it is not),
we can still use the idea of the median as the center of a scale to see
whether the data tips the scale higher or lower than the median.
Use only the bars on your graph and your knowledge of
mean compared to median to discuss why you can see from these bars whether
the mean is above or below the median.
The Mean and Median of High
High is the highest price that
a stock hits each day.
- What is the average of the data in the High column?
- What is the median of the data in the High column?
- What is the smallest High value? You can use a command like
=QUARTILE(D2:D66,0) where you change 66 to the number of rows of data you have.
The quartiles are the 25% markers, so the 0th quartile is the lowest
data point, the 1st quartile = q1 is the 25% marker, etc.
- What is the largest High value?
You can use a command like =QUARTILE(D2:D66,4)
- DISCUSSION POINTS (leave until later)
Many of you may have very similar
numbers here, even though your stock fluctuated up and down from one
day to the next. Explain what is going on using your knowledge of the
difference between mean and median, the relatively short time span
of the data, generalities about the price ranges of stocks in this kind of
time span,
and your knowledge of the stock market from past hw readings
on the stock market (if you don't recall the readings,
review these so that you can
incorporate them here).
Representations
- Skim through Dr. Sarah's MSFT
conflicting graph viewpoints
web page.
- Look for 2 graphs
which chart your stocks progress
that show that different representations of similar
data can be used to illustrate conflicting viewpoints:
Go to
http://stocks.usatoday.com/custom/usatoday-com/html-ichart.asp?symb=MSFT
but change MSFT to your stock symbol in the "Enter Symbol" box.
Under Time Frame: change the time frame to 5 years and then click on
Draw Chart to look at different graphs of the performance of your stock.
Split the 5 year graph up into two graphs with conflicting
viewpoints (for example, one, that might spans a few months,
with the prices increasing, and another
with the prices decreasing) - then roughly sketch the general shapes and
include the time frames.
- Do stock market update #1 on your stock packet [What if we had sold today]
and show us to check off and then work on the remaining discussion
questions above. When you are finished, work on homework or upcoming
projects until we come back together.