MATH 1280-01 Assignment Unit 4

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 6

MATH 1280 --1 Introduction to Statistics

Assignment Unit 4
University of the People
22nd, February 2022
Data
1. Using the list of 17 numbers at the top of the page, the median of this data, rounded
to two decimal places, is: _____.
Arranging data -1.9 -1.5 -1.1 -0.1 0.5 1.2 1.8 2.1 2.8 3.2 3.9 4.1 6.5 7 7.2 13 18
Selecting 0.5*17th value
Median = 2.82
2. If you find the median using the original method (paper and pencil), you have to arrange
the values into numeric order (True/False).______________________ True
3. (The calculations MUST be done manually, do not use R) The interquartile rang for
this data is (round each value to 3 decimal places):_______.
Rounding off to 3 decimal places and arranging: -1.9 -1.5 -1.1 -0.1 0.5 1.2 1.8 2.1 2.8 3.2
3.9 4.1 6.5 7 7.2 13 18
Getting the lower quartile 0.25*17value = 0.500
Getting the upper 0.75*17 value = 6.500
Getting the interquartile range = 6.500-0.500= 6.000
4. The formula for calculating the interquartile range is_____________ (show the
formula and a citation to the source that you used).
Interquartile range = upper quartile – lower quartile (Yakir, 2011)
Or
IQR= Q3-Q1

5. (The calculations MUST be done manually, do not use R) Using techniques that we
studied in this course, the upper and the lower cutoff points (rounded to three
decimal places) for identifying outliers in the given data sample are: -8.5 and 15.5
(this is not a request to show any outliers—just the cutoff points that would
determine what constitutes an outlier.) You may round to three decimal place
Lower cutoff = lower quartile – 1.5*interquartile range = 0.500 – 1.5*6.000 = -8.500
Upper cutoff = upper quartile + 1.5*interquartile range = 6.500 + 1.5*6.000 = 15.500
6. The summary() command shows a list of outliers, if there are any
(True/False):______________________
False
7. The list of outlier values is:_____________ (if there are none, write "NA").
18
8. The standard deviation of the list of 17 numbers is (round to 3 decimal places):
______________
Finding the average (mean) 1/17*(sum of all values) =1/17*66.700 = 3.924
Finding the sum of squared deviations (x -mean)^2 - variance 1/16*(x-mean)^2 = 27.557
Standard deviation = square root of variance = 5.249
Random Variable

9. The missing probability value (under the number 4) in the random variable table
above is:_______

=1-0.85= 0.15

10. The sum of the probabilities in the second row of any random variable table like the
one above should equal (round to 3 decimal places):
_______________________________________________1.000

11. Read section 4.4.1 in the book (Yakir, 2011).  Do the numbers in the table above (for
the random variable) represent a data sample (Yes/No)? No

12. In the random variable table shown above, the value in the second row represents
the cumulative probability of the corresponding values in the first row (True/False)
_________False

13. The probability that a randomly selected value from this random value will be less than
or equal to 3 is:_____.

10 + .15 + .25 + .10 =0.6

14. What is the probability that a randomly selected value from the random variable would be
exactly 1.5? ___________ . 0 (since -1.5 is not part of the sample space)

15. Review section 4.4 in the book (Yakir, 2011), especially pages 57—58.  The
expectation of the random variable is:______.

E(X)= 0*.10 + 1*.15 + 2*.25 + 3*.10 + 4*.15 +5*.10 + 6*.15 =2.95


16. To find the expectation of a random variable by using a relative frequency table,
you can add the values in the first row of the table and divide by the number of
columns in the table (True/False)_________. False
17. Study Yakir (2011) pp. 57-59 and solved problems 4.1.6-4.1.8.  The (population)
standard deviation of the random variable above is (round to 3 decimal
places):_______ (hint, you can not put values from the table into the sd() function
because the sd() function does not adjust for the probabilities).

1.910

v <- c (0, 1, 2, 3, 4, 5, 6)

> p <- c (0.1, 0.15, 0.25, 0.10, 0.15, 0.10, 0.15)

> variance <- sum((v-E) ^2 * p)

> variance [1] 3.6475

> std.dev = sqrt(variance)

> std.dev

[1] 1.909843

> round(std.dev, 3)

[1] 1.910

18. If you have already calculated the standard deviation of a data sample, what is the
next thing to do to find the variance: ______________________________.

Square the standard deviation and raise to the second power

>1.91^2Variance

[1] 3.6481

A Population:

19. Determine how many observations in the pop3.csv file are of type a:  ____49949.

> getwd()

[1] "/Users/Desktop/IntroStat"
> pop3<-read.csv("pop3.csv")

>table(pop3$type)

a b c

49949 33410 16641

Or

nrow(subset(pop3,pop3$type=='a'))
[1] 49949
>nrow(subset(pop3,pop3$type=='a'))

[1] 49949

20. Using the appropriate R function with the defaul options, what is the median of the
time column of pop3 (round to 3 decimal places):
______________________________.4.473

> median (pop3$time)

[1] 4.472685

Which is 3.473 when rounded off to 3 d.p

21. What is the variance of the time column of pop3 (rounded to three decimal places)?
_____54.916

>var(pop3$time)

[1] 54.91635

> round(var(pop3$time),3)

[1] 54.916
References:

Yakir, B. (2011). Introduction to Statistical Thinking (With R, Without Calculus). Retrieved on


April 19, 2021, from https://my.uopeople.edu/pluginfile.php/57436/mod_book/chapter/37629/
MATH1280.IntroStat.pdf

You might also like