1.Explain the significance of: James Madison, Great compromise, Three-Fifths compromised, popular sovereignty, federalism, separation of powers, checks and balances, veto,
reignty, federalism, separation of powers, checks and balances, veto, amendment.
2.Can domestic dogs understand human body cues such as leaning? The experimenter leaned toward one of two objects and recorded
two objects and recorded whether or not the dog being tested correctly chose the object indicated. A four-year-old male beagle named Augie participated in this study. He chose the correct object 42 out of 70 times when the experimenter leaned towards the correct object.
(a) (2 points) Let the parameter of interest, π, represent the probability that the long-run probability that Augie chooses correctly. Researches are interested to see if Augie understands human body cues (better than gussing).
Fill in the blanks for the null and alternative hypotheses.
H0 : Ha :
(b) (6 points) Based on the above context, conduct a test of significance to determine the p-value to investigate if domestic dogs understand human body cues. What conclusion will you draw with significance level of 10%? (If you use an applet, please specify which applet you use, and the inputs.)
(c) (5 points) Based on the above context, conduct a test of significance to determine the p-value to investigate if domestic dogs understand human body cues. What conclusion will you draw with significance level of 5%? (If you use an applet, please specify which applet you use, and the inputs.)
(d) (2 points) Are your conclusions from part (b) and (c) the same? If they are different, please provide an explanation.
(e) (5 points) Shown below is a dotplot from a simulation of 100 sample proportions under the assump- tion that the long-run probability that Augie chooses correct is 0.50. Based on this dotplot, would a 90% confidence interval for π contain the value 0.5? Explain your answer.
(f) (4 points) Compute the standard error of the sample proportion of times that Augie chose the object correctly.
(g) (5 points)
(h) (3 points) question?
(i) (4 points)
(j) (4 points) A.
Construct an approximate 95% confidence interval for π using the 2SD method. What is the margin of error of the confidence interval that you found in the previous
How would you interpret the confidence interval that you found in part (g)?
Which of the following is a correct interpretation of the 95% confidence level?
If Augie repeats this process many times, then about 95% of the intervals produced will capture the true proportion of times of choosing the correct objective.
About 95% times Augie chooses the correct objective.
If Augie repeats this process and constructs 20 intervals from separate independent sam- ples, we can expect about 19 of those intervals to contain the true proportion Augie chooses the correct objective.
(k) (4 points)
object 21 out of 35 times.
Conjecture how, if at all, the center and the width of a 99% confidence interval would change with these data, compared to the original 2SD 95% confidence interval.
The center of the confidence interval would . The width of the confidence interval would .
(l) (4 points) Suppose that we repeated the same study with Augie, and this time he chose the correct object 17 out of 35 times, and we also change the confidence level from 95% to 99%. Conjecture how, if at all, the center and the width of a 99% confidence interval would change with these data, compared to the original 2SD 95% confidence interval.
Suppose that we repeated the same study with Augie, and this time he chose the correct
The center of the confidence interval would The width of the confidence interval would
3.Problem 4 (20 points). Consider the Newmarket 5K dataset. A dataset with certain rows deleted (where there was no Age
there was no Age and/or Sex entered) has been created. It is called Newmarket_Cleaned.csv.
Data file: Newmarket_Cleaned.csv
(Links to an external site.)
. Direct HTTP link: https://unh.box.com/shared/static/p8x4xlbean3rlslmfskfu74fe89yjui8.csv
Please use it for this problem. Consider the “Model 4” developed in class, which contained Age, Age^2, and Sex as independent variables. First re-create this model with the cleaned dataset (you will need to create the Age^2 column; you can verify your model summary against the one from class), and then proceed with this problem.
Make the necessary changes to the model to incorporate Year as a categorical explanatory variable (you may need to change how R interprets the variable, before running this model). Does the year of the race seem to be related to mean finishing time, after taking into account the other variables in Model 4? Use the 0.05 significance level. Explain your work/logic, and key output to support your answer.
Using the updated model, after accounting for age and sex, what is the expected difference in finishing times for runners from 2008 versus runners from 2004?
Using the updated model, after accounting for age and sex, what is the expected difference in finishing times for runners from 2014 versus 2011?
In the above, the directions are to treat Year as a categorical variable. Explain how the regression model would be different if it instead were treated as a numeric variable. What are the assumptions being made about the effect of Year in the two different approaches?
4.ATTACHED ARE GRAPHS
Q1 to Q4
The job satisfaction for the four occupational groups were used (lawyer, physical therapist, cabinetmakers, and
oups were used (lawyer, physical therapist, cabinetmakers, and system analysts). The results obtained for a sample of 5 individuals from each groups. Using the "ANOVA Output" below, please answer the following questions ( Use the significance level 5%).
Q1. The value of the test statistic is ____________
Q2. The p- value of the test is _________________
Q3. At the 5% significance level, the null hypothesis is rejected if the value of the F statistics is >= _________________
Q4. Interpret the ANOVA result at the 5% significance level. Is there any difference in the job satisfaction among the four occupational groups? Answer either yes or no. Explain the reason of your answer statistically.
Data from a Trucking Company is Southern California were utilized to examine the relationship among total daily travel time (y), miles to traveled (X1), and the number of deliveries (x2). Based on the "Regression Output" below, please answer the following questions.
Q5. The number of sample used in this regression analysis is______________
Q6. What is the value of the coefficient of determination?
Q7. What is the F test statistic value for the regression model significane test?
Q8. What is the predicted travel time for X1 =95, and X2= 6?
Q9. Is X2 (number of deliveries) related to Y (travel time)? Answer either yes or no. Explain the reason of your answer statistically.
ATTACHED ARE GRAPHS