MAE256 T1 2023 – ASSIGNMENT
代写statistics作业 This is an INDIVIDUAL Assignment. We strongly discourage plagiarism, as it will be penalized as much as possible.
Due Date: 8:00 pm, Friday, 28 April 2023
Word Limit: 1500 words max excluding appendices, figures and tables.
Weight: 20% of the overall final grade.
General Details 代写statistics作业
(1) This is an INDIVIDUAL Assignment. We strongly discourage plagiarism, as it will be penalized as much as possible. However, it is not collusion if you discuss the questions with other students, but you need to submit your own original work. Note that we may request that you come in and explain your assignment in person if we feel your assignment is too similar to another student’s work.
(2) This assignment corresponds to 20% of your final grade.
(3) Once completed, you will need to submit your ‘Microsoft Word document via CloudDeakin. You must submit a single file only that contains a cover page with your name and student ID.
If you are submitting your assignment as a PDF document, please ensure that you are also submitting it as a Word document to enable word counting.
Please ensure the Word document is self-contained (i.e. all Excel output tables for summary statistics and regressions, and all figures should be in the word document). You will not need to submit a hard copy.
(4) Whenever you are asked to estimate a regression model, please provide your summary output estimation results in a tabular format from Excel in your Word document (using the copy/paste tool) to evidence the actual regression you run.
(5) Round results to the second decimal.
Regression Analysis using Cross Section Data 代写statistics作业
We will study the 2022 Australian election for the House of Representatives. There are 151 members in the House of Representatives—one for each of Australia’s 151 electoral districts.
When voting, voters rank their preferences over candidates. We will look at the two-party-preferred vote,which results after preferences have been distributed to the highest two candidates. We aim to explain the share of votes the Australian Labor Party received at the two-party-preferred vote using the electorate’s demographic.
The excel file “Assignement T1 2023 voting” includes data for 150 electorates. 1 The variables are:
Division: Name of the electoral district.
Laborpartyper: Percentage of the vote the Labor party received in the two-party-preferred vote in that electoral district.
Medianweekinc: The median weekly household income in that electoral districts
Languageper: Percentage of households who speak a language other than English at home in that electoral district.
Ownedoutright: Percentage of households who own their property without a mortgage in that electoral districts.
State: State in which the electoral district is located.
1) Present a scatter plot with the percentage of votes for the Labor party on the Y-axis and the median weekly income on the X-axis. Similarly, create two more scatter plots by keeping the same Y-axis variable but replacing the X-axis variable first with the percentage of households who speak a language other than English at home and then with the percentage of households who own their property without a mortgage.
For each of these three scatter plots, do you observe any correlation between the variables plotted? Explain. (3 Points) 代写statistics作业
2) Estimate the following regression models and interpret the estimates:
a.𝐋𝐚𝐛𝐨𝐫𝐩𝐚𝐫𝐭𝐲𝐩𝐞𝐫 = β0 + β1𝐌𝐞𝐝𝐢𝐚𝐧𝐰𝐞𝐞𝐤𝐢𝐧𝐜 + 𝐮
b.𝐋𝐚𝐛𝐨𝐫𝐩𝐚𝐫𝐭𝐲𝐩𝐞𝐫 = β0 + β1𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞𝐩𝐞𝐫 + 𝐮
c.𝐋𝐚𝐛𝐨𝐫𝐩𝐚𝐫𝐭𝐲𝐩𝐞𝐫 = β0 + β1𝐎𝐰𝐧𝐞𝐝𝐨𝐮𝐭𝐫𝐢𝐠𝐡𝐭 + 𝐮
(6 Points)
3) Which of these three models best fits the data? Explain.(2 Points)
4) Estimate the following regression model and interpret the estimated coefficient
𝐋𝐚𝐛𝐨𝐫𝐩𝐚𝐫𝐭𝐲𝐩𝐞𝐫 = β0 + β1𝐌𝐞𝐝𝐢𝐚𝐧𝐰𝐞𝐞𝐤𝐢𝐧𝐜 + 𝛃𝟐𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞𝐩𝐞𝐫 + 𝛃𝟑𝐎𝐰𝐧𝐞𝐝𝐨𝐮𝐭𝐫𝐢𝐠𝐡𝐭 + 𝐮 (5 Points)
5) Among the four models you estimated in question 2 and 4 which one best fit the data. Explain.(2 Points)
1 The electorate of Hawke has been created recently and we do not have demographic data for it. We therefore dropped it.
6) Burwood is in the electorate of Chisholm. Predict the share of vote Labour will get in Chisholm using the model estimated in (4).(3 Points)
7) Geelong is in the electorate of Corio. Predict the share of vote Labour will get in Corio using the model estimated in (4).(3 Points)
8) Assuming that there are 52 weeks in a year, create a new variable 𝐌𝐞𝐝𝐢𝐚𝐧𝐲𝐞𝐚𝐫𝐥𝐲𝐢𝐧𝐜 with the household yearly income. Estimate the model:
𝐋𝐚𝐛𝐨𝐫𝐩𝐚𝐫𝐭𝐲𝐩𝐞𝐫= β0 + β1𝐌𝐞𝐝𝐢𝐚𝐧𝐰𝐞𝐞𝐤𝐢𝐧𝐜 + 𝛃𝟐𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞𝐩𝐞𝐫 + 𝛃𝟑𝐎𝐰𝐧𝐞𝐝𝐨𝐮𝐭𝐫𝐢𝐠𝐡𝐭+ 𝛃𝟒𝐌𝐞𝐝𝐢𝐚𝐧𝐲𝐞𝐚𝐫𝐥𝐲𝐢𝐧𝐜 + 𝐮
Was there an error in the estimation? If so, why?(2 Points)
9) Estimate the following regression model and interpret the estimated slope coefficients (no need to interpret the intercept) 代写statistics作业
Log(𝐋𝐚𝐛𝐨𝐫𝐩𝐚𝐫𝐭𝐲𝐩𝐞𝐫) = 𝛽0 + 𝛽1 𝐥𝐨𝐠(𝐌𝐞𝐝𝐢𝐚𝐧𝐰𝐞𝐞𝐤𝐢𝐧𝐜) + 𝜷𝟐𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞𝐩𝐞𝐫 + 𝜷𝟑𝐎𝐰𝐧𝐞𝐝𝐨𝐮𝐭𝐫𝐢𝐠𝐡𝐭 +𝐮 (4 Points)
10)Test whether the coefficient of log(𝐌𝐞𝐝𝐢𝐚𝐧𝐰𝐞𝐞𝐤𝐢𝐧𝐜) is statistically significant at the 5% level using a two sided test.(4 Points)
11)Re-estimate the model of question (9) but now by including dummy variable for each states and using NSW as the baseline.(2 Points)
12) Are the dummy variable controlling for each states jointly significant at the 5% level?(4 Points)