当前位置:天才代写 > 作业代写,留学生作业代写-北美、澳洲、英国等靠谱代写 > 预测分析作业代写 Predictive Analytic代写

预测分析作业代写 Predictive Analytic代写

2022-01-14 09:09 星期五 所属: 作业代写,留学生作业代写-北美、澳洲、英国等靠谱代写 浏览:309

预测分析作业代写

TBA2104 Predictive Analytics 2018/19 Sem 2

Individual Assignment 2

 预测分析作业代写 Providethe R codes for generating a logistic regression model (model) using class as the response and the other attributes as predictors. (1 mark)

Submission Information    预测分析作业代写

  • This assignment contributes 15% to the final course grade. The total marks for this assignment is15.
  • Deadline: 12 Apr 2019 2359hrs (Friday11:59pm)
  • Late submission will be penalized.
  • Upload the softcopy (a single pdf or doc/docx) to IVLE workbin (Deliverables Submission > Assignment2)
  • Please ensure that you have written your name and matric number in thedocument

1.This question is based on the diabetes dataset (arff). This dataset consists of 768 observations and 9 attributes. The brief description of the attributes are asfollows:    预测分析作业代写

  • preg : Number of times the patient ispregnant
  • plas: Plasma glucoseconcentration
  • pres : Diastolic blood pressure (mmHg)
  • skin : Triceps skin fold thickness(mm)
  • insu: 2-hour serum insulin (muU/ml)
  • mass : Body mass index (weight in kg / (height in m)^2)
  • pedi : Diabetes pedigreefunction
  • age : Age(years)
  • class : Class variable (either tested_negative ortested_positive)   预测分析作业代写

a)Provide the R codes for loading the data into a variable Diabetes. (1mark)

b)Provide the R codes generating the CSV equivalent of the diabetes dataset (csv). (1mark)

c)Compare and contrast the similarities and differences of the ARFF format and the CSV format. (2marks)

预测分析作业代写
预测分析作业代写

d)Providethe R codes for generating a logistic regression model (model) using class as the response and the other attributes as predictors. (1 mark)    预测分析作业代写

e)Using the logistic regression results of the model, write down the equation oflog- odds of the model. Please round off all the coefficient estimates to 4 decimal places. (1 mark)

f)We learned that logistic regression uses a logistic function: Pr(Y=REFERENCE_ CLASS | data) (i.e. the probability of class = REFERENCE_CLASS given a data point. It turns out that R uses the first level value of a factor-type attribute as the reference class.

> Diabetes$class

[1]tested_positive…    预测分析作业代写

[764] tested_negative …

Levels: tested_negative tested_positive

I.e. from the above output, we can tell that the REFERENCE_CLASS is test_negative. Using this model, determine whether the following testing data point should be classified as tested_negative or tested_positive. You show the step by step (mathematical) working how you arrive at this conclusion.    预测分析作业代写

preg plas pres skin insu mass pedi age
1 123 60 20 0 30 0.3 40

(4 marks)

(g)Provide the R codes for verifying the probability value of f) using thepredict() function in R. (2 marks)

(h)Suppose you want to change the reference class in R to tested_positive, you could use the relevel() Read the help pages and provide the R commandto change reference class to test_positive so that predict() will be based on tested_positive. (1 mark)

(i)If you were to generate a new model (model2) using tested_positive asthe reference class, what is the difference in the regression model of model2 compared to model? (2 marks)

预测分析作业代写
预测分析作业代写

 

更多代写:留学生代写平台  Gmat代考  留学生高中论文代写  留学生Research Proposal代写  留学论文分析代写  代写Python

合作平台:essay代写 论文代写 写手招聘 英国留学生代写

 

天才代写-代写联系方式