当前位置:天才代写 > Python代写 > 机器学习代写 > Big Data代写 Economics代写 Economics代写 Machine Learning代写

Big Data代写 Economics代写 Economics代写 Machine Learning代写

2021-01-29 17:42 星期五 所属: 机器学习代写 浏览:53

Big Data代写

Assignment 3

Big Data代写 Consider the two variables in the dataset Assign3.csv. We are interested in predicting the second variable Y given the first variable X.

Machine Learning and Big Data for Economics and Finance

Consider the two variables in the dataset Assign3.csv.Big Data代写

We are interested in predicting the second variable Y given the first variable X.

  1. Fit a linear regression model to the data. Show the data scatter plot on the samefigure with the values predicted by the linear model.
  2. Fit a quadratic regression model to the data. Show the data scatter plot on the same figure with the values predicted by the quadratic.
  3. We are interested in constructing a step function learner asfollows:

First draw a random number U uniformly on the interval spanned by the minimum and maximum values of the inputs (x1; :::; xn) and then use it to construct the following function whose purpose is to give the prediction of Y given X x:

f(x) = a1I(U x) + a2I(U > x);Big Data代写 

where a1 and a2 are just unknown constants to be learned. It goes without saying that I(some statement) is the indicator function that equals 1 when the statement is true and 0 otherwise.

a.Usetwo  different  methods  to  compute the  estimate  f^(x) = a^1I(U 6x)+a^2I(U > x).  Is f^ a strong learner?

b.Use one of the previous two methods to write an Rfunction that takes as input x and the data (x1; :::; xn; y1; :::; yn) and gives as output f^(x).Make sure the function is capable of dealing with the case where x conatains more than one number.Big Data代写

c.Usingthree different runs of the previous function, create three dif- ferent plots where, on each, f^ is shown together with the scatter plot of the

Big Data代写
Big Data代写

4.Write an R function that applies boosting to the previous step function learner.Big Data代写

That R function should take as inputs: the data, B the number of boosting iterations, L the learning rate and an optional argument indicating thesize of the test subsample in case a validation set approach is needed.

As output the function should give:  f^boost  the boosted learner evaluated

at the training data and the training mean squared error evaluated for each iteration b = 1; :::; B of the boosting algorithm. Also, in case the size of the test subsample is greater than zero, the function should output:  f^boost  evaluated at the test sample and the test MSE evaluated for each iteration b = 1; :::; B.Big Data代写

a.Use that  function  to  plot  f^boost   on  top  of  the  data  scatter  plot  for

L = 0.01 and for B = 10000. Show the same with different values of B.

b.Plot the training MSE vs. the number of

c.Was there overfitting when B =10000?Big Data代写

Even though the algorit

NOTE:

hm is described in detail in both the slides and textbook, for the sake of making the implementation easier, its special case per- taining to the questions in the assignment is presented here.

Boosting algorithm:

  1. Inputs:
  • A sample of covariates (i.e. inputs) x1; :::; xn and responses (i.e. out- puts) y1; :::; yn.
  • A (weak) learnerf^.
  • A learning rate L >0.
  1. Initialize:
    • Set f^boost(x  0.
  • Compute  the  first  learner  f^0(x) = a^1I(U x) + a^2I(U > x)  on  the original data.
  • Setri   yi ¡ Lf^0(xi) for i = 1; :::; n.Big Data代写
  1. Do the following for b = 1; :::;B:

a.Givenx1; :::; xn as covariates and r1; :::; rn as responses, fit a learner f^b by first sampling U and then estimating f^b(x) = a^1I(U x) + a^2I(U > x).

b.Set f^boost(x  f^boost(x) + Lf^b(x).

c.Setri   ri ¡ Lf^b(xi).

4.Output: f^boost(x).

Big Data代写
Big Data代写

其他代写:考试助攻 计算机代写 java代写 function代写 paper代写  web代写 编程代写 report代写 数学代写 algorithm代写 python代写 java代写 code代写 project代写 Exercise代写 dataset代写 analysis代写 C++代写 代写CS 金融经济统计代写 C语言代写 北美代写

合作平台:天才代写 幽灵代写 写手招聘 Essay代写

 

天才代写-代写联系方式