Uploading assignment1#1
Conversation
PatelVishakh
left a comment
There was a problem hiding this comment.
Assignment 1:Pending Resubmission. Needs a few changes. Required Changes:
Q1)III) The type of variable is categorical. In a data science setting, this question is asking whether the variable is numerical or categorial (Integer, continuous, ordinal are for distinguishing further) to assess whether classification or regression methods should be used. The complete statement should Class is a Categorical Variable, represented here in integers (0,1,2) stored in our Dataframe as int64.
Q2)I)The more precise explanation is that Knn relies heavily on comparing distances between data points to fit a model to the data. Hence Variables with large scales sway the DISTANCE computation more, influencing the estimation quite incorrectly.
Q2)II) We do not standardize the variable Class not only because it is categorical variable, but also rather because that is the quantity of interest and we would have to rescale our predictions for them to be interpretable.
Q2)IV) need to use predictors_standardized rather than original dataset. This has caused incorrect models for the 3) and 4)
Vishakh Patel [LS]
|
@PatelVishakh Uploaded corrections as per feedback. Thank you. |
PatelVishakh
left a comment
There was a problem hiding this comment.
Assignment 1 Complete! Remaining Changes:
Q3)II)range(1:51) as python doesn’t include last index/ no need to do skip since our grid is not very big.
Vishakh Patel
No description provided.