Difference between revisions of "Come back after 30 days!/Findings"
Jump to navigation
Jump to search
Line 59: | Line 59: | ||
Week 5 | Week 5 | ||
|align="left"| | |align="left"| | ||
− | *Exploratory data analysis: Thought of the ways to handle 2 variables that have missing values: proposed regression imputation for the variable that has 40% values missing but was cautioned that it may introduce errors. | + | *Exploratory data analysis: Thought of the ways to handle 2 variables that have missing values: proposed regression imputation for the variable that has 40% values missing but was cautioned that it may introduce errors. Literature was obtained mostly from [http://www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/Missing.html David Howell's page] |
+ | |||
*Consultation with Prof. Kam: | *Consultation with Prof. Kam: | ||
**For the variable that has 40% missing values, we were advised to conduct a two-pronged approach (i.e. a model without 40% of the data and a model without the variable entirely) in which the eventual models can be used to compare predictive power and therefore, able to make a judgment as to whether the variable was considered a predictor. Were advised that this sort of judgment can be considered as an eventual recommendation. | **For the variable that has 40% missing values, we were advised to conduct a two-pronged approach (i.e. a model without 40% of the data and a model without the variable entirely) in which the eventual models can be used to compare predictive power and therefore, able to make a judgment as to whether the variable was considered a predictor. Were advised that this sort of judgment can be considered as an eventual recommendation. |
Revision as of 14:50, 7 February 2015
![]() |
![]() |
![]() |
![]() |
![]() |
Summary of findings by week
This is the summary of pointers and major decisions the team have taken in this data mining project. Visit our documentation page for slides for more information.
Week 2 |
|
Week 3 |
|
Week 4 |
|
Week 5 |
|