Difference between revisions of "ZAN Project Findings"

From Analytics Practicum
Jump to navigation Jump to search
 
(9 intermediate revisions by the same user not shown)
Line 24: Line 24:
  
 
| style="padding:0.3em; font-family:Helvetica; font-size:110%; border-bottom:2px solid #228B22; border-top:2px solid #228B22; background:#228B22; text-align:center;" width="10%" |
 
| style="padding:0.3em; font-family:Helvetica; font-size:110%; border-bottom:2px solid #228B22; border-top:2px solid #228B22; background:#228B22; text-align:center;" width="10%" |
[[ZAN_Team|<font  face ="Lucida Grande" color="#FFFFFF"><strong>ABOUT US </strong></font>]]
+
[[Team_ZAN|<font  face ="Lucida Grande" color="#FFFFFF"><strong>ABOUT US </strong></font>]]
 +
| style="border-bottom:2px solid #228B22; border-top:2px solid #228B22; background:#228B22;" width="1%" | &nbsp;
 +
 
 +
| style="padding:0.3em; font-family:Helvetica; font-size:110%; border-bottom:2px solid #228B22; border-top:2px solid #228B22; background:#228B22; text-align:center;" width="10%" |
 +
[[ANLY482_AY2016-17_Term_2|<font  face ="Lucida Grande" color="#FFFFFF"><strong>BACK TO MAIN ANLY82 </strong></font>]]
 
| style="border-bottom:2px solid #228B22; border-top:2px solid #228B22; background:#228B22;" width="1%" | &nbsp;
 
| style="border-bottom:2px solid #228B22; border-top:2px solid #228B22; background:#228B22;" width="1%" | &nbsp;
 
|}
 
|}
Line 31: Line 35:
 
<!--Sub Navigation-->
 
<!--Sub Navigation-->
 
{|style="background-color:#ffffff; color:#000000; font-size:10pt" padding: 5px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"|
 
{|style="background-color:#ffffff; color:#000000; font-size:10pt" padding: 5px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"|
|style="padding:0.4em; text-align:center; border-top:1px solid #ffffff; border-bottom:1.5px solid #005ae6; " width="10%" | [[Mid-Term Progress |<font color="#000a1a"><b>Mid-Term Progress</b></font>]]
+
|style="padding:0.4em; text-align:center; border-top:1px solid #ffffff; border-bottom:1.5px solid #005ae6; " width="10%" | [[ZAN_Project Findings |<font color="#000a1a"><b>Mid-Term Progress</b></font>]]
|style="padding:0.4em; text-align:center; border-top:1px solid #ffffff; border-bottom:1.5px solid #005ae6; " width="10%" | [[ANLY482 AY1516 G1 Team Skulptors - Project Description |<font color="#005ae6"><b>Description</b></font>]]
+
 
|style="padding:0.4em; text-align:center; border-top:1px solid #ffffff; border-bottom:1.5px solid #005ae6; " width="10%" | [[ANLY482 AY1516 G1 Team Skulptors - Warehouse Tour |<font color="#005ae6"><b>Warehouse Tour </b></font>]]
+
 
|style="padding:0.4em; text-align:center; border-top:1px solid #ffffff; border-bottom:1.5px solid #005ae6; " width="10%" | [[ANLY482 AY1516 G1 Team Skulptors - Methodology |<font color="#005ae6"><b>Methodology </b></font><sup><font style="font-size:80%" color="#ff0000">new!</font></sup>]]
+
|style="padding:0.4em; text-align:center; border-top:1px solid #ffffff; border-bottom:1.5px solid #005ae6; " width="10%" | [[Final Progress |<font color="#005ae6"><b> Final Progress</b><sup><font style="font-size:80%" color="#ff0000">new!</font></sup></font>]]
|style="padding:0.4em; text-align:center; border-top:1px solid #ffffff; border-bottom:1.5px solid #005ae6; " width="10%" | [[ANLY482 AY1516 G1 Team Skulptors - Technology |<font color="#005ae6"><b>Technology </b></font><sup><font style="font-size:80%" color="#ff0000">new!</font></sup>]]
 
|style="padding:0.4em; text-align:center; border-top:1px solid #ffffff; border-bottom:1.5px solid #005ae6; " width="10%" | [[Final Progress |<font color="#005ae6"><b> Final ProgressI</b><sup><font style="font-size:80%" color="#ff0000">new!</font></sup></font>]]
 
 
|}
 
|}
 
<br/>
 
<br/>
  
 
<!--End of Navigation Bar-->
 
<!--End of Navigation Bar-->
 +
 +
<div align="left">
 +
<div style="background: #F5FFFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #2E8B57 solid 32px;"><font color="##4682B4">Data Cleaning</font></div>
 +
<br/>
 +
The data had 77,205 records initially. The following diagram shows our team's general data cleaning procedures.
 +
<center>
 +
[[Image:AY2017_ZAN_Data_Cleaning.png|700px]]
 +
</center>
 +
<br/>
 +
After the the data cleaning, the data now has 63,511 records.
 +
<br/>
 +
 +
<div align="left">
 +
<div style="background: #F5FFFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #2E8B57 solid 32px;"><font color="##4682B4">Data Exploration</font></div>
 +
Due to the sensitivity and confidentiality of the data, please refer to the elearn dropbox or send us an email.
 +
<br/>
 +
 +
<div align="left">
 +
<div style="background: #F5FFFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #2E8B57 solid 32px;"><font color="##4682B4">Data Modelling</font></div>
 +
Due to the nature of the data, our team has decided to prepare 3 separate analytical sandboxes for the models.
 +
<br/>
 +
# Sandbox 1 (Per episode): The dependant variable has only 2 levels. Thus, we will run logistic regression and decision tree.
 +
# Sandbox 2 (Per episode): The dependant variable has 3 levels. Thus, we will run multinomial logistic regression and decision tree.
 +
# Sandbox 3 (Per patient): The dependant variable has 3 levels. Thus, we will run multiple linear regression
 +
<br/>

Latest revision as of 13:54, 23 April 2017


HOME

 

PROJECT OVERVIEW

 

PROJECT FINDINGS

 

PROJECT MANAGEMENT

 

DOCUMENTATION

 

ABOUT US

 

BACK TO MAIN ANLY82

 


Mid-Term Progress


Final Progressnew!



Data Cleaning


The data had 77,205 records initially. The following diagram shows our team's general data cleaning procedures.

AY2017 ZAN Data Cleaning.png


After the the data cleaning, the data now has 63,511 records.

Data Exploration

Due to the sensitivity and confidentiality of the data, please refer to the elearn dropbox or send us an email.

Data Modelling

Due to the nature of the data, our team has decided to prepare 3 separate analytical sandboxes for the models.

  1. Sandbox 1 (Per episode): The dependant variable has only 2 levels. Thus, we will run logistic regression and decision tree.
  2. Sandbox 2 (Per episode): The dependant variable has 3 levels. Thus, we will run multinomial logistic regression and decision tree.
  3. Sandbox 3 (Per patient): The dependant variable has 3 levels. Thus, we will run multiple linear regression