Difference between revisions of "ANLY482 AY2016-17 T2 Group11: Project Findings"

From Analytics Practicum
Jump to navigation Jump to search
 
(17 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 +
<div align="right">
 +
[[ANLY482_AY2016-17_Term_2|<font color="#f9660e" font-family:helvetica><b>Return to ANLY482 AY2016-17 Home Page</b></font>]]
 +
</div>
 
<!-- Navigation bar -->
 
<!-- Navigation bar -->
 +
<!--[[File:T11 Banner.png|840px|center]]-->
 +
[[File:T11 logo.png|center|250px]]
 +
<div align="center">
 +
[[File:T11 home.png|135px||link=ANLY482_AY2016-17_T2_Group11]]
 +
[[File:T11 about us.png|135px|link=ANLY482_AY2016-17_T2_Group11: About Us]]
 +
[[File:T11 overview.png|135px|link=ANLY482_AY2016-17_T2_Group11: Project Overview]]
 +
[[File:T11 mgmt.png|135px|link=ANLY482_AY2016-17_T2_Group11: Project Management]]
 +
[[File:T11 findings 2.png|border|135px|link=ANLY482_AY2016-17_T2_Group11: Project Findings]]
 +
[[File:T11 documentation.png|135px|link=ANLY482_AY2016-17_T2_Group11: Documentation]]
 +
</div>
  
 
{| style="background-color:#FFFFFF; color:#000000 padding: 5px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" |
 
{| style="background-color:#FFFFFF; color:#000000 padding: 5px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" |
|style="font-size:100%; text-align:center; border-left:1px solid #ffffff; border-right:1px solid #ffffff;background:linear-gradient(#F9660E, #EE3D10); padding:12px;" width="20%" |[[ANLY482_AY2016-17_T2_Group11 | <font color="#FFF"><b>HOME</b></font>]]
+
|style="font-size:100%; text-align:center; border-left:1px solid #ffffff; border-right:1px solid #ffffff;background-color:#203470; padding:12px;" width="20%" |[[ANLY482_AY2016-17_T2_Group11: Project Findings | <font color="#FFF"><b>Interim</b></font>]]  
|style="font-size:100%; text-align:center; border-left:1px solid #ffffff; border-right:1px solid #ffffff;background:linear-gradient(#F9660E, #EE3D10); " width="20%" |[[ANLY482_AY2016-17_T2_Group11: About Us |<font color="#FFFFFF"><b>ABOUT US</b></font>]]
+
|style="font-size:100%; text-align:center; border-left:1px solid #ffffff; border-right:1px solid #ffffff;background-color:#5478E4; " width="20%" |[[ANLY482_AY2016-17_T2_Group11: Project Findings Final |<font color="#FFFFFF"><b>Final</b></font>]]  
|style="font-size:100%; text-align:center;border-left:1px solid #ffffff; border-right:1px solid #ffffff; background:linear-gradient(#F9660E, #EE3D10); " width="20%" |[[ANLY482_AY2016-17_T2_Group11: Project Overview |<font color="#ffffff"><b>PROJECT OVERVIEW</b></font>]]  
 
|style="font-size:100%; text-align:center;border-left:1px solid #ffffff; border-right:1px solid #ffffff; background:linear-gradient(#F9660E, #EE3D10); " width="20%" |[[ANLY482_AY2016-17_T2_Group11: Project Management |<font color="#ffffff"><b>PROJECT MANAGEMENT</b></font>]]
 
|style="font-size:100%; text-align:center; border-left:1px solid #ffffff; border-right:1px solid #ffffff;background:#444; " width="20%" |[[ANLY482_AY2016-17_T2_Group11: Project Findings |<font color="#ffffff"><b>PROJECT FINDINGS</b></font>]]
 
|style="font-size:100%; text-align:center;border-left:1px solid #ffffff; border-right:1px solid #ffffff; background:linear-gradient(#F9660E, #EE3D10); " width="20%" |[[ANLY482_AY2016-17_T2_Group11: Documentation |<font color="#ffffff"><b>DOCUMENTATION</b></font>]]  
 
 
|}
 
|}
  
 +
[[Media:Group11_Interim_Slides.pdf|Interim Slides]]
 
<br>
 
<br>
 +
[[Media:ANLY482_Team11_InterimReport.pdf|Interim Report]]
  
<div style="background: linear-gradient(#F9660E, #EE3D10); padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 0px; font-size:20px; font-family:helvetica"><font color= #FFFFFF>Coming Soon</font></div>
+
= Data Source=
 +
All the data was taken from the 2015 PISA Database which is available from the PISA website (http://www.oecd.org/pisa/data/2015database/). The 2015 dataset contains the full set of responses from individual students, school principals, teachers, and parents. For this project, the team will be using the questionnaire, codebook, and compendia data from the PISA website.
 +
 
 +
[[File:T11 interim data source.png|center|840px]]
 +
 
 +
=Proposed Phases=
 +
From the start of the project up to the interim presentation, phases 1 to 4 were done and the team's progress and findings are documented in the sections below.
 +
[[File:T11 interim phases.png|center|840px]]
 +
 
 +
= Data Preparation=
 +
6 steps were done in preparing the data before the team could progress further in doing analysis of the data.
 +
 
 +
[[File:T11 interim prep 1.png|center|840px]]
 +
[[File:T11 interim prep 2.png|center|840px]]
 +
 
 +
= Preliminary Findings =
 +
 
 +
==Descriptive Analysis (Cognitive)==
 +
 
 +
===Booklet===
 +
From the booklets which the students answered, there were 3 subjects. These were: Reading, Math, and Science. The team discovered that different sets of booklets consist of different sets of questions from all or some of the 3 subjects and each booklet consists of different number of questions.
 +
[[File:T11 interim booklet.png|center|500px]]
 +
 
 +
===Mean Score===
 +
There was a wide range of mean scores across all questions and a similar pattern could be observed for all 3 subjects.
 +
[[File:T11 interim mean scores.png|center|840px]]
 +
 
 +
===Segmentation ===
 +
The questions were split into 3 segments based on difficulty. The bulk of the questions were placed in the middle segment which mean score falls in the range of 0.4 to 0.7.
 +
[[File:T11 interim segment.png|center|840px]]
 +
 
 +
==Descriptive Analysis (School)==
 +
 
 +
===Uniform Response ===
 +
The team observed that there were questions where all schools answered with the same answer. An insight for this is that there is a potential loophole that schools may be biased or afraid to answer truthfully.
 +
[[File:T11 interim uniform.png|center|840px]]
 +
 
 +
===Science===
 +
The team observed that there were questions which were science specific which could be used together with the mean scores of the science questions. Science related questions asked to schools were related with how prepared and equipped the schools were to improve the learning of students.
 +
[[File:T11 interim science .png|center|840px]]
 +
 
 +
==Descriptive Analysis (Student)==
 +
 
 +
===Country of Birth===
 +
Most of the students who took the PISA test in Singapore were born in Singapore and both parents were also born in Singapore. There is a rather sizeable group of students who have parents born overseas.
 +
[[File:T11 interim students.png|center|840px]]
 +
 
 +
==Overall Challenge==
 +
[[File:T11 interim challenge.png|center|840px]]
 +
 
 +
=Exploratory Analysis=
 +
 
 +
==Standardisation of Booklets==
 +
===Challenge ===
 +
* Different booklets were used for the test
 +
* Each booklet consists of different sets of questions
 +
 
 +
===Proposed Solution===
 +
* Find out the mean percentage of Easy, Medium, Hard questions for Reading, Mathematics and Science
 +
* Use score percentage of students * mean percentage
 +
 
 +
===Assumptions ===
 +
* Mean scores are used as a proxy to determine level of difficulty
 +
* Level of difficulty is perceived to be the same across all students
 +
 
 +
==Overall Cognitive Analysis==
 +
The team did an overview of the performance of all the students based on the booklets.
 +
[[File:T11 interim overall 1.png|center|840px]]
 +
[[File:T11 interim overall 2.png|center|840px]]
 +
 
 +
==School Performance Analysis==
 +
=== Class Size===
 +
*Private schools performed better than public schools in all class sizes except for classes with less than 15 students
 +
[[File:T11 interim class size.png|center|840px]]
 +
 
 +
=== Spending===
 +
* Schools which spent money on new equipment generally performed better than schools which did not.
 +
[[File:T11 interim spending.png|center|840px]]
 +
 
 +
=== Quality of Teaching===
 +
* Schools which had inadequate or poorly qualified teaching staff performed poorer than the rest (34 public schools)
 +
[[File:T11 interim quality.png|center|840px]]
 +
 
 +
=== Autonomy ===
 +
* Fully autonomous schools have the highest average scores compared to less autonomous schools
 +
* The least autonomous schools have the lowest average score compared to the rest
 +
[[File:T11 interim autonomy.png|center|840px]]
 +
 
 +
==Student Performance Analysis==
 +
=== Language===
 +
* Similar trends observed for all languages
 +
* To get more specific insights, further breakdown of students’ score required
 +
[[File:T11 interim language.png|center|840px]]
 +
 
 +
=== Education of Parents ===
 +
* GENERAL UPWARD TREND: Higher the parent’s education level, the higher the scores
 +
* None of the students from private schools have parents with no education
 +
[[File:T11 interim parents.png|center|840px]]
 +
 
 +
=== Support from Parents ===
 +
* GENERAL UPWARD TREND: Higher the support from parents, higher the score
 +
[[File:T11 interim support.png|center|840px]]
 +
 
 +
== Overall Insight ==
 +
[[File:T11 interim overall.png|center|840px]]
 +
 
 +
 
 +
= Further Analyses =
 +
== Regression Analysis==
 +
* Regression Analysis on factors affecting school’s performance
 +
* Regression Analysis on factors affecting student’s performance in school
 +
 
 +
== Cluster Analysis ==
 +
* Cluster Analysis on schools based on their performance
 +
 
 +
<br>  
 +
= Appendix=
 +
== School Performance Analysis ==
 +
[[File:T11 interim appendix 1.png|center|840px]]
 +
[[File:T11 interim appendix 2.png|center|840px]]

Latest revision as of 21:08, 23 April 2017

Return to ANLY482 AY2016-17 Home Page

T11 logo.png

T11 home.png T11 about us.png T11 overview.png T11 mgmt.png T11 findings 2.png T11 documentation.png

Interim Final

Interim Slides
Interim Report

Data Source

All the data was taken from the 2015 PISA Database which is available from the PISA website (http://www.oecd.org/pisa/data/2015database/). The 2015 dataset contains the full set of responses from individual students, school principals, teachers, and parents. For this project, the team will be using the questionnaire, codebook, and compendia data from the PISA website.

T11 interim data source.png

Proposed Phases

From the start of the project up to the interim presentation, phases 1 to 4 were done and the team's progress and findings are documented in the sections below.

T11 interim phases.png

Data Preparation

6 steps were done in preparing the data before the team could progress further in doing analysis of the data.

T11 interim prep 1.png
T11 interim prep 2.png

Preliminary Findings

Descriptive Analysis (Cognitive)

Booklet

From the booklets which the students answered, there were 3 subjects. These were: Reading, Math, and Science. The team discovered that different sets of booklets consist of different sets of questions from all or some of the 3 subjects and each booklet consists of different number of questions.

T11 interim booklet.png

Mean Score

There was a wide range of mean scores across all questions and a similar pattern could be observed for all 3 subjects.

T11 interim mean scores.png

Segmentation

The questions were split into 3 segments based on difficulty. The bulk of the questions were placed in the middle segment which mean score falls in the range of 0.4 to 0.7.

T11 interim segment.png

Descriptive Analysis (School)

Uniform Response

The team observed that there were questions where all schools answered with the same answer. An insight for this is that there is a potential loophole that schools may be biased or afraid to answer truthfully.

T11 interim uniform.png

Science

The team observed that there were questions which were science specific which could be used together with the mean scores of the science questions. Science related questions asked to schools were related with how prepared and equipped the schools were to improve the learning of students.

T11 interim science .png

Descriptive Analysis (Student)

Country of Birth

Most of the students who took the PISA test in Singapore were born in Singapore and both parents were also born in Singapore. There is a rather sizeable group of students who have parents born overseas.

T11 interim students.png

Overall Challenge

T11 interim challenge.png

Exploratory Analysis

Standardisation of Booklets

Challenge

  • Different booklets were used for the test
  • Each booklet consists of different sets of questions

Proposed Solution

  • Find out the mean percentage of Easy, Medium, Hard questions for Reading, Mathematics and Science
  • Use score percentage of students * mean percentage

Assumptions

  • Mean scores are used as a proxy to determine level of difficulty
  • Level of difficulty is perceived to be the same across all students

Overall Cognitive Analysis

The team did an overview of the performance of all the students based on the booklets.

T11 interim overall 1.png
T11 interim overall 2.png

School Performance Analysis

Class Size

  • Private schools performed better than public schools in all class sizes except for classes with less than 15 students
T11 interim class size.png

Spending

  • Schools which spent money on new equipment generally performed better than schools which did not.
T11 interim spending.png

Quality of Teaching

  • Schools which had inadequate or poorly qualified teaching staff performed poorer than the rest (34 public schools)
T11 interim quality.png

Autonomy

  • Fully autonomous schools have the highest average scores compared to less autonomous schools
  • The least autonomous schools have the lowest average score compared to the rest
T11 interim autonomy.png

Student Performance Analysis

Language

  • Similar trends observed for all languages
  • To get more specific insights, further breakdown of students’ score required
T11 interim language.png

Education of Parents

  • GENERAL UPWARD TREND: Higher the parent’s education level, the higher the scores
  • None of the students from private schools have parents with no education
T11 interim parents.png

Support from Parents

  • GENERAL UPWARD TREND: Higher the support from parents, higher the score
T11 interim support.png

Overall Insight

T11 interim overall.png


Further Analyses

Regression Analysis

  • Regression Analysis on factors affecting school’s performance
  • Regression Analysis on factors affecting student’s performance in school

Cluster Analysis

  • Cluster Analysis on schools based on their performance


Appendix

School Performance Analysis

T11 interim appendix 1.png
T11 interim appendix 2.png