Difference between revisions of "Team Shooting Stars: Proposal"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
 
(24 intermediate revisions by 2 users not shown)
Line 2: Line 2:
 
<p></p><br/>
 
<p></p><br/>
  
{| style="background-color:black; color:white padding: 5px 0 0 0;" width="100%" height=50px cellspacing="0" cellpadding="0" valign="top" border="0" |
+
{| style="background-color:white; color:white padding: 5px 0 0 0;" width="100%" height=50px cellspacing="0" cellpadding="0" valign="top" border="0" |
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Team Shooting Stars| <b>Home</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #000000; border-top:1px solid #000000; font-family:Britannic Bold"> [[Team Shooting Stars: Proposal | <b>Proposal</b>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Team Shooting Stars: Proposal | <b>Proposal</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #000000; border-top:1px solid #000000; font-family:Britannic Bold"> [[Team Shooting Stars: Poster | <b>Poster</b>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Team Shooting Stars: Poster | <b>Poster</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #000000; border-top:1px solid #000000; font-family:Britannic Bold"> [[Team Shooting Stars: Application | <b>Application</b>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Team Shooting Stars: Application | <b>Application</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #000000; border-top:1px solid #000000; font-family:Britannic Bold"> [[Team Shooting Stars: Research Paper | <b>Report</b>]]
 +
|}
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Team Shooting Stars: Research Paper | <b>Report</b>]]
+
<br/><br/>
|}
 
 
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Background & Motivation</font></div>==
 
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Background & Motivation</font></div>==
 
The National Basketball Association (NBA) is one of the most famous sports league in the world. It consists 30 men basketball teams (where 29 in the US, 1 in Canada) which was founded in 1940s, named BAA (Basketball Association of America). Then it changed to the current name of NBA after merging with NBL (National Basketball League) in 1949. NBA plays are generally fast-paced, physically intensive where audience find it fascinating to watch. NBA also represents the best basketball play standard in the world. Joining NBA is the ultimate dream for a professional basketball player. <br/><br/>
 
The National Basketball Association (NBA) is one of the most famous sports league in the world. It consists 30 men basketball teams (where 29 in the US, 1 in Canada) which was founded in 1940s, named BAA (Basketball Association of America). Then it changed to the current name of NBA after merging with NBL (National Basketball League) in 1949. NBA plays are generally fast-paced, physically intensive where audience find it fascinating to watch. NBA also represents the best basketball play standard in the world. Joining NBA is the ultimate dream for a professional basketball player. <br/><br/>
Line 20: Line 20:
 
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Project Description</font></div>==
 
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Project Description</font></div>==
 
The aim of this project is to get deeper knowledge into the current trend in NBA and what makes a team succeed, so the questions we are going to answer are:<br/>
 
The aim of this project is to get deeper knowledge into the current trend in NBA and what makes a team succeed, so the questions we are going to answer are:<br/>
1. Is the role of centre becoming less and less important NBA?<br/>
+
1. Is the role of centre becoming less and less important in NBA?<br/>
2. Are 3-point-shooting teams more likely to win through the past ten years?<br/>
+
2. How do the defense and offense factors of a team vary and determines a team's success? <br/>
3. What is the most important quality of a championship team?<br/>
+
3. What are the important quality that leads to a player's success? <br/>
4. Is there a indicator of a player’s best performance in career?<br/>
+
4. Does the presence of a superstar and a bigger budget lead to the success of the team?<br/>
5. What make 2011 Dallas mavericks and 1996 Houston Rockets win the championship?<br/>
 
 
<br/>
 
<br/>
  
  
 
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Data Set Selection</font></div>==
 
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Data Set Selection</font></div>==
 +
We retrieved our data from [http://www.basketball-reference.com Basketball Reference]. The data is in CSV format where each game contains two CSVs files. For example, the following two CSVs represent the box scores of Cleveland Cavaliers vs Golden States Warriors on June 19, 2016:
 +
[[File:raw_data_01.jpg|center|1050px]]
 +
<br/>
 +
[[File:raw_data_02.jpg|center|1050px]]
 +
<br/>
 +
Moreover, we also can retrieve a specific player's game statistics in a certain timeline from this site:
 +
[[File:raw_data_03.jpg|center|1050px]]
 +
<br/>
 +
For our VA project, we plan to retrieve all players data in the past 10 years. We would also categorize the game statistics according to the game type (normal season, playoff, finals). The data size is quite large so we will use JMP to do data transformation and combination.
 +
 
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Schedule </font></div>==
 
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Schedule </font></div>==
 
{| border="1" cellpadding="1"
 
{| border="1" cellpadding="1"
Line 55: Line 64:
 
|width='300px'|2
 
|width='300px'|2
 
|width='600px'|Consulting Prof
 
|width='600px'|Consulting Prof
|width='200px'|Everyone
+
|width='200px'|Wu Wei, Wang Ziteng
 
|width='100px' style="background:lime"|'''Completed'''
 
|width='100px' style="background:lime"|'''Completed'''
 
|-
 
|-
 
|width='300px'|3
 
|width='300px'|3
 
|width='600px'|Deciding on tools/techniques to use
 
|width='600px'|Deciding on tools/techniques to use
|width='200px'|Everyone
+
|width='200px'|Wang Ziteng, Manas Mohapatra
 
|width='100px' style="background:lime"|'''Completed'''
 
|width='100px' style="background:lime"|'''Completed'''
 
|-
 
|-
 
|width='300px'|4
 
|width='300px'|4
 
|width='600px'|Upload detail project proposal
 
|width='600px'|Upload detail project proposal
|width='200px'|Everyone
+
|width='200px'|Wu Wei
 
|width='100px' style="background:lime"|'''Completed'''
 
|width='100px' style="background:lime"|'''Completed'''
  
Line 77: Line 86:
 
|width='600px'|Data preparation, consolidation,  preprocessing and cleaning
 
|width='600px'|Data preparation, consolidation,  preprocessing and cleaning
 
|width='200px'|Everyone
 
|width='200px'|Everyone
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
  
 
|- style="background:gold"  
 
|- style="background:gold"  
Line 84: Line 93:
 
|width='300px'|1
 
|width='300px'|1
 
|width='600px'|Update Wiki page
 
|width='600px'|Update Wiki page
|width='200px'|Everyone
+
|width='200px'|Wu Wei, Wang Ziteng
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
 
|-
 
|-
 
|width='300px'|2
 
|width='300px'|2
 
|width='600px'|Study Treemap
 
|width='600px'|Study Treemap
|width='200px'|
+
|width='200px'|Wang Ziteng
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
 
|-
 
|-
 
|width='300px'|3
 
|width='300px'|3
 
|width='600px'|Study Multi-Series Line Chart
 
|width='600px'|Study Multi-Series Line Chart
|width='200px'|
+
|width='200px'|Manas Mohapatra
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
  
  
Line 104: Line 113:
 
|width='600px'|Web App Developer
 
|width='600px'|Web App Developer
 
|width='200px'|Everyone
 
|width='200px'|Everyone
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
  
 
|- style="background:gold"  
 
|- style="background:gold"  
Line 112: Line 121:
 
|width='600px'|Do poster, presentation preparation
 
|width='600px'|Do poster, presentation preparation
 
|width='200px'|Everyone
 
|width='200px'|Everyone
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
  
 
|- style="background:gold"  
 
|- style="background:gold"  
Line 120: Line 129:
 
|width='600px'|Presentation  
 
|width='600px'|Presentation  
 
|width='200px'|Everyone
 
|width='200px'|Everyone
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
  
 
|- style="background:gold"  
 
|- style="background:gold"  
Line 127: Line 136:
 
|width='300px'|1
 
|width='300px'|1
 
|width='600px'|Submission of project poster
 
|width='600px'|Submission of project poster
|width='200px'|Everyone
+
|width='200px'|Wu Wei
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
 
|-
 
|-
 
|width='300px'|2
 
|width='300px'|2
 
|width='600px'|Submission of final project paper and artifacts
 
|width='600px'|Submission of final project paper and artifacts
|width='200px'|Everyone
+
|width='200px'|Wang Ziteng, Manas Mohapatra
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
  
 
|- style="background:gold"  
 
|- style="background:gold"  
Line 141: Line 150:
 
|width='600px'|Visual Analytics Poster Night
 
|width='600px'|Visual Analytics Poster Night
 
|width='200px'|Everyone
 
|width='200px'|Everyone
|width='100px' style="background:lime"|'''Not Completed'''
+
|width='100px' style="background:lime"|'''Completed'''
 
|-
 
|-
 
|}
 
|}
  
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> TOOLS </font></div>==
+
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Tools </font></div>==
Our team decided to use the following tools for analysis:
+
Our team decided to use the tools such as JMP, Tableau, d3,js for doing the following analysis:
# Tableau:
+
 
*#* Treemap
+
{| class="wikitable"
*#* Sunburst Charts
+
|-
*#*Parallel Plots
+
! Illustration !! Analytical Methods
*# QGIS:  
+
|-
*#* Geospatial Analysis: To see if there’s any correlation between
+
| [[File:TSS2.png|center|400px]]
*# JMP:
+
||
*#* Clustering Analysis
+
 
*#* Logistic Regression: The outcome of the analysis will usually be based on a success or failure of the game. Hence to do a regression analysis on factors for success of a game, we need to use logistic regression as the dependent variable is categorical in nature
+
 
*#* Time Series Analysis: Basketball Reference consists of 5 years’ worth of dataset. Thus it’s worth doing a time series analysis on player performance, team performance, change in player’s role and several other factors
+
* Spider Charts
 +
* Logistic Regression: The outcome of the analysis will usually be based on a success or failure of the game. Hence to do a regression analysis on factors for success of a game, we need to use logistic regression as the dependent variable is categorical in nature
 +
 
 +
|-
 +
| [[File:TSS3.png|center|400px]]
 +
||
 +
 +
* Time Series Analysis: Basketball Reference consists of 5 years’ worth of dataset. Thus it’s worth doing a time series analysis on player performance, team performance, change in player’s role and several other factors
 +
 
 +
|-
 +
| [[File:TSS1.png|center|400px]]
 +
||
 +
 
 +
* Treemap
 +
* Sunburst Charts
 +
*Heatmaps: Heatmaps are a two-dimensional representation of data in which values are represented by colors. It can provide an instantaneous glance of the summary of information. Given the hyper-dimensional data sets of NBA players and teams, Heatmaps is a effective tool to visualise the dataset
 +
|}
  
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> PRIOR WORK & REFERENCES </font></div>==
+
==<div style="background: #000000; padding: 13px; font-weight: bold; text-align:center; line-height: 0.3em; text-indent: 20px;font-size:26px; font-family:Britannic Bold"><font color= #ffffff> Prior Work & References </font></div>==
# Success Factors in NBA:
+
# Success Factors in NBA: http://www.basketball-reference.com/about/factors.html
*#* http://www.basketball-reference.com/about/factors.html
+
# The length and success of NBA careers: Coates, Dennis; Oguntimein, Babatunde. International Journal of Sport Finance 5.1  (Feb 2010): 4-26.
*# The length and success of NBA careers:
+
# Racial Discrimination among NBA referees: http://ww2.amstat.org//Chapters/boston/nessis07/presentation_material/Justin_Wolfers.pdf
*#* Coates, Dennis; Oguntimein, Babatunde. International Journal of Sport Finance 5.1  (Feb 2010): 4-26.
 
*# Racial Discrimination among NBA referees:  
 
*#* http://ww2.amstat.org//Chapters/boston/nessis07/presentation_material/Justin_Wolfers.pdf
 

Latest revision as of 19:54, 2 December 2016

Group4 teamlogo.jpg




Background & Motivation

The National Basketball Association (NBA) is one of the most famous sports league in the world. It consists 30 men basketball teams (where 29 in the US, 1 in Canada) which was founded in 1940s, named BAA (Basketball Association of America). Then it changed to the current name of NBA after merging with NBL (National Basketball League) in 1949. NBA plays are generally fast-paced, physically intensive where audience find it fascinating to watch. NBA also represents the best basketball play standard in the world. Joining NBA is the ultimate dream for a professional basketball player.

As NBA fans, our group would like to analyse on player’s statistics and team’s performance to clear our doubts like how the play styles of NBA basketball have been changed over the last 10 years. We would apply different visualization tools and graphics to gain in-depth analysis.

Project Description

The aim of this project is to get deeper knowledge into the current trend in NBA and what makes a team succeed, so the questions we are going to answer are:
1. Is the role of centre becoming less and less important in NBA?
2. How do the defense and offense factors of a team vary and determines a team's success?
3. What are the important quality that leads to a player's success?
4. Does the presence of a superstar and a bigger budget lead to the success of the team?


Data Set Selection

We retrieved our data from Basketball Reference. The data is in CSV format where each game contains two CSVs files. For example, the following two CSVs represent the box scores of Cleveland Cavaliers vs Golden States Warriors on June 19, 2016:

Raw data 01.jpg


Raw data 02.jpg


Moreover, we also can retrieve a specific player's game statistics in a certain timeline from this site:

Raw data 03.jpg


For our VA project, we plan to retrieve all players data in the past 10 years. We would also categorize the game statistics according to the game type (normal season, playoff, finals). The data size is quite large so we will use JMP to do data transformation and combination.

Schedule

Academic Studying Week Task Done By Status
Week 7
1 Brainstorm project topic and scope Everyone Completed
Week 8
1 Formulate ideas Everyone Completed
2 Consulting Prof Wu Wei, Wang Ziteng Completed
3 Deciding on tools/techniques to use Wang Ziteng, Manas Mohapatra Completed
4 Upload detail project proposal Wu Wei Completed



Week 9
1 Data preparation, consolidation, preprocessing and cleaning Everyone Completed
Week 10
1 Update Wiki page Wu Wei, Wang Ziteng Completed
2 Study Treemap Wang Ziteng Completed
3 Study Multi-Series Line Chart Manas Mohapatra Completed


Week 11 & 12
1 Web App Developer Everyone Completed
Week 13
1 Do poster, presentation preparation Everyone Completed
Week 14
1 Presentation Everyone Completed
Week 15
1 Submission of project poster Wu Wei Completed
2 Submission of final project paper and artifacts Wang Ziteng, Manas Mohapatra Completed
Week 16
1 Visual Analytics Poster Night Everyone Completed

Tools

Our team decided to use the tools such as JMP, Tableau, d3,js for doing the following analysis:

Illustration Analytical Methods
TSS2.png


  • Spider Charts
  • Logistic Regression: The outcome of the analysis will usually be based on a success or failure of the game. Hence to do a regression analysis on factors for success of a game, we need to use logistic regression as the dependent variable is categorical in nature
TSS3.png
  • Time Series Analysis: Basketball Reference consists of 5 years’ worth of dataset. Thus it’s worth doing a time series analysis on player performance, team performance, change in player’s role and several other factors
TSS1.png
  • Treemap
  • Sunburst Charts
  • Heatmaps: Heatmaps are a two-dimensional representation of data in which values are represented by colors. It can provide an instantaneous glance of the summary of information. Given the hyper-dimensional data sets of NBA players and teams, Heatmaps is a effective tool to visualise the dataset

Prior Work & References

  1. Success Factors in NBA: http://www.basketball-reference.com/about/factors.html
  2. The length and success of NBA careers: Coates, Dennis; Oguntimein, Babatunde. International Journal of Sport Finance 5.1 (Feb 2010): 4-26.
  3. Racial Discrimination among NBA referees: http://ww2.amstat.org//Chapters/boston/nessis07/presentation_material/Justin_Wolfers.pdf