Difference between revisions of "JobSage: Proposal"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
 
(10 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
[[File:Group9.png|frameless|center]]
 
<p></p><br/>
 
<p></p><br/>
 
<!--Sub Header Start-->
 
<!--Sub Header Start-->
 
{| style="background-color:white; color:white padding: 5px 0 0 0;" width="100%" height=50px cellspacing="0" cellpadding="0" valign="top" border="0" |
 
{| style="background-color:white; color:white padding: 5px 0 0 0;" width="100%" height=50px cellspacing="0" cellpadding="0" valign="top" border="0" |
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage| <font color="#F5F5F5" size=3><b>Home</b></font>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:20px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage| <font color="#F5F5F5" size=3><b>Home</b></font>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage: Proposal| <font color="#F5F5F5" size=3><b>Proposal</b></font>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:20px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage: Proposal| <font color="#F5F5F5" size=3><b>Proposal</b></font>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage: Poster | <font color="#F5F5F5" size=3><b>Poster</b></font>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:20px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage: Poster | <font color="#F5F5F5" size=3><b>Poster</b></font>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage: Application | <font color="#F5F5F5" size=3><b>Application</b></font>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:20px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage: Application | <font color="#F5F5F5" size=3><b>Application</b></font>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage: Research Paper | <font color="#F5F5F5" size=3><b>Research Paper</b></font>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:20px; background-color: #3A5A74; font-family:helvetica;"> [[JobSage: Research Paper | <font color="#F5F5F5" size=3><b>Research Paper</b></font>]]
 
|}
 
|}
 
<!--/Header-->
 
<!--/Header-->
Line 36: Line 37:
 
<li>Growth trends allows us to understand whether the industry is a sunrise or sunset market, and provides insights on job stability</li>
 
<li>Growth trends allows us to understand whether the industry is a sunrise or sunset market, and provides insights on job stability</li>
 
</ul>
 
</ul>
 
+
<br/>
 
<div style="background: #FFFFF ; margin-top: 10px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#3A5A74 face="Helvetica">Company Level</font></div>
 
<div style="background: #FFFFF ; margin-top: 10px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#3A5A74 face="Helvetica">Company Level</font></div>
 
1. Can sentiment analysis (positive v negative) provide a good representation of the company? More specifically, is a "positive v negative" scale too general to provide viewers with good understanding of company?
 
1. Can sentiment analysis (positive v negative) provide a good representation of the company? More specifically, is a "positive v negative" scale too general to provide viewers with good understanding of company?
Line 44: Line 45:
 
2. Can a company's stock price (against time) provide a good representation on its current and future prospects?
 
2. Can a company's stock price (against time) provide a good representation on its current and future prospects?
 
<ul>
 
<ul>
<li>Stock price represents company's valuation. With comparison to time, we can tell the direction of the company</li>
+
<li>Stock price represents company's valuation. With comparison to time, we can tell the direction of the company's growth</li>
 
</ul>
 
</ul>
  
Line 138: Line 139:
 
| Far and away the best job I have had to date. Worked on leading edge projects with some seriously talented and fun people. The culture is outstanding, but being a contractor often left me on the outside looking in at all the fun and benefits, despite the team's efforts to include me as much as allowable. I would work here again in a heartbeat, but preferably as a direct hire.
 
| Far and away the best job I have had to date. Worked on leading edge projects with some seriously talented and fun people. The culture is outstanding, but being a contractor often left me on the outside looking in at all the fun and benefits, despite the team's efforts to include me as much as allowable. I would work here again in a heartbeat, but preferably as a direct hire.
 
|}
 
|}
 +
 +
==<div style="background: #3A5A74; padding: 15px; line-height: 0.3em; text-indent: 15px; font-size:18px; font-family:Helvetica"><font color= #FFFFF><b>Related Works</b></font></div>==
 +
{| class="wikitable"
 +
|-
 +
! Related Work !! Description
 +
|-
 +
| <center>
 +
'''Title''': Tax Rates across U.S Companies (2013)
 +
[[File:g9rw01.png|500px|frameless|center]]
 +
'''Source''':https://archive.nytimes.com/www.nytimes.com/interactive/2013/05/25/sunday-review/corporate-taxes.html
 +
</center>
 +
| The New York Times published this visualisation showing the tax rates of the S&P 500 companies, and their effective tax rates between 2007 and 2012. The author used circles to represent companies, with the position of the circle representing the amount of effective tax the company is liable to. Size of the circles was used to represent the market capitalization of the company. The author used color to represent intervals of tax rates (eg. dark purple is 0% to 10%). Information of each bracket of tax rates was also provided for the reader to understand the intuition and reasoning behind the spread of the chart. Companies with missing data were also shown, with the author explaining and justifying the reasons for absence of data. The chart can also be filtered by company, industry, or market capitalization.
 +
|-
 +
|  <center>
 +
'''Title''': Tweet Sentiment Visualization Web App
 +
[[File:g9rw02.png|500px|frameless|center]]
 +
'''Source''':https://www.csc2.ncsu.edu/faculty/healey/tweet_viz/tweet_app/
 +
</center>
 +
| The Tweet Sentiment Visualization Web App is a powerful visual that allows the user to query tweets based on keywords, and understand the sentiment of the twitter community based on various emotions. Some emotions are: Stressed, Nervous, Active, Alert, Elated, Happy, Serene, Relaxed, Subdued, etc. This example provides a good way on how to visualize emotions on a multi-dimensional plot, and gives insights on the possible algorithms that we can use for sentiment analysis of the various companies we are analyzing.
 +
 +
The web app also has a zooming tool, but it has limited flexibility and a relative poor user experience. Interactivity must be improved on to provide more exploratory functions to the user.
 +
|-
 +
|  <center>
 +
'''Title''': What is the best job for you?
 +
[[File:g9rw03.png|500px|frameless|center]]
 +
'''Source''':https://public.tableau.com/profile/payscale#!/vizhome/2015-04-15BestJobs/Desktop
 +
</center>
 +
| Created using Tableau, this visualization by PayScale, Inc provides an interesting take on filters, as the filters are structured in the form of a quiz. Upon completion of the quiz, a list of jobs that fits all the inputs will be generated. The visualization that is generated provides useful insights that would bring immediate value to an individual who is in the midst of his or her job search. Insights that are generated are as follows:
 +
<ol>
 +
<li>National Median Pay provides insights on Salary</li>
 +
<li>Job Stress provides insights on Stress Levels</li>
 +
<li>Median Years of Experience required by individual to pursue job</li>
 +
<li>The Degree Level Breakdown and Most Common Major allows users to understand the education requirements of the job</li>
 +
</ol>
 +
|}
 +
 +
==<div style="background: #3A5A74; padding: 15px; line-height: 0.3em; text-indent: 15px; font-size:18px; font-family:Helvetica"><font color= #FFFFF><b>Technical Challenges</b></font></div>==
 +
'''Data Sourcing'''
 +
<ul>
 +
<li>Data for job reviews are highly unstructured and non-collated. As such, web crawling must be conducted to compile large sets of data based on the various industries identified.</li>
 +
</ul>
 +
 +
'''Difficulty in Sentiment Analysis'''
 +
<ul>
 +
<li>Sentiment is a highly difficult variable to capture and analyse. As such, time must be invested to understand the intricacies and to optimize the algorithmic model that we will develop for this project.</li>
 +
<li>Our visualization must be able to display a large range of sentiments effectively</li>
 +
</ul>
 +
 +
'''Technical Knowledge'''
 +
<ul>
 +
<li>Proficiency in programming in R</li>
 +
<li>Learning packages such as ggPlot, R Shiny App</li>
 +
<li>Deploying the R Shiny App live</li>
 +
</ul>
 +
 +
==<div style="background: #3A5A74; padding: 15px; line-height: 0.3em; text-indent: 15px; font-size:18px; font-family:Helvetica"><font color= #FFFFF><b>Storyboard</b></font></div>==
 +
{| class="wikitable"
 +
|-
 +
! Visualisation !! Description
 +
|-
 +
| <center>
 +
[[File:g9sb01.png|700px|frameless|center]]
 +
</center>
 +
| '''Scatter Plot of Industry Attributes'''
 +
A scatter plot of the industries animated over time allows the user to understand how the industries have changed over the years. Whether the industry has grown, or has shrunk, and how much demand there is in the industry.
 +
|-
 +
|  <center>
 +
[[File:g9sb02.png|700px|frameless|center]]
 +
</center>
 +
| '''Circle Plot of Industry Attributes, by Cluster'''
 +
A cluster analysis is performed, and the industries can be classified into groups that are based on their outstanding attributes. The scale can be dynamically changed, allowing the user to select the attribute interactively. This allows the user to understand the nature of the industry, while being able to assess the factors (such as number of job vacancies, or average overtime hours) that matters most to him or her.
 +
|-
 +
|  <center>
 +
[[File:g9sb03.png|700px|frameless|center]]
 +
</center>
 +
|'''Radar Plot of Company Attributes '''
 +
A radar plot allows the reader to make comparisons of the various attributes across companies. Each user has different priorities when it comes to their career, being able to compare companies across multiple attributes simultaneously will allow the user to make an educated choice.
 +
|}
 +
 +
==<div style="background: #3A5A74; padding: 15px; line-height: 0.3em; text-indent: 15px; font-size:18px; font-family:Helvetica"><font color= #FFFFF><b>Project Timeline</b></font></div>==
 +
[[File:g9tl.png|800px|frameless|center]]
 +
 +
==<div style="background: #3A5A74; padding: 15px; line-height: 0.3em; text-indent: 15px; font-size:18px; font-family:Helvetica"><font color= #FFFFF><b>References</b></font></div>==
 +
'''Visualisations'''
 +
<ul>
 +
<li>https://archive.nytimes.com/www.nytimes.com/interactive/2013/05/25/sunday-review/corporate-taxes.html</li>
 +
<li>https://www.csc2.ncsu.edu/faculty/healey/tweet_viz/tweet_app/</li>
 +
<li>ttps://public.tableau.com/profile/payscale#!/vizhome/2015-04-15BestJobs/Desktop</li>
 +
</ul>
 +
 +
'''Web Crawling'''
 +
<ul>
 +
<li>https://www.crummy.com/software/BeautifulSoup/bs4/doc/</li>
 +
</ul>

Latest revision as of 20:16, 15 October 2018

Group9.png



Introduction


Being in the penultimate year of our tertiary education, we witness the hectic job rush that impact many of our peers. In reality, most students face a great amount of uncertainty in making major decisions related to their future career prospects. For instance, freshmen are unsure of which majors to enroll in while graduating students are unsure of the ideal company to work for. We aim to provide an insightful data visualization on industries, companies, and jobs, enabling students and working adults alike to find their ideal job.

Problem Statement


1. Graduating students are unsure of which company they'd want to work for, hence, are seeking a tool to visualise and compare important attributes between companies.
2. Freshmen are unsure which majors they would want to enrol in, hence, are seeking a tool to visualize and compare important attributes across industries.

Research Questions


Industry Level

1. Can salary, growth trends, job vacancies and working hours required provide a good representation of the industry characteristics?

  • Salary allows one to gauge how much money he or she can potentially make
  • Working hours provides insights on work-life balance
  • Job vacancies provides an indicator of the demand for personnel in the industry
  • Growth trends allows us to understand whether the industry is a sunrise or sunset market, and provides insights on job stability


Company Level

1. Can sentiment analysis (positive v negative) provide a good representation of the company? More specifically, is a "positive v negative" scale too general to provide viewers with good understanding of company?

  • Sentiment analysis provides a good and reliable method to help people quickly understand the general thoughts on the company

2. Can a company's stock price (against time) provide a good representation on its current and future prospects?

  • Stock price represents company's valuation. With comparison to time, we can tell the direction of the company's growth

Data Sources

1. Job Listing data from Indeed.com

  • Description: Indeed is a worldwide employment-related search engine for job listings. Indeed is used by many large corporations in Singapore, and has a massive database of active and expired job listings.
Attribute Description Example
Job Title Title of Job Listing Communications Intern
Category Industry that job is classified in Marketing
Company Name Company that job listing belongs to Ubisoft
Job Location Country or Region of the job Singapore
Job Description Job description posted on indeed A background in either Management, Marketing or Communications. The Communications Intern will play a key role in boosting our employer branding efforts, staff...
Overall Rating Cumulative average rating of the company 3.9
Work-Life Balance Rating Community rating of the work-life balance in the company 3.9
Benefits Rating Community rating of the benefits that the company provides to its employees 3.2
Job Security Rating Community rating of job security in the company 3.4
Management Rating Community rating of the management team 3.5
Culture Rating Community rating of the company culture 4.1

2. Company Reviews Indeed.com

  • Description: Indeed is a worldwide employment-related search engine for job listings. Many corporations have a profile page on Indeed, where current and past employees or vendors provide reviews for future potential employees.
Attribute Description Example
Company Company Name Google
Job Title Title of the Current/Former Employee Support Engineer
Rating Overall rating that was given by the employee 5
Title Title of the review Amazing company with great people
Location Country or Region that the employee resides in Seattle, WA
Date Date that review was posted October 7, 2018
Review Content Full content of the review Far and away the best job I have had to date. Worked on leading edge projects with some seriously talented and fun people. The culture is outstanding, but being a contractor often left me on the outside looking in at all the fun and benefits, despite the team's efforts to include me as much as allowable. I would work here again in a heartbeat, but preferably as a direct hire.

Related Works

Related Work Description

Title: Tax Rates across U.S Companies (2013)

G9rw01.png

Source:https://archive.nytimes.com/www.nytimes.com/interactive/2013/05/25/sunday-review/corporate-taxes.html

The New York Times published this visualisation showing the tax rates of the S&P 500 companies, and their effective tax rates between 2007 and 2012. The author used circles to represent companies, with the position of the circle representing the amount of effective tax the company is liable to. Size of the circles was used to represent the market capitalization of the company. The author used color to represent intervals of tax rates (eg. dark purple is 0% to 10%). Information of each bracket of tax rates was also provided for the reader to understand the intuition and reasoning behind the spread of the chart. Companies with missing data were also shown, with the author explaining and justifying the reasons for absence of data. The chart can also be filtered by company, industry, or market capitalization.

Title: Tweet Sentiment Visualization Web App

G9rw02.png

Source:https://www.csc2.ncsu.edu/faculty/healey/tweet_viz/tweet_app/

The Tweet Sentiment Visualization Web App is a powerful visual that allows the user to query tweets based on keywords, and understand the sentiment of the twitter community based on various emotions. Some emotions are: Stressed, Nervous, Active, Alert, Elated, Happy, Serene, Relaxed, Subdued, etc. This example provides a good way on how to visualize emotions on a multi-dimensional plot, and gives insights on the possible algorithms that we can use for sentiment analysis of the various companies we are analyzing.

The web app also has a zooming tool, but it has limited flexibility and a relative poor user experience. Interactivity must be improved on to provide more exploratory functions to the user.

Title: What is the best job for you?

G9rw03.png

Source:https://public.tableau.com/profile/payscale#!/vizhome/2015-04-15BestJobs/Desktop

Created using Tableau, this visualization by PayScale, Inc provides an interesting take on filters, as the filters are structured in the form of a quiz. Upon completion of the quiz, a list of jobs that fits all the inputs will be generated. The visualization that is generated provides useful insights that would bring immediate value to an individual who is in the midst of his or her job search. Insights that are generated are as follows:
  1. National Median Pay provides insights on Salary
  2. Job Stress provides insights on Stress Levels
  3. Median Years of Experience required by individual to pursue job
  4. The Degree Level Breakdown and Most Common Major allows users to understand the education requirements of the job

Technical Challenges

Data Sourcing

  • Data for job reviews are highly unstructured and non-collated. As such, web crawling must be conducted to compile large sets of data based on the various industries identified.

Difficulty in Sentiment Analysis

  • Sentiment is a highly difficult variable to capture and analyse. As such, time must be invested to understand the intricacies and to optimize the algorithmic model that we will develop for this project.
  • Our visualization must be able to display a large range of sentiments effectively

Technical Knowledge

  • Proficiency in programming in R
  • Learning packages such as ggPlot, R Shiny App
  • Deploying the R Shiny App live

Storyboard

Visualisation Description
G9sb01.png
Scatter Plot of Industry Attributes

A scatter plot of the industries animated over time allows the user to understand how the industries have changed over the years. Whether the industry has grown, or has shrunk, and how much demand there is in the industry.

G9sb02.png
Circle Plot of Industry Attributes, by Cluster

A cluster analysis is performed, and the industries can be classified into groups that are based on their outstanding attributes. The scale can be dynamically changed, allowing the user to select the attribute interactively. This allows the user to understand the nature of the industry, while being able to assess the factors (such as number of job vacancies, or average overtime hours) that matters most to him or her.

G9sb03.png
Radar Plot of Company Attributes

A radar plot allows the reader to make comparisons of the various attributes across companies. Each user has different priorities when it comes to their career, being able to compare companies across multiple attributes simultaneously will allow the user to make an educated choice.

Project Timeline

G9tl.png

References

Visualisations

Web Crawling