Difference between revisions of "ANLY482 AY2017-18T2 Group30 Instagram"

From Analytics Practicum
Jump to navigation Jump to search
m (removed bold font)
Line 52: Line 52:
  
 
<div align="left">
 
<div align="left">
<div style=" width: 85%; padding:75px; font-family: Arimo; font-size: 14px; font-weight: bold; line-height: 1em;">
+
<div style=" width: 85%; padding:75px; font-family: Arimo; font-size: 14px; font-weight: normal; line-height: 1em;">
 
<font>
 
<font>
 
To retrieve data from the company's instagram, we made use of a web-scraping script from [https://github.com/timgrossmann/instagram-profilecrawl Github]. We made modifications to the script to include timestamp as well as caption, the data includes:
 
To retrieve data from the company's instagram, we made use of a web-scraping script from [https://github.com/timgrossmann/instagram-profilecrawl Github]. We made modifications to the script to include timestamp as well as caption, the data includes:
Line 67: Line 67:
 
</div>
 
</div>
 
<div align="left">
 
<div align="left">
<div style=" width: 85%; padding:75px; font-family: Arimo; font-size: 14px; font-weight: bold; line-height: 1em;">
+
<div style=" width: 85%; padding:75px; font-family: Arimo; font-size: 14px; font-weight: normal; line-height: 1em;">
 
<font>
 
<font>
 
After scraping the data, we realised that the data needed cleaning. The indexes of the column values were off as seen here:
 
After scraping the data, we realised that the data needed cleaning. The indexes of the column values were off as seen here:

Revision as of 18:12, 10 February 2018

APex Logo.PNG


HOME ABOUT US PROJECT OVERVIEW PROJECT FINDINGS PROJECT MANAGEMENT DOCUMENTATION MAIN PAGE
Facebook Post Facebook Video Youtube Instagram Blog Post


Data Source

To retrieve data from the company's instagram, we made use of a web-scraping script from Github. We made modifications to the script to include timestamp as well as caption, the data includes:

  • Caption
  • Timestamp
  • Img URL
  • Tags
  • No. of Likes
  • No. of Comments
Data Preparation

After scraping the data, we realised that the data needed cleaning. The indexes of the column values were off as seen here: (image) We also concatenated the "tags" into a single column.

Exploratory Data Analysis

Final Application: Learning Dashboard