Difference between revisions of "ANLY482 AY2017-18T2 Group30 Instagram"
Jump to navigation
Jump to search
Ystan.2014 (talk | contribs) m (removed bold font) |
|||
Line 52: | Line 52: | ||
<div align="left"> | <div align="left"> | ||
− | <div style=" width: 85%; padding:75px; font-family: Arimo; font-size: 14px; font-weight: | + | <div style=" width: 85%; padding:75px; font-family: Arimo; font-size: 14px; font-weight: normal; line-height: 1em;"> |
<font> | <font> | ||
To retrieve data from the company's instagram, we made use of a web-scraping script from [https://github.com/timgrossmann/instagram-profilecrawl Github]. We made modifications to the script to include timestamp as well as caption, the data includes: | To retrieve data from the company's instagram, we made use of a web-scraping script from [https://github.com/timgrossmann/instagram-profilecrawl Github]. We made modifications to the script to include timestamp as well as caption, the data includes: | ||
Line 67: | Line 67: | ||
</div> | </div> | ||
<div align="left"> | <div align="left"> | ||
− | <div style=" width: 85%; padding:75px; font-family: Arimo; font-size: 14px; font-weight: | + | <div style=" width: 85%; padding:75px; font-family: Arimo; font-size: 14px; font-weight: normal; line-height: 1em;"> |
<font> | <font> | ||
After scraping the data, we realised that the data needed cleaning. The indexes of the column values were off as seen here: | After scraping the data, we realised that the data needed cleaning. The indexes of the column values were off as seen here: |
Revision as of 18:12, 10 February 2018
HOME | ABOUT US | PROJECT OVERVIEW | PROJECT FINDINGS | PROJECT MANAGEMENT | DOCUMENTATION | MAIN PAGE |
Facebook Post | Facebook Video | Youtube | Blog Post |
---|
Data Source
To retrieve data from the company's instagram, we made use of a web-scraping script from Github. We made modifications to the script to include timestamp as well as caption, the data includes:
- Caption
- Timestamp
- Img URL
- Tags
- No. of Likes
- No. of Comments
Data Preparation
After scraping the data, we realised that the data needed cleaning. The indexes of the column values were off as seen here: (image) We also concatenated the "tags" into a single column.
Exploratory Data Analysis
Final Application: Learning Dashboard