Difference between revisions of "ISSS608 2017-18 T1 Assign FOO CELONG RAYMOND/MakingSenseOfTheChatter"
Jump to navigation
Jump to search
Line 14: | Line 14: | ||
[[File:RaymMicroblogPerHour.png|500px|center]] | [[File:RaymMicroblogPerHour.png|500px|center]] | ||
− | <p>The data will need to be organised in some manner so that they can be easily analysed later. The obvious choice is by city zones.</p> | + | <p>The data will need to be organised in some manner so that they can be easily analysed later. The obvious choice is by city zones. I carefully started to group the microblog by the zones by creating a column to store the zone in which the microblog was transmitted.</p> |
+ | [[File:RaymMicroblogByZone.png|800px|center]] | ||
+ | <p>Next, I will also create an indicator column for microblogs that were transmitted area near the various points of interest.</p> | ||
+ | [[File:RaymMicroblogByPlaceOfInterest.png|800px|center]] | ||
</div> | </div> | ||
</div> | </div> |
Revision as of 20:50, 13 October 2017
Exploring and Organising the Data
First thing first, the number of microblog is massive. I looked through to see the distribution of posts over time.
The data will need to be organised in some manner so that they can be easily analysed later. The obvious choice is by city zones. I carefully started to group the microblog by the zones by creating a column to store the zone in which the microblog was transmitted.
Next, I will also create an indicator column for microblogs that were transmitted area near the various points of interest.