Difference between revisions of "ISSS608 2016-17 T3 Assign ZHENG BIJUN"
(edit by kishan added feedback) |
|||
(9 intermediate revisions by 2 users not shown) | |||
Line 139: | Line 139: | ||
= Interactive Visualization = | = Interactive Visualization = | ||
You may have your own investigation here: [https://public.tableau.com/profile/zheng.bijun#!/vizhome/VAST2017MC1/StoryPatternsofLifeAnalysis Link to interactive visualization] | You may have your own investigation here: [https://public.tableau.com/profile/zheng.bijun#!/vizhome/VAST2017MC1/StoryPatternsofLifeAnalysis Link to interactive visualization] | ||
− | + | * Please be noticed that the link is not working well due to some unknown tableau server issue; please download the workbook via tableau public landing page. | |
<table> | <table> | ||
<tr> | <tr> | ||
Line 282: | Line 282: | ||
<table border=1 style="width:90%;border-collapse: collapse; font-family: Calibri;"> | <table border=1 style="width:90%;border-collapse: collapse; font-family: Calibri;"> | ||
<tr> | <tr> | ||
− | <td>comment1: Hi,Bijun. Amazing work! From you analysis, I can see the level of efforts that you have devoted into this assignment. Yet, I have the following suggestions that hopefully are useful in further improving your work: | + | <td>comment1: Hi,Bijun. Amazing work! From you analysis pack, I can see the level of efforts that you have devoted into this assignment. Yet, I have the following suggestions that hopefully are useful in further improving your work: |
<ul> | <ul> | ||
<b>Aesthetic</b><br> | <b>Aesthetic</b><br> | ||
<li>1. I love the way you present your analysis. However, a big part of the analysis findings are demonstrated by line graphs. I am wondering if you can try other types of graphs to make the findings more visually clear?</li> | <li>1. I love the way you present your analysis. However, a big part of the analysis findings are demonstrated by line graphs. I am wondering if you can try other types of graphs to make the findings more visually clear?</li> | ||
− | <li>2. For the first graph of daily pattern. You are trying to say that the rangers do not work from | + | <li>2. For the first graph of daily pattern. You are trying to say that the rangers do not work from 4-5 am. Yet the graphs x axis only shows hour of 3,9,15,21. I think the pattern will be more obvious if you construct the graph in a way that X axis displays every hour of the day.</li> |
+ | <li>3. the story structure in your tableau workbook is clear. Look forward to viewing it interactively on line soon.</li> | ||
</ul> | </ul> | ||
− | + | <ul> | |
+ | <b>Clarity</b><br> | ||
+ | <li>1. the tableau workbook contains lots of useful information, yet it is a bit complex and confusing. There are quite a few selectors and parameters defined and the graphs are controlled by different selectors, which is not straightforward. Audience probably need to spend quite some time in understanding the dashboard, particularly for those who do not know the background of vast challenge.</li> | ||
+ | <li>2. the legends are not closely attached to their corresponding graphs, which also adds confusion to the dashboard.</li> | ||
+ | </ul> | ||
+ | <b>Overall, fantastic work! Hopefully my comments can add value</b> | ||
+ | <br> | ||
+ | Best Regards | ||
+ | <br> | ||
+ | Yunna | ||
</td> | </td> | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td> | + | <td>Hi Joyce, <br> |
+ | Great overall effort and very engaging. Some of my feedback as below 😉<br> | ||
+ | <b>Aesthetics:</b><br> | ||
+ | <ol> | ||
+ | * Though it is very interactive with a lot of interactive filters and legends, I feel that it is abit overwhelming, confusing and took me a while to understand and link them together; which I believe also is causing some dashboard performance issue to load slowly. Selecting some of the values also caused the whole dashboard to blank out and getting lost in visualization, can consider to reduce the number of variables of filters for interactive visualization. | ||
+ | * I guess storybook should be story-telling and easy to follow for anyone. Currently, it is designed for exploratory purposes. | ||
+ | * The colors of the titles, filter and legends are well-designed and implemented and all well-linked across the various graphs! Only thing is that descriptions fonts are a bit small for old folks. | ||
+ | </ol> | ||
+ | <b>Clarity:</b><br> | ||
+ | <ol> | ||
+ | * I don’t understand the “top x and bottom x” & “Timestamp Slector” (typo? Sector or Selector?) but I guess you are trying to compare the traffic at each gate with the arrival time, though I can’t tell anything obvious from the arrival time from pattern detection. In such case, it may be interesting to include and look at their departure time as well. | ||
+ | * Currently it only shows the route on the map; I think you can also consider the intensity of the path traveled illustrating by thickness of line. | ||
+ | * Coordinates of the checkpoints are offset from the actual 200x200 grid or actual distance/area and may cause confusion. Zooming feature in the map is good as it allows for better visibility similar to “How long did they spend”. Only point is to consider expanding the box to allow complete view of it; currently it requires scrolling. | ||
+ | </ol> | ||
+ | Cheers, <br> | ||
+ | Zac | ||
+ | </td> | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td>comment3</td> | + | <td>comment3 |
+ | <p>Hi Zheng Bijun,</p> | ||
+ | <p>Please find my feedback comments as follows. You present a very nice analysis, which is answering the questions of the challenge. With regard to the clarity and aesthetics aspect, I have the following to add.</p> | ||
+ | <p><strong>Clarity :</strong></p> | ||
+ | <p><strong> </strong></p> | ||
+ | <ul> | ||
+ | *In the daily pattern plots, you have used the ‘select’ feature effectively to illustrate clearly the trend you wish to explain. This is a good practice, as it help to retain the background information clearly, whilst projecting the focus for the user.</li> | ||
+ | *When using map images, you might want to use the Cartesian coordinates more effectively. You can use Tableau to import the map as a background image, and then geocode it so that you will be able to use annotations. The current texts you have indicated does help the user identify the locations inside the preserve, such as entrance 0, entrance 3, etc. but having annotations will help them to pop out of the plane, thereby presenting better clarity.</li> | ||
+ | *In your 2<sup>nd</sup> plot inside longer period patterns, I assume the x axis shows the days of the week (1-7). You might want to add an axis label, or you might want to use aliases for the days of the week and label the axis. (for e.g. 1-Sunday, 2-Monday, etc.).</li> | ||
+ | *On most of the plots, you have a well defined title, so you might not want to show the headers on the Y axis (# of cars) since it can already be known that the chart shows the trends of traffic.</li> | ||
+ | </ul> | ||
+ | <p><strong>Aesthetics: </strong></p> | ||
+ | <ul> | ||
+ | *I notice that you have tweaked the background colour. Maybe, you would want to also explore format axis feature in Tableau that might change the text to more bolder and visible formats. This would lend more readability to the plots.</li> | ||
+ | *On the arrival time calendar plot you have developed, when you try to visualize the number of episodes, the gradation in colors is good, and helps to quickly infer, which times of the day have higher episodes.</li> | ||
+ | </ul> | ||
+ | <p style="margin-left: .25in;"> Hope the feedback helps, and please leave out a feedback on my page as well. You may access it [[ISSS608_2016-17_T3_Assign_KISHAN_BHARADWAJ_SHRIDHAR|here]]. <br /> | ||
+ | Navigate to the bottom of the main page, after reading the 3 sub pages.</p> | ||
+ | <p style="margin-left: .25in;"> </p> | ||
+ | <p style="margin-left: .25in;">Thank You,</p> | ||
+ | <p style="margin-left: .25in;">Kishan Bharadwaj Shridhar</p> | ||
+ | </td> | ||
</tr> | </tr> | ||
<tr> | <tr> |
Latest revision as of 11:41, 13 July 2017
Contents
VAST Challenge 2017: Mini-Challenge 1
Mistford is a mid-size city is located to the southwest of a large nature preserve. It has been discovered that the number of nesting pairs of the Rose-Crested Blue Pipit, a popular local bird due to its attractive plumage and pleasant songs, is decreasing! Provided several datasets, it is required to investigate the reason for the decrease of Rose-Crested Blue Pipit. In Mini-Challenge 1, traffic movement dataset is given to analyze patterns of life of vehicles through out the reserve, and detect unusual patterns that are potentially harmful to the birds. Questions raised are:
- Describe up to six daily patterns of life by vehicles traveling through and within the park.
- Describe up to six patterns of life that occur over multiple days (including across the entire data set) by vehicles traveling through and within the park.
- Describe up to six unusual patterns (either single day or multiple days) and highlight why you find them unusual.
- What are the top 3 patterns you discovered that you suspect could be most impactful to bird life in the nature preserve?
This webpage will guide you through my investigations and help to save Rose-Crested Blue Pipit!
Data Description
Entry gates are positioned at the Preserve entrances. Each vehicle receives an entry ticket at the gate and is assigned a vehicle class; the entry is recorded. The entry ticket contains an RF-tag that enables the Preserve sensors to pick up the passage of a vehicle through the Preserve. Each vehicle surrenders their entry ticket when exiting the Preserve and the exit is recorded. A .csv file containing data recorded from sensors around the Boonsong Lekagul Nature Preserve. A map containing the locations of roadways and sensors throughout the Preserve is also provided.
Dataset Overview
1 CSV Traffic Movement Data | 2 Roadway Map |
More Detailed Data Facts
1 CSV Traffic Movement Data
- Entrances:
- All vehicles pass through an Entrance when entering or leaving the Preserve.
- General-gates:
- All vehicles may pass through these gates. These sensors provide valuable information for the Preserve Rangers trying to understand the flow of traffic through the Preserve.
- Gates:
- These are gates that prevent general traffic from passing. Preserve Ranger vehicles have tags that allow them to pass through these gates to inspect or perform work on the roadway beyond.
- Ranger-stops:
- These sensors represent working areas for the Rangers, so you will often see a Ranger-stop sensor at the end of a road managed by a Gate. Some Ranger-stops are in other locations however, so these sensors record all traffic passing by.
- Camping:
- These sensors record visitors to the Preserve camping areas. Visitors pass by these entering and exiting a campground.
2 Roadway Map
The contractors working with the Nature Preserve rangers have provided a map that presents the Preserve in terms of a 200x200 gridded area. The grid is oriented with north at the top of the map. Grid location (0,0) is at the lower left corner of the map (the SW corner). They have superimposed both the roadways and the sensor locations on this grid. The map shows an area 12 miles x 12 miles.
3 Others
- Traffic either passes through the Preserve, stay as day campers, or stay as extended campers.
- Preserve Rangers stay at the ranger-base toward the southeast of the Preserve when they are not working in the Preserve.
- The speed limit through the Preserve is 25 mph.
- The Preserve area does not observe "Daylight Savings Time".
- The roadways traveling southward from Entrances 3 and 4 do continue to other roadways outside of the Preserve area, but these are not shown on the map. Vehicle data will not reflect travel beyond the Preserve in this direction.
Data Preparation
1 Define episodes
Intuitively, each car-id (except for park service vehicles 2p) should possess two entrance records with one serves as entering and the other for exiting. However, it is noticed that many car-ids had more than two entrances recorded. It is rational to assume that these cars did made multiple visits to the preserve. Therefore, I split the trip with multiple entering and exiting of the same car-id into different episodes. Each episode can be seen as a complete trip and analysis is performed based on episodes.
2 Exclude incomplete trips By looking at the maximum timestamp of the entire dataset, it is safe to conclude that the data provided is generated before June 2016. Therefore, there are vehicles with incomplete trips (the maximum timestamps of these car-ids are around end of May) in the dataset, and I exclude car-ids with only one entrance record. The figure on the right shows the car-ids with only one entrance record and their maximum timestamp. |
3 Remove duplicate records
It is noticed that some sensor records are duplicated in the dataset, and these duplicates all have three entrance records for a car-id - the first two have the same timestamp (or very close timestamp), and the third one has a different timestamp and is coherent with the following activities. Hence, I remove the first two records assuming there were something wrong the data entry and keeping the third one makes more sense when interpreting the trip. The figure on the right shows an example of this kind of data anomaly. |
4 Label the sequence of gates of each episode In order to plot the entire routes of the vehicles, I create a new column named 'Sequence' to |
5 Concatenate the routes To concatenate the gate-names of each episode to form a route. |
6 Extract the gate-to-gate directions |
7 Calculate the gate-to-gate duration |
8 Extract the arrival timestamp of each episode |
9 Segment the visitor types
|
10 Map coordinates of gates
|
Interactive Visualization
You may have your own investigation here: Link to interactive visualization
- Please be noticed that the link is not working well due to some unknown tableau server issue; please download the workbook via tableau public landing page.
Patterns of Life Analysis
Daily Patterns
Images | Interpretations |
---|---|
|
|
The two types of buses and 4+ axle trucks, all large vehicles, had no appearance in any camping areas. It might represent that these three car-types can only be passing through the preserve, and camping area is not allowed for large vehicles. | |
Majority of traffics through camping areas only happened between 5am to 22pm, except for one car-id 20154519024544-322, which is discussed in later section. It might indicate that traffics were not allowed in camping areas after 22pm to ensure the safety and rest of overnight campers. | |
2 axle car/motorcycle, 2 axle truck, and 3 axle truck were most active vehicles in the preserve. Their activities started to increase at 6am and started to flatten out at around 18pm. 7am to 17pm had most vehicle activities. | |
There were vehicles that simply passed the preserved without making any stops and looking around. These vehicles can be identified by investigating the number of gates they passed through. This pattern only applies to non-campers and happened within a short time period. The graph on the left shows all the possible routes for trespassing.
|
Longer-Period Patterns
Images | Interpretations |
---|---|
Traffic increased since May and reached highest in July, then started to decrease. November to March were the least popular months for visitors and it is possible that these are winter months. | |
Activities of 2 axle car/motorcycle, 2 axle truck and 3 axle truck increased on Friday and decreased on Monday. This can be explained by the overnight camping during weekends. | |
The duration rangers spent at ranger-stops and camping areas were less than 1 hour. | |
The graph shows the route that had most rangers' episodes. It was the most frequent patrol route of the rangers, and it was almost twice as frequent as the second most frequent patrol route. It is possible that the east side of the preserve required more care and protection.
|
|
Campers arrived at the preserve between 5am and 17pm. Friday to Sunday were more popular as expected. |
Unusual Patterns
Images | Interpretations |
---|---|
This table displayed the route of car-id 20154519024544-322 (a 2 axle truck), which passed through camping gates after 22pm. This vehicle had 16 episodes, and each episode had exact same route except for the first episode. This vehicle came to the preserve each Friday and left the on the following Monday. | |
Apart from 20154519024544-322, there were other car-ids that had multi-episodes, which means they did not render their car-id by the time they exited the preserve. And every time they came to the preserve, they followed the same routes and went for overnight camping. This group of visitors might hold a regular pass for their visits. | |
|
|
|
|
Apart from the gate-skippers mentioned above, there were another 3 episodes made a simple round-trip in the preserve: they entered the preserve, passed through general-gate, then made the same route back to the entrance. |
Top 3 Possible Causes
- Long term visitor with car-id 20154519024544-322 and his behavior to travel pass camping areas during midnight.
- Unauthorized 4+ axle trucks invading restricted areas because the route they traveled was part of the most frequent patrol route of the rangers. This area could be where pipits resided, and therefore needed more care and protection from rangers. And the fact that they went through the restricted area when the rangers were off-duty makes them extremely suspicious.
- Possible over speeding which requires further investigations, especially for trespassing routes.
Comments & Discussions
comment1: Hi,Bijun. Amazing work! From you analysis pack, I can see the level of efforts that you have devoted into this assignment. Yet, I have the following suggestions that hopefully are useful in further improving your work:
Overall, fantastic work! Hopefully my comments can add value
|
Hi Joyce, Great overall effort and very engaging. Some of my feedback as below 😉
Clarity:
Cheers, |
comment3
Hi Zheng Bijun, Please find my feedback comments as follows. You present a very nice analysis, which is answering the questions of the challenge. With regard to the clarity and aesthetics aspect, I have the following to add. Clarity :
Aesthetics:
Hope the feedback helps, and please leave out a feedback on my page as well. You may access it here.
Thank You, Kishan Bharadwaj Shridhar |
comment4 |
comment5 |