Difference between revisions of "ISSS608 2017-18 T3 Assign Li Hongxin Answers"

From Visual Analytics and Applications
Jump to navigation Jump to search
 
(18 intermediate revisions by the same user not shown)
Line 33: Line 33:
  
 
[[File:Pipit gird.gif]]
 
[[File:Pipit gird.gif]]
<hr/>
 
 
===Growth Pattern===
 
Area plot is a straightforward way of indicating the growth trend. From the figures below, we can conclude that the number of birds around this Preserve has a large growth from 1983 to 2017, so does the Rose-crested Blue Pipit. <b>However, the number of Rose-crested Blue Pipit has a sharp decrease from 2016</b>. This might be another evidence to show that Rose-crested Blue Pipit was influenced by the dumping site. But we still need further proof because the total number of 19 species decreased as well at the same time.
 
 
[[File:Trend_mark.png|800px]]
 
 
<hr/>
 
<hr/>
  
 
===Kernel Density Plot===
 
===Kernel Density Plot===
 
Kernel density plot was used to visualize the geographical distribution more clearly. <b>Figure below shows that during 1983-2018, there were two centers with high density distribution of Rose-crested Blue Pipits.</b>
 
Kernel density plot was used to visualize the geographical distribution more clearly. <b>Figure below shows that during 1983-2018, there were two centers with high density distribution of Rose-crested Blue Pipits.</b>
 +
 
[[File:Kernel_all.png|800px]]
 
[[File:Kernel_all.png|800px]]
  
Line 52: Line 47:
  
 
[[File:pipit_kernal.gif]]
 
[[File:pipit_kernal.gif]]
 +
<br><br>
 +
<hr/>
 +
 +
===Growth Pattern===
 +
Area plot is a straightforward way of indicating the growth trend. From the figures below, we can conclude that the number of birds around this Preserve has a large growth from 1983 to 2017, so does the Rose-crested Blue Pipit. <b>However, the number of Rose-crested Blue Pipit has a sharp decrease from 2016</b>. This might be another evidence to show that Rose-crested Blue Pipit was influenced by the dumping site. But we still need further proof because the total number of 19 species decreased as well at the same time.
 +
 +
[[File:Trend_mark.png|800px]]
 
<hr/>
 
<hr/>
  
Line 61: Line 63:
 
[[File:Jianloude_dashboard.png|800px]]
 
[[File:Jianloude_dashboard.png|800px]]
  
==Question 2:Does the set of bird calls are Pipits==
+
 
 +
 
 +
==Question 2:Does the set of bird calls are Pipits?==
 
===Audio Visualization===
 
===Audio Visualization===
 
First, we apply visualization to get the pattern of songs all 19 species and try to figure out whether the test birds belongs to the Rose-crested Blue Pipit. <b>That means we only care about Rose-crested Blue Pipit and we don't have to differentiate the other 18 classes</b>.
 
First, we apply visualization to get the pattern of songs all 19 species and try to figure out whether the test birds belongs to the Rose-crested Blue Pipit. <b>That means we only care about Rose-crested Blue Pipit and we don't have to differentiate the other 18 classes</b>.
====1. Wave plot and Spectrogram====
+
====I. Wave plot and Spectrogram====
 
Wave plot and spectrogram were used to visualize the sound waves of bird species. For each species, we selected one "Call" and one "Song" from the observations with Quality A as training set. Wave plot is more intuitive than the spectogram. Thus, we compared the patterns of wave plot between training birds and test birds to classify the test birds first. If the wave plot was ambiguous, we would tried to compare the spectrogram.
 
Wave plot and spectrogram were used to visualize the sound waves of bird species. For each species, we selected one "Call" and one "Song" from the observations with Quality A as training set. Wave plot is more intuitive than the spectogram. Thus, we compared the patterns of wave plot between training birds and test birds to classify the test birds first. If the wave plot was ambiguous, we would tried to compare the spectrogram.
  
Line 70: Line 74:
  
 
<br>
 
<br>
====2. Comparison Results for the Test Birds ====
+
====II. Comparison Results for the Test Birds ====
<hr/>
+
By comparing the pattern of test birds and training birds, <b>test no.1,11, and 15 might belong to the Rose-crested Blue Pipit.</b>
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
|-
 
|-
Line 93: Line 97:
 
|-
 
|-
 
|
 
|
1.
+
2.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 102: Line 106:
 
|-
 
|-
 
|
 
|
1.
+
3.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 111: Line 115:
 
|-
 
|-
 
|
 
|
1.
+
4.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 120: Line 124:
 
|-
 
|-
 
|
 
|
1.
+
5.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 129: Line 133:
 
|-
 
|-
 
|
 
|
1.
+
6.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 138: Line 142:
 
|-
 
|-
 
|
 
|
1.
+
7.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 147: Line 151:
 
|-
 
|-
 
|
 
|
1.
+
8.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 156: Line 160:
 
|-
 
|-
 
|
 
|
1.
+
9.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 165: Line 169:
 
|-
 
|-
 
|
 
|
1.
+
10.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 174: Line 178:
 
|-
 
|-
 
|
 
|
1.
+
11.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 180: Line 184:
 
[[File:Test11.png|800x244px]]
 
[[File:Test11.png|800x244px]]
 
||
 
||
Yes
+
Maybe
 
|-
 
|-
 
|
 
|
1.
+
12.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 192: Line 196:
 
|-
 
|-
 
|
 
|
1.
+
13.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 201: Line 205:
 
|-
 
|-
 
|
 
|
1.
+
14.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 210: Line 214:
 
|-
 
|-
 
|
 
|
1.
+
15.
 
||
 
||
 
[[File:17.png|800px]]
 
[[File:17.png|800px]]
Line 220: Line 224:
 
<br>
 
<br>
  
 +
<hr/>
 
===Audio Classification===
 
===Audio Classification===
 +
Now pay attention to machine learning methods to classify the bird species. Similar with the visualization, we labeled the samples into 2 classes:0(not Rose-crested Blue Pipit) and 1(Rose-crested Blue Pipit).
  
 +
For the three methods, random forest has the best performance - 98.7% for the training set and 92.3% for test set. The predicted result was also shown below. <b>By taking all three results together, test no.7, 11 and 13 were more likely to be Rose-crested Blue Pipit</b>.
 +
 +
[[File:Accuracy.png|600px]]
 +
 +
[[File:ML_result.png|600px]]
 +
<br><br>
 
<hr/>
 
<hr/>
  
 
===Discussion===
 
===Discussion===
 +
Audio visualization can provide some additional evidence for the confirmation of machine learning results. For example, by comparing the result of visualization and of machine learning, only test no.11 was confirmed by both methods. For the test no.7 and test no.13 which were also picked by logistic regression and SVM, their patterns of waveplot were quite different with the pattern of Rose-crested Blue Pipit. And they might be allocated into wrong class.
 +
 +
In addition, going back to the location, the test no.11 also located at the area where Rose-crested Blue Pipit was thriving in this past.
 +
Other test birds, such as test no.1 and test no.15 picked by audio visualization were close to test no.11; whereas test no.7 and test no.13 which were picked by machine learning methods were far from the thriving area, which indicated that they had slight chance to be Rose-crested Blue Pipit.
 +
 +
In conclusion, only one observations provided by Kasios can be confirmed as Rose-crested Blue Pipit, and two observations were similar with the Rose-crested Blue Pipit. <b>And we can say that Pipits were not found across preserve.</b>
  
 
[[File:Test Birds Location.png|500px]]
 
[[File:Test Birds Location.png|500px]]

Latest revision as of 11:57, 8 July 2018

Pipits hx.jpg VAST Mini Challenge 1: "Cheep" Shots?

Background

Methodology

Answers

Conclusions

 

Question 1: Are Pipits actually thriving across the Preserve?

Geographical Distribution Overview

Birds were found throughout the preserve as shown in the left figure. The darker, the more birds were found around the area. Figure in the right indicates the geographical distribution of Rose-crested Blue Pipit. That is, at the northeast of the Preserve, and near to the Dumping Site(shown as red triangle).

Grid.png

From the dynamic graph, we can find that the gathering place of Rose-crested Blue Pipit moved around the northeast corner of the Preserve from 2007 to 2017. And the path of the moving went to the red triangle direction, where the dumping site located in the recent years, and then to the opposite direction. This may provide an evidence that the dumping site has some effect on the Rose-crested Blue Pipit.

Pipit gird.gif


Kernel Density Plot

Kernel density plot was used to visualize the geographical distribution more clearly. Figure below shows that during 1983-2018, there were two centers with high density distribution of Rose-crested Blue Pipits.

Kernel all.png


If we take look at the kernel density plot over the recent years(from 2012 to 2017), it is obvious that there were 2 centers at the beginning, and then the center near the location where dumping site was built later gathered more Pipits. But eventually, the density of this center kept decreasing from 2015 and now only one center exists for the Pipits.

Kernel density.jpg


You may refer to the dynamic graph to have an intuitive feelings. If the dumping site was built around 2014 or 2015, it is more likely that the Rose-crested Blue Pipit was influenced by the dumping site.

Pipit kernal.gif


Growth Pattern

Area plot is a straightforward way of indicating the growth trend. From the figures below, we can conclude that the number of birds around this Preserve has a large growth from 1983 to 2017, so does the Rose-crested Blue Pipit. However, the number of Rose-crested Blue Pipit has a sharp decrease from 2016. This might be another evidence to show that Rose-crested Blue Pipit was influenced by the dumping site. But we still need further proof because the total number of 19 species decreased as well at the same time.

Trend mark.png


Interactive Dashboard

Interactive dashboard is created for users to find out some interesting patterns of bird species by themselves.
The upper side of the dashboard is an area plot, which shows the growth trend of one bird species or all birds in the Preserve. The bottom of the dashboard is a scatter density plot, where we change the grid from 1x1 to 5x5 to show the density of birds around the Preserve.
The controller in at the right side allow users to select Year/Bird Species/Season/Timeslot.

Jianloude dashboard.png


Question 2:Does the set of bird calls are Pipits?

Audio Visualization

First, we apply visualization to get the pattern of songs all 19 species and try to figure out whether the test birds belongs to the Rose-crested Blue Pipit. That means we only care about Rose-crested Blue Pipit and we don't have to differentiate the other 18 classes.

I. Wave plot and Spectrogram

Wave plot and spectrogram were used to visualize the sound waves of bird species. For each species, we selected one "Call" and one "Song" from the observations with Quality A as training set. Wave plot is more intuitive than the spectogram. Thus, we compared the patterns of wave plot between training birds and test birds to classify the test birds first. If the wave plot was ambiguous, we would tried to compare the spectrogram.

Train wave&spectrogram small.png


II. Comparison Results for the Test Birds

By comparing the pattern of test birds and training birds, test no.1,11, and 15 might belong to the Rose-crested Blue Pipit.

Bird No.

Wave plot of Rose-crested Blue Pipit

Wave plot of Test Bird

Similar with Rose-crested Blue Pipit?

1.

17.png

Test1.png

Yes

2.

17.png

Test2.png

No

3.

17.png

Test3.png

No

4.

17.png

Test4.png

No

5.

17.png

Test5.png

No

6.

17.png

Test6.png

No

7.

17.png

Test7.png

No

8.

17.png

Test8.png

No

9.

17.png

Test9.png

No

10.

17.png

Test10.png

No

11.

17.png

Test11.png

Maybe

12.

17.png

Test12.png

No

13.

17.png

Test13.png

No

14.

17.png

Test14.png

No

15.

17.png

Test15.png

Yes



Audio Classification

Now pay attention to machine learning methods to classify the bird species. Similar with the visualization, we labeled the samples into 2 classes:0(not Rose-crested Blue Pipit) and 1(Rose-crested Blue Pipit).

For the three methods, random forest has the best performance - 98.7% for the training set and 92.3% for test set. The predicted result was also shown below. By taking all three results together, test no.7, 11 and 13 were more likely to be Rose-crested Blue Pipit.

Accuracy.png

ML result.png


Discussion

Audio visualization can provide some additional evidence for the confirmation of machine learning results. For example, by comparing the result of visualization and of machine learning, only test no.11 was confirmed by both methods. For the test no.7 and test no.13 which were also picked by logistic regression and SVM, their patterns of waveplot were quite different with the pattern of Rose-crested Blue Pipit. And they might be allocated into wrong class.

In addition, going back to the location, the test no.11 also located at the area where Rose-crested Blue Pipit was thriving in this past. Other test birds, such as test no.1 and test no.15 picked by audio visualization were close to test no.11; whereas test no.7 and test no.13 which were picked by machine learning methods were far from the thriving area, which indicated that they had slight chance to be Rose-crested Blue Pipit.

In conclusion, only one observations provided by Kasios can be confirmed as Rose-crested Blue Pipit, and two observations were similar with the Rose-crested Blue Pipit. And we can say that Pipits were not found across preserve.

Test Birds Location.png