Difference between revisions of "The Indian Story Report"
Line 80: | Line 80: | ||
<br/> | <br/> | ||
=Future Work= | =Future Work= | ||
− | |||
1) The bar plot that gives us the distribution of population across states in India. R's plotly was used for this purpose. | 1) The bar plot that gives us the distribution of population across states in India. R's plotly was used for this purpose. | ||
The package has a limitation that the bars cannot be sorted in ascending or descending order. This can be considered for future work. | The package has a limitation that the bars cannot be sorted in ascending or descending order. This can be considered for future work. | ||
[[File:newplot.png| 600 px | left]] | [[File:newplot.png| 600 px | left]] | ||
+ | <br/> | ||
2) Micromap package has a limitation that the micromap cannot be embedded into the Shiny app. However, it can be displayed as pop-up from the Shiny app. | 2) Micromap package has a limitation that the micromap cannot be embedded into the Shiny app. However, it can be displayed as pop-up from the Shiny app. |
Revision as of 22:40, 6 August 2017
Contents
Motivation of the Application
In this era of increasing openness, the importance of information created or held by the government has become impossible to deny. Government is one of the largest producers of information in many areas, such as business information, health data, geographic data, census data, and legal information. Allowing the public to access public information has become an important objective in many of the latest data.gov initiative of countries worldwide. Despite the increasing availability of government data, the use of these data by the public is often hampered by a general lack of appropriate and unaffordable data exploratory and analysis tool. This is particularly true if the data is geospatial and high-dimensional in nature. In view of this, our project aims to design and develop a geo visual analytics tool for data discovery from geographically reference statistical data.
The application we developed is called CenViz. It is developed using R Shinny framework and several R data visualization packages such as tmpa, micromap and treemap. This presentation consists of four sections.First, the motivation and objectives of the project will be discussed.This is followed by a detailed discussion on the principles and concepts of micromap. After which, the R packages used to develop the application and the user-interface designed will be discussed. Using the latest census data of India, we will demonstrate how the functions of CenViz can be used to detect the geospatial patterns and attribute distributions of literacy in the country.
Review and critic on past works
Design Framework
Totally there are three tabs which will provide various data visualisations from different perspective with the user in our application.
Tmap
Choropleth maps gave context to our story. Even the audience not aware of India and its states could easily visualize the geographical map clearly displaying states. We were able to encode population parameters into color hue - Minimal colors were used so the map is clear enough. These maps helped visualise which state is dominant for a particular category like “number of Male literate” or “number of female illiterate”. We built the choropleth maps using R's tmap package along with QGIS for simplifying shape files. A sample of a choropleth map is as shown below:
The tab in the Shiny app that shows Choropleth is as shown:
As seen above in the diagram, the maps show the education level performances of each state by age group which can be further filtered by Gender. The bar plot gives us the distribution of population by States in India.
Treemap
Treemap is most appropriate for hierarchical data. Certain packages in R related to treemap adds interactivity to the data and helps drill down through the hierarchy.
Treemap in the given context allows users to select a given state and then drill down further to town level literacy rates for various categories (e.g. Gender - Male, Education Level - Graduates). Size of the treemap shows population, colour is represents literacy rates(Education Level). R’s d3TreeR package for creating interactive tree maps were used to build the treemaps, and Tidyverse and tidy r packages for modifying the data.
A sample treemap that shows all states in India which can be further drilled down to city level is as shown below. Cenviz tab (on the right) that helps interact with treemap.
Micromap
Micromaps helps understand the distribution of education levels across cities per state. It not only adds statistical inferences, but also gives a geographical context. In the given context, for example,for “Males”, “Females” and “all” it plots the box plot of literacy rates based on town-wise literacy rate data. It gives a look “into” the states themselves that helps us understand what cities cause a skewed distribution, thereby portraying different results than the actual. R’s micromap package for creating the choropleth, QGIS for simplifying the shape files were used to build micromap.
A sample micromap is as shown below.The tab on Cenviz that helps interact with the micromap is shown on the right.
Demonstration
- Sample test cases
Discussion
- What has the audience learned from your work? What new insights or practices has your system enabled? A full blown user study is not expected, but informal observations of use that help evaluate your system are encouraged.
Future Work
1) The bar plot that gives us the distribution of population across states in India. R's plotly was used for this purpose. The package has a limitation that the bars cannot be sorted in ascending or descending order. This can be considered for future work.
2) Micromap package has a limitation that the micromap cannot be embedded into the Shiny app. However, it can be displayed as pop-up from the Shiny app.
Installation Guide
The installation guide is introduced in detail at:
User Guide
- Step-by-step guide on how to use the data visualisation functions designed.
Sample User Guide