Group04 Report
Water For Life: India's Rainfall & Crop Analysis Through Visualizations
|
|
|
|
|
Contents
Introduction
This study focuses on exploratory analysis of rainfall pattern, crop productivity and effect of rainfall pattern changes on crop productivity across different meteorological subdivisions of India. Considering a wide range of weather conditions across a vast geographic scale and varied topography, it won’t be wise to generalise climate changes and its effect on crop productivity in varied regions of India, that is why we decided to explore and visualise rainfall pattern changes and its effect on crop productivity for every meteorological subdivision.
Our research focuses on 34 out of 36 meteorological subdivisions rainfall and crop productivity. Two subdivision excluded are Lakshadweep and Andaman & Nicobar subdivisions. Rainfall data used for research is at meteorological subdivisions level however crop data used is at administrative district level. We have maintained same granularity level for visualisation of separate data. For analysing effect of rainfall over crop productivity, district data is aggregated to subdivision level based on mapping of subdivisions and districts.
Crop growing season in India is classified into two main seasons – (i) Kharif and (ii) Rabi based on monsoon. The Kharif cropping season if from July- Oct during south– west monsoon and Rabi cropping season is from October- March (Winter). Crop grown between March- June are Summer Crops. Apart from these seasonal crops there are few crops which are grown throughout year and are classified as Whole year crops. So, we have considered total Four seasons as Kharif, Rabi, Summer and Whole Year for analysis.
We have used multiple visualisation technics such as Time series, Heat Map, Tree Map, Parallel Coordinate Plot, Geo Facet and Bar charts for easy visualisation of varying rainfall pattern and changing crop productivity for 15 years across different subdivisions of India. R is used for data visualisation and application as it offers satisfactory set of inbuilt functions and libraries for both data mining and visualisation.
|
Objective & Motivation
Climate plays a significant role in economic development of India. Because large population of India depends on climate sensitive sectors like agriculture and forestry for livelihood. Climate change could lower the farmer’s income by up to 25% (Economic Survey 2018: http://mofapp.nic.in:8080/economicsurvey/pdf/082-101_Chapter_06_ENGLISH_Vol_01_2017-18.pdf).This is because agriculture in India is vulnerable to the vagaries of whether as close to 52% farm land is still unirrigated and depends on rainfall. This project is honest endeavor in gaining deeper knowledge into the impact of increasingly changing rainfall patterns, so that we can be prepared to mitigate the risk of these uncontrollable factors and seek remedies that would help sustain such drastic natural phenomenon.
Considering crops cultivation period and water requirement for crop during different stages of its lifecycle, it is important to analyse effect of monthly rainfall on crop productivity during cultivation period rather than simply considering yearly/seasonal average rainfall. Our objective is to provide single view to analyse monthly rainfall pattern changes, crop productivity changes and correlation between every month’s rainfall and crop productivity.
|
Data Sources
Rainfall data was obtained from “IITM Indian subdivision Monthly Rainfall data set” available on http://www.tropmet.res.in/. Data set consists of monthly, season wise and annual rainfall (in mm) for 36 meteorological subdivisions of India from year 1871 to 2016. Our research only used monthly data from year 2000 to 2014 for 34 subdivisions. This data set is chosen based on available crop production data. Crop Production data is obtained from “State-wise, season-wise crop production statistics from 1997” available on https://data.gov.in/. Data set contains season wise crop production data for 646 districts from 1997 to 2015 for 113 different crops. For this research, only those crops are selected for which districtwise data is available for more than 10 years. So final selected subset contains data for 56 crops for 595 districts. Crop productivity is calculated using below formula –
List of meteorological subdivisions and districts covered under these subdivisions is obtained from India Meteorological Department’s website (http://www.imd.gov.in). This data is used to map subdivision wise rainfall data and district wise crop production data. Grid file for Geo facet graph is prepared based on geographic location of every subdivisions in India.
|
Data Preparation
|
Past Work and Critique
There has been wide range of analysis for India’s rainfall pattern changes and its effect on Crop productivity. One of these papers is “The Impact of Climate Change on Crop Yields in India from 1961 to 2010 by Aravind Moorthy, Wolfgang Buermann, and Deepak Rajagopal June 12, 2012(http://hpccc.gov.in/PDF/Agriculture/Climate%20Change%20and%20Crop%20Yields%20in%20India.pdf). Most of these studies are focused on seasonal or yearly rainfall pattern and its effect on crop productivity. However, there has been limited analysis for monthly rainfall effect on crop productivity which is more important. Visualization provided is static and limited to line graphs as shown below:
|
Dashboard Design & Visualization Methodology
1. Rainfall Analysis Separate visualization for only rainfall analysis has been provided by Rainfall-geo facet and Rainfall-cyclic plot.
From Geo-facet plot monthly rainfall pattern for all subdivisions of India can be compared for selected year. Different subdivisions have different rainfall pattern based on geo-graphic location. South west coastal subdivisions and north east subdivisions observe wide rainfall season due monsoon wind pattern.
From Cyclic plot rainfall monthly rainfall fluctuations for last 15 years can be visualized. Below cyclic plot is for Assam and Meghalaya subdivisions. Wide variation in rainfall in all the months has extreme effect on crop productivity.
In this section, we analyze how crop production is distributed across various subdivisions and districts in India from year 2000 to 2014. As India’s crop cultivation is highly dependent on various seasons, we are also analyzing crop’s productivity over various seasons. In the crop data set, details for Crop’s total production and cultivation area are provided. Using these two fields we have calculated Crop Productivity as a ratio of crop’s production over cultivation area and used this parameter for our analysis.
Parallel coordinate plot allows user to view high dimensional data with a facility to visualize categorical and numerical variables together. The plot is created using R parcoords package. Parallel coordinate plot shows how crop productivity is distributed for various crops over various season for particular subdivision. User can select one or multiple subdivisions as well as one or multiple years to make a fair comparison among different subdivisions over the years.
As given crop data has a prominent subdivision and district hierarchy, Treemap is the first choice for visualization as treemap gives the hierarchical view of data to user and user can drill down or up in the data. We have used treemap and d3treeR packages to create interactive treemap where user can hover and click on the subdivision to drill down and show various district inside that subdivision. Clicks from the treemap are used as input to bar plot which shows the high yielding crops arranged in descending order of their productivity for that subdivision or district.
After exploring the rainfall and crop data individually for better understanding of the data, we advance further to analyze the interrelationship between precipitation and crop productivity, if there exists any. For our application, we especially focus on discovering correlation between monthly variations in the rainfall precipitation with crop productivity. We have created a separate page in our application for viewing these graphs. We created two visualizations to facilitate the user to gain deeper understanding of the interdependence of these variables.
The first tab on this page showcases geo-spatial visualization of India’s map on left side of the page and corresponding crop productivity values for subdivisions on right side of the page. Users can select the year and crop for configuring the graphs as per their requirement. We utilized leaflet package to plot an interactive chloropleth allowing the end user to view the distribution of annual rainfall precipitation across the 34 subdivisions of interest. For plotting the subdivisions, we obtained the shapefile to add those polygons to the map. The map also provides interactivity wherein on hover on individual subdivisions, a tooltip appears showing the value for the precipitation for that subdivision.
As correlation can be better viewed with a diverging scale ranging from negative to positive values, we used heatmap function form plotly package to create a correlation plot as shown. We used the viridis color scheme which is available by default with plotly package. This heatmap plots negative values as hues of dark blue color transitioning to the lighter yellow hue for positive values of the correlation. Such diverging color scheme provides instant insight on how crop productivity moves with respect to rainfall and which months have adverse effect on crop productivity
https://wiki.smu.edu.sg/1718t3isss608/Group04_Application
|
Key Insights
Conclusion/Future Work
Given time constraints and the nature of data we gathered, this application is only limited to show the correlation between crop productivity and rainfall pattern changes. We cannot conclude rainfall pattern change is the causation for crop productivity change in Indian agricultural sector, as there are several other factors impacting the cultivation and harvesting of various crops in different regions of India such as temperature, wind, soil as well as capital and government support.
|
Acknowledgement
We would like to extend our gratitude towards Dr Kam Tin Seong (Singapore Management University) for his guidance on analytical techniques and R packages that may be used and feedback on visualisation techniques. Without his encouragement and technical assistance, this project would not be as it is today.
|
R Packages Used
We have used the following R packages to come up with our visualizations: |