Difference between revisions of "GeViz"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
 
(73 intermediate revisions by the same user not shown)
Line 22: Line 22:
 
| style="padding:0.2em; font-size:100%; background-color:#143c67;  border-bottom:0px solid #3D9DD7; text-align:center; color:#F5F5F5" width="10%" |  
 
| style="padding:0.2em; font-size:100%; background-color:#143c67;  border-bottom:0px solid #3D9DD7; text-align:center; color:#F5F5F5" width="10%" |  
 
[[1718t1is428T7_Research Paper|<font color="#F5F5F5" size=3 face="Lato">Research Paper</font>]]
 
[[1718t1is428T7_Research Paper|<font color="#F5F5F5" size=3 face="Lato">Research Paper</font>]]
 +
 +
|style="background:none;" width="1%" | &nbsp;
 +
| style="padding:0.2em; font-size:100%; background-color:#143c67;  border-bottom:0px solid #3D9DD7; text-align:center; color:#F5F5F5" width="10%" |
 +
[[Project_Groups|<font color="#F5F5F5" size=3 face="Lato">Back to Project Groups</font>]]
 
|}  
 
|}  
 
</div>
 
</div>
Line 28: Line 32:
 
<br/>
 
<br/>
 
==<div style="background:#143c67; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>PROBLEM & MOTIVATION</center></font></div>==
 
==<div style="background:#143c67; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>PROBLEM & MOTIVATION</center></font></div>==
<br/>
+
GeBIZ is a Singapore Government’s one-stop e-procurement portal which facilitates tender activities between Singapore government and local and overseas suppliers. Currently, there is no available tool to aid the public and ministries to understand and gain insights on the procurement made by the government under each ministry. Hence, we are motivated to create an interactive visualisation tool on government's procurement spending to allow the public and ministries to identify spending patterns and gain insights into procurement spending under each ministry.  
GeBIZ is a Singapore Government’s one-stop e-procurement portal which facilitates the entire procurement lifecycle and revenue tender activities between Singapore government and local and overseas supplies electronically since June 2000. The purpose of the portal is to be transparency, encourage fair and open competition and to generate demands / quotations. With this portal, more companies are tendering for projects hence increasing the competitiveness.
 
 
 
We aim to create a visualisation which benefits businesses in crafting bidding prices and increasing their chances in winning the government contracts and tender. In addition, we hope to give an overview and identify interesting patterns of government spending.  
 
 
<br/>
 
<br/>
  
 
==<div style="background:#143c67; padding: 15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>OBJECTIVES</center></font></div>==
 
==<div style="background:#143c67; padding: 15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>OBJECTIVES</center></font></div>==
<br/>
 
 
In this project, we are creating a visualisation that is able to show the following:
 
In this project, we are creating a visualisation that is able to show the following:
* Government investment analysis
+
* Gain an overview of procurement spending made by each ministry and agency
* The tender distribution of projects to different type of business
+
* Identify the relationships between ministries, agencies and suppliers
* Identify the price range when bidding for projects
+
* Identify what are the goods and services procured by ministries and agencies under each category
<br/>
+
 
 
==<div style="background:#143c67; padding: 15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>SELECTED DATASETS</center></font></div>==
 
==<div style="background:#143c67; padding: 15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>SELECTED DATASETS</center></font></div>==
<b>The following datasets will be used for analysis , as elaborated below:</b>
+
The following datasets will be used for analysis , as elaborated below:
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
Line 68: Line 68:
 
<br/>
 
<br/>
  
===Text Classification using Support Vector Classifier (SVC) ===
+
==<div style="background:#143c67; padding: 15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>APPROACH</center></font></div>==
 +
[[File:Approach 2.png|800px|frameless|center]]
 +
<b> Exploratory Data Analytics </b></br>
 +
We used Tableau to perform EDA to better understand our dataset and to aid us in the conceptualization of our story board.
 +
<br/>
 +
 
 +
<b> Data Cleaning and Feature Creation </b></br>
 +
We used Excel and Python to create a new column showing the Ministry that each agency belongs to by merging with data obtained from the Singapore Government Directory.
 +
<br/>
 +
 
 +
<b> Text Classification using Support Vector Classifier (SVC) </b></br>
 
One of the key challenges of working with the provided procurement dataset is the absence of categorization of each procurement transaction. Instead of labelling manually, we applied <b>machine learning</b> to classify the tender descriptions into different categories. We firstly scraped the procurement descriptions and categories from GeBiz website using <b> Selenium</b> and <b> BeautifulSoup </b> libraries in Python to be used as the training and validation dataset in our Support Vector Classifier model. We were able to achieve <b> 90% for training accuracy </b> before performing the categorization prediction. <br/><br/>
 
One of the key challenges of working with the provided procurement dataset is the absence of categorization of each procurement transaction. Instead of labelling manually, we applied <b>machine learning</b> to classify the tender descriptions into different categories. We firstly scraped the procurement descriptions and categories from GeBiz website using <b> Selenium</b> and <b> BeautifulSoup </b> libraries in Python to be used as the training and validation dataset in our Support Vector Classifier model. We were able to achieve <b> 90% for training accuracy </b> before performing the categorization prediction. <br/><br/>
  
<b> Dataset after Text Classification </b>
+
<b> Government Procurement Dataset after Text Classification </b>
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
! Dataset/Source
+
! style = "width: 50%;" | Dataset/Source
! Data Attributes
+
! style = "width: 50%;" | Data Attributes
! Rationale of Usage
 
 
|-
 
|-
 
| Government Procurement Data  
 
| Government Procurement Data  
|  
+
|
 
* Tender No  
 
* Tender No  
 
* Agency  
 
* Agency  
Line 91: Line 100:
 
|}
 
|}
  
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>BACKGROUND SURVEY</center></font></div>==
 
<br/>
 
Some of these visualizations that we draw inspiration from, are as follows:
 
To be filled!
 
 
<br/>
 
<br/>
 +
<b> Visualization in R </b><br/>
 +
The web application will be built in R and deployed to Shinyapps.io
  
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>CONSIDERATION & VISUAL SELECTION</center></font></div>==
+
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>BACKGROUND SURVEY OF RELATED WORKS</center></font></div>==
<br/>
+
Some of these visualizations that we draw inspiration from, are as follows: <br/>
To be filled!
+
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
<br/>
+
|-
 +
! style="font-weight: bold;width: 50%;" | Reference of Other Interactive Visualization
 +
! style="font-weight: bold;" | What We Can Learn
 +
|-
 +
| <p><center>'''Title : Pareto Analysis of Suppliers'''  </center></p>
 +
[[File:Pareto Analysis Reference.png|400px|frameless|center]]
 +
<p><center>'''Source''': https://goo.gl/P9RjHk </center></p>
 +
||
 +
*  The use of pareto chart allows us to identify the component(s) which is/are contributing significantly and how the categorises value sums up cumlatively.
 +
|-
 +
| <p><center>'''Title : Word Cloud on Procurement Details ''' </center></p>
 +
[[File:Word cloud reference.png|400px|frameless|center]]
 +
<p><center>'''Source''': https://linpack-for-tableau.com/data-visualizations/tableau-dashboards/procurement-dashboard/procurement-cockpit/</center></p>
 +
||
 +
* Based on the size of the words, we can identify the keywords quickly. We can learn that by making frequency used keywords stand out can prevent us from overlooking it if it is presented in a tabular format.
 +
|-
 +
| <p><center>'''Title : Breakdown on UK Government Spending ''' </center></p>
 +
[[File:Treemap ref 2.png|400px|frameless|center]]
 +
<p><center>'''Source''': http://www.nickmalleson.co.uk/2014/02/uk-government-spending-treemap.html</center></p>
 +
||
 +
* Based on the chart, we can gain an overview of how the spending is breakdown according to the size and colour of the box.
 +
* From this, we are able to identify outliers or signficant contribution quickly at a glance.
 +
|-
 +
| <p><center>'''Title : Team Budget Breadown ''' </center></p>
 +
[[File:Sankey diagram reference.png|400px|frameless|center]]
 +
<p><center>'''Source''':https://acquireprocure.com/spend-analysis-visualisation/3-reasons-procurement-professionals-use-sankey-diagrams</center></p>
 +
||
 +
* Based on the path of the sankey chart, we are able to identify the cash flow from one end to the other end.
 +
* The size of the path allows us to to identify how signficant it is in terms of value.
 +
|-
 +
| <p><center>'''Title : Analyzing Involved Authorities, Tenders and Companies ''' </center></p>
 +
[[File:Graph network references.png|400px|frameless|center]]
 +
<p><center>'''Source''': https://linkurio.us/blog/exploring-e1-3-trillion-in-public-contracts-with-graph-visualization/#!prettyPhotooard</center></p>
 +
||
 +
* Based on the charts, we are able to identify key relationship between objects quickly based on the nodes and edges.
 +
* From this, we can find common nodes as well.
 +
|}
  
 
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>BRAINSTORMING SESSIONS</center></font></div>==
 
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>BRAINSTORMING SESSIONS</center></font></div>==
<br/>
+
[[File:Brainstorm2.png|800px|frameless|center]]
 +
During our brainstorming session, we came out a list of visualization which are able to achieve our objectives and eventually shortlisted 4 visualization - Tree map, Network Graph, Sankey Graph and Word Cloud. After rounds of refinement and consultations with our Professor, the image above is the final draft for our visualization.
 +
 
 +
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>PROPOSED STORYBOARD</center></font></div>==
 +
Our group has proposed the following storyboard in the use of our visual application:
  
<br/>
+
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 +
|-
 +
! style="font-weight: bold;width: 50%;" | Proposed Layout
 +
! style="font-weight: bold;" | What We Can Analyse
 +
|-
 +
| <p><center>'''Overview of Procurement Spending'''  </center></p>
 +
[[File:Storyboard treemap.png|400px|frameless|center]]
 +
<p><center>'''Source''':https://www.r-graph-gallery.com/treemap/</center></p>
 +
||
 +
* Treemap to show the spending breakdown for each category of all agencies under the selected ministry.
 +
* The interactive filters are year and ministry.
 +
|-
 +
| <p><center>'''Relationship between Ministry and Suppliers''' </center></p>
 +
[[File:Storyboard network.png|400px|frameless|center]]
 +
<p><center>'''Source''': https://cran.r-project.org/web/packages/visNetwork/vignettes/Introduction-to-visNetwork.html</center></p>
 +
||
 +
* Network diagram to show the relationship of agencies and suppliers of the selected ministry.
 +
* The interactive filters are year, ministry and range filter which allows the user to filter the suppliers based on the procurement amount.
 +
|-
 +
| <p><center>'''Cashflow from Agency to Suppliers ''' </center></p>
 +
[[File:Storyboard sankey.png|400px|frameless|center]]
 +
<p><center>'''Source''': https://www.r-graph-gallery.com/sankey-diagram/</center></p>
 +
||
 +
* Sankey diagram to show the cash flow between selected agency and suppliers for the selected category.
 +
* The interactive filters are year, ministry, agency and category.
 +
|-
 +
| <p><center>'''Word Cloud on Goods and Services Procured''' </center></p>
 +
[[File:Storyboard wordcloud.png|400px|frameless|center]]
 +
<p><center>'''Source''':http://www.sthda.com/english/wiki/text-mining-and-word-cloud-fundamentals-in-r-5-simple-steps-you-should-know</center></p>
 +
||
 +
* Word cloud to show an overview of the tender description for the selected agency and selected category.
 +
* The filters are year, ministry, agency and category.
 +
* We added a searchable table below the word cloud to allow the user to search for keywords and view the exact tender description
 +
|}
  
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>TECHNOLOGIES</center></font></div>==
+
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>TOOLS & TECHNOLOGIES</center></font></div>==
 
<b>Tools and technologies</b>
 
<b>Tools and technologies</b>
 
[[File:Tools used.png|800px|frameless|center]]
 
[[File:Tools used.png|800px|frameless|center]]
Line 113: Line 193:
  
 
<b>Data Architecture</b>
 
<b>Data Architecture</b>
[[File:Data architecture.png|800px|frameless|center]]
+
[[File:Data architecture 2.png|800px|frameless|center]]
 
<br/>
 
<br/>
  
Line 119: Line 199:
  
 
The following are some of the key technical challenges that we may face throughout the course of the project:
 
The following are some of the key technical challenges that we may face throughout the course of the project:
<center>
+
 
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
! Key Challenges  
+
! style = "width: 50%;" | Key Challenges  
! Mitigation Plan
+
! style="width: 50%;"| Mitigation Plan
 
|-
 
|-
| Unfamiliarity with Javascript Rshiny libraries )
+
| Unfamiliarity with R and Rshiny Libraries
 
|  
 
|  
 
* Attend R Shiny Workshop
 
* Attend R Shiny Workshop
* Independent learning via online resources  
+
* Independent learning via online resources such as Datacamp
 
* Ask team mates for help
 
* Ask team mates for help
 
|-
 
|-
| Unfamiliarity with R Libraries for Machine Learning and Selenium
+
| Unfamiliarity with Libraries for Machine Learning and Web Crawling
 
|  
 
|  
 +
* Clean, transform and analyse data together
 
* Independent learning via online resources  
 
* Independent learning via online resources  
* Clean, transform and analyse data together
 
 
|-
 
|-
 
| Data Cleaning and Transformation  
 
| Data Cleaning and Transformation  
 
|
 
|
* Need to crawl data on website to obtain company information
+
* Need to crawl data on website to obtain training data for text classification
 
* Clean, transform the data together  
 
* Clean, transform the data together  
|-
 
 
|}
 
|}
</center>
+
 
 
<br/>
 
<br/>
  
 
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>TIMELINE</center></font></div>==
 
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>TIMELINE</center></font></div>==
<br/>
+
[[File:Timeline 2.png|800px|frameless|center]]
To be filled!
 
  
 +
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>REFERENCES</center></font></div>==
 +
* https://www.dsta.gov.sg/docs/default-source/dsta-about/dh02200602-gebiz-from-vision-to-reality.pdf?sfvrsn=2
 +
* https://support.rstudio.com/hc/en-us/articles/201057987-Quick-list-of-useful-R-packages
 +
* http://enhancedatascience.com/2017/07/06/pick-best-r-packages-interactive-plot-visualisation-22/
 +
* http://www.sthda.com/english/wiki/text-mining-and-word-cloud-fundamentals-in-r-5-simple-steps-you-should-know
 +
* https://www.r-graph-gallery.com/
 +
* https://rpubs.com/brandonkopp/creating-a-treemap-in-r
 +
* https://cran.r-project.org/web/packages/visNetwork/vignettes/Introduction-to-visNetwork.html
 +
* https://www.displayr.com/sankey-diagrams-r/
 +
* https://towardsdatascience.com/using-networkd3-in-r-to-create-simple-and-clear-sankey-diagrams-48f8ba8a4ace
 +
* Datacamp Building Web Applications in R with Shiny Course
 +
* https://www.r-graph-gallery.com/the-wordcloud2-library/
 +
* https://rstudio.github.io/shinythemes/
 +
* https://rstudio.github.io/DT/shiny.html
 +
* https://rstudio-pubs-static.s3.amazonaws.com/72023_670962b57f444c04999fd1a0a393e113.html
  
<br/>
 
 
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>COMMENTS</center></font></div>==
 
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Lato"><center>COMMENTS</center></font></div>==
<br/>
+
Feel free to leave us some comments so that we can improve!
 
 
Feel free to leave us some comments so that we can improve! We dont bite :)
 
  
 
<center>
 
<center>

Latest revision as of 15:41, 25 November 2018

Geviz.png


Team

 

Proposal

 

Poster

 

Application

 

Research Paper

 

Back to Project Groups


PROBLEM & MOTIVATION

GeBIZ is a Singapore Government’s one-stop e-procurement portal which facilitates tender activities between Singapore government and local and overseas suppliers. Currently, there is no available tool to aid the public and ministries to understand and gain insights on the procurement made by the government under each ministry. Hence, we are motivated to create an interactive visualisation tool on government's procurement spending to allow the public and ministries to identify spending patterns and gain insights into procurement spending under each ministry.

OBJECTIVES

In this project, we are creating a visualisation that is able to show the following:

  • Gain an overview of procurement spending made by each ministry and agency
  • Identify the relationships between ministries, agencies and suppliers
  • Identify what are the goods and services procured by ministries and agencies under each category

SELECTED DATASETS

The following datasets will be used for analysis , as elaborated below:

Dataset/Source Data Attributes Rationale of Usage
Government Procurement Data (https://data.gov.sg/dataset/government-procurement)
  • Tender No
  • Agency
  • Tender Description
  • Award Date
  • Tender Detail Status
  • Supplier Name
  • Awarded Amount
To gain information on government procurement such as tender description, amount and supplier information
Ministry and Agencies List
  • Ministry
  • Agency
We will be looking through the Singapore Government Directory (https://www.gov.sg/sgdi/ministries) to categorise the agencies into their respective ministries. This will allow us to visualise the procurement spending on a ministry level.


APPROACH

Approach 2.png

Exploratory Data Analytics
We used Tableau to perform EDA to better understand our dataset and to aid us in the conceptualization of our story board.

Data Cleaning and Feature Creation
We used Excel and Python to create a new column showing the Ministry that each agency belongs to by merging with data obtained from the Singapore Government Directory.

Text Classification using Support Vector Classifier (SVC)
One of the key challenges of working with the provided procurement dataset is the absence of categorization of each procurement transaction. Instead of labelling manually, we applied machine learning to classify the tender descriptions into different categories. We firstly scraped the procurement descriptions and categories from GeBiz website using Selenium and BeautifulSoup libraries in Python to be used as the training and validation dataset in our Support Vector Classifier model. We were able to achieve 90% for training accuracy before performing the categorization prediction.

Government Procurement Dataset after Text Classification

Dataset/Source Data Attributes
Government Procurement Data
  • Tender No
  • Agency
  • Tender Description
  • Award Date
  • Tender Detail Status
  • Supplier Name
  • Awarded Amount
  • Category
  • Sub Category


Visualization in R
The web application will be built in R and deployed to Shinyapps.io

BACKGROUND SURVEY OF RELATED WORKS

Some of these visualizations that we draw inspiration from, are as follows:

Reference of Other Interactive Visualization What We Can Learn

Title : Pareto Analysis of Suppliers

Pareto Analysis Reference.png

Source: https://goo.gl/P9RjHk

  • The use of pareto chart allows us to identify the component(s) which is/are contributing significantly and how the categorises value sums up cumlatively.

Title : Word Cloud on Procurement Details

Word cloud reference.png

Source: https://linpack-for-tableau.com/data-visualizations/tableau-dashboards/procurement-dashboard/procurement-cockpit/

  • Based on the size of the words, we can identify the keywords quickly. We can learn that by making frequency used keywords stand out can prevent us from overlooking it if it is presented in a tabular format.

Title : Breakdown on UK Government Spending

Treemap ref 2.png

Source: http://www.nickmalleson.co.uk/2014/02/uk-government-spending-treemap.html

  • Based on the chart, we can gain an overview of how the spending is breakdown according to the size and colour of the box.
  • From this, we are able to identify outliers or signficant contribution quickly at a glance.

Title : Team Budget Breadown

Sankey diagram reference.png

Source:https://acquireprocure.com/spend-analysis-visualisation/3-reasons-procurement-professionals-use-sankey-diagrams

  • Based on the path of the sankey chart, we are able to identify the cash flow from one end to the other end.
  • The size of the path allows us to to identify how signficant it is in terms of value.

Title : Analyzing Involved Authorities, Tenders and Companies

Graph network references.png

Source: https://linkurio.us/blog/exploring-e1-3-trillion-in-public-contracts-with-graph-visualization/#!prettyPhotooard

  • Based on the charts, we are able to identify key relationship between objects quickly based on the nodes and edges.
  • From this, we can find common nodes as well.

BRAINSTORMING SESSIONS

Brainstorm2.png

During our brainstorming session, we came out a list of visualization which are able to achieve our objectives and eventually shortlisted 4 visualization - Tree map, Network Graph, Sankey Graph and Word Cloud. After rounds of refinement and consultations with our Professor, the image above is the final draft for our visualization.

PROPOSED STORYBOARD

Our group has proposed the following storyboard in the use of our visual application:

Proposed Layout What We Can Analyse

Overview of Procurement Spending

Storyboard treemap.png

Source:https://www.r-graph-gallery.com/treemap/

  • Treemap to show the spending breakdown for each category of all agencies under the selected ministry.
  • The interactive filters are year and ministry.

Relationship between Ministry and Suppliers

Storyboard network.png

Source: https://cran.r-project.org/web/packages/visNetwork/vignettes/Introduction-to-visNetwork.html

  • Network diagram to show the relationship of agencies and suppliers of the selected ministry.
  • The interactive filters are year, ministry and range filter which allows the user to filter the suppliers based on the procurement amount.

Cashflow from Agency to Suppliers

Storyboard sankey.png

Source: https://www.r-graph-gallery.com/sankey-diagram/

  • Sankey diagram to show the cash flow between selected agency and suppliers for the selected category.
  • The interactive filters are year, ministry, agency and category.

Word Cloud on Goods and Services Procured

Storyboard wordcloud.png

Source:http://www.sthda.com/english/wiki/text-mining-and-word-cloud-fundamentals-in-r-5-simple-steps-you-should-know

  • Word cloud to show an overview of the tender description for the selected agency and selected category.
  • The filters are year, ministry, agency and category.
  • We added a searchable table below the word cloud to allow the user to search for keywords and view the exact tender description

TOOLS & TECHNOLOGIES

Tools and technologies

Tools used.png


Data Architecture

Data architecture 2.png


KEY CHALLENGES

The following are some of the key technical challenges that we may face throughout the course of the project:

Key Challenges Mitigation Plan
Unfamiliarity with R and Rshiny Libraries
  • Attend R Shiny Workshop
  • Independent learning via online resources such as Datacamp
  • Ask team mates for help
Unfamiliarity with Libraries for Machine Learning and Web Crawling
  • Clean, transform and analyse data together
  • Independent learning via online resources
Data Cleaning and Transformation
  • Need to crawl data on website to obtain training data for text classification
  • Clean, transform the data together


TIMELINE

Timeline 2.png

REFERENCES

COMMENTS

Feel free to leave us some comments so that we can improve!

No. Name Date Comments
1. Insert your name here Insert date here Insert comment here
2. Insert your name here Insert date here Insert comment here
3. Insert your name here Insert date here Insert comment here