Difference between revisions of "ANLY482 AY2017-18 T1 Group2 Project EZLin Project Data"

From Analytics Practicum
Jump to navigation Jump to search
Line 47: Line 47:
 
== Mapping of Data ==
 
== Mapping of Data ==
 
<div style="font-size: 16px;">
 
<div style="font-size: 16px;">
The initial 20 excels sheets have different format and include various information. There are sheets on the exchange rate between different countries, BOM of the plants and markets in various APAC countries, and etc. In order to identify: <br/>
+
The initial 22 excels sheets have different format and include various information. There are sheets on the exchange rate between different countries, BOM of the plants and markets in various APAC countries, and etc. In order to identify: <br/>
 
1) the flow of manufacturing and distributing, <br/>
 
1) the flow of manufacturing and distributing, <br/>
 
2) the location of the manufacturing plant and markets and <br/>
 
2) the location of the manufacturing plant and markets and <br/>
 
3) the cost spending on raw materials and overhead, <br/>
 
3) the cost spending on raw materials and overhead, <br/>
 
we need to map all these sheets together, through identifying primary and secondary key in each sheet before linking all the sheets together.<br/>
 
we need to map all these sheets together, through identifying primary and secondary key in each sheet before linking all the sheets together.<br/>
'''Combine all excel files for BOM into one csv file'''<br/>
 
* Necessary for data extraction like getting data for different plant at one time.
 
'''Combine Purchase Information excel sheet with Purchase Information Linkage '''<br/>
 
* As there are too many useless columns in these two files, we decided to select the useful ones and combine them together
 
'''Combine material master excel sheet with Special procurement type description
 
* These two files are closely related. Adding the description of special procurement type to material master can help us to understand the data better.
 
</div>
 
<br/>
 
 
== Variable Transformation ==
 
<div style="font-size: 16px;">
 
 
Due to the large number of files received, we had to integrate all the files into a single master copy for our analysis and visualisation. The tool we used to consolidate the files is JMP. <br/>
 
Due to the large number of files received, we had to integrate all the files into a single master copy for our analysis and visualisation. The tool we used to consolidate the files is JMP. <br/>
 
'''Integrating the purchase information files''' <br/>
 
'''Integrating the purchase information files''' <br/>
Line 77: Line 66:
 
* Before we standardize any unit of amount, we need to join the consolidated table with Dimension Conversion file to ensure the amount stands for the price of one single piece of material instead of multiple
 
* Before we standardize any unit of amount, we need to join the consolidated table with Dimension Conversion file to ensure the amount stands for the price of one single piece of material instead of multiple
 
* We will use outer join to join the table and retrieve the UoM value for each material in order to identify the amount for one piece of material. <br/>
 
* We will use outer join to join the table and retrieve the UoM value for each material in order to identify the amount for one piece of material. <br/>
 +
'''Integrating finishing products details''' <br/>
 +
* Consolidate all the internal manufacturing and subcontracting plant data into a single file
 +
* Consisted of the various cost and components for all levels of the Bills of Material (BOM), which together constitute the Finish Good that will be sent to the respective plant <br/>
 +
</div>
 +
<br/>
 +
 +
== Variable Transformation ==
 +
<div style="font-size: 16px;">
 +
The Amount which stands for the price of the materials are not standardized, in different currencies at different unit pieces. Therefore we need to make transformation before we make any visualization and analysis. <br/>
 
'''Transform the value of Amount''' <br/>
 
'''Transform the value of Amount''' <br/>
 
* Transfer and standardize the Amount based on UoM, per and Currency
 
* Transfer and standardize the Amount based on UoM, per and Currency
Line 82: Line 80:
 
* Standardize all the per value to 100
 
* Standardize all the per value to 100
 
* Standardize all the currency to USD to view the distribution of cost of materials across country or region <br/>
 
* Standardize all the currency to USD to view the distribution of cost of materials across country or region <br/>
'''Integrating finishing products details''' <br/>
 
* Consolidate all the internal manufacturing and subcontracting plant data into a single file
 
* Consisted of the various cost and components for all levels of the Bills of Material (BOM), which together constitute the Finish Good that will be sent to the respective plant <br/>
 
 
</div>
 
</div>
 
<br/>
 
<br/>

Revision as of 09:10, 16 October 2017


HOME

 

ABOUT US

 

PROJECT OVERVIEW

 

PROJECT MANAGEMENT

 

DOCUMENTATION

 

 


We received various data files for our exploration and analysis. There are three main internal manufacturing plants and other plants are subcontracting plants that do not produce any goods in house.

Mapping of Data

The initial 22 excels sheets have different format and include various information. There are sheets on the exchange rate between different countries, BOM of the plants and markets in various APAC countries, and etc. In order to identify:
1) the flow of manufacturing and distributing,
2) the location of the manufacturing plant and markets and
3) the cost spending on raw materials and overhead,
we need to map all these sheets together, through identifying primary and secondary key in each sheet before linking all the sheets together.
Due to the large number of files received, we had to integrate all the files into a single master copy for our analysis and visualisation. The tool we used to consolidate the files is JMP.
Integrating the purchase information files

  • Joined two purchase information files together. We join the tables through their private key attributes conditional record number using outer join.
  • Remove the unnecessary columns and select the columns that identify the identical purchase information and the columns that provides the details for a certain purchase information.

Integrating the Vendor and Plant files

  • Joined the consolidated PIR file with external vendor description file with Vendor number using inner join
  • Join plant description file with plant number using left outer join to filter out the plants which do not produce goods
  • The consolidated file consists of details of all the external vendors and the details of all the manufacturing plants, internal vendors and Asia Pacific Supply Chain(APSC) plants.

Integrating material master file

  • Include material master file with inner join to understand which plants or subcontract the materials are produced internally or purchased from external vendor
  • Identify the procurement type and special procurement type which will link the flow of the purchase of a material in our future visualization and analysis.

Integrating dimension conversion

  • Before we standardize any unit of amount, we need to join the consolidated table with Dimension Conversion file to ensure the amount stands for the price of one single piece of material instead of multiple
  • We will use outer join to join the table and retrieve the UoM value for each material in order to identify the amount for one piece of material.

Integrating finishing products details

  • Consolidate all the internal manufacturing and subcontracting plant data into a single file
  • Consisted of the various cost and components for all levels of the Bills of Material (BOM), which together constitute the Finish Good that will be sent to the respective plant


Variable Transformation

The Amount which stands for the price of the materials are not standardized, in different currencies at different unit pieces. Therefore we need to make transformation before we make any visualization and analysis.
Transform the value of Amount

  • Transfer and standardize the Amount based on UoM, per and Currency
  • Divide the Amount by UoM to get the cost per piece of material (Amount/UoM)
  • Standardize all the per value to 100
  • Standardize all the currency to USD to view the distribution of cost of materials across country or region


Missing Variable

In order to track the relationship between excel sheets and help with further analysis.
Add Bom_Id, SerielCode column to all excel files about BOM.

  • Adding Bom_Id: easier to extract one BOM for future analysis
  • Adding SerialCode: To keep track of the BOM level relationship & For converting to JSON (explain later)

Add SubconOrNot column to all excel files about BOM

  • Adding SubconOrNot: Easier to filter data for future analysis


Multi Entry Data

We identified that some excel sheets include multiple entry of the same product, with different variable values.
This occurs due to various issues such as human errors, or generating reports in new data format using the old systems in the company. After talking to the sponsors, it is believed that the multi entry of the data need to be combined before we start working on any analysis.


Data Visualization Transformation

Exploratory Data Analysis was conducted to understand which variables were meaningful variables to be included in the analysis and that will allow us to achieve the objective of mapping out the end-to-end supply chain.

  • Some of the meaningful variables included the vendor, vendor country, CnTy, number of level, number of Finished Goods (Level 1), number of finished goods per plant, aggregated cost for each component for each plant.
  • From the summary and initial exploratory data analysis, it can be seen that Adult Wash has 6 different BOL levels and is being supplied from 10 different countries with 3 countries being the company’s internal manufacturing site.
  • In total, there are 158 Level 1 Finished Products coming out from the internal manufacturing plant and subcontracting plant. However, each L1 finished product does not equate a unique product. Some finish products may consist of other L1 product from another plant that has been repackaged into a new product, with a new Material No, either for promotional or repackaging purpose.
  • Based on the Net Trade Sales data, initial exploratory data analysis was also done. As seen from the chart above, China has the largest Net Trade Sales (NTS) by dollar value for the period from January to August 2017. Even though no in-depth analysis has been done yet, the chart shows a clear representation of the top countries by NTS value.