Difference between revisions of "Group12 Report"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 28: Line 28:
  
  
Cross shareholding is a situation in which a corporation owns stock in another company. So, technically, corporations own securities issued by other corporations. Cross shareholding can lead to double counting, whereby the equity of each company is counted twice when determining value. When double counting occurs, the security's value is counted twice, which can result in estimating the wrong value of the two companies.  
+
Cross shareholding is a situation in which a corporation owns stock in another company. So, technically, corporations own securities issued by other corporations. The good thing about cross shareholding is it allows the owner to set up a new company to start any business easily. The newly created entity will be backed by strong equity from existing companies and it gives a massive advantage. However, cross shareholding can lead to double counting, whereby the equity of each company is counted twice when determining value. When double counting occurs, the security's value is counted twice, which can result in estimating the wrong value of the two companies.  
  
Cross shareholding is very common in corporate world. Sometimes, there can be more than 10 companies involved and it is very difficult for investors and regulators to track who owns how much.  
+
Cross shareholding is very common in the corporate world. Sometimes, there can be more than 10 companies involved and it is very difficult for investors and regulators to track who owns how much.  
  
In this project, our group choose 1 or 2 big groups of companies from Korea and China with heavy cross shareholding between each other and conduct visualization and relationship analysis on their networks using R-Shiny so that people can have better picture of these companies’ network and easier to understand relationship between companies.
+
In this project, our group choose 1 or 2 big groups of companies from Korea and China with heavy cross shareholding between each other and conduct visualization and relationship analysis on their networks using R-Shiny so that people can have a better picture of these companies’ network and easier to understand the relationship between companies.
 +
 
 +
[[File:Asia.jpg|350px]]
  
 
=== Background ===
 
=== Background ===
  
The stock markets are not the same across the countries. There is only one stock market in Korea. However, in China, there are three stock markets, one is in Shanghai, one is in Shenzhen and the other one is in Hong Kong.   
+
The stock markets are not the same across the countries. There is only one stock market in Korea. However, in China, there are three stock markets, one is in Shanghai, one is in Shenzhen and Hong Kong.   
According to what learnt from visualization class, our group decided to do the visualization analysis for the top stocks listed in Korea and China. We choose Samsung group in Korea.
+
Our group was inspired by one of a lecture on network visualization and decided to do the visualization analysis for the top stocks listed in Korea and China. We choose Samsung group in Korea.
  
 
For Korea, why our group choose Samsung group as the analysed target. There are four major reasons to choose Samsung group as our final selection.  
 
For Korea, why our group choose Samsung group as the analysed target. There are four major reasons to choose Samsung group as our final selection.  
Line 43: Line 45:
 
1. Samsung group is the largest company group controlled by single owner family in Korea
 
1. Samsung group is the largest company group controlled by single owner family in Korea
  
2. Samsung group has 62 companies, 16 listed companies and 46 unlisted companies, these companies are holding each other
+
2. Samsung group has 62 companies, 16 listed companies, and 46 unlisted companies, these companies are holding each other
  
3. Samsung Electronics in Samsung group reached the market value of USD 200 billion, which is about 22% of KOSPI capital
+
3. Samsung Electronics in Samsung group reached the market value of USD 200 billion, which is about 22% of KOSPI capital in 2017 Q2
  
4. Samsung Group takes around 32% of KOSPI capital
+
4. Samsung Group takes around 32% of KOSPI capital in 2017 Q2
  
For China stock market, our group decide to use the top 5 companies listed in the stock market which are Industrial and Commercial Bank, PetroChina, Agricultural Bank, Bank of China, and PingAn insurance.
+
For China stock market, our group decided to use the top 5 companies listed in the stock market which are Industrial and Commercial Bank, PetroChina, Agricultural Bank, Bank of China, and PingAn insurance.
  
However, after go though all the data, our group found that the Stocks in China could not show things straight forward to us. There are only two level. Several big national companies hold the share of the top 5 companies. In contrast, Samsung group provide a better view of the visualization, which we can find many loops between several stocks.
+
However, after going through all the data, our group found that the company network in China is very straightforward with most of the groups comprise of two level. Several big national companies hold the share of the top 5 companies. In contrast, Samsung group has more complex network due to its large number of listed companies, as well as many loops between several stocks.
  
In additionally, our group also had done the graphs for top 5 companies in China. There are only two levels for the whole view. The shareholders and the stock. We can't find any loop. Only several big companies hold many stocks in the market. Such as China security finance, Central Huijin Asset and so on.
+
In addition, our group also had done the graphs for top 5 companies in China. There are only two levels of the whole view. The shareholders and the stock. We can't find any loop. Only several big companies hold many stocks in the market. Such as China security finance, Central Huijin Asset and so on.<br/>
<br>[[File:Group12China2017Q1.jpg|center|900px]]
+
[[File:Group12China2017Q1.jpg|900px]]
  
 
Finally, our group will only focus on the Samsung group visualization.
 
Finally, our group will only focus on the Samsung group visualization.
  
 
=== Chaebol ===
 
=== Chaebol ===
 +
 
A chaebol (from Korean jaebeol) is a large industrial conglomerate that is run and controlled by an owner or family in South Korea. A chaebol often consists of a large number of diversified affiliates, controlled by an owner whose power over the group often exceeds legal authority. The term is often used in a context similar to that of the English word "conglomerate". The term was first used in 1984. There are several dozen large Korean family-controlled corporate groups which fall under this definition.
 
A chaebol (from Korean jaebeol) is a large industrial conglomerate that is run and controlled by an owner or family in South Korea. A chaebol often consists of a large number of diversified affiliates, controlled by an owner whose power over the group often exceeds legal authority. The term is often used in a context similar to that of the English word "conglomerate". The term was first used in 1984. There are several dozen large Korean family-controlled corporate groups which fall under this definition.
  
 
=== Motivation ===
 
=== Motivation ===
Cross shareholding is very important when understanding market behavior. It gets more important when a lot of companies are owned each other which may cause an event on one company result in the others.
 
 
http://www.theborneopost.com/2015/10/09/samsungs-lee-top-asias-richest-families-list/
 
  
 +
Cross shareholding is very important when understanding market behavior. It gets more important when a lot of companies are owned each other which may cause an event on one company result in the others.
 
Due to its market nature in South Korea with Chaebol, it often gets very difficult to understand ownership connection between company groups. Samsung, Hyundai, LG, Lotte, and Hanjin are the biggest company group run by Chaebol family.
 
Due to its market nature in South Korea with Chaebol, it often gets very difficult to understand ownership connection between company groups. Samsung, Hyundai, LG, Lotte, and Hanjin are the biggest company group run by Chaebol family.
 
Also, our group is interested in company structure difference between South Korea and China.
 
Also, our group is interested in company structure difference between South Korea and China.
  
 
=== Objectives ===
 
=== Objectives ===
 +
 
With help of visualization tool in this module, our group will focus company network of one of the biggest company group in Korea, Samsung as well as 5 top companies in China. The objectives are as follows:
 
With help of visualization tool in this module, our group will focus company network of one of the biggest company group in Korea, Samsung as well as 5 top companies in China. The objectives are as follows:
  
Line 79: Line 81:
  
 
== Review and critic on past works ==
 
== Review and critic on past works ==
 +
 
=== Hyundai ===
 
=== Hyundai ===
Nikkei Asian Review posted research on Hyundai Group. Although it contains general overview of Hyundai’s company structure, show much shares, percentage and historical relationship are not defined.
 
  
https://asia.nikkei.com/Business/Companies/The-tower-that-cars-and-family-politics-built
+
Nikkei Asian Review posted research on Hyundai Group. Although it contains a general overview of Hyundai’s company structure, show much shares, percentage and historical relationship are not defined. [https://asia.nikkei.com/Business/Companies/The-tower-that-cars-and-family-politics-built] <br/>
 +
[[File:Critic.png|300px]]
 +
 
 +
 
 
=== Samsung ===
 
=== Samsung ===
Post by Korean media, published in English translated article on Samsung network at the time straight after merger in 2015. It gives ownership but it would be nice to show before and after diagram with historical relationship.
 
  
http://pulsenews.co.kr/view.php?year=2015&no=1216267
+
Post by Korean media, published in English translated article on Samsung network at the time straight after merger in 2015. It gives ownership but it would be nice to show before and after diagram with historical relationship. [http://pulsenews.co.kr/view.php?year=2015&no=1216267]<br/>
 +
[[File:Critic2.jpg|300px]]
 +
 
 +
 
 
=== Lotte ===
 
=== Lotte ===
Lotte also has very complicated structure. Lotte’s network by Korea Herald. It shows network at single given time. Again, ownership and historical relationship is missing.
 
  
http://www.koreaherald.com/view.php?ud=20160707000985
+
Lotte also has very complicated structure. Lotte’s network by Korea Herald. It shows network at single given time. Again, ownership and historical relationship is missing. [http://www.koreaherald.com/view.php?ud=20160707000985]<br/>
 +
[[File:Critic3.jpg|300px]]
 +
 
 +
 
 +
== Data Preparation ==
  
== Data Preparation Data Preparation ==
 
 
=== Samsung Group ===
 
=== Samsung Group ===
It's not easy to get the dataset for Samsung group. It is not only because the language is Korean, but also because the dataset are distributed in many different resources. Lastly, our group download the data from the Korea stock exchange by using python code. The code is as below.
 
<br>[[File:Group12DownloadData.JPG|center|800px]]
 
  
Then the following steps are what we did in order to clean the data.
+
Data on Korean companies are usually only available in the Korean language in DART. Dart is an official Korean website where all Korean listed companies' announcements are. It is compulsory for Korean companies to announce proportions of their shareholdings in the quarterly financial statement. We used python to crawl our target data using Python. Below codes shows code we used to pull data from the financial statement in HTML code. <br/>
<br>[[File:Group12DataPre1.JPG|center|700px]]
+
[[File:Group12DownloadData.JPG|800px]]
 +
 
 +
Once crawled, data is translated into English with Company, Stockholder, proportion, and equity value.  <br/>
 +
 
 +
Then the data need to be cleaned, the final data need to be from the following datasets.<br/>
 +
 
 +
*Raw Node Data<br/>
 +
 
 +
[[File:Group12RawNode.jpg|700px]]<br/>
 +
 
 +
*Raw Edge Data<br/>
 +
 
 +
[[File:Group12RawEdge.jpg|900px]]<br/>
 +
 
 +
*Samsung Group Stock Value in KOSPI<br/>
  
Then the data need to be cleaned, the final data need to be like the following datasets.
+
[[File:Group12KOSPIVALUE.jpg|800px]]<br/>
*Raw Node Data
 
<br>[[File:Group12RawNode.jpg|center|700px]]
 
*Raw Edge Data
 
<br>[[File:Group12RawEdge.jpg|center|900px]]
 
  
*Samsung Group Stock Value in KOSPI
 
<br>[[File:Group12KOSPIVALUE.jpg|center|800px]]
 
 
In the above diagram, the blue ones are the companies in Samsung group, the grey ones are the other companies. it can easily find the top three companies in Samsung group, SAMSUNG ELECTRONICS, SAMSUNG C&T and Samsung Life Insurance. Especially, SAMSUNG ELECTRONICS takes almost 22% of KOSPI capital. And also the Samsung Group takes 32% of KOSPI capital.
 
In the above diagram, the blue ones are the companies in Samsung group, the grey ones are the other companies. it can easily find the top three companies in Samsung group, SAMSUNG ELECTRONICS, SAMSUNG C&T and Samsung Life Insurance. Especially, SAMSUNG ELECTRONICS takes almost 22% of KOSPI capital. And also the Samsung Group takes 32% of KOSPI capital.
*Trend of the KOSPI
+
 
<br>[[File:Group12KOSPITrend.jpg|center|800px]]
+
*Trend of the KOSPI<br/>
In the above diagram, the blue color is the trend of market value for Samsung group and the grey color is the trend of market value for other companies in KOSPI. Comparing the Samsung with other companies, it can easily find that the market value is more stable for Samsung group. You can also find a sharply drop in 2008 that was mainly caused by the global economic crisis in 2008.
+
 
 +
[[File:Group12KOSPITrend.jpg|800px]]<br/>
 +
 
 +
In the above diagram, the blue color is the trend of market value for Samsung group and the grey color is the trend of market value for other companies in KOSPI. Comparing the Samsung with other companies, it can easily find that the market value is more stable for Samsung group. You can also find a sharp drop in 2008 that was mainly caused by the global economic crisis in 2008.
  
 
=== China Top 10 Companies Data Preparation ===
 
=== China Top 10 Companies Data Preparation ===
 +
 
There are more than one hundred thousands of stocks list in China Stock Market. For the analysis purpose, our group just do the top 5 companies in China.
 
There are more than one hundred thousands of stocks list in China Stock Market. For the analysis purpose, our group just do the top 5 companies in China.
The distribution of the top 10 stocks in China displayed as below screenshot. The top five stocks list in China are Industrial and Commercial Bank, PetroChina, Agricultural Bank, Bank of China, and PingAn insurance. Three out of five are banks(ICBC, ABC, BOC). The other two is insurance and petro. The proportions of these five companies are around 17% out of all China stock market values. The top 10 companies proportion is around 25% out of all China stock market value.
+
The distribution of the top 10 stocks in China displayed as below screenshot. The top five stocks list in China are Industrial and Commercial Bank, PetroChina, Agricultural Bank, Bank of China, and PingAn insurance. Three out of five are banks(ICBC, ABC, BOC). The other two is insurance and petrol. The proportions of these five companies are around 17% out of all China stock market values. The top 10 companies proportion is around 25% out of all China stock market value.
  
*Top 10 companies proportion in China stock market
+
*Top 10 companies proportion in China stock market<br/>
<br>[[File:Group12top10.jpg|center|800px]]
 
  
*Industries trend in China stock market
+
[[File:Group12top10.jpg|800px]]<br/>
<br>[[File:Group12ChinaStockIndustrialOverview.JPG|center|1100px]]
 
  
 +
*Industries trend in China stock market<br/>
  
For the data preparation of the China stock, we got the datasets from OneDriver in SMU library. We download the stock profiles
+
[[File:Group12ChinaStockIndustrialOverview.JPG|1100px]]<br/>
, shareholders, equity, and other datasets and the time period is about 20 years. In the dataset, we joint all the table together and make the final sheet like following table.
+
 
 +
 
 +
For the data preparation of the China stock, we got the datasets from OneDriver in SMU library. We download the stock profiles, shareholders, equity, and other datasets and the time period is about 20 years. In the dataset, we joint all the table together and make the final sheet like the following table.
  
 
== Design Framework of Cross Shareholding Application ==
 
== Design Framework of Cross Shareholding Application ==
 +
 
=== Overview of the Samsung Group in the application ===
 
=== Overview of the Samsung Group in the application ===
The following diagram is the overview of Samsung group network diagram.
 
  
[[File:Group12OverviewOfSamsung.jpg|center|1200px]]
+
The following diagram is the overview of Samsung group network diagram.<br/>
 +
 
 +
[[File:Group12OverviewOfSamsung.jpg|1200px]]
  
 
=== R Packages ===
 
=== R Packages ===
Our group use several Rshiny packages. And also we use Tableau and JMP to do the cleaning for the datasets.  
+
 
 +
Our group uses several R shiny packages. And also we use Tableau and JMP to do the cleaning for the datasets.  
 
The following packages are the R packages used in the coding part.
 
The following packages are the R packages used in the coding part.
 
*igraph  : Fundamental package allowing us to convert network data to ‘graph’ and then to perform Centrality-based network analysis.  
 
*igraph  : Fundamental package allowing us to convert network data to ‘graph’ and then to perform Centrality-based network analysis.  
Line 143: Line 166:
  
 
=== Application Overview ===
 
=== Application Overview ===
There are three layers and two levels in the application, the first level contains Data Manipulation Layer, Filter Interactive Layer, and Graph Render Layer.
+
 
The second level is the Shiny Dashboard.  
+
There are three layers and two levels of the application, the first level contains Data Manipulation Layer, Filter Interactive Layer, and Graph Render Layer.
 +
The second level is the Shiny Dashboard. <br/>
 
[[File:Group12ApplicationOverview.JPG|center|950px]]
 
[[File:Group12ApplicationOverview.JPG|center|950px]]
  
 
=== Application Workflow ===
 
=== Application Workflow ===
  
For the different layers in the first level, the features are different. Please refer to following diagram to understand the functions of each layer.
+
For the different layers in the first level, the features are different. Please refer to the following diagram to understand the functions of each layer.<br/>
[[File:Group12ApplicationWorkflow.JPG|center|1200px]]
+
[[File:Group12ApplicationWorkflow.JPG|1200px]]
  
 
=== Application Features ===
 
=== Application Features ===
  
There are many tool bars in the application. Such as zoom in/out, sliders, textinput and so on. Please refer to the following table to get the knowledge of the functions in the tool bar.
+
There are many toolbars in the application. Such as zoom in/out, sliders, text input and so on. Please refer to the following table to get the knowledge of the functions in the toolbar.<br/>
[[File:Group12ApplicationFeatures.JPG|center|700px]]
+
[[File:Group12ApplicationFeatures.JPG|700px]]
  
 
=== Shiny Dashboard ===
 
=== Shiny Dashboard ===
 +
 
The following is the design features of the dashboard.  
 
The following is the design features of the dashboard.  
  
 
*Dashboard Page
 
*Dashboard Page
The colour of the Skin of the dashboard is blue, and the header is the fixed title "CrossHolding Dashboard"   
+
The color of the Skin of the dashboard is blue, and the header is the fixed title "CrossHolding Dashboard"   
 
*SideBar
 
*SideBar
The sliders are the filter which can filter data by Year and Quarter. The Text can use to filter data by Holder and Stock names. The SelectInput can filter data Centrality Type.
+
The sliders are the filter which can filter data by Year and Quarter. The Text can use to filter data by Holder and Stock names. The select input can filter data Centrality Type.
The radioButtion can filter data PreAssignedLevel Or All.
+
The radio button can filter data PreAssignedLevel Or All.
 
*Body
 
*Body
The tableItem is Graph, the fluidRow is network graph and the tableItem you can refer to the Rawdata.
+
The tableItem is Graph, the fluidRow is network graph and the tableItem you can refer to the Raw data.
 
*rawData
 
*rawData
There are Nodes and Edges in the rawdata.
+
There are Nodes and Edges in the raw data.
  
 
== Demonstration ==
 
== Demonstration ==
 +
 
== Discussion & Future Work ==
 
== Discussion & Future Work ==
 +
 
Although the application can display the overview of the whole Samsung group and the shareholders between the companies in the Samsung group. But there are several things need to be discussed, and also there are a few improvement for future work.  
 
Although the application can display the overview of the whole Samsung group and the shareholders between the companies in the Samsung group. But there are several things need to be discussed, and also there are a few improvement for future work.  
 +
 
=== Discussion ===
 
=== Discussion ===
 +
 
The following details you can get from the application.
 
The following details you can get from the application.
 
*Introduce company structure in Korea where most of the information is available in Korean only
 
*Introduce company structure in Korea where most of the information is available in Korean only
Line 180: Line 209:
  
 
=== Future Work ===
 
=== Future Work ===
 +
 
*Share ownership line to show indirect holdings
 
*Share ownership line to show indirect holdings
*Able to calculate actual value in ownership
+
*Able to calculate actual value of ownership
 
*Able to highlight and detect loop ownership
 
*Able to highlight and detect loop ownership
  
 
== Installation guide ==
 
== Installation guide ==
 +
 
'''<big>For online user:</big>'''  
 
'''<big>For online user:</big>'''  
 
* Recommend using Chrome
 
* Recommend using Chrome
* User just needs to launch the application from https://koreastockcrossholding.shinyapps.io/ksch1203/ without the need to install anything locally.
+
* User just needs to launch the application from [https://koreastockcrossholding.shinyapps.io/ksch1203/] without the need to install anything locally.
  
 
'''<big>For local user or developer:</big>'''
 
'''<big>For local user or developer:</big>'''
Line 195: Line 226:
 
* Intel® Core™ i5-7500 Processor
 
* Intel® Core™ i5-7500 Processor
 
* 10GB free space
 
* 10GB free space
* MicroSoft Windows 10 Home,64-bit Operating System
+
* Microsoft Windows 10 Home,64-bit Operating System
  
 
2.R :
 
2.R :
Line 201: Line 232:
  
 
3.R Studio :
 
3.R Studio :
R Studio Version 1.0.153 or later shall be installed from: https://www.rstudio.com/  
+
R Studio Version 1.0.153 or later shall be installed from [https://www.rstudio.com/]
(At the point of the project,R Studio 1.0.153 is used for creating this application)
+
(At the point of the project, R Studio 1.0.153 is used for creating this application)
  
 
4.R Packages:
 
4.R Packages:
The following R packages are required,hence need to be installed through R Studio ->Tools -> Install Packages...  
+
The following R packages are required, hence need to be installed in R Studio ->Tools -> Install Packages...  
* shinydashboard
+
* shiny dashboard
 
* igraph
 
* igraph
 
* visNetwork
 
* visNetwork
Line 223: Line 254:
  
 
== User Guide ==
 
== User Guide ==
Users can use the application freely by click the url:https://koreastockcrossholding.shinyapps.io/ksch1203/. you’ll see in the ‘Graph’ tab, a default graph based on data from 2017 Quarter 1. Then you can refer to the following steps to change what you want in the application.
+
Users can use the application freely by clicking the url:[https://koreastockcrossholding.shinyapps.io/ksch1203/]. you’ll see in the ‘Graph’ tab, a default graph based on data from 2017 Quarter 1. Then you can refer to the following steps to change what you want in the application.
 
The overview of the application is as below screenshot.
 
The overview of the application is as below screenshot.
[[File:Group12ApplicationView.jpg|center|1100px]]
+
[[File:Group12ApplicationView.jpg|1100px]]
 
*Step 1
 
*Step 1
Set Year Filter : Move the slider
+
Set Year Filter: Move the slider
[[File:Group12Userguide1.jpg|center|400px]]
+
[[File:Group12Userguide1.jpg|400px]]
  
 
*Step 2
 
*Step 2
Set Quarter Filter : Move the slider
+
Set Quarter Filter: Move the slider
[[File:Group12Userguide2.jpg|center|400px]]
+
[[File:Group12Userguide2.jpg|400px]]
  
 
*Step 3
 
*Step 3
 
Search by Stock Name :  
 
Search by Stock Name :  
Type full or partial Stock name,for example, SAMSUNG, will render the graph with any Stock containing SAMSUNG in the name and the direct links
+
Type full or partial Stock name, for example, SAMSUNG, will render the graph with any Stock containing SAMSUNG in the name and the direct links
[[File:Group12Userguide3.jpg|center|800px]]
+
[[File:Group12Userguide3.jpg|800px]]
  
you can search multiple Stock Names delimited by |, for instance, LIFE|ELEC will filter out any Stock containing ‘LIFE’ or ‘ELEC’ in the name,and all the direct links from the filtered nodes
+
you can search multiple Stock Names delimited by |, for instance, LIFE|ELEC will filter out any Stock containing ‘LIFE’ or ‘ELEC’ in the name, and all the direct links from the filtered nodes
[[File:Group12Userguide4.jpg|center|800px]]
+
[[File:Group12Userguide4.jpg|800px]]
  
 
*Step 4
 
*Step 4
 
Search by Holder Name :  
 
Search by Holder Name :  
Type full or partial Holder name,for example, NPS, will render the graph with any Holder containing NPS in the name and the direct links
+
Type full or partial Holder name, for example, NPS, will render the graph with any Holder containing NPS in the name and the direct links
[[File:Group12Userguide5.jpg|center|800px]]
+
[[File:Group12Userguide5.jpg|800px]]
  
  
Similarly, you can also search by multiple Holder Names delimited by |, for instance, LIFE|ELEC will filter out any Stock containing ‘LIFE’ or ‘ELEC’ in the name,and all the direct links from the filtered nodes
+
Similarly, you can also search by multiple Holder Names delimited by |, for instance, LIFE|ELEC will filter out any Stock containing ‘LIFE’ or ‘ELEC’ in the name, and all the direct links from the filtered nodes
[[File:Group12Userguide6.jpg|center|800px]]
+
[[File:Group12Userguide6.jpg|800px]]
  
 
*Step 5
 
*Step 5
Render graph by NodeLevel: NodeLevel in the Hierarchy is determined by Centrality degree per different types(out,in,all), or determined by PreAssigned Level in Edge table.
+
Render graph by NodeLevel: NodeLevel in the Hierarchy is determined by Centrality degree per different types(out, in, all), or determined by PreAssigned Level in Edge table.
[[File:Group12Userguide7.jpg|center|300px]]
+
[[File:Group12Userguide7.jpg|300px]]
Swtich NodeLevel, you’ll see the graph redraw and nodes re-positioned to different level based on their centrality degree.
+
Swtich NodeLevel, you’ll see the graph redraw and nodes re-positioned to a different level based on their centrality degree.
  
 
*Step 6
 
*Step 6
 
Render graph by ‘EdgeDisplay’:  Default ‘EdgeDisplay’ is ‘All’ that means Edges are drawn solely as per the actual ownership between filtered Holders and Stocks.
 
Render graph by ‘EdgeDisplay’:  Default ‘EdgeDisplay’ is ‘All’ that means Edges are drawn solely as per the actual ownership between filtered Holders and Stocks.
[[File:Group12Userguide8.jpg|center|300px]]
+
[[File:Group12Userguide8.jpg|300px]]
If select ‘Per PreAssignedFlag’, some Edges will be hidden from the graph per Dataprovider’s intension.
+
If select ‘Per PreAssignedFlag’, some Edges will be hidden from the graph per Data providers intention.
  
 
*Step 7
 
*Step 7
‘Color-Degree’ legend:  At the left side of the graph,you’ll see an auto-generated legend, hich shows  you the actual Centrality Degree value per different colors:
+
‘Color-Degree’ legend:  At the left side of the graph, you’ll see an auto-generated legend, which shows  you the actual Centrality Degree value per different colors:
[[File:Group12Userguide9.jpg|center|80px]]
+
[[File:Group12Userguide9.jpg|80px]]
  
 
*Step 8
 
*Step 8
Line 272: Line 303:
  
 
== Reference ==
 
== Reference ==
 +
 
<references/>
 
<references/>
*Application URL: https://koreastockcrossholding.shinyapps.io/ksch1203/
+
*Application URL: [https://koreastockcrossholding.shinyapps.io/ksch1203/]
*Rhsiny coding part:https://www.rstudio.com/products/shiny/
+
*Rhsiny coding part: [https://www.rstudio.com/products/shiny/]
*Dataset for shareholders in Korea: http://dart.fss.or.kr/
+
*Dataset for shareholders in Korea: [http://dart.fss.or.kr/]
*Dataset for shareholders in China:http://us.gtadata.com.libproxy.smu.edu.sg/SingleTable/Index
+
*Dataset for shareholders in China: [http://us.gtadata.com.libproxy.smu.edu.sg/SingleTable/Index]
*Sunsang Group: http://www.theborneopost.com/2015/10/09/samsungs-lee-top-asias-richest-families-list/
+
*Sunsang Group: [http://www.theborneopost.com/2015/10/09/samsungs-lee-top-asias-richest-families-list/]
*Hyundai: https://asia.nikkei.com/Business/Companies/The-tower-that-cars-and-family-politics-built  
+
*Hyundai: [https://asia.nikkei.com/Business/Companies/The-tower-that-cars-and-family-politics-built]
*Samsung: http://pulsenews.co.kr/view.php?year=2015&no=1216267  
+
*Samsung: [http://pulsenews.co.kr/view.php?year=2015&no=1216267]
*Lotte: http://www.koreaherald.com/view.php?ud=20160707000985
+
*Lotte: [http://www.koreaherald.com/view.php?ud=20160707000985]

Revision as of 22:22, 3 December 2017

Group12Title.JPG Group 12 - Cross Shareholding

Proposal

Poster

Application

Report

 


Motivation of the application

Cross shareholding is a situation in which a corporation owns stock in another company. So, technically, corporations own securities issued by other corporations. The good thing about cross shareholding is it allows the owner to set up a new company to start any business easily. The newly created entity will be backed by strong equity from existing companies and it gives a massive advantage. However, cross shareholding can lead to double counting, whereby the equity of each company is counted twice when determining value. When double counting occurs, the security's value is counted twice, which can result in estimating the wrong value of the two companies.

Cross shareholding is very common in the corporate world. Sometimes, there can be more than 10 companies involved and it is very difficult for investors and regulators to track who owns how much.

In this project, our group choose 1 or 2 big groups of companies from Korea and China with heavy cross shareholding between each other and conduct visualization and relationship analysis on their networks using R-Shiny so that people can have a better picture of these companies’ network and easier to understand the relationship between companies.

Asia.jpg

Background

The stock markets are not the same across the countries. There is only one stock market in Korea. However, in China, there are three stock markets, one is in Shanghai, one is in Shenzhen and Hong Kong. Our group was inspired by one of a lecture on network visualization and decided to do the visualization analysis for the top stocks listed in Korea and China. We choose Samsung group in Korea.

For Korea, why our group choose Samsung group as the analysed target. There are four major reasons to choose Samsung group as our final selection.

1. Samsung group is the largest company group controlled by single owner family in Korea

2. Samsung group has 62 companies, 16 listed companies, and 46 unlisted companies, these companies are holding each other

3. Samsung Electronics in Samsung group reached the market value of USD 200 billion, which is about 22% of KOSPI capital in 2017 Q2

4. Samsung Group takes around 32% of KOSPI capital in 2017 Q2

For China stock market, our group decided to use the top 5 companies listed in the stock market which are Industrial and Commercial Bank, PetroChina, Agricultural Bank, Bank of China, and PingAn insurance.

However, after going through all the data, our group found that the company network in China is very straightforward with most of the groups comprise of two level. Several big national companies hold the share of the top 5 companies. In contrast, Samsung group has more complex network due to its large number of listed companies, as well as many loops between several stocks.

In addition, our group also had done the graphs for top 5 companies in China. There are only two levels of the whole view. The shareholders and the stock. We can't find any loop. Only several big companies hold many stocks in the market. Such as China security finance, Central Huijin Asset and so on.
Group12China2017Q1.jpg

Finally, our group will only focus on the Samsung group visualization.

Chaebol

A chaebol (from Korean jaebeol) is a large industrial conglomerate that is run and controlled by an owner or family in South Korea. A chaebol often consists of a large number of diversified affiliates, controlled by an owner whose power over the group often exceeds legal authority. The term is often used in a context similar to that of the English word "conglomerate". The term was first used in 1984. There are several dozen large Korean family-controlled corporate groups which fall under this definition.

Motivation

Cross shareholding is very important when understanding market behavior. It gets more important when a lot of companies are owned each other which may cause an event on one company result in the others. Due to its market nature in South Korea with Chaebol, it often gets very difficult to understand ownership connection between company groups. Samsung, Hyundai, LG, Lotte, and Hanjin are the biggest company group run by Chaebol family. Also, our group is interested in company structure difference between South Korea and China.

Objectives

With help of visualization tool in this module, our group will focus company network of one of the biggest company group in Korea, Samsung as well as 5 top companies in China. The objectives are as follows:

1. Introduce company structure in Korea where most of the information is available in Korean only

2. Visualize complex company structure in Samsung Group and some Groups in China

3. Visualize changes in ownership and elaborate potential impact and interesting facts

Review and critic on past works

Hyundai

Nikkei Asian Review posted research on Hyundai Group. Although it contains a general overview of Hyundai’s company structure, show much shares, percentage and historical relationship are not defined. [1]
Critic.png


Samsung

Post by Korean media, published in English translated article on Samsung network at the time straight after merger in 2015. It gives ownership but it would be nice to show before and after diagram with historical relationship. [2]
Critic2.jpg


Lotte

Lotte also has very complicated structure. Lotte’s network by Korea Herald. It shows network at single given time. Again, ownership and historical relationship is missing. [3]
Critic3.jpg


Data Preparation

Samsung Group

Data on Korean companies are usually only available in the Korean language in DART. Dart is an official Korean website where all Korean listed companies' announcements are. It is compulsory for Korean companies to announce proportions of their shareholdings in the quarterly financial statement. We used python to crawl our target data using Python. Below codes shows code we used to pull data from the financial statement in HTML code.
Group12DownloadData.JPG

Once crawled, data is translated into English with Company, Stockholder, proportion, and equity value.

Then the data need to be cleaned, the final data need to be from the following datasets.

  • Raw Node Data

Group12RawNode.jpg

  • Raw Edge Data

Group12RawEdge.jpg

  • Samsung Group Stock Value in KOSPI

Group12KOSPIVALUE.jpg

In the above diagram, the blue ones are the companies in Samsung group, the grey ones are the other companies. it can easily find the top three companies in Samsung group, SAMSUNG ELECTRONICS, SAMSUNG C&T and Samsung Life Insurance. Especially, SAMSUNG ELECTRONICS takes almost 22% of KOSPI capital. And also the Samsung Group takes 32% of KOSPI capital.

  • Trend of the KOSPI

Group12KOSPITrend.jpg

In the above diagram, the blue color is the trend of market value for Samsung group and the grey color is the trend of market value for other companies in KOSPI. Comparing the Samsung with other companies, it can easily find that the market value is more stable for Samsung group. You can also find a sharp drop in 2008 that was mainly caused by the global economic crisis in 2008.

China Top 10 Companies Data Preparation

There are more than one hundred thousands of stocks list in China Stock Market. For the analysis purpose, our group just do the top 5 companies in China. The distribution of the top 10 stocks in China displayed as below screenshot. The top five stocks list in China are Industrial and Commercial Bank, PetroChina, Agricultural Bank, Bank of China, and PingAn insurance. Three out of five are banks(ICBC, ABC, BOC). The other two is insurance and petrol. The proportions of these five companies are around 17% out of all China stock market values. The top 10 companies proportion is around 25% out of all China stock market value.

  • Top 10 companies proportion in China stock market

Group12top10.jpg

  • Industries trend in China stock market

Group12ChinaStockIndustrialOverview.JPG


For the data preparation of the China stock, we got the datasets from OneDriver in SMU library. We download the stock profiles, shareholders, equity, and other datasets and the time period is about 20 years. In the dataset, we joint all the table together and make the final sheet like the following table.

Design Framework of Cross Shareholding Application

Overview of the Samsung Group in the application

The following diagram is the overview of Samsung group network diagram.

Group12OverviewOfSamsung.jpg

R Packages

Our group uses several R shiny packages. And also we use Tableau and JMP to do the cleaning for the datasets. The following packages are the R packages used in the coding part.

  • igraph : Fundamental package allowing us to convert network data to ‘graph’ and then to perform Centrality-based network analysis.
  • visNetwork : Provides an interactive visualization of networks and offering very granular configurations,such as Node Size,Node Selection,Edge Color,Shadow etc. for making our graph neater and prettier.
  • dplyr : massging data using left_join
  • data.table : get data frame from graph for further enrichment
  • Shinydashboard : Create dashboards with 'Shiny'.

Application Overview

There are three layers and two levels of the application, the first level contains Data Manipulation Layer, Filter Interactive Layer, and Graph Render Layer. The second level is the Shiny Dashboard.

Group12ApplicationOverview.JPG

Application Workflow

For the different layers in the first level, the features are different. Please refer to the following diagram to understand the functions of each layer.
Group12ApplicationWorkflow.JPG

Application Features

There are many toolbars in the application. Such as zoom in/out, sliders, text input and so on. Please refer to the following table to get the knowledge of the functions in the toolbar.
Group12ApplicationFeatures.JPG

Shiny Dashboard

The following is the design features of the dashboard.

  • Dashboard Page

The color of the Skin of the dashboard is blue, and the header is the fixed title "CrossHolding Dashboard"

  • SideBar

The sliders are the filter which can filter data by Year and Quarter. The Text can use to filter data by Holder and Stock names. The select input can filter data Centrality Type. The radio button can filter data PreAssignedLevel Or All.

  • Body

The tableItem is Graph, the fluidRow is network graph and the tableItem you can refer to the Raw data.

  • rawData

There are Nodes and Edges in the raw data.

Demonstration

Discussion & Future Work

Although the application can display the overview of the whole Samsung group and the shareholders between the companies in the Samsung group. But there are several things need to be discussed, and also there are a few improvement for future work.

Discussion

The following details you can get from the application.

  • Introduce company structure in Korea where most of the information is available in Korean only
  • Visualize complex company structure in Samsung Group
  • Visualize changes in ownership and elaborate potential impact and interesting facts

Future Work

  • Share ownership line to show indirect holdings
  • Able to calculate actual value of ownership
  • Able to highlight and detect loop ownership

Installation guide

For online user:

  • Recommend using Chrome
  • User just needs to launch the application from [4] without the need to install anything locally.

For local user or developer:

1.Hardware Configuration:

  • Minimum 4GB RAM
  • Intel® Core™ i5-7500 Processor
  • 10GB free space
  • Microsoft Windows 10 Home,64-bit Operating System

2.R : R version 3.4.2 or later shall be installed

3.R Studio : R Studio Version 1.0.153 or later shall be installed from [5] (At the point of the project, R Studio 1.0.153 is used for creating this application)

4.R Packages: The following R packages are required, hence need to be installed in R Studio ->Tools -> Install Packages...

  • shiny dashboard
  • igraph
  • visNetwork
  • data.table
  • dplyr

5.R source code file & Data files: the source code file : <<V_Group12_CrossHolding_ShinyDashboard.R>> shall be saved in an individual folder,for example : CrossHoldingAnalytics,and the following two data files <<Final_Samsung_connection.csv>> <<Final_Samsung_holdinglevel_withht.csv>> shall be saved into the subfolder : data, for example : CrossHoldingAnalytics\data folder.


6.After the packages have finished installing, open the project files and click on "Run App".

User Guide

Users can use the application freely by clicking the url:[6]. you’ll see in the ‘Graph’ tab, a default graph based on data from 2017 Quarter 1. Then you can refer to the following steps to change what you want in the application. The overview of the application is as below screenshot. Group12ApplicationView.jpg

  • Step 1

Set Year Filter: Move the slider Group12Userguide1.jpg

  • Step 2

Set Quarter Filter: Move the slider Group12Userguide2.jpg

  • Step 3

Search by Stock Name : Type full or partial Stock name, for example, SAMSUNG, will render the graph with any Stock containing SAMSUNG in the name and the direct links Group12Userguide3.jpg

you can search multiple Stock Names delimited by |, for instance, LIFE|ELEC will filter out any Stock containing ‘LIFE’ or ‘ELEC’ in the name, and all the direct links from the filtered nodes Group12Userguide4.jpg

  • Step 4

Search by Holder Name : Type full or partial Holder name, for example, NPS, will render the graph with any Holder containing NPS in the name and the direct links Group12Userguide5.jpg


Similarly, you can also search by multiple Holder Names delimited by |, for instance, LIFE|ELEC will filter out any Stock containing ‘LIFE’ or ‘ELEC’ in the name, and all the direct links from the filtered nodes Group12Userguide6.jpg

  • Step 5

Render graph by NodeLevel: NodeLevel in the Hierarchy is determined by Centrality degree per different types(out, in, all), or determined by PreAssigned Level in Edge table. Group12Userguide7.jpg Swtich NodeLevel, you’ll see the graph redraw and nodes re-positioned to a different level based on their centrality degree.

  • Step 6

Render graph by ‘EdgeDisplay’: Default ‘EdgeDisplay’ is ‘All’ that means Edges are drawn solely as per the actual ownership between filtered Holders and Stocks. Group12Userguide8.jpg If select ‘Per PreAssignedFlag’, some Edges will be hidden from the graph per Data providers intention.

  • Step 7

‘Color-Degree’ legend: At the left side of the graph, you’ll see an auto-generated legend, which shows you the actual Centrality Degree value per different colors: Group12Userguide9.jpg

  • Step 8

“RawData’ tab: This tab just lists all raw Edge and Node data from Korea Stock market data source.

Special Thanks

Our group members would like to thank Prof. Kam Tin Seong for his patience, his kindness, and knowledge to offer the team his consultation advice. And also appreciate for the Prof's great help during the whole course.

Reference

  • Application URL: [7]
  • Rhsiny coding part: [8]
  • Dataset for shareholders in Korea: [9]
  • Dataset for shareholders in China: [10]
  • Sunsang Group: [11]
  • Hyundai: [12]
  • Samsung: [13]
  • Lotte: [14]