Difference between revisions of "Group05 Proposal"
Ivygoh.2017 (talk | contribs) |
Ivygoh.2017 (talk | contribs) |
||
Line 30: | Line 30: | ||
==Background on Time Series Clustering== | ==Background on Time Series Clustering== | ||
− | + | ===What is Time Series Clustering?=== | |
Clustering is a data analysis technique for organizing observed data (e.g. people, things, events, brands, companies) into meaningful taxonomies, groups or clusters without advanced knowledge of the groups’ definition. Clusters are formed based on combinations of input variable, which maximizes the similarity of cases within each cluster while maximizing the dissimilarity between groups that are initially unknows. | Clustering is a data analysis technique for organizing observed data (e.g. people, things, events, brands, companies) into meaningful taxonomies, groups or clusters without advanced knowledge of the groups’ definition. Clusters are formed based on combinations of input variable, which maximizes the similarity of cases within each cluster while maximizing the dissimilarity between groups that are initially unknows. | ||
A special type of clustering is time-series clustering, which is essentially dynamic data as its feature values changes as a function of time. | A special type of clustering is time-series clustering, which is essentially dynamic data as its feature values changes as a function of time. | ||
− | + | ===Key Parameters of Time-Series Clustering=== | |
− | |||
{| class="wikitable" | {| class="wikitable" | ||
|- | |- | ||
Line 53: | Line 52: | ||
* Shape Averaging (Shape) | * Shape Averaging (Shape) | ||
|} | |} | ||
− | |||
− | |||
− | |||
==Packages Used== | ==Packages Used== | ||
This dashboard mainly uses dtwclust package from R. | This dashboard mainly uses dtwclust package from R. | ||
<b><big>dtwclust:</big></b><br> | <b><big>dtwclust:</big></b><br> | ||
− | |||
XXXX | XXXX | ||
==Objective== | ==Objective== | ||
− | + | ===To build an application for Time Series Clustering=== | |
− | |||
Time-series data are of interest due to their ubiquity in various areas ranging from science, engineering, business, economics, healthcare, to government. This dashboard aims to allow user to do time series clustering on time series related data to uncover patterns which have potential use case in the respective domain. | Time-series data are of interest due to their ubiquity in various areas ranging from science, engineering, business, economics, healthcare, to government. This dashboard aims to allow user to do time series clustering on time series related data to uncover patterns which have potential use case in the respective domain. | ||
− | + | =Reference= |
Revision as of 17:22, 20 November 2018
|
|
|
|
Contents
Abstract
XXXXX
Background on Time Series Clustering
What is Time Series Clustering?
Clustering is a data analysis technique for organizing observed data (e.g. people, things, events, brands, companies) into meaningful taxonomies, groups or clusters without advanced knowledge of the groups’ definition. Clusters are formed based on combinations of input variable, which maximizes the similarity of cases within each cluster while maximizing the dissimilarity between groups that are initially unknows. A special type of clustering is time-series clustering, which is essentially dynamic data as its feature values changes as a function of time.
Key Parameters of Time-Series Clustering
Parameters | Algorithm |
---|---|
Type |
|
Distances |
|
Centroid |
|
Packages Used
This dashboard mainly uses dtwclust package from R.
dtwclust:
XXXX
Objective
To build an application for Time Series Clustering
Time-series data are of interest due to their ubiquity in various areas ranging from science, engineering, business, economics, healthcare, to government. This dashboard aims to allow user to do time series clustering on time series related data to uncover patterns which have potential use case in the respective domain.