DATASET
At present, we have been given a dataset from a time period between July 2010 to August 2012. We are also currently in the midst of requesting more data from our client.
Currently, Natasha keeps track of its purchases by members (in the form of dance packages) and attendance through a basic Excel spreadsheet using a legacy system. Thus, such manual input has resulted in various issues such as redundancy of data as well as data errors. For example, Hip Hop can be written as Hip Hop or Hip Hop (Kids), both of which are related to the same type of sale. In addition, some fields are also left blank. Thus, we anticipate that data cleansing needs to be done to address these issues. In addition, on top of performing data cleansing for our analysis, we also aim help to design a simple system for data entry purposes and eliminate the issues mentioned above. This also helps us to achieve the objective of continuity which will be beneficial to our client.
There are 4 main datasets. They are linked to each other via the various lookup functions in Excel. They are as follows:
Members List
Variable Name
|
Description
|
Name |
Full Name of Participant
|
IC |
NRIC of Participant (S or G)
|
Mobile Number |
Contact Number of Participant
|
Date Started |
Date started attending class at Natasha
|
Attendance List
Variable Name
|
Description
|
Date |
Date of class
|
Type |
Type of class
|
Attendant |
Participants attending the class
|
Amount |
Price per participant
|
Faculty Attendance
Variable Name
|
Description
|
Date |
Date of class
|
Time |
Time of class
|
Faculty |
Instructor for the class
|
Genre |
Genre of class
|
Type |
Course or Open class
|
Level |
Beginner, Level I, Level II, Level III
|
Course No. |
Internal Course No.
|
Venue |
Studio One or Studio Two
|
Class size |
Total class size
|
Trial Lesson |
No. of participants on trial lesson
|
Unlimited |
No. of participants on Unlimited packages (1 or 3 month)
|
Actual |
No. of participants on a per class basis (Not unlimited)
|
Pay |
Price that participants paid for class
|
Owe |
Price that participants owe for class
|
Purchases
Variable Name
|
Description
|
Date Purchases |
Date of Purchase
|
Member No. |
To extract member details from member list
Members List
Variable Name
|
Description
|
Name |
Full Name of Participant
|
IC |
NRIC of Participant (S or G)
|
Mobile Number |
Contact Number of Participant
|
Date Started |
Date started attending class at Natasha
|
|
Mobile Number |
Contact Number of Participant
|
Status |
Adult, student or NSF
|
Package |
Type of dance (Hiphop, locking, popping, any)
|
Course/ Open & Level |
Course or Open. If Course, level I or II or III
|
Comments |
Specific type of package e.g. 01 Month Package
|
Paid/Not Paid |
Status of payment
|
Amount |
Amount paid
|
Date Start |
Start Date of Package
|
Date End |
End Date of Package
|