Automotive Angels - References

From Analytics Practicum
Revision as of 15:42, 23 April 2017 by Aaron.mak.2013 (talk | contribs)
Jump to navigation Jump to search

HOME

 

PROJECT OVERVIEW

 

FINDINGS

 

PROJECT DOCUMENTATION

 

PROJECT MANAGEMENT

References

1 Pudaruth, S. (2014). Predicting the Price of Used Cars using Machine Learning Techniques. International Journal of Information & Computation Technology, 4(7), 753-764. Retrieved March 29, 2017, from http://www.ripublication.com/irph/ijict_spl/ijictv4n7spl_17.pdf

2 Peerun S., Chummun N.H., Pudaruth, S. (2015) Predicting the Price of Second-hand Cars using Artificial Neural Networks. Retrieved April 01, 2017, from https://www.academia.edu/13579173/Predicting_the_Price_of_Second-hand_Cars_using_Artificial_Neural_Networks

3 D. (2017, February 14). Car Price Prediction. Retrieved March 28, 2017, from https://github.com/dkeske/CarPrice

4 Chen, P. (2015, March 22). Predicting Car Prices Part 1: Linear Regression. Retrieved March 28, 2017, from http://www.datasciencecentral.com/profiles/blogs/predicting-car-prices-part-1-linear-regression

5 Hirsh, E., Hedlund, S., & Schweizer, M. (2003, November 30). Reality Is Perception: The Truth about Car Brands. Retrieved April 03, 2017, from https://www.strategy-business.com/article/03302?gko=fbb50

6 MarketLine Advantage (February 2017). MarketlLine Industry Profile Automotive Aftermarket in the United States

7 Biostatistics. (2006). www.biostat.jhsph.edu. Retrieved 7 February 2017, from http://www.biostat.jhsph.edu/~iruczins/teaching/jf/ch10.pdf

8 Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. Royal. Statist. Soc B., Vol. 58, No. 1, pages 267-288)

9 Sarma, K. (n.d.). Predictive modeling with SAS Enterprise Miner. 1st ed. SAS Publishing.

10 Kadane, J., & Lazar, N. (2008). Methods and Criteria for Model Selection (1st ed.). Carnegie Mellon University Department of Statistics. Retrieved from https://www.cs.cmu.edu/~tom/10-702/tr759.pdf

11 Ag, B. (2017, January 20). CLAIMS YOUR TERRITORY. Retrieved February 07, 2017, from http://www.bmw.com/com/en/newvehicles/mseries/overview.html

12 List of Chevrolet vehicles. (2017, March 10). Retrieved February 07, 2017, from https://en.wikipedia.org/wiki/List_of_Chevrolet_vehicles#Current_production_vehicles_.28North_America.29

13 List of Ford vehicles. (2017, March 22). Retrieved February 07, 2017, from https://en.wikipedia.org/wiki/List_of_Ford_vehicles

14 G. (2016, March 21). Most (and Least) Expensive States to Own a Car. Retrieved February 1, 2017, from http://www.huffingtonpost.com/gobankingrates/most-and-least-expensive_b_9516846.html


Appendix

Exploratory Data Analysis for Ford and Chevrolet

20fig16.png

Figure 16: Scatterplot of Current Selling Price and Current Suggested Retail Price for Ford

20fig17.png

Figure 17: Scatterplot of Current Selling Price and Current Suggested Retail Price for Chevrolet

20fig18.png

Figure 18: Parameter estimates based on fit of least squares for Ford

20fig19.png

Figure 19: Parameter estimates based on fit of least squares for Chevrolet

20fig20.png

Figure 20: Map of Ford Dealers’ locations by Number of Listings

20fig21.png

Figure 21: Map of Chevrolet Dealers’ locations by Number of Listings

20fig22.png

Figure 22: Box Plot of Current Selling Price by Corrected Vehicle Model for Ford

20fig23.png

Figure 23: Box Plot of Current Selling Price by Corrected Vehicle Model for Chevrolet

20fig24.png

Figure 24: Table of Corrected Vehicle Models by Number of Rows and Mean of Current Selling Price for Ford

20fig25.png

Figure 25: Table of Corrected Vehicle Models by Number of Rows and Mean of Current Selling Price for Chevrolet

Missing Value Tables

20fig26.png

Figure 26: Before removing missing values for “Current Selling Price” (BMW)

20fig27.png

Figure 27: After removing missing values for “Current Selling Price” (BMW)

20fig28.png

Figure 28: Missing Values Table for “Vehicle Make” BMW

20fig29.png

Figure 29: Before removing missing values for Current Selling Price (Ford)

20fig30.png

Figure 30: After removing missing values for Current Selling Price (Ford)

20fig31.png

Table 31: Missing Values Table for “Vehicle Make” Ford

20fig32.png

Figure 32: Before removing missing values for Current Selling Price (Chevrolet)

20fig33.png

Figure 33: After removing missing values for Current Selling Price (Chevrolet)

20fig34.png

Figure 34: Missing Values Table for “Vehicle Make” Chevrolet

Corrected Vehicle Model Formulas

Match( :modelName,
	"318", "3 Series",
	"323", "3 Series",
	"325", "3 Series",
	"325Ic", "3 Series",
	"325iC", "3 Series",
	"328I", "3 Series",
	"3Series", "3 Series",
	"524TD AUTOMATIC", "5 Series",
	"525", "5 Series",
	"525I", "5 Series",
	"528I", "5 Series",
	"530", "5 Series",
	"530It Automatic", "5 Series",
	"535I", "5 Series",
	"5Series", "5 Series",
	"6Series", "6 Series",
	"735I Automatic", "7 Series",
	"740", "7 Series",
	"7Series", "7 Series",
	"X3 25I", "X3",
	"Z4M", "Z4 M",
	If( :modelName == "i3" | :modelName == "i8",
		:modelName,
		Titlecase( :modelName )
	)
)

Figure 35: Code to rectify “Vehicle Model” (BMW)

If(
	Contains( Lowercase( :Vehicle Model ), "police" ), "Police Interceptor",
	Contains( Lowercase( :Vehicle Model ), "taurus" ), "Taurus",
	Contains( Lowercase( :Vehicle Model ), "escape" ), "Escape",
	Contains( Lowercase( :Vehicle Model ), "focus" ), "Focus",
	Contains( Lowercase( :Vehicle Model ), "explorer" ), "Explorer",
	Contains( Lowercase( :Vehicle Model ), "mustang" ), "Mustang",
	Contains( Lowercase( :Vehicle Model ), "f-150" ), "F-150",
	Contains( Lowercase( :Vehicle Model ), "tempo" ), "Tempo",
	Contains( Lowercase( :Vehicle Model ), "f-250" ), "F-250",
	Contains( Lowercase( :Vehicle Model ), "f250" ), "F-250",
	Contains( Lowercase( :Vehicle Model ), "econoline" ), "Econoline",
	Contains( Lowercase( :Vehicle Model ), "econo" ), "Econoline",
	Contains( Lowercase( :Vehicle Model ), "c-max" ), "C-Max",
	Contains( Lowercase( :Vehicle Model ), "crown victoria" ), "Crown Victoria",
	Contains( Lowercase( :Vehicle Model ), "crownvictoria" ), "Crown Victoria",
	Contains( Lowercase( :Vehicle Model ), "crown vic" ), "Crown Victoria",
	Contains( Lowercase( :Vehicle Model ), "club wagon" ), "Club Wagon",
	Contains( Lowercase( :Vehicle Model ), "conventional" ), "Conventional",
	Contains( Lowercase( :Vehicle Model ), "aspire" ), "Aspire",
	Contains( Lowercase( :Vehicle Model ), "bronco" ), "Bronco",
	Contains( Lowercase( :Vehicle Model ), "cab forw" ), "Cab Forw",
	Contains( Lowercase( :Vehicle Model ), "e-series" ), "E-Series",
	Contains( Lowercase( :Vehicle Model ), "escort" ), "Escort",
	Contains( Lowercase( :Vehicle Model ), "festiva" ), "Festiva",
	Contains( Lowercase( :Vehicle Model ), "e150" ), "E-150",
	Contains( Lowercase( :Vehicle Model ), "e250" ), "E-250",
	Contains( Lowercase( :Vehicle Model ), "e350" ), "E-350",
	Contains( Lowercase( :Vehicle Model ), "e450" ), "E-450",
	Contains( Lowercase( :Vehicle Model ), "escort" ), "Escort",
	Contains( Lowercase( :Vehicle Model ), "expedition" ), "Expedition",
	Contains( Lowercase( :Vehicle Model ), "f150" ), "F-150",
	Contains( Lowercase( :Vehicle Model ), "f100" ), "F-100",
	Contains( Lowercase( :Vehicle Model ), "f350" ), "F-350",
	Contains( Lowercase( :Vehicle Model ), "f450" ), "F-450",
	Contains( Lowercase( :Vehicle Model ), "f550" ), "F-550",
	Contains( Lowercase( :Vehicle Model ), "f650" ), "F-650",
	Contains( Lowercase( :Vehicle Model ), "f600" ), "F-600",
	Contains( Lowercase( :Vehicle Model ), "f700" ), "F-700",
	Contains( Lowercase( :Vehicle Model ), "f750" ), "F-750",
	Contains( Lowercase( :Vehicle Model ), "f800" ), "F-800",
	Contains( Lowercase( :Vehicle Model ), "freestar" ), "Freestar",
	Contains( Lowercase( :Vehicle Model ), "fusion" ), "Fusion",
	Contains( Lowercase( :Vehicle Model ), "probe" ), "Probe",
	Contains( Lowercase( :Vehicle Model ), "shelby" ), "Shelby",
	Contains( Lowercase( :Vehicle Model ), "super duty" ), "Super Duty",
	Contains( Lowercase( :Vehicle Model ), "thunderbird" ), "Thunderbird",
	Contains( Lowercase( :Vehicle Model ), "transit" ), "Transit",
	Contains( Lowercase( :Vehicle Model ), "windstar" ), "Windstar",
	Titlecase( :Vehicle Model )
)

Figure 36: Code to rectify “Vehicle Model” (Ford)

If(
	Contains( Lowercase( :modelName ), "van" ), "Van",
	Contains( Lowercase( :modelName ), "v20" ), "V20",
	Contains( Lowercase( :modelName ), "v10" ), "V10",
	Contains( Lowercase( :modelName ), "trax" ), "Trax",
	Contains( Lowercase( :modelName ), "traverse" ), "Traverse",
	Contains( Lowercase( :modelName ), "trailblazer" ), "TrailBlazer",
	Contains( Lowercase( :modelName ), "tahoe" ), "Tahoe",
	Contains( Lowercase( :modelName ), "suburban2500" ), "Suburban",
	Contains( Lowercase( :modelName ), "suburban1500" ), "Suburban",
	Contains( Lowercase( :modelName ), "suburban 2500" ), "Suburban",
	Contains( Lowercase( :modelName ), "suburban 20" ), "Suburban",
	Contains( Lowercase( :modelName ), "suburban 1500" ), "Suburban",
	Contains( Lowercase( :modelName ), "suburban 10" ), "Suburban",
	Contains( Lowercase( :modelName ), "sportvan" ), "Sportvan",
	Contains( Lowercase( :modelName ), "spark" ), "Spark",
	Contains( Lowercase( :modelName ), "sonic" ), "Sonic",
	Contains( Lowercase( :modelName ), "silverado" ), "Silverado",
	Contains( Lowercase( :modelName ), "s-10" ), "S-10",
	Contains( Lowercase( :modelName ), "s10" ), "S-10",
	Contains( Lowercase( :modelName ), "r10" ), "R10",
	Contains( Lowercase( :modelName ), "r/v" ), "R10",
	Contains( Lowercase( :modelName ), "p30" ), "P30",
	Contains( Lowercase( :modelName ), "tiltmaster" ), "P30",
	Contains( Lowercase( :modelName ), "monte" ), "Monte Carlo",
	Contains( Lowercase( :modelName ), "metro" ), "METRO",
	Contains( Lowercase( :modelName ), "malibu" ), "Malibu",
	Contains( Lowercase( :modelName ), "lumina" ), "Lumina",
	Contains( Lowercase( :modelName ), "kodiak" ), "K30",
	Contains( Lowercase( :modelName ), "k20" ), "K20",
	Contains( Lowercase( :modelName ), "k2500" ), "K2500",
	Contains( Lowercase( :modelName ), "k1500" ), "K1500",
	Contains( Lowercase( :modelName ), "k10" ), "K10",
	Contains( Lowercase( :modelName ), "impala" ), "Impala",
	Contains( Lowercase( :modelName ), "hhr" ), "HHR",
	Contains( Lowercase( :modelName ), "g-" ), "Van",
	Contains( Lowercase( :modelName ), "g30" ), "Van",
	Contains( Lowercase( :modelName ), "g20" ), "Van",
	Contains( Lowercase( :modelName ), "express" ), "Express",
	Contains( Lowercase( :modelName ), "elcamino" ), "El Camino",
	Contains( Lowercase( :modelName ), "military" ), "Van",
	Contains( Lowercase( :modelName ), "c-" ), "Van",
	Contains( Lowercase( :modelName ), "cruze" ), "Cruze",
	Contains( Lowercase( :modelName ), "corsica" ), "Corsica",
	Contains( Lowercase( :modelName ), "chevette" ), "Chevette",
	Contains( Lowercase( :modelName ), "cavalier" ), "Cavalier",
	Contains( Lowercase( :modelName ), "captiva" ), "Captiva Sport",
	Contains( Lowercase( :modelName ), "caprice" ), "Caprice",
	Contains( Lowercase( :modelName ), "c8500" ), "Silverado",
	Contains( Lowercase( :modelName ), "c7500" ), "Silverado",
	Contains( Lowercase( :modelName ), "c6500" ), "Silverado",
	Contains( Lowercase( :modelName ), "c5c042" ), "Silverado",
	Contains( Lowercase( :modelName ), "c5500" ), "Silverado",
	Contains( Lowercase( :modelName ), "c4500" ), "Silverado",
	Contains( Lowercase( :modelName ), "c3500" ), "Silverado",
	Contains( Lowercase( :modelName ), "c30" ), "Silverado",
	Contains( Lowercase( :modelName ), "c2500" ), "Silverado",
	Contains( Lowercase( :modelName ), "c20" ), "Silverado",
	Contains( Lowercase( :modelName ), "c1500" ), "Silverado",
	Contains( Lowercase( :modelName ), "c10" ), "Silverado",
	Contains( Lowercase( :modelName ), "c/k" ), "C/K Pickup",
	Contains( Lowercase( :modelName ), "c%2fk" ), "C/K Pickup",
	Contains( Lowercase( :modelName ), "blazer" ), "Blazer",
	Contains( Lowercase( :modelName ), "avalanche" ), "Avalanche",
	Contains( Lowercase( :modelName ), "beretta" ), "Beretta",
	Contains( Lowercase( :modelName ), "aveo" ), "Aveo",
	Contains( Lowercase( :modelName ), "astro" ), "Astro",
	Titlecase( :modelName )
)

Figure 37: Code to rectify “Vehicle Model” (Chevrolet)

Comparisons of Distributions of Vehicle Model and Corrected Vehicle Model

20fig38.png

Figure 38: Before re-categorising values for Vehicle Model (BMW)

20fig39.png

Figure 39: After re-categorising values for Vehicle Model (BMW)

20fig40.png

Figure 40: Before re-categorising values for Vehicle Model (Ford)

20fig41.png

Figure 41: After re-categorising values for Vehicle Model (Ford)

20fig42.png

Figure 42: Before re-categorising values for Vehicle Model (Chevrolet)

20fig43.png

Figure 43: After re-categorising values for Vehicle Model (Chevrolet)

Vehicle Category Formulas

Match( :Corrected Vehicle Model,
	"C650Gt", "Not Car",
	"F700Gs", "Not Car",
	"F800Gs", "Not Car",
	"F800Gt", "Not Car",
	"G650Gs", "Not Car",
	"K1100Rs", "Not Car",
	"K1200Gt", "Not Car",
	"K1200Lt", "Not Car",
	"K1600 Gtl", "Not Car",
	"R1150Gs", "Not Car",
	"R1200 Rs", "Not Car",
	"R1200Gs", "Not Car",
	"R1200Rt", "Not Car",
	"S1000Rr", "Not Car",
	If( :Corrected Vehicle Model == "Null",
		Empty(),
		"Car"
	)
)

Figure 44: Formula to create “Vehicle Category” column (BMW)

If(
:Corrected Vehicle Model == "Taurus" | 
:Corrected Vehicle Model == "Aerostar" | 
:Corrected Vehicle Model == "Aspire" | 
:Corrected Vehicle Model == "Bronco" | 
:Corrected Vehicle Model == "Contour" | 
:Corrected Vehicle Model == "Crown Victoria" | 
:Corrected Vehicle Model == "Edge" | 
:Corrected Vehicle Model == "Escape" | 
:Corrected Vehicle Model == "Escort" | 
:Corrected Vehicle Model == "Excursion" | 
:Corrected Vehicle Model == "Aerostar" | 
:Corrected Vehicle Model == "Explorer" | 
:Corrected Vehicle Model == "Festiva" | 
:Corrected Vehicle Model == "Fiesta" | 
:Corrected Vehicle Model == "Five Hundred" | 
:Corrected Vehicle Model == "Flex" | 
:Corrected Vehicle Model == "Focus" | 
:Corrected Vehicle Model == "Freestar" | 
:Corrected Vehicle Model == "Freestyle" | 
:Corrected Vehicle Model == "Fusion" | 
:Corrected Vehicle Model == "Gt" | 
:Corrected Vehicle Model == "Ltd" | 
:Corrected Vehicle Model == "Probe" | 
:Corrected Vehicle Model == "Shelby" | 
:Corrected Vehicle Model == "Tempo" | 
:Corrected Vehicle Model == "Thunderbird" | 
:Corrected Vehicle Model == "Windstar",
	"Car",
	"Not Car"
)

Figure 45: Formula to create “Vehicle Category” column (Ford)

If(
	:Corrected Vehicle Model == "Volt" | 
:Corrected Vehicle Model == "Venture" |
	:Corrected Vehicle Model == "Uplander" | 
:Corrected Vehicle Model == "Trax" |
	:Corrected Vehicle Model == "Traverse" | 
:Corrected Vehicle Model == "TrailBlazer" |
	:Corrected Vehicle Model == "Tracker" | 
:Corrected Vehicle Model == "Tahoe" |
	:Corrected Vehicle Model == "Suburban" | 
:Corrected Vehicle Model == "Ssr" |
	:Corrected Vehicle Model == "Ss" | 
:Corrected Vehicle Model == "Spark" |
	:Corrected Vehicle Model == "Sonic" | 
:Corrected Vehicle Model == "Prizm" |
	:Corrected Vehicle Model == "Monte Carlo" | 
:Corrected Vehicle Model == "METRO" |
	:Corrected Vehicle Model == "Malibu" | 
:Corrected Vehicle Model == "Lumina" |
	:Corrected Vehicle Model == "HHR" | 
:Corrected Vehicle Model == "Equinox" |
	:Corrected Vehicle Model == "El Camino" | 
:Corrected Vehicle Model == "Cruze" |
	:Corrected Vehicle Model == "Corvette" | 
:Corrected Vehicle Model == "Corsica" |
	:Corrected Vehicle Model == "Cobalt" | 
:Corrected Vehicle Model == "Cavalier" |
	:Corrected Vehicle Model == "Captiva Sport" | 
:Corrected Vehicle Model == "Caprice" |
	:Corrected Vehicle Model == "Camaro" | 
:Corrected Vehicle Model == "Bolt Ev" |
	:Corrected Vehicle Model == "Beretta" | 
:Corrected Vehicle Model == "Aveo",
	"Car",
	"Not Car"
)

Figure 46: Formula to create “Vehicle Category” column (Chevrolet)

SAS Enterprise Miner Process Flows

20fig47.png

Figure 47: Ford Process Flow in SAS Enterprise Miner

20fig48.png

Figure 48: Chevrolet Process Flow in SAS Enterprise Miner

20fig49.png

Figure 49: BMW Process Flow in SAS Enterprise Miner

Residual Plots of Best Performing Regression Model

20fig50.png

Figure 50: Ford Focus Residual Plot of “Current Selling Price” after Forward Regression

20fig51.png

Figure 51: Ford Fusion Residual Plot of “Current Selling Price” after Forward Regression

20fig52.png

Figure 52: Ford Explorer Residual Plot of “Current Selling Price” after Stepwise Regression

20fig53.png

Figure 53: Ford Edge Residual Plot of “Current Selling Price” after Stepwise Regression

20fig54.png

Figure 54: BMW 3 Series Residual Plot of “Current Selling Price” after Forward Regression

20fig55.png

Figure 55: BMW 5 Series Residual Plot of “Current Selling Price” after Stepwise Regression

20fig56.png

Figure 56: BMW X3 Model Residual Plot of “Current Selling Price” after Backward Regression

20fig57.png

Figure 57: BMW X5 Model Residual Plot of “Current Selling Price” after Stepwise Regression

20fig58.png

Figure 58: Chevrolet Equinox Model Residual Plot of “Current Selling Price” after Stepwise Regression

20fig59.png

Figure 59: Chevrolet Malibu Model Residual Plot of “Current Selling Price” after Forward Regression

20fig60.png

Figure 60: Chevrolet Cruze Model Residual Plot of “Current Selling Price” after Forward Regression

Variables Selection Table

20fig61.png

Figure 61: Variables Selected and Excluded from Regression Analysis