ANLY482 AY2016-17 T2 Group7: Methodology
Chart 12: Text analytics data preparation
Firstly, we need to format the search queries to lowercase form for standardization purposes. We do that by using Tableau’s ‘LOWER()’ function, filtering out two data sets: euromonitor’s data being “euromonitor_text_data” and lawnet’s data being “lawnet_text_data”.
After which we use SAS Enterprise Miner 14.1 to carry out text analytics. We import ‘euromonitor_text_data’ and ‘lawnet_text_data’ respectively by using the File Import function and running though the text mining process in Chart 13: Text mining process.
Chart 14: Text Parsing Configuration
We configure text parsing so that Parts of Speech such as ‘Aux’, ‘Conj’, ‘Det’, ‘Interj’, ‘Part’, ‘Prep’, ‘Pron’ and Types of Attributes including ‘Num’ and ‘Punct’ are all ignored.