Data Collection – Source dataset will be collected from publicly available data on [url removed, login to view] etc.
Data Cleansing – Openrefine opensource data cleansing tool will be used to clean and filter the messy semi structured data sets in to clean usable data sets.
Data Loading – Data will be loaded to Hive tables on Big Data platform.
Data Analytics/Data Mining – Data will be analyzed and mined to get insights.
Data Visualization – Final graphical output in an easy way to understand the complex data insights will be presented using data visualization tool
Present consumer complaints by year with in a financial institution?
What is the response time for each financial institution?
Which financial products has more complaints with in a financial institution?
Forecasting complaints trend within a financial institution?