Using Text Mining and Natural Language Processing (NLP) to Extract Actuarial-Related Information from Online Customer Reviews for Businesses

For the insurance industry, the potential to understand customers and businesses using new dimensions represented by social and other online data can unleash significant new insights from both customer behavior and risk perspective. These insights can drive insurance automation, underwriting efficiency, and enhanced customer experience.

This project is a real-life actuarial data science project provided by Carpe Data. Carpe Data is an Insurtech company that provides insurance companies with next-generation data solutions to gain a more in-depth insight into risks.

Carpe Data will share online reviews (text data, for example, Yelp reviews) with IRisk Lab. Using these text data, we will investigate the following tasks, but not limited to:

  • Sentiment analysis of customer reviews.
  • Extract possible risk characteristics for business.
  • Refine business segmentation

Students: Irene Chen, Wennan Huang, Stacy Shen, Boyuan Wang, Sophia Wang, Haoming Yang, Annie, Zheng

Supervisors: Zhiyu (Frank) Quan, Eli O’Donohue (Carpe Data)

Graduate Supervisor: Yong Xie