Focused Concept Miner (FCM): Interpretable Deep Learning for Text Exploration

47 Pages Posted: 21 Dec 2018 Last revised: 19 May 2020

See all articles by Dokyun (DK) Lee

Dokyun (DK) Lee

Carnegie Mellon University - David A. Tepper School of Business

Emaad Manzoor

Carnegie Mellon University, Students

Zhaoqi Cheng

Carnegie Mellon University - David A. Tepper School of Business

Date Written: May 20, 2018

Abstract

We introduce the Focused Concept Miner (FCM), an interpretable deep learning text mining algorithm to (1) automatically extract interpretable high-level concepts from text data, (2) focus the extracted concepts to correlate with user-provided business outcomes, and (3) quantify the concept correlational importance. FCM incorporates advances in neural language modeling in a supervised geometric learning framework explicitly configured to maximize interpretability. We evaluate FCM using a dataset of online purchases containing the reviews read by each consumer. Compared to 4 interpretable and 4 prediction-focused baselines, FCM attains higher interpretability as quantified by human judgments and automated metric, and higher recall of unique concepts as supported by several experiments. In addition, we find that the concepts extracted by FCM map to dimensions of product quality developed in prior literature, without being explicitly trained to do so. FCM achieves superior predictive performance compared to interpretable benchmarks while maintaining competitive predictive performance compared to uninterpretable blackbox classifiers. In further experiments, we evaluate FCM on textual data from online newsgroups and a crowdfunding platform, investigate the impact of supervision on concept recovery, and study the interpretability-accuracy trade-off. We conclude by discussing managerial implications, potential marketing applications, limitations, and ideas for future development.

Keywords: Interpretable Machine Learning, Deep Learning, Text Mining, Automatic Concept Extraction, Coherence, Transparent Algorithm, Augmented Hypothesis Development, XAI

JEL Classification: C38, C39, M31, M39

Suggested Citation

Lee, Dokyun (DK) and Manzoor, Emaad and Cheng, Zhaoqi, Focused Concept Miner (FCM): Interpretable Deep Learning for Text Exploration (May 20, 2018). Available at SSRN: https://ssrn.com/abstract=3304756 or http://dx.doi.org/10.2139/ssrn.3304756

Dokyun (DK) Lee (Contact Author)

Carnegie Mellon University - David A. Tepper School of Business ( email )

5000 Forbes Avenue
Pittsburgh, PA 15213-3890
United States

Emaad Manzoor

Carnegie Mellon University, Students ( email )

Pittsburgh, PA
United States

Zhaoqi Cheng

Carnegie Mellon University - David A. Tepper School of Business ( email )

Pittsburgh, PA
United States

Here is the Coronavirus
related research on SSRN

Paper statistics

Downloads
4,139
Abstract Views
15,949
rank
2,374
PlumX Metrics