An Augmented Intelligence Model to Extract Pragmatic Markers from Large Content Repositories

10 Pages Posted: 25 Jun 2019

See all articles by Vijay Perincherry

Vijay Perincherry

Ironbridge Systems; Indiggo Associates

David White

Independent

Staci Warden

Independent

Date Written: May 8, 2019

Abstract

This paper presents a novel methodology for automatically extracting pragmatic markers from large streams of texts and repositories of documents. Pragmatic markers typically are implications, innuendos, suggestions, contradictions, sarcasms or references that are difficult to define objectively, but that are subjectively evident.

Our methodology uses a two-stage augmented learning model applied to a specific use case, extracting from a repository of over 1500 Article IV country reports prepared for government officials by International Monetary Fund (IMF) staff. The model uses principles of evidence theory to train a machine to decipher the textual patterns of suggested actions for government officials and to extract those suggestions from the country reports at scale.

We demonstrate the effectiveness of the model with impressive precision and recall metrics that over time outperform even the human trainers.

Keywords: text processing, NLP, augmented intelligence, pragmatics

Suggested Citation

Perincherry, Vijay and Perincherry, Vijay and White, David and Warden, Staci, An Augmented Intelligence Model to Extract Pragmatic Markers from Large Content Repositories (May 8, 2019). Available at SSRN: https://ssrn.com/abstract=3406779 or http://dx.doi.org/10.2139/ssrn.3406779

Vijay Perincherry (Contact Author)

Ironbridge Systems ( email )

PO Box 7747
Silver Spring, MD Maryland 20907
United States

Indiggo Associates ( email )

4600 East-West Highway
Suite 800
Bethesda, MD Maryland 20814
United States

HOME PAGE: http://www.indiggolead.com

David White

Independent

United States

Staci Warden

Independent

United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
78
Abstract Views
601
Rank
816,709
PlumX Metrics