Learning Meta-Prompt with Entity-Enhanced Semantics for Few-Shot Named Entity Recognition

14 Pages Posted: 20 Dec 2023

See all articles by Yuwei Xia

Yuwei Xia

affiliation not provided to SSRN

Zhao Tong

Chinese Academy of Sciences (CAS) - Institute of Information Engineering

Liang Wang

affiliation not provided to SSRN

Qiang Liu

affiliation not provided to SSRN

Shu Wu

affiliation not provided to SSRN

Xiao-Yu Zhang

affiliation not provided to SSRN

Abstract

Recently, prompt-tuning has been proved to be surprisingly effective on few-shot tasks. Intuitively, some studies explore Few-shot Named Entity Recognition (NER) based on prompt-tuning. However, how to properly initialize and effectively learn the prompt under limited training conditions still remains significantly challenging for few-shot NER. To meet these challenges, we propose a novel Meta-Prompt with Entity-Enhanced semantics for Few-shot N}R, MPE^3 for brevity. Specifically, we first explore the importance of the named entities' semantics in the few-shot NER task. And we propose to construct prompts with entity-enhanced semantics which contain much useful prior knowledge for identifying named entities. Furthermore, to address the issue of inadequate training more substantially, we aim to train a meta-prompt that can be more effective and adaptive for few-shot NER scenarios. To achieve this, we divide the training data into many source-domain agnostic meta-tasks tailored to the characteristics of the NER problem for training. And we specially design a prompt meta-learner for training these meta-tasks. This training strategy succeeds in guiding prompts to optimize in a better direction for few-shot scenarios by the learned meta-knowledge from each meta-task. We conduct extensive experiments on three NER datasets under two different few-shot settings. Our method outperforms the current state-of-the-art model by 5.60%~13.34% and 2.41%~7.34% on average in the two different few-shot settings respectively, which validates the effectiveness and superiority of our model.

Keywords: Information Extraction, Few-shot Named Entity Recognition, Prompt-Learning, Meta-Learning

Suggested Citation

Xia, Yuwei and Tong, Zhao and Wang, Liang and Liu, Qiang and Wu, Shu and Zhang, Xiao-Yu, Learning Meta-Prompt with Entity-Enhanced Semantics for Few-Shot Named Entity Recognition. Available at SSRN: https://ssrn.com/abstract=4670639 or http://dx.doi.org/10.2139/ssrn.4670639

Yuwei Xia (Contact Author)

affiliation not provided to SSRN ( email )

No Address Available

Zhao Tong

Chinese Academy of Sciences (CAS) - Institute of Information Engineering ( email )

Liang Wang

affiliation not provided to SSRN ( email )

No Address Available

Qiang Liu

affiliation not provided to SSRN ( email )

No Address Available

Shu Wu

affiliation not provided to SSRN ( email )

No Address Available

Xiao-Yu Zhang

affiliation not provided to SSRN ( email )

No Address Available

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
42
Abstract Views
157
PlumX Metrics