Sure: Screening Unlabeled Samples for Reliable Negative Samples Based on Reinforcement Learning

30 Pages Posted: 29 Jul 2022

See all articles by Ying Li

Ying Li

Jilin University (JLU)

Hang Sun

Jilin University (JLU)

Wensi Fang

Jilin University (JLU)

Qin Ma

Ohio State University (OSU) - Department of Biomedical Informatics

Siyu Han

Technische Universität München (TUM)

Rui Wang-Sattler

affiliation not provided to SSRN

Wei Du

Jilin University (JLU)

Qiong Yu

Jilin University (JLU)

Abstract

For many classification tasks, only experimentally validated positive samples are available, and experimentally validated negative samples are not recorded. The lack of negative samples poses a great challenge for using supervised machine learning. To address this problem, we propose a novel deep reinforcement learning based model to screen reliable negative samples from unlabeled samples, named SURE. SURE has two modules: sample selector and sample inspector. The sample selector screens reliable negative samples from unlabeled samples by two reinforcement strategies. The sample inspector classifies samples and provides rewards to the sample selector. In this paper, we focus on one popular issue in the field of bioinformatics: the ncRNA-protein interaction (NPI) prediction task, which lacks reliable negative samples. Thirty datasets for NPI prediction are used to test the screening effect of SURE. The Experimental results show that our model has a robust negative sample screening capability and is superior to all outstanding sample screening methods used in the NPI prediction task. In addition, we refine 5 NPI datasets containing reliable negative samples screened by SURE, and a webserver (www.csbg-jlu.info/sure) is available offering the NPI prediction refined by SURE.

Keywords: negative sample screening, Deep reinforcement learning, ncRNA-protein interaction

Suggested Citation

Li, Ying and Sun, Hang and Fang, Wensi and Ma, Qin and Han, Siyu and Wang-Sattler, Rui and Du, Wei and Yu, Qiong, Sure: Screening Unlabeled Samples for Reliable Negative Samples Based on Reinforcement Learning. Available at SSRN: https://ssrn.com/abstract=4176480

Ying Li (Contact Author)

Jilin University (JLU) ( email )

China

Hang Sun

Jilin University (JLU) ( email )

China

Wensi Fang

Jilin University (JLU) ( email )

China

Qin Ma

Ohio State University (OSU) - Department of Biomedical Informatics ( email )

Columbus, OH
United States

Siyu Han

Technische Universität München (TUM) ( email )

Rui Wang-Sattler

affiliation not provided to SSRN

Wei Du

Jilin University (JLU) ( email )

China

Qiong Yu

Jilin University (JLU) ( email )

China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
42
Abstract Views
214
PlumX Metrics