Feature Extraction and Classification Analysis of High-Dimensional Biological Data Based on Dimensionality Reduction Fusion Method

17 Pages Posted: 7 Sep 2023

See all articles by Yulong Liu

Yulong Liu

North China Electric Power University

Yankun Li

North China Electric Power University

Chenyang Wang

Chinese People's Liberation Army

Ziyu Shang

North China Electric Power University

Zhiyu Zheng

Southern University of Science and Technology

Abstract

Identification and extraction of characterized information from complex high-dimensional biological data is a very meaningful issue. The dimensionality reduction fusion method based on random forest, feature extraction and neural network is proposed to recognize and classify two datasets of mRNA and lncRNA. It is shown that the proposed fusion method achieved accurate identification/classification of cancer and non-cancer groups, and simultaneously selected identity variables that have biological relevance to lung cancer (tumor) as potential biomarkers from a large number of variables. It is considered as an effective tool and theoretical support for lung cancer identification in clinical application, and it can be extended to other kinds of cancer or biological data. Ultimately, an advanced method for feature extraction and classification analysis of high-dimensional data is provided.

Note:
Funding declaration: This study is supported by National College Students Innovation and Entrepreneurship Training Program (X2023-097).

Conflict of Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Keywords: High-dimensional data, Classification, Dimensionality reduction, Feature selection, Tumor biomarker

Suggested Citation

Liu, Yulong and Li, Yankun and Wang, Chenyang and Shang, Ziyu and Zheng, Zhiyu, Feature Extraction and Classification Analysis of High-Dimensional Biological Data Based on Dimensionality Reduction Fusion Method. Available at SSRN: https://ssrn.com/abstract=4560680 or http://dx.doi.org/10.2139/ssrn.4560680

Yulong Liu

North China Electric Power University ( email )

Yankun Li (Contact Author)

North China Electric Power University ( email )

Chenyang Wang

Chinese People's Liberation Army ( email )

Ziyu Shang

North China Electric Power University ( email )

Zhiyu Zheng

Southern University of Science and Technology ( email )

No 1088, xueyuan Rd.
Xili, Nanshan District
Shenzhen, 518055
China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
44
Abstract Views
207
PlumX Metrics