default author photo

Chengshi Zheng

Chinese Academy of Sciences (CAS)

SCHOLARLY PAPERS

7

DOWNLOADS

537

TOTAL CITATIONS

3

Scholarly Papers (7)

1.

X-Tf-Gridnet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion

Number of pages: 18 Posted: 24 Oct 2023
Fengyuan Hao, Xiaodong Li and Chengshi Zheng
affiliation not provided to SSRN, Chinese Academy of Sciences (CAS) and Chinese Academy of Sciences (CAS)
Downloads 157 (474,889)

Abstract:

Loading...

Target Speaker Extraction, End-to-end, Complex spectral mapping, Time-frequency domain, Adaptive speaker embedding fusion

2.

End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation

Number of pages: 13 Posted: 06 Apr 2023
Fengyuan Hao, Xiaodong Li and Chengshi Zheng
Chinese Academy of Sciences (CAS) - State Key Laboratory of Acoustics, Chinese Academy of Sciences (CAS) and Chinese Academy of Sciences (CAS)
Downloads 97 (704,534)

Abstract:

Loading...

Speaker diarization, End-to-end, Adaptive attractor estimation, Iterative refinement, Unified training

3.

Analysis of Trade-Offs between Magnitude and Phase Estimation in Loss Functions for Speech Denoising and Dereverberation

Number of pages: 41 Posted: 13 May 2022
Xiaoxue Luo, Chengshi Zheng, Andong Li, Yuxuan Ke and Xiaodong Li
affiliation not provided to SSRN, Chinese Academy of Sciences (CAS), affiliation not provided to SSRN, affiliation not provided to SSRN and Chinese Academy of Sciences (CAS)
Downloads 90 (747,755)
Citation 2

Abstract:

Loading...

Monaural speech enhancement, time-frequency domain optimization, magnitude-phase estimation, trade-off coefficients, supervised deep learning

4.

Tabe: Decoupling Spatial and Spectral Processing with Taylor's Unfolding Method for Multi-Channel Speech Enhancement

Number of pages: 21 Posted: 04 Jul 2023
affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, Anhui University, Chinese Academy of Sciences (CAS) and Chinese Academy of Sciences (CAS)
Downloads 89 (747,755)
Citation 1

Abstract:

Loading...

Multi-channel speech enhancement, Taylor's series expansion, neural networks, multi-source information fusion

5.

A New Calibration Method for Bone Conduction Transducers Using Electrical Input Impedance

Number of pages: 28 Posted: 06 Mar 2023
Chinese Academy of Sciences (CAS), East China Normal University (ECNU), Chinese Academy of Sciences (CAS), Chinese Academy of Sciences (CAS), National Institute of Metrology, Chinese Academy of Sciences (CAS) and Chinese Academy of Sciences (CAS)
Downloads 57 (989,921)

Abstract:

Loading...

Bone conduction transducer, lumped-parameter model, electrical input impedance, mastoid impedance

6.

Hvqu$^{2}$-Vc: A One Shot Voice Conversion by Integrating Hierarchical Vector Quantization and Nested U-Net Structure

Number of pages: 39 Posted: 18 Apr 2024
Fangkun Liu, Hui Wang, Xiaodong Li and Chengshi Zheng
affiliation not provided to SSRN, Communication University of China, Chinese Academy of Sciences (CAS) and Chinese Academy of Sciences (CAS)
Downloads 27 (1,359,097)

Abstract:

Loading...

One-shot voice conversion, U$^{2}$-Net structure, Time-frequency multi-scale features, Hierarchical vector quantization

7.

Naturall2s: End-to-End High-Quality Multispeaker Lip-to-Speech Synthesis with Differential Digital Signal Processing

Number of pages: 14 Posted: 14 May 2025
affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, Chinese Academy of Sciences (CAS) and Chinese Academy of Sciences (CAS)
Downloads 20 (1,448,301)

Abstract:

Loading...

Lip-to-speech, End-to-end training, Differentiable digital signal process, Speech reconstruction