Convolutional Neural Networks Based Algorithm for Speech Separation

Joseph, Richard; Kalgutkar​, Abhishek; Kinage, Chinmayee; Dighe, Soham; Singh, Jaskaran

doi:10.2139/ssrn.3569729

Download This Paper

Open PDF in Browser

Add Paper to My Library

Convolutional Neural Networks Based Algorithm for Speech Separation

Proceedings of the 3rd International Conference on Advances in Science & Technology (ICAST) 2020

5 Pages Posted: 8 Apr 2020

See all articles by Richard Joseph

Richard Joseph

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT)

Abhishek Kalgutkar

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT)

Chinmayee Kinage

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT)

Soham Dighe

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT)

Jaskaran Singh

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT)

Date Written: April 8, 2020

Abstract

Vocal Separation is the separation of a set of source signals from a set of mixed signals, without extensive information about the source signals or the mixing process. Audio data is available in large amounts across the internet. Usage of machine learning and datasets constructed around one context can be used to design a system capable of performing vocal separation on audio data. Our algorithm uses a convolutional neural network, down sampling the input and then up sampling to a desired output. A custom dataset comprised of mixed conversations is used for training, with the expected output being the two vocal sources correctly separated.

Keywords: Blind source separation, Cocktail party problem, Convolutional Neural Networks

Suggested Citation: Suggested Citation

Joseph, Richard and Kalgutkar, Abhishek and Kinage, Chinmayee and Dighe, Soham and Singh, Jaskaran, Convolutional Neural Networks Based Algorithm for Speech Separation (April 8, 2020). Proceedings of the 3rd International Conference on Advances in Science & Technology (ICAST) 2020, Available at SSRN: https://ssrn.com/abstract=3569729 or http://dx.doi.org/10.2139/ssrn.3569729

Richard Joseph (Contact Author)

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

India

Abhishek Kalgutkar

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

India

Chinmayee Kinage

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

India

Soham Dighe

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

India

Jaskaran Singh

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

India

Download This Paper

Open PDF in Browser

Do you have a job opening that you would like to promote on SSRN?

Place Job Opening

Paper statistics

Downloads

210

Abstract Views

1,014

Rank

362,957

12 References

PlumX Metrics

Feedback

Convolutional Neural Networks Based Algorithm for Speech Separation

Abstract

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

University of Mumbai - Vivekanand Education Society's Institute of Technology (VESIT) ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Related Alerts

Artificial Intelligence

Electrical Engineering

Cognitive Psychology

Mechanical Engineering

Psychology Research Methods

Computational Linguistics & Natural Language Processing

Computational & Quantitative Research in Communication

Libraries & Information Technology

Computational Neuroscience & Artificial Neural Networks