A review of Noise Suppression Technology for Real-Time Speech Enhancement

Page 1

International Research Journal of Engineering and Technology (IRJET)

e-ISSN: 2395-0056

Volume: 09 Issue: 07 | July 2022

p-ISSN: 2395-0072

www.irjet.net

A review of Noise Suppression Technology for Real-Time Speech Enhancement Keshav Patta1, Hitesh Tiwari2, Mohit Kumar Tiwari3, Prof. Vaishali Gatty4 1,2,3 PG

Student, Department of M.C.A. VESIT, Mumbai, Maharashtra, India. Vaishali Gatty, Department of M.C.A, VESIT, Mumbai, Maharashtra, India. ---------------------------------------------------------------------***--------------------------------------------------------------------4Professor

Abstract - Despite noise suppression being a mature space

signals (noise, speech, alarms, etc.) from the encompassing setting. These audio signals square measure sent to a digital signal processor (DSP) with algorithms to assist separate and suppressing background signals. Also, with advanced algorithms, this could embody uninflected and enhancing speech thus you'll hear and be detected clearly in spite of the noise around you. Noise suppression AI also achieved prominent results through deep learning which can detect human voice between different noises given as an input, these results show the advancement in Artificial Intelligence in today’s world.

in the signal process, it remains hugely captivated by the fine calibration of reckoner algorithms and parameters. In this paper, we demonstrate a Real-Time learning approach to noise suppression. We tend to focus powerfully on keeping the quality as low as potential, while still achieving high-quality increased speech. A huge number of communication programs or systems have introduced next-generation noise-canceling AI as an alternative to reduce background sounds/noise in their online meetings or work and made this process well organized. Yet, noise suppression is far more prominent than active noise canceling systems out there in existing systems and provides a better result. Currently leading tech companies to choose noise suppression for communication applications to offer better result, even though these system does not yield 100% accuracy it is still far superior than the other conventional systems.

2. Voice Activity Detector (VAD) Voice activity detection (VAD) can be defined as a technique during which the presence or absence of a human voice is detected. The detection is accustomed to triggering a method. VAD has been applied in speech-controlled applications and devices like smartphones, which can be operated by voice commands. Most common VAD algorithms are based on using amplitude and are very efficient namely Short-Time average Energy (STE) and Zero-crossing Rate (ZCR) but these techniques do not work in loud or noisy environments. Using spectral energy for this process is the best approach since it works with high accuracy in high speech noisy environments, some of the spectral energy techniques are long-term spectral envelope, Mel-frequency cepstral coefficients (MFCCs), linear predictive coding coefficients (LPCC).

Key Words: Noise Detection, Noise Suppression, Deep Learning, Voice Detector, Microphone,

1. INTRODUCTION For decades we hear that AI is the future and with its help, Noise suppression has gained much interest in the field. Despite important enhancements in quality, the high-level structure has remained principally equivalent. The noise spectrum estimation technique is the backbone of noise suppression AI, it is derived by a voice activity detector (VAD). It has three component which needs correct estimators and is tough to tune. for instance, the crude initial noise estimators and also spectral estimators that supported spectral subtraction are replaced by a lot of correct noise estimators and spectral amplitude estimators. Despite the enhancements, these estimators have remained tough to style and have needed important manual calibration effort. that's why recent advances in deep learning techniques are appealing for noise suppression.

The detection process has been carried out by the technique chosen by the engineer, in the spectral energy base technique when input comes with speech + noise, it sets the idle frequency range in which speech can be recognized and set the condition to remove everything else when spectral energy is less than zero. In the next part, sum up all the parts and set a threshold for energy to recognize active and nonactive parts in the detection, and create a length filter for removing non-active segments, and in the last part, it creates a buffer on both sides (e.g. 0.5 seconds) to complete the process.

Active noise-canceling technology has been the focus of the world for many years because of its easy implementation and the technology sector see it as a compelling option, but since it has drawback on hazardous level (security of user) noise suppression technology has been seen as an only solution for that problem. Noise suppression in devices is achieved with twin microphones or quad (Omni-directional) microphones, that square measure accustomed pull in audio

© 2022, IRJET

|

Impact Factor value: 7.529

|

ISO 9001:2008 Certified Journal

|

Page 335


Turn static files into dynamic content formats.

Create a flipbook