ADAPTIVE FILTERING AND MACHINE LEARNING METHODS IN NOISE SUPPRESSION SYSTEMS, IMPLEMENTED ON THE SoC

A. S. Shkil; O. I. Filippenko; D. Y. Rakhlis; I. V. Filippenko; A. V. Parkhomenko; V. R. Korniienko

doi:10.15588/1607-3274-2024-4-16

Authors

A. S. Shkil Kharkiv National University of Radio Electronics, Kharkiv, Ukraine
O. I. Filippenko Kharkiv National University of Radio Electronics, Kharkiv, Ukraine
D. Y. Rakhlis Kharkiv National University of Radio Electronics, Kharkiv, Ukraine
I. V. Filippenko Kharkiv National University of Radio Electronics, Kharkiv, Ukraine
A. V. Parkhomenko National University Zaporizhzhia Polytechnic, Zaporizhzhia, Ukraine
V. R. Korniienko Kharkiv National University of Radio Electronics, Kharkiv, Ukraine

DOI:

https://doi.org/10.15588/1607-3274-2024-4-16

Keywords:

embedded systems, system-on-a-chip, FPGA, adaptive filtering, digital signal processing algorithms, noise suppression algorithms, audio signals, machine learning, neural networks

Abstract

Context. Modern video conferencing systems work in different noise environments, so preservation of speech clarity and provision of quick adaptation to changes in this environment are relevant tasks. During the development of embedded systems, finding a balance between resource consumption, performance, and signal quality obtained after noise suppression is necessary. Systems on a chip allow us to use the power of both processor cores available on the hardware platform and FPGAs to perform complex calculations, which contributes to increasing the speed or reducing the load on the central SoC cores.

Objective. To conduct a comparative analysis of the noise suppression quality in audio signals by an adaptive filtering algorithm and a filtering algorithm using machine learning based on the RNNoise neural network in noise suppression devices on the technological platform SoC.

Method. Evaluation using objective metrics and spectrogram analysis using the Librosa library in Python. Neural network training and model design are performed on the basis of Python and Torch tools. The Vitis IDE package was used for the neural network implementation on the platform SoC.

Results. The analysis of two noise suppression methods using the adaptive Wiener filter and the RNNoise neural network was performed. In the considered scenarios, it was determined that the neural network shows better noise suppression results according to the analysis of spectrograms and objective metrics.

Conclusions. A comparative analysis of the effectiveness of noise suppression algorithms based on adaptive filters and a neural network was performed for scenarios with different noise environments. The results of objective SIGMOS metrics were obtained to evaluate the quality of the received audio signal. In addition, the possibility of running the RNNoise neural network on the technological platform SoC ZYNQ 7000 was verified.

Author Biographies

A. S. Shkil, Kharkiv National University of Radio Electronics, Kharkiv

PhD, Associate Professor, Associate Professor of Design Automation Department

O. I. Filippenko, Kharkiv National University of Radio Electronics, Kharkiv

PhD, Associate Professor, Associate Professor of Infocommunication Engineering Department named by V.V. Popovsky

D. Y. Rakhlis, Kharkiv National University of Radio Electronics, Kharkiv

PhD, Associate Professor, Associate Professor of Design Automation Department

I. V. Filippenko, Kharkiv National University of Radio Electronics, Kharkiv

PhD, Associate Professor, Associate Professor of Design Automation Department

A. V. Parkhomenko, National University Zaporizhzhia Polytechnic, Zaporizhzhia

PhD, Associate Professor, Associate Professor of Software Tools Department

V. R. Korniienko, Kharkiv National University of Radio Electronics, Kharkiv

PhD student of Design Automation Department

References

Lahti S., Rintala M., Hamalainen T. D. Leveraging Modern C++ in High-level Synthesis, IEEE Transactions on ComputerAided Design of Integrated Circuits and Systems, 2023, Vol. 42, № 4, pp. 1123–1132. DOI: 10.1109/TCAD.2022.3193646.

Monson J., Wirthlin M., Hutchings B. L. Optimization techniques for a high level synthesis implementation of the Sobel filter, International Conference on reconfigurable computing and FPGAs (ReConFig’13). Cancun, Mexico, 9–11 December 2013, pp. 1–6. DOI: 10.1109/ReConFig.2013.6732315.

Plugariu O., Petrica L., Pirea R., Hobincu R. Hadoop ZedBoard cluster with GZIP compression FPGA acceleration, 11th International Conference on Electronics, Computers and Artificial Intelligence (ECAI’19). Pitesti, Romania, 27–29 June 2019, pp. 1–5. DOI: 10.1109/ ecai46879.2019.9042006.

Rong X., Sun T., Zhang X., Hu Y., Zhu C. , Lu J. GTCRN: A speech enhancement model requiring ultralow computational resources, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’2024). Seoul, Korea, 14–19 April 2024, pp. 971–975. DOI: 10.1109/icassp48485.2024.10448310.

Leiva L., Torrents-Barrena J., Vazquez M. FPGA-based accelerator for AI-toolbox reinforcement learning library, IEEE Embedded Systems Letters, 2023, Vol. 15, № 2, pp. 113–116. DOI: 10.1109/les.2022.3218168.

Fan H., Wang H., Che K., Wu Z. Design of FPGA deep neural network accelerator based on high-level synthesis, 5th International Academic Exchange Conference on Science and Technology Innovation (IAECST’23). Guangzhou, China, 8–10 December 2023, pp. 163–166. DOI: 10.1109/iaecst60924.2023.10502749.

Chang Y.-W., Huang C.-C., Hwang Y.-T. A face mask detection system based on high level synthesis and hardware software codesign, IET International Conference on Engineering Technologies and Applications (IET-ICETA’22). Changhua, Taiwan, 14–16 October 2022, pp. 1–2. DOI: 10.1109/ieticeta56553.2022. 9971488.

Riazati M., Daneshtalab M., Sjödin M. , Lisper B. DeepFlexiHLS: Deep neural network flexible high-level synthesis directive generator, IEEE Nordic Circuits and Systems Conference (NorCAS’22). Oslo, Norway, 25–26 October 2022, pp. 1–6. DOI: 10.1109/norcas 57515.2022.9934617.

Doumanidis C. C. , Anagnostou C. , Arvaniti E.-S., Papadopoulou A. RNNoise-Ex: hybrid speech enhancement system based on rnn and spectral features [Electronic resource], 2021, pp. 1–5. Access mode: https://arxiv.org/pdf/2105.11813. DOI: 10.48550/arXiv. 2105.11813

Cheng B., Zhang G., Tao X. , Wang S., Wu N., Chen M. An improved real-time noise suppression method based on RNN and long-term speech information, 3rd international symposium on automation, information and computing (ICSPCC’22). Beijing, China, China, 9–11 December 2022, pp. 476–481. DOI: 10.1109/ICASSP40776.2020.9054597.

Reddy C.K.A., Beyrami E., Pool J., Cutler R., Srinivasan S., Gehrke J. A scalable noisy speech dataset and online subjective test framework, International Conference “Interspeech 2019”. Graz, Austria, 15–19 September 2019. pp. 1816–1820. DOI: 10.21437/interspeech. 2019-3087.

Chen B. , Zhou Y., Ma Y., Liu H. A new real-time noise suppression algorithm for far-field speech communication based on recurrent neural network, IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC’22). Xi’an, China, 17–19 August 2021, pp. 1–5. DOI: 10.1109/icspcc52875. 2021.9564530.

Zenodo: Python library for audio and music analysis Librosa [Electronic resource], 2024. Access mode: https://zenodo.org/records/11192913.

Scheibler R., Bezzam E., Dokmanić I. Pyroomacoustics: A Python package for audio room simulation and array processing

algorithms, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’18), Calgary. Canada, 15–20 April 2018, pp. 351–355. DOI: doi.org/10.1109/icassp.2018.8461310.

Upadhyay N., Jaiswal R. K. Single channel speech enhancement: using wiener filtering with recursive noise estimation, Proceeding of the Seventh International Conference on Intelligent Human Computer Interaction, 2016, Vol. 84, pp. 22–30. DOI: 10.1016/j.procs.2016.04.061.

Shkil A., Rahlis D., Filippenko I. , Kornijenko V., Rozhnova T. Automated design of embedded digital signal processing systems on SOC platform, Innovative technologies and scientific solutions for industries, 2024, No. 1 (27), pp. 192– 203. DOI: https://doi.org/10.30837/ITSSI.2024.27.192

Naderi B., Cutler R. Subjective evaluation of noise suppression algorithms in crowdsourcing, International Conference “Interspeech 2021”. Brno, Czechia, 30 August –3 September 2021, pp. 2132–2136. DOI: 10.21437/interspeech.2021-343.

Catalin R. N., Saabas A., Cutler R., Naderi B. , Braun S., Branets S. Speech signal improvement challenge [Electronic resource], IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’24), 2024. Access mode: https://arxiv.org/pdf/2401.14444. DOI: 10.48550/arXiv.2401.14444.

ADAPTIVE FILTERING AND MACHINE LEARNING METHODS IN NOISE SUPPRESSION SYSTEMS, IMPLEMENTED ON THE SoC

Authors

DOI:

Keywords:

Abstract

Author Biographies

A. S. Shkil, Kharkiv National University of Radio Electronics, Kharkiv

O. I. Filippenko, Kharkiv National University of Radio Electronics, Kharkiv

D. Y. Rakhlis, Kharkiv National University of Radio Electronics, Kharkiv

I. V. Filippenko, Kharkiv National University of Radio Electronics, Kharkiv

A. V. Parkhomenko, National University Zaporizhzhia Polytechnic, Zaporizhzhia

V. R. Korniienko, Kharkiv National University of Radio Electronics, Kharkiv

References

Downloads

Published

How to Cite

Issue

Section

License

Creative Commons Licensing Notifications in the Copyright Notices

Information

Current Issue

Announcements