Técnica rápida para geração de aurilizações utilizando redes neurais artificiais

Filipe Otsuka Taminato; Roberto  A. Tenenbaum; Viviane S. G. Melo

doi:10.55753/aev.v33e50.84

Authors

Filipe Otsuka Taminato Laboratório de Instrumentação em Dinâmica, Acústica e Vibrações – LIDAV, Programa de Pós-Graduação em Modelagem Computacional, Universidade do Estado do Rio de Janeiro
Roberto A. Tenenbaum Programa de Pós-Graduação em Engenharia Civil, Universidade Federal de Santa Maria https://orcid.org/0000-0002-5268-3849
Viviane S. G. Melo Engenharia Acústica, Programa de Pós-Graduação em Engenharia Civil, Universidade Federal de Santa Maria https://orcid.org/0000-0002-2354-6167

DOI:

https://doi.org/10.55753/aev.v33e50.84

Keywords:

acoustic virtual reality, room auralization, BRIRs generation, artificial neural networks, room acoustic simulation, articulation index

Abstract

The main goal in development of numerical techniques in acoustic virtual reality systems and production of reliable auralizations is to reduce the computational cost and, at the same time, to guarantee the sound quality. In this paper, a new technique for modeling head-related transfer functions are presented. Artificial neural networks of the radial basis functions type are used. A set of these networks is trained and tested to cover the entire auditory space around the head. Each neural network for a given direction has as input the spectrum of the sound ray that reaches the receiver and, as output, the filtered head-related impulse response, for the corresponding direction working directly in the time domain and circumventing the need for convolutions with a computational cost reduction of 90%. The proposed technique is compared with the convolution method in both the time and frequency domain. The results show the efficiency of the proposed technique, with correlation values very close to one. To validate the result, preliminary tests using articulation indices to compare speech intelligibility in an actual and virtual room were conducted, with fully satisfactory results.

References

VORLÄNDER, M. Auralization: Fundamentals of Acoustics, Modelling, Simulation, Algorithms and Acoustic Virtual Reality. Berlin: Springer-Verlag, 2008. doi: 10.1121/1.2908264 DOI: https://doi.org/10.1121/1.2908264

KLEINER, M.; DALENBÄCK, B.I.; SVESSON, P. Auralization - an overview. J. audio Eng. Soc, 41, p.861, 1993.

BLAUERT, J. Spatial Hearing. Cambridge: The MIT Press, 1997. doi: 10.7551/mitpress/6391.001.0001 DOI: https://doi.org/10.7551/mitpress/6391.001.0001

RINDEL, J. The use computer modeling in room acoustics. Journal of Vibro engeneering, 4(3):41–72, 2000.

TENENBAUM, R.A.; CAMILO, T.S.; TORRES, J.C.B. and GERGES, S.Y. Hybrid method for numerical simulation of room acoustic: Part 1 – theorical and numerical aspects. J. Braz. Soc. Mech. Sci. Engin., 29(2):211–221, 2007a. DOI: https://doi.org/10.1590/S1678-58782007000200012

BORK, I. Report on the 3rd Round Robin on room acoustical computer simulation - Part II: Calculations. Acta Acustica united with Acustica, 91(4):753–763, 2005.

SAVIOJA, L.; SVENSSON, U.P. Overview of geometrical room acoustic modeling techniques. J. Acoust. Soc. Am, 138(2):708–730, 2015 doi: 10.1121/1.4926438 DOI: https://doi.org/10.1121/1.4926438

EMBRECHTS, J. Randomly traced sound ray techniques. Acustica, 51:285–295, 1982.

KULOWSKI, A. Algorithmic representation of the ray tracing technique. Applied Acoustics, 18:449–469, 1984. doi: 10.1016/0003-682X(85)90024-6 DOI: https://doi.org/10.1016/0003-682X(85)90024-6

ONDET, M; BARBRY, J.L. Modeling of sound propagation in fitted workshops using ray tracing. J. Acoust. Soc. Am., 85(2):787–796, 1989. doi: 10.1121/1.397551 DOI: https://doi.org/10.1121/1.397551

FARINA, A. RAMSETE – a new pyramid tracer for medium and large-scale acoustic problems. Proceedings of the Euronoise, Lyon, 1995

ALLEN, J.B.; BERKLEY, D.A. Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am., 65, p. 943, 1979. doi: 10.1121/1.382599 DOI: https://doi.org/10.1121/1.382599

DALENBÄCK, B; KLEINER, M.; SVENSON, P. A macroscopic view of diffuse reflection. J. áudio Eng. Soc. 42:793–807, 1994.

KURZINS, E.; FRICKE, F. The prediction of sound fields in non-diffuse spaces by random walk approach. J. Sound and Vib., 81(4):549–564, 1982. doi: 10.1016/0022-460X(82)90296-6 DOI: https://doi.org/10.1016/0022-460X(82)90296-6

ALARCÃO, D.; BENTO COELHO, J.L.; TENENBAUM, R.A. On modeling of room acoustics by a sound energy transition approach. Proceedings of EEA Symposium on Architectural Acoustics, 2000.

GARDNER, B.; MARTIN, K. HRTF Measurements of a KEMAR Dummy-Head Microphone. J. Acoust. Soc. Am., vol. 97, n. 6, pp. 3907– 3908, 1995. DOI: https://doi.org/10.1121/1.412407

BRINKMANN, F.; LINDAU, A.; WEINZIERL, S.; VAN DER PAR, S.; OPDAM, R.; VORLÄNDER, M. The FABIAN head-related transfer function database. doi: 10.14279/depositonce-5718.2, 2017.

MILLS, A. On the minimum audible angle. J. Acoust. Soc. Am., 30:237–246, 1958. doi: 10.1121/1.1909553 DOI: https://doi.org/10.1121/1.1909553

KISTLER, D. J.; WIGHTMAN, F.L. A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. J. Acoust. Soc. Am. 91(3), 1637–1647, 1992. doi: 10.1121/1.402444 DOI: https://doi.org/10.1121/1.402444

TORRES, J.C.B; PETRAGLIA, M.R.; TENENBAUM, R.A. An efficient wavelet based HRTF model for auralization. Acta Acustica united with Acustica, 90(1):108−120, 2004.

HU, H.; ZHOU, L.; MA, H.; WU, Z. HRTF personalization based on artificial neural network in individual virtual auditory space. Applied Acoustics, 69(2):163–172, 2008. doi: 10.1016/j.apacoust.2007.05.007 DOI: https://doi.org/10.1016/j.apacoust.2007.05.007

TENENBAUM, R.A.; TAMINATO, F.O.; MELO,V.S.G.; TORRES, J.C.B. Auralization generated by modeling HRIRs with artificial neural networks and its validation using articulation tests. Applied Acoustics, 130, pp. 260–269, 2018. doi: 10.1016/j.apacoust.2017.09.025 DOI: https://doi.org/10.1016/j.apacoust.2017.09.025

BROOMHEAD, D.; LOWE, D. Multivariable functional interpolation and adaptive networks. Complex Systems, 2:321–355, 1988.

LI, L.; HUANG, Q. HRTF personalization modeling based on RBF neural network. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, p. 3707–3710, Vancouver, Canada, 2013. doi: 10.1109/ICASSP.2013.6638350 DOI: https://doi.org/10.1109/ICASSP.2013.6638350

TENENBAUM, R.A.; CAMILO, T.S.; TORRES, J.C.B. and STUTZ, L.T. Hybrid method for numerical simulation of room acoustic: Part 2 – validation of the computational code RAIOS 3. J. Braz. Soc. Mech. Sci. Engin., 29(2):223–231, 2007b. doi: 10.1590/S1678-58782007000200013 DOI: https://doi.org/10.1590/S1678-58782007000200013

FERNANDES, K.M.; TENENBAUM, R.A.; STUTZ, L.T. Problema inverso em propagação de ondas em sólidos com aplicação de método híbrido de otimização. Em: Técnicas de Inteligência Computacional com Aplicações em Problemas Inversos de Engenharia. Editora Omnipax, Curitiba, pp. 51–66, 2014. DOI: https://doi.org/10.7436/2014.tica.05

RAHIM, M.G., GOODYEAR, C.C. and KLEIJN, W.B. On the use of neural networks in articulatory speech synthesis. J. Acoust. Soc. Am., v. 93(2):1109–1121, 1992. DOI: https://doi.org/10.1121/1.405559

HAYKIN, S. Neural Networks and Learning Machines. 3rd Edition, Prentice Hall, New Jersey, 2009.

TAMINATO, F.O. Redes neurais artificiais aplicadas à modelagem de respostas impulsivas associadas à cabeça humana para gerar aurilização. Tese de doutorado, Programa de Pós-graduação em Modelagem Computacional, Universidade do Estado do Rio de Janeiro, 2018.

TENENBAUM, R.A.; TAMINATO, F.O.; MELO V.S.G. Fast auralization using radial basis functions type of artificial neural network techniques. Applied Acoustics, 157, 106993, 2020. doi: 10.1016/j.apacoust.2019.07.041 DOI: https://doi.org/10.1016/j.apacoust.2019.07.041

A Ground Truth for Room Acoustic Simulation. Outubro 2019. doi.org/10.14279/depositonce-6726

HODGSON, M.; YORK, N.; YANG, W.; BLISS, M. Comparison of predicted, measured, and auralized sound fields with respect to speech intelligibility in classrooms using CATT-acoustic and ODEON. Acta Acustica united with Acustica, 94(6):883–890, 2008. doi: 10.3813/AAA.918106 DOI: https://doi.org/10.3813/AAA.918106

MELO, V. S. G.; TENENBAUM, R. A.; NARANJO, J. F. L. A new approach to validate computer modeling auralizations by using articulation indexes. Acústica & Vibrações, Vol. 46, pp. 1−8, 2014. doi: 10.55753/aev.v29e46.125

POLLOW, M.; BEHLER, G. Variable directivity for platonic sound sources based on spherical harmonics optimization. Acta Acustica united with Acustica, 95(6):1082–1092, 2009. doi: 10.3813/AAA.918240 DOI: https://doi.org/10.3813/AAA.918240

KLEIN, J.; POLLOW, M.; VORLÄNDER, M. Optimized spherical sound source for auralization with arbitrary source directivity. Proceedings of the EAA Joint Symposium on Auralization and Ambisonics, p.56–61, Berlin, 2014. doi: 10.14279/depositonce-10

LINDAU, A.; ERBES, V.; LEPA, S.; MAEMPEL, H.J.; BRINKMANN, F.; WEINZIERL, S. A spatial audio quality inventory for virtual acoustic environments (SAQI). Acta Acustica united with Acustica, 100(5):984–994, 2014. doi: 10.3813/AAA.918778 DOI: https://doi.org/10.3813/AAA.918778

MELO, V. S. G.; LIMA, P. G.; SANTOS, T. C.; TENENBAUM, R. A. Validação de realidade virtual acústica via testes de articulação em salas ruidosas e reverberantes. Acústica & Vibrações, 49, pp. 51–57, 2017. doi: 10.55753/aev.v32e49.95 DOI: https://doi.org/10.55753/aev.v32e49.95

Fast technique for auralization generation using artificial neural networks

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Sobrac

Make a Submission

Language

Information

AeV - Lado direito