Sound image externalization for headphone based real-time 3D audio

Yougen YUAN, Lei XIE, Zhong-Hua FU, Ming XU, Qi CONG

PDF(469 KB)
PDF(469 KB)
Front. Comput. Sci. ›› 2017, Vol. 11 ›› Issue (3) : 419-428. DOI: 10.1007/s11704-016-6182-2
RESEARCH ARTICLE

Sound image externalization for headphone based real-time 3D audio

Author information +
History +

Abstract

3D audio effects can provide immersive auditory experience, but we often face the so-called in-head localization (IHL) problem in headphone sound reproduction. To address this problem, we propose an effective sound image externalization approach. Specifically, we consider several important factors related to sound propagation, which include image-source model based early reflections with distance decay, wall absorption and air absorption, late reverberation and other dynamic factors like head movement. We apply our sound image externalization approach to a headphone based real-time 3D audio system. Subjective listening tests show that the sound image externalization performance is significantly improved and the sound source direction is preserved as well. A/B preference test further shows that, as compared with a recent popular approach, the proposed approach is mostly preferred by the listeners.

Keywords

Keywords 3D audio / in-head localization (IHL) / headphone reproduction / sound image externalization / reflections

Cite this article

Download citation ▾
Yougen YUAN, Lei XIE, Zhong-Hua FU, Ming XU, Qi CONG. Sound image externalization for headphone based real-time 3D audio. Front. Comput. Sci., 2017, 11(3): 419‒428 https://doi.org/10.1007/s11704-016-6182-2

References

[1]
BegaultD, WenzelE M, GodfroyM, Miller J D, AndersonM R . Applying spatial audio to human interfaces: 25 years of nasa experience. In: Proceedings of the 40th International Conference on Spatial Audio: Sense the Sound of Space. 2010
[2]
SekiY, SatoT. A training system of orientation and mobility for blind people using acoustic virtual reality.IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2011, 19(1): 95–104
CrossRef Google scholar
[3]
XieB. Head-Related Transfer Function and Virtual Auditory Display. 2ed ed. Boca Raton, FL: J. Ross Publishing, 2013
[4]
TooleF E. In-head localization of acoustic images. The Journal of the Acoustical Society of America, 1970, 48(4B): 943–949
CrossRef Google scholar
[5]
WightmanF L, Kistler D J.Headphone simulation of free-field listening. II: Psychophysical validation.The Journal of the Acoustical Society of America, 1989, 85(2): 868–878
CrossRef Google scholar
[6]
WeinrichS G. Improved externalization and frontal perception of headphone signals. In: Proceedings of Audio Engineering Society Convention 92. 1992
[7]
HartmannW M, Wittenberg A. On the externalization of sound images. The Journal of the Acoustical Society of America, 1996, 99(6): 3678–3688
CrossRef Google scholar
[8]
DurlachN I, Rigopulos A, PangX D , WoodsW S, Kulkarni A, ColburnH S , WenzelE M. On the externalization of auditory images. Presence: Teleoperators & Virtual Environments, 1992, 1(2): 251–257
CrossRef Google scholar
[9]
LoomisJ M, HebertC, CicinelliJ G . Active localization of virtual sounds. The Journal of the Acoustical Society of America, 1990, 88(4): 1757–1764
CrossRef Google scholar
[10]
BegaultD R. Perceptual effects of synthetic reverberation on threedimensional audio systems. Journal of the Audio Engineering Society, 1992, 40(11): 895–904
[11]
LiitolaT. Headphone sound externalization. Dissertation for the Doctoral Degree. Espoo: Helsinki University of Technology, 2006
[12]
XiaR S, LiJ F, XuC D, Yan Y H. A sound image externalization approach for headphone reproduction by simulating binaural room impulse responses. Chinese Journal of Electronics, 2014, 23(3): 527–532
[13]
PlengeG. On the differences between localization and lateralization. The Journal of the Acoustical Society of America, 1974, 56(3): 944–951
CrossRef Google scholar
[14]
ZhangC Y, XieB S. Platform for dynamic virtual auditory environment real-time rendering system. Chinese Science Bulletin, 2013, 58(3): 316–327
CrossRef Google scholar
[15]
TianX H, FuZ H, XieL. An experimental comparison on KEMAR and BHead210 dummy heads for HRTF-based virtual auditory on Chinese subjects. In: Proceedings of the 3rd IET International Conference on Wireless, Mobile and Multimedia Networks. 2010, 369–372
[16]
MøllerH, Sørensen M F, HammershøiD, JensenC B. Head-related transfer functions of human subjects. Journal of the Audio Engineering Society, 1995, 43(5): 300–321
[17]
MøllerH, JensenC B, HammershøiD , SørensenM F. Using a typical human subject for binaural recording. In: Proceedings of Audio Engineering Society Convention 100. 1996
[18]
AllenJ B, Berkley D A. Image method for efficiently simulating smallroom acoustics. The Journal of the Acoustical Society of America, 1979, 65(4): 943–950
CrossRef Google scholar
[19]
DelanyM E, BazleyE N. Acoustical properties of fibrous absorbent materials. Applied Acoustics, 1970, 3(2): 105–116
CrossRef Google scholar
[20]
HuopaniemiJ, Savioja L, KarjalainenM . Modeling of reflections and air absorption in acoustical spaces: a digital filter design approach. In: Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 1997, 19–22
CrossRef Google scholar
[21]
JonesJr R H, JobseB D. Real-time digital audio reverberation system.US Patent 5,530,762. 1996
[22]
BrowneS. Hybrid reverberation algorithm using truncated impulse response convolution and recursive filtering. Dissertation for the Doctoral Degree. Miami: University of Miami, 2001
[23]
MoorerJ A. About this reverberation business. Computer Music Journal, 1979, 13–28
CrossRef Google scholar
[24]
GardnerW G. A realtime multichannel room simulator. Journal of the Acoustical Society of America, 1992, 92(4): 2395
CrossRef Google scholar
[25]
AlgaziV R, DudaR O, ThompsonD M , AvendanoC. The CIPIC HRTF database. In: Proceedings of IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics. 2001, 99–102
CrossRef Google scholar
[26]
GardnerW G, MartinK D. HRTF measurements of a KEMAR. The Journal of the Acoustical Society of America, 1995, 97(6): 3907–3908
CrossRef Google scholar
[27]
FrigoM, Johnson S G. FFTW: An adaptive software architecture for the FFT. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. 1998, 1381–1384
CrossRef Google scholar
[28]
DavidH A. The method of paired comparisons. In: Kendall M G, ed. Griffin’s Statistical Monographs and Courses, Vol. 12. New York: Hafner, 1963

RIGHTS & PERMISSIONS

2017 Higher Education Press and Springer-Verlag Berlin Heidelberg
AI Summary AI Mindmap
PDF(469 KB)

Accesses

Citations

Detail

Sections
Recommended

/