Deep learning-based approaches to audio telepresence – International Workshop on Acoustic Enhancement 2024

Special Session on “Deep learning-based approaches to audio telepresence”

Thursday 12/9 – 10:00-12:00

Organizers: Mingsian Bai and Boaz Rafaely

Short Description: This special session will explore advances in the theory, implementation, and applications of audio and speech processing for telepresence. Signal processing and deep learning are explored in the context of audio telepresence. The goal is to bring together the core technologies relevant to telepresence using laptops or pads, smart speakers, VR eyeglasses, gaming stations, and others.

The scope of the special session shall include, but not be limited to, the following topics:

Microphone and loudspeaker array signal processing for AT: source counting, localization, beamforming, soundfield synthesis and zone control, etc.
Binaural AT using headphones and global AT using loudspeaker arrays
Signal processing-based systems, deep learning-based systems, and hybrid systems of the two
AT-specific performance metrics and evaluation
Scalability of signal enhancement and ambience preservation
Enhancement techniques, including denoising, dereverberation, acoustic echo cancellation, etc., for AT
Interpolation of array Relative Transfer Functions (RTFs)
Online, real-time, and low-complexity implementations of telepresence systems
Application scenarios of AT

Session Papers

1022: FEASIBILITY OF IMAGLS-BSM – ILD INFORMED BINAURAL SIGNAL MATCHING WITH ARBITRARY MICROPHONE ARRAYS

Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely

1036: RGI-NET: 3D ROOM GEOMETRY INFERENCE FROM ROOM IMPULSE RESPONSES WITH HIDDEN FIRST-ORDER REFLECTIONS

Inmo Yeon, Jung-Woo Choi

1081: A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes

Yicheng Hsu, Mingsian Bai

1097: NEURAL DIRECTIONAL FILTERING: FAR-FIELD DIRECTIVITY CONTROL WITH A SMALL MICROPHONE ARRAY

Julian Wechsler, Srikanth Raj Chetupalli, Mhd Modar Halimeh, Oliver Thiergart, Emanuël A. P. Habets

1151: Magnitude Least-Squares based Ambisonics Estimation of Head-Worn Device Microphone Measurements for Binaural Reproduction

AMY BASTINE, Lachlan Birnie, Thushara Abhayapala, Prasanga Samarasinghe, Vladimir Tourbabin