Special Session on “Deep learning-based approaches to audio telepresence”
Thursday 12/9 – 10:00-12:00
Organizers: Mingsian Bai and Boaz Rafaely
Short Description: This special session will explore advances in the theory, implementation, and applications of audio and speech processing for telepresence. Signal processing and deep learning are explored in the context of audio telepresence. The goal is to bring together the core technologies relevant to telepresence using laptops or pads, smart speakers, VR eyeglasses, gaming stations, and others.
The scope of the special session shall include, but not be limited to, the following topics:
- Microphone and loudspeaker array signal processing for AT: source counting, localization, beamforming, soundfield synthesis and zone control, etc.
- Binaural AT using headphones and global AT using loudspeaker arrays
- Signal processing-based systems, deep learning-based systems, and hybrid systems of the two
- AT-specific performance metrics and evaluation
- Scalability of signal enhancement and ambience preservation
- Enhancement techniques, including denoising, dereverberation, acoustic echo cancellation, etc., for AT
- Interpolation of array Relative Transfer Functions (RTFs)
- Online, real-time, and low-complexity implementations of telepresence systems
- Application scenarios of AT
Session Papers
1022: FEASIBILITY OF IMAGLS-BSM – ILD INFORMED BINAURAL SIGNAL MATCHING WITH ARBITRARY MICROPHONE ARRAYS |
Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely |
1036: RGI-NET: 3D ROOM GEOMETRY INFERENCE FROM ROOM IMPULSE RESPONSES WITH HIDDEN FIRST-ORDER REFLECTIONS |
Inmo Yeon, Jung-Woo Choi |
1081: A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes |
Yicheng Hsu, Mingsian Bai |
1097: NEURAL DIRECTIONAL FILTERING: FAR-FIELD DIRECTIVITY CONTROL WITH A SMALL MICROPHONE ARRAY |
Julian Wechsler, Srikanth Raj Chetupalli, Mhd Modar Halimeh, Oliver Thiergart, Emanuël A. P. Habets |
1151: Magnitude Least-Squares based Ambisonics Estimation of Head-Worn Device Microphone Measurements for Binaural Reproduction |
AMY BASTINE, Lachlan Birnie, Thushara Abhayapala, Prasanga Samarasinghe, Vladimir Tourbabin |