Special Session on “Deep learning-based approaches to audio telepresence”

Thursday 12/9 – 10:00-12:00

Organizers: Mingsian Bai and Boaz Rafaely

Short Description: This special session will explore advances in the theory, implementation, and applications of audio and speech processing for telepresence. Signal processing and deep learning are explored in the context of audio telepresence. The goal is to bring together the core technologies relevant to telepresence using laptops or pads, smart speakers, VR eyeglasses, gaming stations, and others.

The scope of the special session shall include, but not be limited to, the following topics:

  • Microphone and loudspeaker array signal processing for AT: source counting, localization, beamforming, soundfield synthesis and zone control, etc.
  • Binaural AT using headphones and global AT using loudspeaker arrays
  • Signal processing-based systems, deep learning-based systems, and hybrid systems of the two
  • AT-specific performance metrics and evaluation
  • Scalability of signal enhancement and ambience preservation
  • Enhancement techniques, including denoising, dereverberation, acoustic echo cancellation, etc., for AT
  • Interpolation of array Relative Transfer Functions (RTFs)
  • Online, real-time, and low-complexity implementations of telepresence systems
  • Application scenarios of AT

Session Papers

1022: FEASIBILITY OF IMAGLS-BSM – ILD INFORMED BINAURAL SIGNAL MATCHING WITH ARBITRARY MICROPHONE ARRAYS
Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely
1036: RGI-NET: 3D ROOM GEOMETRY INFERENCE FROM ROOM IMPULSE RESPONSES WITH HIDDEN FIRST-ORDER REFLECTIONS
Inmo Yeon, Jung-Woo Choi
1081: A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes
Yicheng Hsu, Mingsian Bai
1097: NEURAL DIRECTIONAL FILTERING: FAR-FIELD DIRECTIVITY CONTROL WITH A SMALL MICROPHONE ARRAY
Julian Wechsler, Srikanth Raj Chetupalli, Mhd Modar Halimeh, Oliver Thiergart, Emanuël A. P. Habets
1151: Magnitude Least-Squares based Ambisonics Estimation of Head-Worn Device Microphone Measurements for Binaural Reproduction
AMY BASTINE, Lachlan Birnie, Thushara Abhayapala, Prasanga Samarasinghe, Vladimir Tourbabin