Final intern talk: Improving Frechet Audio Distance for Generative Music Evaluation
As generative music models become more powerful and popular, there is a growing need for robust objective metrics of music quality that correlates with human perception. The Frechet Audio Distance (FAD) is a commonly used…
ICASSP 2023 Acoustic Echo Cancellation Challenge
Large-Scale Automatic Audiobook Creation
HyWay: Physical Walk (MSR India – TAB Feb 2023)
A key aspect of attending such an event in person is being able to experience the setting in its fullness — hearing the buzz of background conversations and seeing who is around. This can be…
Research Focus: Week of August 14, 2023
In this issue: HyWay enables hybrid mingling; Auto-Tables transforms non-relational tables into standard relational forms; training dense retrievers to identify high-quality in-context examples for LLM; improving pronunciation assessment in CAPT.
Audio Retrieval with WavText5K and CLAP Training
Thinking beyond audio: Augmenting headphones for everyday digital interactions
Because headphones rank among the most popular wearables in the market, we have an exciting opportunity to expand their capabilities through integrating existing sensors with supplementary ones to enable a wide variety of experiences that…