Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Automatic Acquisition of Chinese-English Parallel Corpus from the Web Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines October 2005
Publication Regression-Based Residual Acoustic Echo Suppression Amit S. Chhetri, Muktha Ananda, Jack W. Stokes, John C. Platt International Workshop on Acoustic Echo and Noise Control IWAENC ’05, Eindhoven, Netherlands | September 2005 Project
Publication Robust access to large structured data using voice form-filling Sarangarajan Parthasarathy, Cyril Allauzen, R. Munkong Interspeech 2005 | September 2005
Publication Let’s Go Public! Taking a Spoken Dialog System to the Real World Antoine Raux, Brian Langner, Dan Bohus, Alan W Black, Maxine Eskenazi 9th European Conference on Speech Communication and Technology, Lisbon, Portugal | September 2005
Publication A Principled Approach for Rejection Threshold Optimization in Spoken Dialog Systems Dan Bohus, Alexander I. Rudnicky 9th European Conference on Speech Communication and Technology, Lisbon, Portugal | September 2005
Publication Sorry, I Didn’t Catch That! – An Investigation of Non-understanding Errors and Recovery Strategies Dan Bohus, Alexander I. Rudnicky 6th SIGdial Workshop on Discourse and Dialogue | September 2005
Publication Learning Statistically Characterized Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation and Reduction Li Deng, Dong Yu, Alex Acero Proc. of the Interspeech Conference | September 2005 Proc. of the Interspeech Conference
Publication Speech Technology and Systems in Human-Machine Communication Li Deng, Kuansan Wang, Wu Chou IEEE Signal Processing Magazine | September 2005, Vol 22(5): pp. 12-14
Publication Maximum Mutual Information SPLICE Transform for Seen and Unseen Conditions Jasha Droppo, Alex Acero Proc. Interspeech Conference | September 2005 Proc. Interspeech Conference
Publication A Graphical Model for Multi-Sensory Speech Processing in Air-and-Bone Conductive Microphones A. Subramanya, Jasha Droppo, Alex Acero, Zheng Zhang, Zicheng Liu Proc. of the Interspeech Conference | September 2005 Proc. of the Interspeech Conference