Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Annotations for Streaming Video on the Web: System Design and Usage Studies David Bargeron, Jonathan Grudin, Anoop Gupta, Elizabeth Sanocki MSR-TR-98-60 | March 1999
Publication Improved Topic-Dependent Language Modeling Using Information Retrieval Techniques Milind Mahajan, Doug Beeferman, Xuedong Huang Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing | March 1999
Publication A Robust Parser for Spoken Language Understanding Ye-Yi Wang Eurospeech | January 1999 Project Project
Publication Probabilistic Modeling with Bayesian Networks for Automatic Speech Recognition Geoffrey Zweig, Stuart Russell January 1999
Publication Computational Models for Auditory Speech Processing Li Deng Computational Models of Speech Pattern Processing, (NATO ASI Series) | Published by Springer Verlag | 1999 Project Project
Publication Computational Models for Speech Production Li Deng Computational Models of Speech Pattern Processing, (NATO ASI Series) | Published by Springer Verlag | 1999
Publication Automatic Generation of Synthesis Units for Trainable Text-to-Speech Systems Hsiao-Wuen Hon, Alex Acero, Xuedong Huang Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing | December 1998
Publication A Mixed-Excitation Frequency Domain Model for Time-Scale Pitch-Scale Modification of Speech Alex Acero Proc. of the Int. Conf. on Spoken Language Processing | December 1998
Publication Speaker detection in broadcast speech databases. Aaron E. Rosenberg, Ivan Magrin-Chagnolleau, Sarangarajan Parthasarathy, Qian Huang ICSLP 1998 | November 1998
Publication Can continuous speech recognizers handle isolated speech Fil Alleva, Xuedong Huang, Mei-Yuh Hwang, Li Jiang Speech Communication | November 1998, Vol 26(3): pp. 183-189