DNATagging
An implementation of data encoding and decoding using DNA Tags and paper tickets. The api directory contains implementations for REST API endpoints to enable a DNA Tagging application. The test directory contains configurations and tests…
Final intern talk: Distilling Self-Supervised-Learning-Based Speech Quality Assessment into Compact
Speaker: Benjamin StahlHost: Hannes Gamper In this talk, we explore advancements in computational models for speech quality assessment. Self-supervised learning models have emerged as powerful front-ends, outperforming supervised-only models. However, their large size renders them…
Abstracts: July 18, 2024
Senior Researcher Arindam Mitra introduces AgentInstruct. Using raw data sources, the automated multi-agent framework can create diverse, high-quality synthetic data at scale for the post-training of small and large language models.
Research Focus: Week of July 15, 2024
Advancing time series analysis with multi-granularity guided diffusion model; An algorithm-system co-design for fast, scalable MoE inference; What makes a search metric successful in large-scale settings; learning to solve PDEs without simulated data.