HOBIT: Hardness Optimized Batch Sampling for InfoNCE Training
- Himanshu Dutta ,
- Lokesh Nagalapatti ,
- Yashoteja Prabhu
Spotlight Acceptance
BibTex 다운로드
Research
Spotlight Acceptance
BibTex 다운로드Contrastive training with InfoNCE loss and in-batch negatives is the standard approach for learning dual-encoder models. Its effectiveness, however, critically depends on the availability of hard negatives; in their absence, learning quickly saturates. Existing methods address this via explicit hard-negative mining, which is often costly or heuristic-driven. We introduce HOBIT, a principled mini-batch construction method that improves in-batch negative quality by reordering training examples at every epoch. HOBIT solves an optimization problem motivated by the InfoNCE objective to yield mini-batches such that each query in the batch is exposed to hard yet non-contradictory, informative negative examples. We show that the optimization objective is monotone and submodular which in turn leads us to a greedy algorithm that admits the standard O(1-1/e) approximation guarantee. Empirically, we show that HOBIT incurs negligible computational overhead while significantly outperforming state-of-the-art batching methods, and remains complementary to existing hard negative mining techniques.
한국마이크로소프트(유)
대표이사: 조원우
주소: (우)110-150 서울 종로구 종로1길 50 더 케이트윈타워 A동 12층
전화번호: 02-531-4500, 메일: ms-korea@microsoft.com
사업자등록번호: 120-81-05948 사업자정보확인
호스팅서비스 제공자: Microsoft Corporation
통신판매신고: 제2013-서울종로-1009호
사이버몰의 이용약관: Microsoft Store 판매 약관