MS MARCO
MS MARCO is a collection of datasets focused on deep learning in search. The first dataset was a question answering dataset featuring 100,000 real Bing questions and a human generated answer. Since then, we released…
Access Publication Publication Publication Publication Publication
Natural Language Interfaces to Web APIs Dataset
The NL2API dataset includes the web APIs call from the Microsoft Graph API suite, which are respectively used to search a user’s emails and calendar events. Each data points include the API call, its canonical…
Active Ranking with Subset-wise Preferences
Beyond Search
When we think of the experiences that search engines are designed to support, criteria such as speed and efficiency instantly come to mind. In this project, we wanted to focus on something different: how the…