Research Tools: code, datasets, & models

Tool

VQA Introspect

The VQA-Introspect dataset consists of 238K new perception questions which serve as sub questions corresponding to the set of perceptual tasks needed to effectively answer the complex reasoning questions in the Reasoning split of the…

Access Publication

Tool

vOW4SIKE

The vOW4SIKE project provides C code that implements the parallel collision search algorithm by van Oorschot and Wiener (vOW). The algorithm can be instantiated for generic collision finding and for solving the supersingular isogeny problem…

GitHub

Tool

OSCAR

This repository contains source code necessary to reproduce the results presented in the paper Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks. We propose a new cross-modal pre-training method Oscar (Object-Semantics Aligned Pre-training). It leverages object…

GitHub Publication

Tool

Independent Subspace Analysis for Unsupervised Learning of Disentangled Representation

This repository contains the code for reproducing the quantitative experiments in our publication “Independent Subspace Analysis for Unsupervised Learning of Disentangled Representations.”

GitHub

Tool

Web Demonstration and Explanation Dataset

This data was collected for and used in our ACL 2020 paper that demonstrates the potential to effectively combine explanations and demonstrations to learn web-based procedures. This data consists of 520 explanations and corresponding demonstrations…

Access Publication

Tool

SPLASH: Semantic Parsing with Language ASsistance from Humans

SPLASH is dataset for the task of semantic parse correction with natural language feedback. The task, dataset along with baseline results are presented in: Speak to your Parser: Interactive Text-to-SQL with Natural Language Feedback Ahmed…

GitHub Publication

Tool

Beluga Sounds

Using machine learning to detect beluga whale calls in hydrophone recordings. Of the five populations of beluga whales in Alaska, the Cook Inlet population is the smallest and has declined by about seventy-five percent since…

GitHub

Tool

Conservative Uncertainty Estimation By Fitting Prior Networks

Code accompanying “Conservative Uncertainty Estimation By Fitting Prior Networks” – ICLR 2020

GitHub Publication

Tool

VL-BERT

VL-BERT is a simple yet powerful pre-trainable generic representation for visual-linguistic tasks. It is pre-trained on the massive-scale caption dataset and text-only corpus, and can be fine-tuned for various down-stream visual-linguistic tasks, such as Visual…

GitHub Publication

Tool

RaCT

This repository implements Ranking-Critical Training (RaCT) for Collaborative Filtering, accepted in International Conference on Learning Representations (ICLR), 2020. By using an actor-critic architecture to fine-tune a differentiable collaborative filtering model, we can improve the performance…

GitHub Publication