Fast Approximation of Empirical Entropy via Subsampling
The promotional video for the KDD 2019 paper: Fast Approximation of Empirical Entropy via Subsampling by Chi Wang and Bailu Ding.
Gen Encoder
Gen Encoder is a deep learning model that generically represents (embeds) search queries and questions based on their user intents. It leverages large scale user interaction data in Bing to learn this state-of-the-art intent-oriented representation.…
SILK: Preventing Latency Spikes in Log-Structured Merge Key-Value Stores
LSM-based KV stores are designed to offer good write performance by capturing client writes in memory and only later flushing them to storage. Writes are later compacted into a tree-like data structure on disk to…
Trill Samples
This repository hosts samples that demonstrate how to use Trill, a high-performance one-pass in-memory streaming analytics engine from Microsoft Research. It can handle both real-time and offline data, and is based on a temporal data…
DoWhy: Causal Reasoning for Designing and Evaluating Interventions
Today’s computing systems can be thought of as interventions in people’s work and daily lives. But what are the outcomes of these interventions, and how can we tune these systems for desired outcomes? In this…
DNA Assembly for Nanopore Data Storage Readout
Argosy: Verifying Layered Storage Systems With Recovery Refinement
Storage systems make persistence guarantees even if the system crashes at any time, which they achieve using recovery procedures that run after a crash. Realistic storage systems are built using layered abstractions, built on top…