LINE: Large-scale Information Network Embedding
Embedding information networks into low-dimensional spaces is potentially useful in many applications such as visualization, node classification, link prediction and recommendation. In this project, we proposed a large-scale information network embedding model called the “LINE”,…
Pre-training of Hidden-Unit CRFs
WikiQA Corpus
The WikiQA corpus is a new publicly available set of question and sentence pairs, collected and annotated for research on open-domain question answering. In order to reflect the true information need of general users, we…
Compact Lexicon Selection with Spectral Methods
SigmaDolphin: Automated Math Word Problem Solving
Building a computer system to automatically solve math word problems written in natural language. SigmaDolphin is a project initiated in early 2013 at Microsoft Research Asia, with the primary goal of building a computer intelligent…