Pareto Governors for Energy-Optimal Computing
Azure for Research Data Science Training at New York University’s CUSP
New York University’s Center for Urban Science and Progress (CUSP) in conjunction with Microsoft Research and Northeast Big Data Innovation hub is offering a free hands-on data science workshop on Microsoft Azure at CUSP in…
Cross-Domain Data Fusion
1. Overview Traditional data mining usually deals with data from a single domain. In the big data era, we face a diversity of datasets from different sources in different domains. These datasets consist of multiple…
Visualization for Machine Teaching
We explore ways to help people easily build machine learning models by leveraging information visualization. We aim to effectively support understanding and debugging of machine learning models.
Trust, but Verify: Optimistic Visualizations of Approximate Queries for Exploring Big Data (video figure)
Analysts need interactive speed for exploratory analysis, but big data systems are often slow. With sampling, data systems can produce approximate answers fast enough for exploratory visualization, at the cost of accuracy and trust. We…
WinMine Toolkit
The WinMine Toolkit is a set of tools for Windows 2000/NT/XP that allow you to build statistical models from data. The majority of the tools are command-line executables that can be run in scripts. Click…