Interactive Data Science
Course Information
Course Number
HCI Undergraduate:
n/a
HCI Graduate:
05-839
Course Description
This course covers techniques and technologies for creating data driven interfaces. You will learn about the entire data pipeline from sensing to cleaning data to different forms of analysis and computation.
Introduction
- identifying the questions you want to answer
- identifying the data required to answer the question
- transforming data to answers
Collecting data
- Sources to collect from: click, sensors, mobile phones, etc.
- APIs for social web & OAUTH
-
Common data formats: XML, json, csv, …
-
Sampling and Bias in data collection
Cleaning data
- Understanding your data
- Data Quality: coherence, correctness, completeness and accountability
- Common problems with data
Tools for analyzing data
- Exploratory Analysis, Distributions and their meanings
- Causality
- Transformations and Features
-
Usable Machine Learning
Visualisation
- What, why and how not to visualize
- Perceptual issues in visualization
- What makes a good visualization, narrative
- Visualizing big data
Semester Offered and Units
Semester:
Spring
Graduate:
12
units