This course covers techniques and technologies for creating data driven interfaces. You will learn about the entire data pipeline from sensing to cleaning data to different forms of analysis and computation.
identifying the questions you want to answer
identifying the data required to answer the question
transforming data to answers
Sources to collect from: click, sensors, mobile phones, etc.
APIs for social web & OAUTH
Common data formats: XML, json, csv, …
Sampling and Bias in data collection
Understanding your data
Data Quality: coherence, correctness, completeness and accountability
Common problems with data
Tools for analyzing data
Exploratory Analysis, Distributions and their meanings