Lectures /
BigData
- Big Data
- What is it?
- Why is it hot?
- Big n vs big p
- How is big data different?
- use available large-scale data
- rather than hoping for annotated data
- memorization works well
- non-parametric methods
- use available large-scale data
- combine multiple data sources
- Why does it work?
- How to handle it?
- Dimensionality reduction
- Hadoop/MapReduce
- Sampling (not covered much this year!)
- How to handle it?