BigData

  • Big Data
    • What is it?
    • Why is it hot?
    • Big n vs big p
    • How is big data different?
      • use available large-scale data
        • rather than hoping for annotated data
      • memorization works well
        • non-parametric methods
  • combine multiple data sources
  • Why does it work?
  • How to handle it?
    • Dimensionality reduction
    • Hadoop/MapReduce
    • Sampling (not covered much this year!)

Back to Lectures