Lecture Notes 2021

Week 1:
  • Course Introduction
  • What is Big Data ? Where Big Data Comes from [PDF ]
  • What we can do and what should we do with Big Data ? [PDF]

Week 2:
  • Introduction to Feature Manipulation [PDF]
  • Feature Selection: Wrapper approaches and Sequential Search [PDF]
    • Reading: Data Preprocessing [PDF]

Week 3:
  • Feature Selection: Filter and Embedded approaches [PDF]
    • Extra Notes --- COMP 307 Decision Tree Learning with An Example [PDF]

Week 4:
  • Feature Manipulation for High-Dimensional Data: Feature Construction [PDF]

Week 5:

Week 6: Moving Beyond Linearity: Linear Regression and Shrinkage Methods [PDF]

Week 7:
  • Reading [PDF]
  • Clustering 1
  • Clustering 2

Week 8: Hadoop MapReduce

Week 9: Apache Spark

Week 10: Apache Spark

Week 11: Spark Machine Learning Libraries

Week 12: Tutorial on Spark Machine Learning Libraries
Topic attachments
I Attachment Action Size Date Who Comment
COMP307Related.zipzip COMP307Related.zip manage 4 MB 16 Feb 2021 - 09:15 Main.chenqi1  
Credit.csvcsv Credit.csv manage 25 K 21 Mar 2021 - 14:00 Main.chenqi1  
Week5_6_Regression2Nolinearity.pdfpdf Week5_6_Regression2Nolinearity.pdf manage 3 MB 28 Mar 2021 - 21:45 Main.chenqi1  
Week5_Regression1.pdfpdf Week5_Regression1.pdf manage 4 MB 24 Mar 2021 - 17:05 Main.chenqi1