Lecture Notes 2021
Week 1:
- Course Introduction
- What is Big Data ? Where Big Data Comes from [PDF ]
- What we can do and what should we do with Big Data ? [PDF]
Week 2:
- Introduction to Feature Manipulation [PDF]
- Feature Selection: Wrapper approaches and Sequential Search [PDF]
- Reading: Data Preprocessing [PDF]
Week 3:
- Feature Selection: Filter and Embedded approaches [PDF]
- Extra Notes --- COMP 307 Decision Tree Learning with An Example [PDF]
Week 4:
- Feature Manipulation for High-Dimensional Data: Feature Construction [PDF]
Week 5:
Week 6: Moving Beyond Linearity: Linear Regression and Shrinkage Methods
[PDF]
Week 7:
- Reading [PDF]
- Clustering 1
- Clustering 2
Week 8: Hadoop MapReduce
Week 9: Apache Spark
Week 10: Apache Spark
Week 11: Spark Machine Learning Libraries
Week 12: Tutorial on Spark Machine Learning Libraries