CSCI 307
    Data Mining

    College of the Holy Cross, Fall 2023


    Home | | Schedule | | Resources


    Schedule
    i
    Week Day Date Topic Book Chapter Assignments and Projects
    1 T 8/29/2023 Logistics and motivation
    R 8/31/2023 Introduction to Python hw1 out
    2 T 9/5/2023 Data and attributes 1.1 - 1.3
    R 9/7/2023 Probabilistic view of data 1.4
    3 T 9/12/2023 Numerical attributes 2.1 hw1 due
    R 9/14/2023 Multivariate analysis 2.2 - 2.4 hw2 out
    4 T 9/19/2023 Probability: Normal distribution 2.5
    R 9/21/2023 Categorical attributes 3.1, 3.4 hw2 due
    5 T 9/26/2023 Probability: Bernouli and Binomial distribution
    Random variables in Python
    R 9/28/2023 Dimensionality reduction, PCA 6.1, 7.1 - 7.2
    6 T 10/3/2023 Principal Component Analysis Continued 7.1, 7.2, 7.4 Project - checkpoint 1
    bonus math hw out
    R 10/5/2023 Classification, k-nearest neighbors 18.1, 18.3 hw3 out delayed
    7 T 10/10/2023 (break)
    R 10/12/2023 (break)
    8 T 10/17/2023 kNN and Decision Tree 18.3, 19
    R 10/19/2023 Linear regression and mid-term recap
    Away for conference
    hw3 due delayed
    bonus math hw due
    9 T 10/24/2023 Classification with sci-kit learn, mid-term recap Logistic regression
    R 10/26/2023 In-class mid-term
    10 T 10/31/2023 Decision trees, sklearn
    R 11/2/2023 Assessment of classifiers.
    11 T 11/7/2023 Linear regression. Project - checkpoint 2
    R 11/9/2023 Gradient descent hw4 out
    12 T 11/14/2023 Logistic regression, limitations of linear models
    R 11/16/2023 Neural networks - intuition hw4 due
    13 T 11/21/2023 Neural networks
    R 11/23/2023 (break)
    14 T 11/28/2023 Clustering - k-means and Density-based
    R 11/30/2023 NLP
    15 T 12/5/2023 NLP continued
    R 12/7/2023 Recap for final exam Projects - final checkpoint


    Schedules, particularly those for homeworks and projects are tentative. You'll get updates in class in case anything changes.