CSCI 307
    Data Mining

    College of the Holy Cross, Fall 2025


    Home | | Schedule | | Resources


    Consider this an aspiritional schedule. Schedule is subject to change.

    Week Day Date Actual
    1 T 8/26/2025 Intro, data matrix, samples, features
    R 8/28/2025 Vector and Matrix Operations
    2 T 9/2/2025 More vector and matrix operations
    R 9/4/2025 Numerical attributes, mean, median, variance
    3 T 9/9/2025 Multivariate analysis, covariance, correlation
    R 9/11/2025 Categorical attributes
    4 T 9/16/2025 Data Analysis with Python
    R 9/18/2025 Probability, conditional probability, Bayes theorem
    5 T 9/23/2025 Probabilistic view of data, distributions, and histograms
    R 9/25/2025 Frequent Itemset Mining
    6 T 9/30/2025 In-class mid-term
    R 10/2/2025 Association Rules
    7 T 10/7/2025 Graph data
    R 10/9/2025 Graph mining
    8 T 10/14/2025 (break)
    R 10/16/2025 (break)
    9 T 10/21/2025 Clustering
    R 10/23/2025 k-means clustering
    10 T 10/28/2025 Regression and linear models
    R 10/30/2025 Gradient descent
    11 T 11/4/2025 Classification and Logistic Regression
    R 11/6/2025 Mid-term (out of class)
    12 T 11/11/2025 Neural networks, limitations of linear models
    R 11/13/2025 Neural networks 2
    13 T 11/18/2025 Text mining, text features
    R 11/20/2025 Application of text-mining: Sentiment analysis
    14 T 11/25/2025 (buffer)
    R 11/27/2025 (break)
    15 T 12/2/2025 Text-based embeddings
    R 12/4/2025 Large language models?