Skip to content

Homework 02

The second homework is posted.

The first problem is an exercise in using cross-validation to select the best-fit polynomial for some synthetic data where both the degree and coefficients are unknown. The second problem is an application of naive Bayes for multiclass text classification in which you’re asked to train a classifier that, given the text of an article from the New York Times, predicts the section to which the article belongs.

You’ll need the polyfit.tsv data file for the first problem and the stopwords.txt file for the second.