Homework 4: MLE and MAP

Maximum Likelihood Estimation (MLE) and Maximum a Posteriori (MAP)

Consider the theory and the notation provided in the the MLE/MAP section (https://devangelista2.github.io/statistical-mathematical-methods/regression_classification/MLE_MAP.html). Let $f_{θ} (x)$ be a polynomial regression model as in the previous Homework, and let the poly_regression_small.csv from Virtuale be the training set. Then, sample 20% of the data in the poly_regression_large.csv dataset to use as test set.

For a given value of $K$ , write three Python functions computing $θ_{M L E}$ , i.e. the optimal parameters obtained by optimizing the MLE-related loss function with Gaussian assumption on the likelihood $p_{θ} (y ∣ x)$ , by Gradient Descent, Stochastic Gradient Descent (with a batch_size = 5), and Normal Equations method with Cholesky Decomposition.
Compare the performance of the three regression model computed above. In particular, if $(X_{t es t}, Y_{t es t})$ is the test set from the poly_regression_large.csv dataset, for each of the model, compute: $E rr = \frac{1}{N _{t es t}} \sum_{i = 1}^{N_{t es t}} (f_{θ} (x^{i}) - y^{i})^{2},$ where $N_{t es t}$ is the number of elements in the test set, $(x^{i}, y^{i})$ are the input and output elements in the test set. Comment the performance of the three models.
For different values of $K$ , plot the training datapoints and the test datapoints with different colors and visualize (as a continuous line) the three learned regression model $f_{θ} (x)$ . Comment the results.
For increasing values of $K$ , compute the training and test error as discussed above. Plot the two errors with respect to $K$ . Comment the results.
Repeat the same experiments by considering the MAP formulation with Gaussian assumption on the prior term $p (θ)$ . Set $K = 8$ and test different values of $λ > 0$ in the experiments. Comment the results, comparing:

the three optimization method used to obtain $θ_{M A P}$ (i.e. GD, SGD and Normal Equations),
the different values of $λ > 0$ tested,
the results obtained by $θ_{M L E}$ vs $θ_{M A P}$ .

Other assignments:

🌱AI4Climate.science

Homework 4: MLE and MAP

Maximum Likelihood Estimation (MLE) and Maximum a Posteriori (MAP)

Backlinks

Graph View