0% found this document useful (0 votes)

40 views

Tute1 Questions

This document contains a machine learning tutorial with 4 questions covering support vector machines, hinge loss, softmax transformations, and cross entropy. (1) It proves that two optimization problems generate the same hard-margin SVM classifier and derives the first-order condition for the Lagrangian. Support vectors are defined as observations where the Lagrange multipliers are nonzero. (2) It calculates the conditional hinge risk and shows the Bayes classifier minimizes this risk and population classification error. (3) It proves representations involving the sigmoid and softmax functions and shows the Bayes classifier equals the sign of the log odds function. (4) It analyzes properties of cross entropy loss such as convexity and shows its

Uploaded by

Lucas Bakker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views

Tute1 Questions

Uploaded by

Lucas Bakker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Machine Learning for Econometrics

Week 1 Tutorials Questions

Lecturer: Yi He

2 November, 2023

Question 1: Support Vector Machines

(a) Prove that the following optimization problems on page 8 of lecture slides generate the same
hard-margin SVM classifier:

Optimization Problem 1

maximize M
w,w0 ,M

subject to Yi · (wT Xi + w0 ) ≥ M ∀i

∥w∥ = 1

Optimization Problem 2

1 T
minimize β β
β,β0 2
subject to Yi (β T Xi + β0 ) ≥ 1 ∀i

Assume that optimization problem 1 has a solution and the length of margin is positive.

(b) Henceforth we only consider the second optimization problem. Differentiate the Lagrange
function
n
1 T X
β β+ λi (1 − Yi (β T Xi + β0 )), λi ≥ 0,
2
i=1

with respect to β to get the first-order condition:

n
X
β= λi Yi Xi .
i=1

1
(c) Let S = {i : λ
bi ̸= 0} be the so-called support set, where λ
bi is the optimal solution of λi in part

(b). By the slackness condition (you do not need to prove this) we also know that

S = {i : Yi (βbT Xi + βb0 ) = 1}.

P bi Yi . The observations {Xi : i ∈ S} are therefore called

Verify that βb = i∈S α
bi Xi for α
bi = λ
support vectors.

Question 2: Hinge Loss

Consider a two-class classification problem with a target Y ∈ {−1, 1} and features X ∈ Rd . Define
p(x) = P(Y = 1|X = x) for all x ∈ Rd .

(a) Calculate the conditional hinge risk

E[ℓH (f (X), Y ) | X = x]

in terms of p(x) and any candidate prediction rule f : Rd → R.

(b) Suppose that p(x) ∈

/ {0, 1/2, 1}. Show that the Bayes classifier

1

p(x) > 1/2
Bayes
C (x) =
−1 p(x) < 1/2


minimizes conditional expected Hinge loss in part (a) over all candidate prediction rules f .

P(Y ̸= sign(f (X)) | X = x) = P (Y f (X) ≤ 0 | X = x)

over all candidate prediction rules f : Rd → R.

Question 3: Softmax Transformation

Consider again the two-class classification problem from Question 2. Let σ denote the sigmoid
function given by
1
σ(a) = , a ∈ R.
1 + exp(−a)

2
(a) Prove that the bivariate regression function for the one-hot encoded target satisfies the following
representation:

P(Y = 1|X = x) = σ(a(x)), P(Y = −1|X = x) = σ(−a(x))

with
P(Y = 1|X = x)
a(x) = log .
P(Y = −1|X = x)

(b) Using part (a), verify that

   
P(Y = 1|X = x) a(x)
  = σ 
P(Y = −1|X = x) 0

where σ denotes the softmax transformation.

(c) Prove that the Bayes classifier equals to the sign of the log odds function a(x), that is,

1

a(x) > 0
Bayes
C (x) = sign(a(x)) =
−1 a(x) < 0.


Question 4: Cross Entropy

PK
Let yb, y o ∈ Pk = {y ∈ (0, 1)K : k=1 yk = 1} represent two distributions: yb = (b
y1 , . . . , ybK ) is an
estimate of the true distribution y o = (y1o , . . . , yK
o ). Consider the cross-entropy loss given by

K
X
o
y, y ) = −
ℓCE (b yko log ybk .
k=1

y , y o ) ̸= ℓCE (y o , yb).
Note that ℓCE (b

(a) Prove that log x ≤ x − 1 for all x > 0, where the equality holds if and only if x = 1.

(b) Use part (a) to prove that the cross-entropy loss is minimal at the true distribution y o , that is,

y , y o ) ≥ ℓCE (y o , y o ).
ℓCE (b

When does the equality hold?

(c) Use the concavity of the logarithm function to verify that the cross-entry loss is convex in yb,
meaning that for any two estimates yb, ye ∈ Pk and λ ∈ (0, 1),

y , y o ) ≤ λℓCE (b
y + (1 − λ)e
ℓCE (λb y , y o ) + (1 − λ)ℓCE (e
y , y o ).

3
(d) Let Y ∈ {1, . . . , K} be a target variable from the true distribution y o such that P(Y = k) = yko .
Show that the log-likelihood for parameters y o is given by
K
X
y |Y ) =
log L(b 1[Y = k] log ybk .
k=1

(e) Compare the expression of the log-likelihood function in (d) with the cross-entropy error. Can
you find a relation between the method of maximum likelihood and minimum cross-entropy?

4i Shepherding Movement
No ratings yet
4i Shepherding Movement
239 pages
Exercise Solution 05 Linear Classification
No ratings yet
Exercise Solution 05 Linear Classification
9 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Midterm 2010 F
No ratings yet
Midterm 2010 F
15 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
No ratings yet
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
16 pages
EndSem 202223 Solution
No ratings yet
EndSem 202223 Solution
4 pages
Exam 21
No ratings yet
Exam 21
17 pages
Final: CS 189 Spring 2013 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2013 Introduction To Machine Learning
9 pages
ass8_solns
No ratings yet
ass8_solns
10 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Midterm Sp16 Solutions
100% (1)
Midterm Sp16 Solutions
17 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Solutions Problem Set 1
No ratings yet
Solutions Problem Set 1
7 pages
ML Midsem 2018 Solutions
No ratings yet
ML Midsem 2018 Solutions
7 pages
output_23
No ratings yet
output_23
6 pages
Midterm Exam - Summer 21
No ratings yet
Midterm Exam - Summer 21
6 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
ML Practice 1
No ratings yet
ML Practice 1
106 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
CS 419M Midsem 2021 22
No ratings yet
CS 419M Midsem 2021 22
6 pages
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
No ratings yet
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
6 pages
Machine Learning PDF
No ratings yet
Machine Learning PDF
77 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
hw3_red
No ratings yet
hw3_red
4 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
13 pages
CMPUT 466/551 - Assignment 1: Paradox?
No ratings yet
CMPUT 466/551 - Assignment 1: Paradox?
6 pages
midterm2008f_sol
No ratings yet
midterm2008f_sol
12 pages
Exam 2011
No ratings yet
Exam 2011
22 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
deep learning
No ratings yet
deep learning
3 pages
Quiz3_2024
No ratings yet
Quiz3_2024
2 pages
Epfl Machine Learning Final Exam 2021 Solutions
No ratings yet
Epfl Machine Learning Final Exam 2021 Solutions
21 pages
Quiz2_Mock_Solutions
No ratings yet
Quiz2_Mock_Solutions
19 pages
553.740 Project 2 Optimization. Fall 2020 Due On Wednesday October 21
No ratings yet
553.740 Project 2 Optimization. Fall 2020 Due On Wednesday October 21
5 pages
output_25
No ratings yet
output_25
8 pages
2019-20-I MS Key
No ratings yet
2019-20-I MS Key
6 pages
Mid-Term A2 ML Solution
No ratings yet
Mid-Term A2 ML Solution
7 pages
COMPSCI5014 1 Machine Learning (M) 201904
No ratings yet
COMPSCI5014 1 Machine Learning (M) 201904
7 pages
dis1
No ratings yet
dis1
5 pages
Inference Quals 1992-2019
No ratings yet
Inference Quals 1992-2019
66 pages
ml-20230316-1
No ratings yet
ml-20230316-1
9 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
No ratings yet
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
67 pages
Homework2 v1.0
No ratings yet
Homework2 v1.0
5 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
exam_practice
No ratings yet
exam_practice
5 pages
09. Stochastic Gradient Descent 1
No ratings yet
09. Stochastic Gradient Descent 1
42 pages
Proof of Softmax
No ratings yet
Proof of Softmax
3 pages
189 Cheat Sheet Nominicards PDF
No ratings yet
189 Cheat Sheet Nominicards PDF
2 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Lecture 2
No ratings yet
Lecture 2
8 pages
Machine Learning PYQ 2022 Ans
No ratings yet
Machine Learning PYQ 2022 Ans
17 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
midem_ML_makeup_sol_upated
No ratings yet
midem_ML_makeup_sol_upated
6 pages
CS 771A: Intro To Machine Learning, IIT Kanpur Name Roll No Dept
No ratings yet
CS 771A: Intro To Machine Learning, IIT Kanpur Name Roll No Dept
2 pages
Econometrics - Exercise set 2 (solution)
No ratings yet
Econometrics - Exercise set 2 (solution)
12 pages
2022-exam2-solution
No ratings yet
2022-exam2-solution
10 pages
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Jessica Barnes Resume 2 4
No ratings yet
Jessica Barnes Resume 2 4
3 pages
TOM TOM Trade Certificate
No ratings yet
TOM TOM Trade Certificate
15 pages
Komatsu Machine Maintenance
No ratings yet
Komatsu Machine Maintenance
2 pages
RONNITEC Catalogue Version 2024
No ratings yet
RONNITEC Catalogue Version 2024
40 pages
Internship Diary
No ratings yet
Internship Diary
36 pages
A New Memory Reduced Radix-4 CORDIC Processor For FFT Operation
No ratings yet
A New Memory Reduced Radix-4 CORDIC Processor For FFT Operation
8 pages
Class 10 Biology Abstract Notes Eng Medium (All Units 15 Pages) by Rasheed Odakkal
No ratings yet
Class 10 Biology Abstract Notes Eng Medium (All Units 15 Pages) by Rasheed Odakkal
15 pages
Session 2
No ratings yet
Session 2
28 pages
Literature Review
No ratings yet
Literature Review
15 pages
McKenzie Model
No ratings yet
McKenzie Model
34 pages
Working Methodology: Planning For Green Field International Airport Near Agra, Up
No ratings yet
Working Methodology: Planning For Green Field International Airport Near Agra, Up
2 pages
Pfile Vs Spfile
No ratings yet
Pfile Vs Spfile
2 pages
Augalų Veislių Klasifikatorius Augalo Pavadinimas Ir Kodas Veislės
No ratings yet
Augalų Veislių Klasifikatorius Augalo Pavadinimas Ir Kodas Veislės
17 pages
.TR 2023 Bahar Yökdi̇l Sosyal-Sinav Kampi
No ratings yet
.TR 2023 Bahar Yökdi̇l Sosyal-Sinav Kampi
29 pages
Backfill Material Specification1
No ratings yet
Backfill Material Specification1
7 pages
Experimental Approach For Water Treatment Using Moringa Leaf Powder
No ratings yet
Experimental Approach For Water Treatment Using Moringa Leaf Powder
20 pages
Cultural Diversity in The Workplace
No ratings yet
Cultural Diversity in The Workplace
5 pages
The Sanitation Code of The Philippines
No ratings yet
The Sanitation Code of The Philippines
94 pages
Kami Export - Mars Rovers Worksheet.docx
No ratings yet
Kami Export - Mars Rovers Worksheet.docx
3 pages
SQL Common Table Expression Guide Simplify Complex Queries
No ratings yet
SQL Common Table Expression Guide Simplify Complex Queries
17 pages
ITIL-based IT Service Management Applied in Telecom Business Operation and Maintenance System
No ratings yet
ITIL-based IT Service Management Applied in Telecom Business Operation and Maintenance System
4 pages
Matter-Nature and Behaviour (Conceptual Questions With Very Short Answers)
No ratings yet
Matter-Nature and Behaviour (Conceptual Questions With Very Short Answers)
4 pages
Manaseer Cluster4 Jordan
No ratings yet
Manaseer Cluster4 Jordan
7 pages
Averages From Frequency Tables
No ratings yet
Averages From Frequency Tables
16 pages
New Migration Form
No ratings yet
New Migration Form
2 pages
Gaz Band
No ratings yet
Gaz Band
2 pages
Hvac Formulae
No ratings yet
Hvac Formulae
5 pages
Equine Genital Leptospirosis Evidence of An Important Silent Chronic Reproductive Syndrome AZEVEDO, 2022
No ratings yet
Equine Genital Leptospirosis Evidence of An Important Silent Chronic Reproductive Syndrome AZEVEDO, 2022
8 pages
Methods To Find Slope and Deflection
No ratings yet
Methods To Find Slope and Deflection
10 pages

Uploaded by

Uploaded by

Machine Learning for Econometrics

Week 1 Tutorials Questions

Question 1: Support Vector Machines

with respect to β to get the first-order condition:

S = {i : Yi (βbT Xi + βb0 ) = 1}.

P bi Yi . The observations {Xi : i ∈ S} are therefore called

Question 2: Hinge Loss

(a) Calculate the conditional hinge risk

in terms of p(x) and any candidate prediction rule f : Rd → R.

(b) Suppose that p(x) ∈

P(Y ̸= sign(f (X)) | X = x) = P (Y f (X) ≤ 0 | X = x)

over all candidate prediction rules f : Rd → R.

Question 3: Softmax Transformation

P(Y = 1|X = x) = σ(a(x)), P(Y = −1|X = x) = σ(−a(x))

(b) Using part (a), verify that

where σ denotes the softmax transformation.

Question 4: Cross Entropy

When does the equality hold?

You might also like