Bienvenue sur Scribd !

Ignorer le carrousel

Tutorial 2

Transféré par

nikhil chavanke

0% ont trouvé ce document utile (0 vote)

41 vues3 pages

Tutorial

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

Tutorial

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

41 vues3 pages

Tutorial 2

Transféré par

nikhil chavanke

Tutorial

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 3

Rechercher à l'intérieur du document

Tutorial 2

CS 337 Artificial Intelligence & Machine Learning, Autumn 2019

Week 2, August, 2019

Problem 1. Consider a data set in which each data point yi is associated with a
weighting factor ri , so that the sum-square error function becomes
m
1X
ri (yi − wT φ(xi ))2
2 i=1

Find an expression for the solution w∗ that minimizes this error function. The weights
ri ’s are known before hand. (Exercise 3.3 of Pattern Recognition and Machine Learning,
Christopher Bishop).

Problem 2. Equivalence between Ridge Regression and Bayesian Linear Re-

gression (with fixed σ 2 and λ): Consider the Bayesian Linear Regression Model

y = wT φ(x) + ε and ε ∼ N (0, σ 2 )

w ∼ N (0, αI) and w | D ∼ N (µm , Σm )
µm = (λσ 2 I + φT φ)−1 φT y and Σ−1 T
m = λI + φ φ/σ
2

Show that wM AP = argmax Pr (w | D) is the same as that of Regularized Ridge Re-

w
gression.
wRidge = argmin ||φw − y||22 + λσ 2 ||w||22
w

In other words, The Bayes and MAP estimates for Linear Regression coincide with
that of Regularized Ridge Regression.

Problem 3. Ridge Regression and Error Minimization:

1
1. Prove the following Claim:
The sum of squares error on training data using the weights obtained after min-
imizing ridge regression objective is greater than or equal to the sum of squares
error on training data using the weights obtained after minimizing the ordinary
least squares (OLS) objective.
More specifically, if φ and y are defined on the training set D = {(x1 , y1 )...(xm , ym )}
as  
φ1 (x1 ) φ2 (x1 ) ...... φn (x1 )
 . 
φ=  (1)
 . 
φ1 (xm ) φ2 (xm ) ...... φn (xm )
 
y1
y= .  (2)
ym
and if
wRidge = argmin ||φw − y||22 + λ||w||22
w

and
wOLS = argmin ||φw − y||22
w

then you should prove that

||φwRidge − y||22 ≥ ||φwOLS − y||22

2. If it is the case that ridge regression leads to greater error than ordinary least
squares regression, then why should one be interested in ridge regression at all?

Problem 4. Gradient descent is a very helpful algorithm. But it is not guaranteed

to converge to global minima always. Give an example of a continuous function and
initial point for which gradient descent converges to a value which is not global minima.

Problem 5. In class, we have illustrated Bayesian estimation for the parameter µ of

a Normally distributed random variable X ∼ N (µ, σ 2 ), assuming that σ was known
by imposing a Normal (conjugate) prior on µ. Now suppose that the parameter µ is
known and we wish to estimate σ 2 . What will be the form of the conjugate prior for
this estimation procedure? If D = X1 , X2 , X3 , ....., Xn is a set of independent samples
from this distribution, after imposing the conjugate prior, compute the form of the
likelihood function L(θ), the posterior density P (θ | D) and the posterior probability
P (X | D). Again, you can ignore normalization factors.

2
Problem 6. Consider a linear model of the form
D
X
y(x, w) = w0 + w i xi
i=1

together with a sum-of-squares error function of the form

N
1X
ED (w) = {y(xn , w) − tn }2
2 n=1

Now suppose that Gaussian noise i with zero mean and variance σ 2 is added indepen-
dently to each of the input variables xi . By making use of E[i ] = 0 and E[i j ] = δij σ 2
(i.e. E[i j ] = σ 2 when i = j), show that minimizing ED averaged over the noise
distribution is equivalent to minimizing the sum-of-squares error for noise-free input
variables with the addition of a weight-decay regularization term, in which the bias
parameter w0 is omitted from the regularizer. (Problem 3.4 from Bishop, PRML)

Vous aimerez peut-être aussi

Stanford University CS 229, Autumn 2014 Midterm Examination
Document23 pages
Stanford University CS 229, Autumn 2014 Midterm Examination
Erico Archeti
Pas encore d'évaluation
Tutorial Problems Day 1
Document3 pages
Tutorial Problems Day 1
Tejas Duseja
Pas encore d'évaluation
CS 229, Public Course Problem Set #1 Solutions: Supervised Learning
Document10 pages
CS 229, Public Course Problem Set #1 Solutions: Supervised Learning
suhar adi
Pas encore d'évaluation
Vogel 2002 Computational PDF
Document200 pages
Vogel 2002 Computational PDF
Juan Luis Saavedra
100% (1)
Ps and Solution CS229
Document55 pages
Ps and Solution CS229
Anonymous COa5DYzJw
Pas encore d'évaluation
Parameter Estimation and Inverse Problems
Document313 pages
Parameter Estimation and Inverse Problems
jabirhussain
67% (3)
05 Regression Least Squares
Document5 pages
05 Regression Least Squares
Yiwei Chen
Pas encore d'évaluation
HW 5
Document5 pages
HW 5
Johnathan Tucker
Pas encore d'évaluation
Basic R Programming: Exercises
Document7 pages
Basic R Programming: Exercises
Bom Villatuya
Pas encore d'évaluation
Additional Problems
Document6 pages
Additional Problems
Anonymous Q3EMYoWNS
Pas encore d'évaluation
Machine Learning Lecture 4
Document4 pages
Machine Learning Lecture 4
chelsea
Pas encore d'évaluation
HW1 ML
Document2 pages
HW1 ML
Shashank Sistla
Pas encore d'évaluation
Linear Regression and Classification: Lectures 3-4
Document41 pages
Linear Regression and Classification: Lectures 3-4
Val
Pas encore d'évaluation
Homework 4: Kernel Methods: Minted Listings
Document14 pages
Homework 4: Kernel Methods: Minted Listings
Mayur Agrawal
Pas encore d'évaluation
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
Document40 pages
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
S
Pas encore d'évaluation
Homework2 v1.0
Document5 pages
Homework2 v1.0
royadaneshi2001
Pas encore d'évaluation
HW 4
Document7 pages
HW 4
Angel Avila
Pas encore d'évaluation
MATH21
Document11 pages
MATH21
Nilusha
Pas encore d'évaluation
Today: - Calculus
Document61 pages
Today: - Calculus
Jose Ramon Villatuya
Pas encore d'évaluation
Midterm 2014
Document23 pages
Midterm 2014
Zeeshan Ali Sayyed
Pas encore d'évaluation
HW 1
Document8 pages
HW 1
Ben
Pas encore d'évaluation
COGS 118 Homework 3 Supervised Machine Learning Algorithms
Document7 pages
COGS 118 Homework 3 Supervised Machine Learning Algorithms
Veronica Lin
Pas encore d'évaluation
Kernel SVM For Image Classification
Document20 pages
Kernel SVM For Image Classification
Ariake Swyce
Pas encore d'évaluation
Cs 229, Public Course Problem Set #2 Solutions: Kernels, SVMS, and Theory
Document8 pages
Cs 229, Public Course Problem Set #2 Solutions: Kernels, SVMS, and Theory
suhar adi
Pas encore d'évaluation
1.6. Indirect Utility Function Roys Identity Expenditure Minimisation
Document30 pages
1.6. Indirect Utility Function Roys Identity Expenditure Minimisation
2019850261
Pas encore d'évaluation
Multiple View Geometry: Exercise Sheet 8
Document3 pages
Multiple View Geometry: Exercise Sheet 8
Berkay Özerbay
Pas encore d'évaluation
ML Solutions
Document5 pages
ML Solutions
Jamie Chan
Pas encore d'évaluation
Ps 1
Document5 pages
Ps 1
Emre Uysal
Pas encore d'évaluation
Week5 SWI
Document3 pages
Week5 SWI
Aditya Paharwa
Pas encore d'évaluation
Discretization and Simulation For A Class of Spdes With Applications To Zakai and Mckean-Vlasov Equations
Document43 pages
Discretization and Simulation For A Class of Spdes With Applications To Zakai and Mckean-Vlasov Equations
pooja
Pas encore d'évaluation
Support Vector Machines For Classification and Regression
Document8 pages
Support Vector Machines For Classification and Regression
Yiwei Chen
Pas encore d'évaluation
Finite Difference Method For Elliptic Pdes
Document3 pages
Finite Difference Method For Elliptic Pdes
Dev5 Jan
Pas encore d'évaluation
Theory For Regression and Linear Models (I)
Document21 pages
Theory For Regression and Linear Models (I)
Charlie
Pas encore d'évaluation
1 Lecture 5b: Probabilistic Perspectives On ML Algorithms
Document6 pages
1 Lecture 5b: Probabilistic Perspectives On ML Algorithms
Jeremy Wang
Pas encore d'évaluation
Correlation Learning Rule: M I I I
Document33 pages
Correlation Learning Rule: M I I I
Stefanescu Alexandru
Pas encore d'évaluation
Lec. 03 - An Introduction To Support Vector Machines - Giorgio Valentini U. Milano
Document42 pages
Lec. 03 - An Introduction To Support Vector Machines - Giorgio Valentini U. Milano
Alejandro Vasquez
Pas encore d'évaluation
06 Fitting Matching
Document13 pages
06 Fitting Matching
Tú Linh
Pas encore d'évaluation
Mathematical Economics: 1 What To Study
Document23 pages
Mathematical Economics: 1 What To Study
jrvv2013gmail
Pas encore d'évaluation
Tutorial 1
Document2 pages
Tutorial 1
bkiakisolako
Pas encore d'évaluation
Linear
Document31 pages
Linear
HiGrill25
Pas encore d'évaluation
hw3 Sol
Document14 pages
hw3 Sol
Kiyan Roy
Pas encore d'évaluation
Ch. 5.4 Variational Methods
Document4 pages
Ch. 5.4 Variational Methods
Jakov Pelz
Pas encore d'évaluation
Sturm-Liouville Equations
Document16 pages
Sturm-Liouville Equations
vanaj123
Pas encore d'évaluation
Metric Spaces
Document102 pages
Metric Spaces
tomdavis777
Pas encore d'évaluation
Tutorial 8 Questions
Document3 pages
Tutorial 8 Questions
Evan Duh
Pas encore d'évaluation
FEM - 3 Weighted Residuals
Document49 pages
FEM - 3 Weighted Residuals
wiyorejesend22u.info
Pas encore d'évaluation
Solutions Problem Set 1
Document7 pages
Solutions Problem Set 1
sanketjaiswal
Pas encore d'évaluation
De Module 3
Document7 pages
De Module 3
reyesmannyson
Pas encore d'évaluation
Econometrics: Problem Set 2 - 18/2/2022
Document2 pages
Econometrics: Problem Set 2 - 18/2/2022
John Doe
Pas encore d'évaluation
2021 Week 5 Chapter3 Control
Document7 pages
2021 Week 5 Chapter3 Control
Adib
Pas encore d'évaluation
FIN-403 Final Exam Sample Questions
Document6 pages
FIN-403 Final Exam Sample Questions
testing
Pas encore d'évaluation
Introduction To Mathematical Modeling: Simple Linear Regression
Document21 pages
Introduction To Mathematical Modeling: Simple Linear Regression
Meher Md Saad
Pas encore d'évaluation
Metric Space and Norm Linear Space Important Questions
Document3 pages
Metric Space and Norm Linear Space Important Questions
P GOSWAMI
Pas encore d'évaluation
Lecture 2
Document57 pages
Lecture 2
happy_user
Pas encore d'évaluation
Machine Learning II: The Linear Model
Document48 pages
Machine Learning II: The Linear Model
Quentin Lambotte
Pas encore d'évaluation
Sat Comp0142
Document4 pages
Sat Comp0142
Musa Asad
Pas encore d'évaluation
Picard
Document2 pages
Picard
manjoy das
Pas encore d'évaluation
Mathematics Formulary PDF
Document30 pages
Mathematics Formulary PDF
fizarimae
Pas encore d'évaluation
Fuzzy Algo Updated
Document8 pages
Fuzzy Algo Updated
nirmala periasamy
Pas encore d'évaluation
Unit 3 Sessionwise Problems
Document5 pages
Unit 3 Sessionwise Problems
Nilay Shah
Pas encore d'évaluation
Phys 531 Fall 2019 HW 5
Document1 page
Phys 531 Fall 2019 HW 5
waskhez
Pas encore d'évaluation
02 NLP Cvxopt
Document11 pages
02 NLP Cvxopt
AZEEN
Pas encore d'évaluation
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
D'Everand
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
Jean Bourgain
Pas encore d'évaluation
Batteries: Generalized Distribution of Relaxation Times Analysis For The Characterization of Impedance Spectra
Document16 pages
Batteries: Generalized Distribution of Relaxation Times Analysis For The Characterization of Impedance Spectra
Meriem
Pas encore d'évaluation
Support Vector Regression
Document23 pages
Support Vector Regression
Ali Ahmed
Pas encore d'évaluation
RTV 4 Manual
Document128 pages
RTV 4 Manual
kloud
Pas encore d'évaluation
L3 Linear Regression
Document23 pages
L3 Linear Regression
Hieu Tien Trinh
Pas encore d'évaluation
Deblurring Images, Matrices, Spectra, and Filtering (Fundamentals of Algorithms)
Document145 pages
Deblurring Images, Matrices, Spectra, and Filtering (Fundamentals of Algorithms)
Zengben Hao
Pas encore d'évaluation
Statistical Modeling
Document19 pages
Statistical Modeling
Anonymous PKVCsG
Pas encore d'évaluation
Maximum Likelihood Estimation
Document46 pages
Maximum Likelihood Estimation
Toàn Phạm Đức
Pas encore d'évaluation
Ridge Regression
Document82 pages
Ridge Regression
Suvir
Pas encore d'évaluation
Bitcoin Stochastic Stock To Flow Model
Document21 pages
Bitcoin Stochastic Stock To Flow Model
Nathaniel Fowler
Pas encore d'évaluation
Aml CS 9 PRV
Document47 pages
Aml CS 9 PRV
ved prakash
Pas encore d'évaluation
Key Words and Phrases. Inverse Heat Conduction Problem, Fundamental Solution Method
Document21 pages
Key Words and Phrases. Inverse Heat Conduction Problem, Fundamental Solution Method
Fahad Almafakir
Pas encore d'évaluation
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
Document11 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
Yu Ching Lee
Pas encore d'évaluation
HW1 Solutions
Document3 pages
HW1 Solutions
yanxia
Pas encore d'évaluation
HW 1
Document3 pages
HW 1
Prakhar Srivastava
Pas encore d'évaluation
Ridge Regression: Patrick Breheny
Document22 pages
Ridge Regression: Patrick Breheny
Djordje Miladinovic
Pas encore d'évaluation
I. Introduction To Convex Optimization: Georgia Tech ECE 8823a Notes by J. Romberg. Last Updated 13:32, January 11, 2017
Document20 pages
I. Introduction To Convex Optimization: Georgia Tech ECE 8823a Notes by J. Romberg. Last Updated 13:32, January 11, 2017
zeldaik
Pas encore d'évaluation
Lasso and Ridge Regression
Document30 pages
Lasso and Ridge Regression
Aarti
Pas encore d'évaluation
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
Document25 pages
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
S
Pas encore d'évaluation
2011 Beamforming Regularization Matrix and Inverse Problems Applied To Sound Field Measurement and Extrapolation Using Microphone Array
Document27 pages
2011 Beamforming Regularization Matrix and Inverse Problems Applied To Sound Field Measurement and Extrapolation Using Microphone Array
Philippe-Aubert Gauthier
Pas encore d'évaluation
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
Document57 pages
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
s8nd11d UNI
Pas encore d'évaluation
Image Reconstruction With Tikhonov Regularization
Document5 pages
Image Reconstruction With Tikhonov Regularization
RichardBehiel
Pas encore d'évaluation
Dual Porosity
Document12 pages
Dual Porosity
HafiizhNurrahman
Pas encore d'évaluation
Computational Inverse Problems
Document67 pages
Computational Inverse Problems
Rodrigo Vidonscky
100% (1)
International Journal of Forecasting
Document18 pages
International Journal of Forecasting
Kumar Manoj
Pas encore d'évaluation
Ffjord: F - C D S R G M: REE Form Ontinuous Ynamics For Calable Eversible Enerative Odels
Document13 pages
Ffjord: F - C D S R G M: REE Form Ontinuous Ynamics For Calable Eversible Enerative Odels
jeff chen
Pas encore d'évaluation
10.0000@Www - Onepetro.org@journal Paper@SPWLA 1990 V31n1a2
Document10 pages
10.0000@Www - Onepetro.org@journal Paper@SPWLA 1990 V31n1a2
Anonymous yjLUF9gDTS
Pas encore d'évaluation
Nowcasting Paper PDF
Document30 pages
Nowcasting Paper PDF
Youssef Said
Pas encore d'évaluation