Slide View : 15-462/662 Fall 2021

Optimization

Previous | Next --- Slide 27 of 32

Back to Lecture Thumbnails

bobzhangyc 4 years ago

How is Newton's method equation mapped to a disc?

tacocat 4 years ago

How does applying the inverse of the hessian add a quadratic term of optimization?

niyiqiul 4 years ago

Are methods involving higher-order momentums better suited for non-convex problems?

Midoriya 4 years ago

For nonconvex problems, can we do gradient descent with random restarts?

twizzler 4 years ago

Are algorithms from machine learning applied here (random starts, random walking with k walkers, etc)?

aa4 4 years ago

In the lecture, you mentioned that the sign of Hessian flips as we move from x_k to to x*. I didn't quite get why is that the case? Shouldn't it be always positive as it positive semi-definite?

coolpotato 4 years ago

In the video, you said we use the inverse of the Hessian as it would be part of the 2nd order Taylor approximation of f. Would using a higher order derivative than the Hessian further help create that "bowl-like" structure? And if so, what kind of tradeoffs are there? Specifically, does it help save time?

WhaleVomit 4 years ago

Is it practical to use 3rd degree or above order descents?

corgo 4 years ago

How costly would it be to use 3 or above order descents? Is it worth it?