All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Computing Hessian Inverse Products for Deep Neural Networks to Speed Up Gradient Descent

By

rahimiali

4mo ago· 2 min readenCode

Summary

This article presents a GitHub repository that demonstrates how to compute the inverse of the Hessian matrix for deep neural networks and multiply it with a vector. The method enables solving the equation Hx = v for x, where H is the Hessian and v is a vector, which is more efficient than naive approaches. The technique builds on Pearlmutter's work for Hessian-vector products but focuses on Hessian-inverse-products, with the goal of using this as a preconditioner to accelerate stochastic gradient descent optimization in machine learning.

Key quotes

· 5 pulled
This package shows how to multiply the inverse of the Hessian of a deep network with a vector.
The hope is to soon use this as a preconditioner to speed up stochastic gradient descent.
Pearlmutter showed a clever way to compute the Hessian-vector-product for a deep net.
By contrast, the paper and code in this repo shows how to compute the Hessian-inverse-product, the product of the inverse of the Hessian of a deep net with a vector.
Solving this system naively requires a number o
Snippet from the RSS feed
The Hessian of tall-skinny networks is easy to invert - a-rahimi/hessian

You might also wanna read