Gradient descent

Last revised by Jay Gajera on 2 Oct 2020

Citation, DOI, disclosures and article data

Citation:

Wang D, Gajera J, Murphy A, Gradient descent. Reference article, Radiopaedia.org (Accessed on 16 Apr 2024) https://doi.org/10.53347/rID-61713

DOI:

https://doi.org/10.53347/rID-61713

Permalink:

https://radiopaedia.org/articles/61713

rID:

61713

Article created:

15 Jul 2018, David John Wang

Disclosures:

At the time the article was created David John Wang had no recorded disclosures.

View David John Wang's current disclosures

Last revised:

2 Oct 2020, Jay Gajera ◈

Disclosures:

At the time the article was last revised Jay Gajera had no recorded disclosures.

View Jay Gajera's current disclosures

Revisions:

4 times, by 3 contributors - see full revision history and disclosures

Tags:

The gradient descent algorithm is an optimization strategy that aims to minimize an objective cost function (degree of predicting error) of a model in order to produce a model that gives the most accurate predictions. Gradient descent is by far the most commonly used algorithm in machine learning, and is usually the first algorithm most people learn due to its simplicity and usefulness.

As the name suggests, gradient descent can be seen as descending to the lowest point in an n-dimensional space. The lowest point represents the lowest value, and hence the minimum value of the cost function. Gradient descent uses the derivative of the function (the gradient of the curve) and changes the parameters of the algorithm in small steps (determined by a learning rate), each time moving towards another point that has a smaller cost. Eventually, gradient descent converges to a point where the gradient is close to 0, the minimum value for the cost function. The parameters that the gradient descent algorithm yielded will form the best model as it produced the minimum error in prediction.

It is an important consideration in designing the training phase of a machine learning algorithm as it iteratively computes the gradient and performs a parameter update in a loop.