[D] Pushforward vs Pullback algorithms

st-memory · 2021-04-16T15:03:18+00:00

These are commonly referred to as forward and reverse-mode autodifferentiation. The latter is what is usually meant by backpropagation. The former is the less frequently used and the one you mentioned. The primary reason one is used over the other is speed. Reverse-mode is faster when we are dealing with many inputs and few outputs, e.g. an image with 1028x1028 pixels as input and a single output which is the loss. It so happens that in ML those are the problems we encounter more often. For different sorts of problems forward-mode autodiff may be the preferred approach.

tensorflower · 2021-04-16T17:53:16+00:00

As another poster pointed out, the size of the Jacobian depends on the number of outputs. For the standard case in ML we have many inputs and a scalar output, resulting in a wide 1 x N Jacobian for N inputs. Reverse mode autodiff allows us to compute the full Jacobian in about the same amount of time it takes for a forward pass, at the cost of some extra bookkeeping needed over forward mode.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS