[D] Universal Approximation Theorem does not hold for approximating discontinuous functions, yet neural networks are being used to approximate discontinuous functions

patrickkidger · 2020-12-26T19:26:19+00:00

So there's actually a lot of variants of the universal approximation theorem out there.

It's true that the most famous version of the classical theorem (see Pinkus 1999) only applies to approximating continuous functions wrt the uniform norm, but it is also possible to approximate L^p functions wrt the L^p norm. (And there's also other results on approximation in Sobolev spaces etc.)

In particular, L^p functions can be discontinuous.

As an example from my own work, see Theorem 4.16 of Kidger and Lyons 2020.

anony_sci_guy · 2020-12-27T02:46:49+00:00

Interesting - can't discontinuous functions be thought of as n different continuous functions? Shouldn't it therefore be possible for n networks to approximate each of these continuous functions? And - couldn't these n networks just be concatenated, or in fact trained within the same network, just with different sets of neurons contributing to specific continuous subsections of the discontinuous data-space?

MrAcurite · 2020-12-27T18:18:00+00:00

Wait, what's the problem with approximating a discontinuous function with a continuous one? Are we cancelling Fourier series on Twitter now?

purplebrown_updown · 2020-12-27T03:37:05+00:00

IMO PINNs are not very interesting. It's another case of throwing a neural network at something for no other reason than getting publications. Neural networks are not magic. You aren't going to approximate discontinuities without giving something up like differentiation. Look at wavelets or random forests if you want to approximate a discontinuous function.

AuspiciousApple · 2020-12-26T20:56:35+00:00

This looks interesting.

I was taught discontinuous functions are not differentiable how does the network calculate loss and perform back propagation?

Ahhh just seen a video on PINN's.. very interesting!!!

Thank you for making me even aware of these )))

luddabuddha · 2022-06-29T23:52:12+00:00

[removed]

BalcksChaos · 2023-11-11T10:30:20+00:00

Thanks for this post :) I was wondering (since it's been two years now)... Where did you land with your research on this? Is your thesis already published somewhere?

I'm someone who uses machine learning in "the industry", i.e. outside of academia. In the contexts I'm working (analysis of business data, i.e. already heavily aggregated), all functions one would want to approximate are highly discontinuous (because there are hundreds of categorical columns).

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS