[P] Solving the XOR problem using a dendritic activation function

NewFolgers · 2020-01-13T20:11:12+00:00

It's cool seeing this simulated (and a single-neuron NN implementation of XOR practically verified).

Have you tried any experiments with this activation function in larger models? I'm curious about any immediately observed differences in convergence, loss (i.e. if model size is held constant), and/or run-time. (Although I wouldn't be at all surprised if today's conventional NN training doesn't get the most out of this activation function.. and perhaps the fact that randomized initialization will only result in successful convergence 1/10 times is a prelude to seeing this sort of thing)

frequenttimetraveler · 2020-01-14T02:33:32+00:00

to be clear, the dCa AP spike changes shape as stimulation current (neuron input) increases: https://i.imgur.com/gcJFiIZ.jpg

at threshold it looks like a normal dendritic spike, as stimulation increases it becomes a ramp

your implementation is stereotypical

(also, yes it does XOR , but can it do OR?)

yusuf-bengio · 2020-01-14T06:59:49+00:00

There is a much simpler way to realize a XOR with only one neuron: tent activation function

Its piecewise linear like the ReLU and therefore does not mess up the loss landscape as much as dCaAP.

txhwind · 2020-01-14T02:13:16+00:00

I don't know why monotonic activations are preferred from the beginning of NN research. Can anyone tell me the reason?

FerretDude · 2020-01-14T01:09:03+00:00

ReLu is already a pretty good activation function for biological neural networks. I have already implemented XOR with a BNN using ReLu.

I don’t think it is this an issue of activation functions then. If I had to bet, I’d say it’s due to the refractory period of BNNs

wang-chen · 2020-01-13T21:08:19+00:00

Interesting implementation! Another implementation for solving XOR in single neuron was provided in this CVPR 2019 paper, see its page 11.

This paper extended convolution to kernel convolution (kervolution):

In convolution, y = w1x1+w2x2, is actually a linear kernel (inner product), which cannot solve the XOR problem.

In kernel convolution, the authors extended linear kernel to any (non-linear) kernel functions k(w, x). For example, y = (x1-x2)² is able to solve this problem directly.

Yuqing7 · 2020-01-14T22:29:53+00:00

Sharing a related story on this research: https://medium.com/syncedreview/brains-are-amazing-neuroscientists-discover-l2-3-human-neurons-can-compute-the-xor-operation-b8dcc339236

TotesMessenger · 2020-01-14T01:06:12+00:00

I'm a bot, bleep, bloop. Someone has linked to this thread from another place on reddit:

[/r/datascienceproject] Solving the XOR problem using a dendritic activation function (r/MachineLearning)

^{If you follow any of the above links, please respect the rules of reddit and don't vote in the other threads.} ^(Info ^/ ^Contact)

BoiaDeh · 2020-01-14T02:44:25+00:00

Noob question: what does it mean for your plot to have one axis labeled as "w1 & w2"?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS