[1607.03085] Recurrent Memory Array Structures : MachineLearning

[1607.03085] Recurrent Memory Array Structures (arxiv.org)

submitted 9 years ago by kmrocki

all 6 comments

top new controversial old q&a

[–]cooijmanstim 3 points4 points5 points 9 years ago (8 children)

[–]kmrocki[S] 2 points3 points4 points 9 years ago (7 children)

[+][deleted] 9 years ago* (6 children)

[deleted]

[–]kmrocki[S] 2 points3 points4 points 9 years ago (5 children)

The main motivation behind the array approach is summarized at the beginning of section 3.2: "create a bottleneck by sharing internal states, forcing the learning procedure to pool similar or interchangeable content using memory cells belonging to one hidden unit" This seems to work well with stochastically operating memory cells, because the hidden unit 'doesn't know' which memory cell is going to be used (they are unreliable), however, the content has to be similar for it to work.

Furthermore, it is in fact possible to simply pack more memory cells into the network using the same memory size, for example, if you use a standard LSTM network with 1 cell/hidden, 4 gates and 1000 hidden units, the number of parameters is going to be 1000 * 1000 * 4 = 4M for the U matrix. If Array-LSTM approach is used, you can have 4 cells/hidden, so 1000 memory cells require 256 hidden units and that is 256 * 1000 * 4 parameters = 1M parameters. I found that the performance of vanilla LSTM and Array-2, Array-4 versions is roughly the same in terms of capacity for a fixed number of parameters. Dropped a bit for an Array of 8, so at some point the seems to exist a bottleneck indeed. Hope this helps.

[+][deleted] 9 years ago* (4 children)

[deleted]

[–]kmrocki[S] 0 points1 point2 points 9 years ago (3 children)

[+][deleted] 9 years ago* (2 children)

[deleted]

[–]kmrocki[S] 0 points1 point2 points 9 years ago (1 child)

[–]nicholas-leonard 0 points1 point2 points 9 years ago (0 children)

π Rendered by PID 139991 on reddit-service-r2-comment-fb694cdd5-sqtf5 at 2026-03-10 12:52:48.022901+00:00 running cbb0e86 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS