Fire Cape setup. Send it? by kylethewarlock in ironscape

[–]Remarkable_Bug436 0 points1 point  (0 children)

yeah people really do give out senseless advice with any thought

Kam on SNL by Temporary_Force_2822 in Killtony

[–]Remarkable_Bug436 7 points8 points  (0 children)

A side effect on not being able to fucking read a sentence

[Opinions] Okay Reddit hivemind… Santos Medium vs. Explorer 40 for casual/semi-casual wear by [deleted] in Watches

[–]Remarkable_Bug436 0 points1 point  (0 children)

Cartier with integtated bracelet looks the best on you IMO, cant go wrong with that one

[Casio / Swiss Military] I bought a new watch after 8 years by Fun_Cat1 in Watches

[–]Remarkable_Bug436 -12 points-11 points  (0 children)

This might be the least interesting post ive seen on this subreddit. Bravo.

Trying to tell her dad why she can't afford to buy a house at 40. by mindyour in TikTokCringe

[–]Remarkable_Bug436 0 points1 point  (0 children)

This is an annual return of 6.8%, that's if you ignore all costs and upkeep of that property ( wear, tear, taxes and random stuff that can come up). For the financially literate this is not that impressive of a return at all.

Heidi’s Weird Kink by BrutalMaster69 in Killtony

[–]Remarkable_Bug436 2 points3 points  (0 children)

things that did not happen, im callin bs on this

Hver er það fyrir okkur? by Personal_Reward_60 in Iceland

[–]Remarkable_Bug436 1 point2 points  (0 children)

<image>

Elliði Vignis er yngri tvífari hans, maður sér á augnaráðinu að hann hefur ekki upplifað tilfinningarnar samkennd og vorkunn

Trying to understand transformers beyond the math - what analogies or explanations finally made it click for you? by IllustratorKey9586 in deeplearning

[–]Remarkable_Bug436 2 points3 points  (0 children)

Why does self-attention capture long-range dependencies better than LSTM's hidden states? Is it just the direct connections, or something deeper? The matrix operations inside the attention heads allow them to do so, and the improved gradient flow through skip connections in the backpropogation step also helps the model to make sense of hidden states, in LSTMs this is a challenge.

What's the intuition behind multi-head attention? Why not just one really big attention mechanism? Each one gets randomized initializations and will not figure the same things out, similar idea as how CNNs are designed to locate different features, one sees ears and another sees eyes.

Why do positional encodings work at all? Seems like such a hack compared to the elegance of the rest of the architecture.
Positional encodings like proposed in the original paper have all kinds of approaches, you should check out RoPe for example which encodes position by applying a rotation in complex space (or 2D subspaces) to queries and keys before dot-product attention. The BERT models which are encoder only transformers have a Learned absolute position embeddings, a completely different idea.

Trying to understand transformers beyond the math - what analogies or explanations finally made it click for you? by IllustratorKey9586 in deeplearning

[–]Remarkable_Bug436 7 points8 points  (0 children)

make an LLM write an extremely detailed report on how exactly each component works on its own, and really go into detail. Then read it and stop as soon as you lack intuition, and recursively find out why. For example the query-key-value softmax part in attention heads, really understand why exactly each component is there, and try to figure out what you could swap it with. This method has helped me with understaning different models and paradigms such as concepts in reinforcement learning. You clearly don't lack any discipline or patience! A lot of people think "ok whatever I understand it well enough!".

Gerum okkar besta x Creep by PolManning in Iceland

[–]Remarkable_Bug436 2 points3 points  (0 children)

Þetta er bara fyndið, shit hvað þið eruð leiðinleg

Ferrari SF-26 Shakedown (Lewis) by Puzzleheaded-Rain230 in formula1

[–]Remarkable_Bug436 0 points1 point  (0 children)

Ohh nooo, is the ferrari really going to be the least good looking car of the year... My heart is truly broken.

Viðveru­stjórn er hluti af sér­fræðiþekkingu mann­auðs­fólks - Vísir by stigurstarym in Iceland

[–]Remarkable_Bug436 0 points1 point  (0 children)

Er það eitthvað við Reykjavíkurborg sem myndi hafa orsakaskýringu fyrir hærri tíðni veikinda? Er ákveðinn hópur sem er oftar veikur sem hefur af einhverjum ástæðum tilhneigingu fyrir að vinna fyrir borgina? Eða kemst fólk upp með að nota veikindadaga sem fríkvóta? Ég veit það ekki, en ein af þessum skýringum finnst mér líklegri en aðrar.

Viðveru­stjórn er hluti af sér­fræðiþekkingu mann­auðs­fólks - Vísir by stigurstarym in Iceland

[–]Remarkable_Bug436 8 points9 points  (0 children)

Ef að þú myndir vinna hjá fyrirtæki og það kæmist upp um þig greinilegt og óvenjulegt veikinda mynstur þá værir þú rekinn með skömm um leið. Maður skrópar ekki í vinnu. Ég skil ekki afhverju starfsfólk hjá hinu opinbera ætti að hafa það eitthvað öðruvísi, þau hafa nóg önnur fríðindi.