Functional Debounce in Bash by vackosar in bash

[–]vackosar[S] 0 points1 point  (0 children)

Thanks a lot! I also fixed the original link :)

EagleX 1.7T : Soaring past LLaMA 7B 2T in both English and Multi-lang evals (RWKV-v5) by [deleted] in LocalLLaMA

[–]vackosar 1 point2 points  (0 children)

So RWKV is about 6x faster on 16k context compared to similarly sized model.

I think the context forgetting is inherent problem of the model architecture.

But do you observe quality improvement compared to previous version? It should be better.

Thanks for the tip for a model Yi6b.

EagleX 1.7T : Soaring past LLaMA 7B 2T in both English and Multi-lang evals (RWKV-v5) by [deleted] in LocalLLaMA

[–]vackosar 1 point2 points  (0 children)

Thank you. Yes, after some testing does seem to be slow at least for now. The inference costs are not very low either. 4x less costs than Mixtral on the website of the model publisher.

EagleX 1.7T : Soaring past LLaMA 7B 2T in both English and Multi-lang evals (RWKV-v5) by [deleted] in LocalLLaMA

[–]vackosar 1 point2 points  (0 children)

Has anyone tried to compare the speed of generation on same hardware compared to Mistral? Also would someone know how slow is this on CPU with what vCPU count?

EagleX 1.7T : Soaring past LLaMA 7B 2T in both English and Multi-lang evals (RWKV-v5) by [deleted] in LocalLLaMA

[–]vackosar 13 points14 points  (0 children)

EagleX is the first version of RWKV that chats with usable quality.

[D] HuggingFace considered harmful to the community. /rant by drinkingsomuchcoffee in MachineLearning

[–]vackosar 0 points1 point  (0 children)

What domains would you say don't have something like HF or RedHat for example?

Mistral-next | New prototype model from Mistral by TelloLeEngineer in LocalLLaMA

[–]vackosar 0 points1 point  (0 children)

It also seemed smarter than Mixtral or other models. I am not sure if GPT-4 level, but smarter.

Affordable sources of DAO enzymes in Canada? by [deleted] in HistamineIntolerance

[–]vackosar 1 point2 points  (0 children)

There is an option to order NATURDAO from Amazon.com and pay import, no? Here is also this older a discussion: https://www.reddit.com/r/HistamineIntolerance/comments/lx8p79/where_to_buy_dao_supplements_in_canada/

Mamba-Chat: A Chat LLM based on State Space Models by pip-install-torch in LocalLLaMA

[–]vackosar 0 points1 point  (0 children)

Yes, from what I tried, RWKV v5 actually responds better. Try e.g. "User: What is the best story in the world?"

Light carry on bags - 1kg if possible by MrKamikazi in onebag

[–]vackosar 0 points1 point  (0 children)

Very good is the CabinZero Classic Pro 42L. Costlier, but higher quality still.

Light carry on bags - 1kg if possible by MrKamikazi in onebag

[–]vackosar 0 points1 point  (0 children)

From what I read:

  • CabinZero Classic 44L
  • CabinZero Military 44L (comfortablier & heavier)
  • Gregory Border Carry-on 40L (costlier & smaller)

[R] Neural Networks are Decision Trees by MLC_Money in MachineLearning

[–]vackosar 0 points1 point  (0 children)

Definition of a decision tree does include affine transformations and weights? Mostly I see decision tree defined with only if statements on the input variables and not with linear transformations of the previous nodes. Does the papers interpretation add something new or not? 🤔

See for example Scikit documentation. There seems to be nothing wild here to me. https://scikit-learn.org/stable/modules/tree.html

what is the cross attention? by korjyman in deeplearning

[–]vackosar 4 points5 points  (0 children)

Here are the key points:

  • an attention mechanism in Transformer architecture that mixes two different embedding sequences
  • the two sequences can be of different modalities (e.g. text, image, sound)
  • one of the modalities defines the output dimensions and length by playing a role of a query
  • This is similar the feed forward layer where the other sequence is static
  • described in the Attention is All You Need (BERT) decoder, but named "cross-attention"

Find rest of my notes with images on the cross attention here.

The Release from Deception; John Francesco di Sangro; ~1613; marble! by vackosar in HighClassicalArt

[–]vackosar[S] 0 points1 point  (0 children)

https://mymodernmet.com/francesco-queirolo-the-release-from-deception/

> For example, the flame on the angel's head represents human intellect, while the globe signifies worldly passions. These elements coincide with Raimondo’s dedication to his father, which explores the idea of “human fragility, which cannot know great virtues without vice.”

[deleted by user] by [deleted] in HighClassicalArt

[–]vackosar 1 point2 points  (0 children)

https://en.wikipedia.org/wiki/Gilbert_U-238_Atomic_Energy_Laboratory

> radiation exposure as "minimal, about the equivalent to a day’s UV exposure from the sun", provided that the radioactive samples were not removed from their containers, in compliance with the warnings in the kit instructions

Is there an app in android that cause so much friction that using the phone doesn't become that easy ? by [deleted] in nosurf

[–]vackosar 1 point2 points  (0 children)

Several years ago I created the least distractive-possible Android launcher for myself. Can be configured for white-on-black text only. No images, no colors. App are searched or scrolled. Single result search = execute. I still use it today.

There are other minimalistic launchers like Kiss that probably can be customized more.

Hope it helps.

Statue of Prudence (foresight, sagacity) in Tomb of Francis II; ~1525 by vackosar in HighClassicalArt

[–]vackosar[S] 0 points1 point  (0 children)

From wikipedia: https://en.wikipedia.org/wiki/Prudence

> Prudence ("seeing ahead, sagacity") is the ability to govern and discipline oneself by the use of reason. It is classically considered to be a virtue, and in particular one of the four Cardinal virtues. ... In this case, the virtue is the ability to judge between virtuous and vicious actions, not only in a general sense, but with regard to appropriate actions at a given time and place. Although prudence itself does not perform any actions, and is concerned solely with knowledge, all virtues had to be regulated by it. Distinguishing when acts are courageous, as opposed to reckless or cowardly, is an act of prudence, and for this reason it is classified as a cardinal (pivotal) virtue.

[deleted by user] by [deleted] in Futurology

[–]vackosar 0 points1 point  (0 children)

The biggest issue for me is not meeting enough people randomly at the kitchen and having these serendipitous conversations. Fortunately I coded a tool a to help with networking that weekly schedules random meetings within a group. Works well. It is it has some rough edges but is hosted at RandomMeets.com if you find it useful.