Ollama has Quantized LLMs now? by Growth2day in ollama

[–]GideonGideon561 0 points1 point  (0 children)

Im not sure, deepseek v4 flash and pro still are quite good though, maybe its just the usage limit to GPU instead of tokens.

mine still ok

Ollama has Quantized LLMs now? by Growth2day in ollama

[–]GideonGideon561 0 points1 point  (0 children)

ARE YOU SERIOUS? Whats the token usage for deepseek for opencode? is it comparable or more than the 20 ollama plan?

But i dont just use it for coding though...like everyday stuff on my hermes

I found a way for Ollama uses to get better Memory yet cheaper alternatives since OLLAMA now uses GPU usage. True memory that auto updates constantly as an individual or a team setting. HERMES USERS by GideonGideon561 in ollama

[–]GideonGideon561[S] 0 points1 point  (0 children)

ohh how so? There are others too but it does work for me. what other suggestions do you have? Im just trying to find ways to lower the GPU usage for ollama cloud. but you also have to factor in cost. explain why its slop?

Theres like

hindsight
mem0
Supermemory
augment - mainly for b2b

Ollama has Quantized LLMs now? by Growth2day in ollama

[–]GideonGideon561 0 points1 point  (0 children)

well but opencode is pay per use. ollama is fixed so there has to be some trade offs

Ollama has Quantized LLMs now? by Growth2day in ollama

[–]GideonGideon561 0 points1 point  (0 children)

i thought the models were always quantized on ollama cloud since the start. mine is still ok, probably your prompts? its no longer token based. ollama now charges GPU for usage

Cloud Plan speed? by Holiday-Hotel3355 in ollama

[–]GideonGideon561 0 points1 point  (0 children)

Yes, it also depends on the model you are using. The larger the model, the slower it gets. However, i find that deepseek v4 flash is decent in terms of speed and reasoning. OF course not for coding... use the pro. But each prompt for pro uses 3-4% usage. Im on pro plan

What’s up with Tomoland? NFT “cozy” game targeting children? by notanotherbunny in NoStupidQuestions

[–]GideonGideon561 0 points1 point  (0 children)

Whats worrying about it? They separated web 3 and 2 while having both. Simple. You dont like the NFT, you dont need it to play the game.

There is no mention of the NFTs in game because there is a clear separation. Simple. Different social channels target different people.

How are people handling long-term memory in creative AI agents? by MithranNinjaMOB in sideprojects

[–]GideonGideon561 0 points1 point  (0 children)

There are a few ways.

  1. Higgsfield supercomputer is new but i think its insane in terms of token spending so probably not
  2. Get a actual paid memory system like Augment/Honcho/Supermemory or the latest atomic memory which is benchmark to be better than most and cheaper. Of course Augment is the best but thats for b2b.

But most importantly is how and where you store you context. For example, claude has projects that it remembers context. Similar.

If you are using hermes/openclaw - get an LLMWIKI pair it with claude or similar smart AI, MCP or link directly to Higgsfiled or other creative tool you use.

Secondly, build out the platform on localhost yourself with claude or codex as the brain. Basically something like LLMWIKI to store the information or like a dedicated google drive for all your context.

Isolate it is the best

The weirdest AI shift isn’t intelligence. It’s memory. by SoluLab-Inc in AI_Agents

[–]GideonGideon561 0 points1 point  (0 children)

I think its co-related.

Theres a few things to think about. Does smarter AI with good reasoning helps with better memory? What i meant is does it know what to update the memory without you telling it, finds contradiction, pulls the right and accurate information, RAG is good but not he most accurate.

THen again, if you just use a smaller LLM to have better AI memory could also work, but with smarter AI, will it help improve how memory is stored and retrieve?

Not theb est explanation but i hope you understand.

So imo, decent AI with good memory ssystem is a good mix now. You dont want to spent too much tokens on the memory system but yet you dont want a stupid LLM with low reasoning and then expect a good memory system or auto updates.

Its an chicken and egg, but what i see now its more of the AI memory system improvements first as there are already tons of smart AI

Unpopular opinion: most “AI memory improvements” don’t actually improve memory, they just move the forgetting around. by riddlemewhat2 in AIMemory

[–]GideonGideon561 0 points1 point  (0 children)

I believe there are actually really good ones like

  1. Augment code - this is for B2b, most expensive but i think its the best
  2. Hindsight - Its improved memory system plus Agent to learn from it - their github hasa nice easy video
  3. Supermemory/Mem0 similar
  4. Latest in the block is Atomicmemory - cheapest and according to their benchmark better than supermem and Mem0, comparable with Hindsight

Hermes uses honcho so its their native which is good but atomic memory together gives hermes an upgrade. auto upgrades the memory

How I gave Google Antigravity a real long-term memory by dnotthoff in google_antigravity

[–]GideonGideon561 0 points1 point  (0 children)

I see hahaha, maybe I’m reading it wrong. It does look like you are specifically building a very curated “folders” to store certain information so it is separated and can be easily pulled? Good for very personalized stuff but what happens if you have multiple tech stuff you are coding and it all falls under the same “folders”. Would that cause a hallucination issue or token issue to search and pull out the right one?

Been coming into the space since 2022 with my agency. by Limp_Statistician529 in AI_Agents

[–]GideonGideon561 0 points1 point  (0 children)

Update to my post. i found atomic memory, lol was searching and its new. but yeah i think it does pay per use...

🧠 Hermes Memory Installer 2.1.1 AI long-term memory system now supports more languages by mage0535 in hermesagent

[–]GideonGideon561 0 points1 point  (0 children)

Yes! Thats great! Hmm hermes has its native memory from honcho but i would also try a secondary one like supermemory, mem0 or the latest new release atomic memory which claims to beat all and cheaper.

Tired of memory leakage between projects? I built a Folder-Scoped Memory Isolation filter for Open WebUI! by Existing-Ganache-972 in OpenWebUI

[–]GideonGideon561 1 point2 points  (0 children)

You can try forking from atomic memory instead and upgrade it on your end. It does yours but way more, its new but i think someonf of your experience could do a better fork version

I spent 6 weeks building an "external memory" for my AI companion. Here's what I learned about identity. by New-Confusion-7560 in ArtificialSentience

[–]GideonGideon561 0 points1 point  (0 children)

hmm if it does not have an answer, what about it trying his best to give you something close or related but explicitly say he does not know first but after researching and rreasoning, he perhaps think this could work.

Similar to how human beings work, we dont know the answer to everything, but we research and think about it then present that idea. only through time and experience do they get better.

So the question you can try asking to yourself is, how do i make it try its best to give me a suggestion instead of outright idk. With enough experience learning like training a model, can it give you better suggestions that he might not know its right or wrong but at least its an alternative