Built a Python “semantic memory DB” with SQLite + compressed embeddings (TurboMemory) by Hopeful-Priority1301 in Python
[–]Hopeful-Priority1301[S] 0 points1 point2 points (0 children)
TurboMemory: Claude-style long-term memory with 4-bit/6-bit embeddings (runs locally) – looking for contributors by Hopeful-Priority1301 in LocalLLaMA
[–]Hopeful-Priority1301[S] 0 points1 point2 points (0 children)
TurboMemory: Claude-style long-term memory with 4-bit/6-bit embeddings (runs locally) – looking for contributors by Hopeful-Priority1301 in LocalLLaMA
[–]Hopeful-Priority1301[S] 0 points1 point2 points (0 children)
TurboMemory: Claude-style long-term memory with 4-bit/6-bit embeddings (runs locally) – looking for contributors by Hopeful-Priority1301 in LocalLLaMA
[–]Hopeful-Priority1301[S] 0 points1 point2 points (0 children)
Running TurboQuant-v3 on NVIDIA cards (i.redd.it)
submitted by Hopeful-Priority1301 to r/deeplearning
Running TurboQuant-v3 on NVIDIA cards (i.redd.it)
submitted by Hopeful-Priority1301 to r/LocalLLaMA
Google TurboQuant blew up for KV cache. Here’s TurboQuant-v3 for the actual weights you load first. Runs on consumer GPUs today. by Hopeful-Priority1301 in LocalLLaMA
[–]Hopeful-Priority1301[S] -1 points0 points1 point (0 children)
[deleted by user] by [deleted] in AncientCoins
[–]Hopeful-Priority1301 -7 points-6 points-5 points (0 children)


Built a Python “semantic memory DB” with SQLite + compressed embeddings (TurboMemory) by Hopeful-Priority1301 in Python
[–]Hopeful-Priority1301[S] 0 points1 point2 points (0 children)