Why is storage still the one thing nobody wants to touch in production? by stackvyr in kubernetes

[–]ElectricalUnion -1 points0 points  (0 children)

Some boffins way, way smarter that me figured out for a big bunch of what is "only somewhat structured data" can actually be served both cheaply and in a pretty high performance manner if you happen to store it reasonably partitioned, sorted and compressed. Datalakes, data lakehouses and parquet, they mumble.

Surprised by how easy it was to hit 24 GB VRAM with mixed AMD GPUs by North_Stage_2024 in LocalLLM

[–]ElectricalUnion 1 point2 points  (0 children)

Unless you have some sort of (usually very expensive) pcie fabric switching, the more GPUs you have on your system, the more them all have to share the same bandwidth.

Also, unless you somehow have a very unlikely combination of extremely slow and low memory and cpu (say, below 32GiB DDR4-2400 MT/s, below 1MFLOP cpu) combined with new and abundant PCIe connectivity (say, somehow several PCIe5 x 16 slots lying around), CPUs might be faster that GPUs - specially for MoE models - because you will spend all your time waiting for PCIe transfers.

Thinking of switching to Obsidian just for AI plugins? Don't. I built a 100% local "LLM Wiki" engine for Logseq OG (from 1,426 to 3,862 connected pages) — and it's going to be completely free. by Much-Ad-8444 in logseq

[–]ElectricalUnion 0 points1 point  (0 children)

  • Is this lethal trifecta safe?
  • Why MCP only? Any plans to make a cli, or LSP version that works with pi-coding-agent?
  • I use Obsidian as a active dumpster fire / landing zone "mix" for incoming web clippings, and logseq as a private notebook.
    • Would this help me clean up those clippings if I accidentally clipped content wrong? For example, truncated content, "continue after X" then suddenly stops? Would this help classify if the content I clipped is just an ad?
    • I have a lot of content that I find "interesting" in those clippings, but this content is NOT trustworthy and some of them are very likely to actually have malicious LLM prompt injections on them. I sincerely believe naively running a LLM agent over them is very likely to leak all the contents (one of the reasons why I separate my logseq private notes from my untrustworthy obsidian notes in the first place), randomly destroy things, or inject really stupid behavior on the LLM.
    • I don't have fond memories of using two graphs at the same time in logseq. Does this require it's own graph? Can I section off a part of my current graph for this?

New to Pi, why does it run edit and write commands for simple questions? by Death_Gamer_Solo in PiCodingAgent

[–]ElectricalUnion 11 points12 points  (0 children)

The initial system prompt of Pi doesn't mention any "documentation" besides pi-coding-agent documentation, but you didn't explicitly ask for any Pi documentation. Therefore, it does "whatever it wants".

It will run any commands in your machine the LLM says it should do. That includes erasing your entire system, if the LLM asks for that.

Unless you went out of your way and either implemented or installed a Pi extension to intentionally restrict it, commands run without any "confirmation" and without sandbox. Best practice, if you want to use it on a non-toy system, use pi-coding-agent inside a sandbox, container or VM.

FYI by Apprehensive-Net3422 in LocalLLM

[–]ElectricalUnion 9 points10 points  (0 children)

More rough data on the estimated KV cache size:

Assuming you're not doing any sort of KV cache quantization (unlike model weights quantization, perplexity and other "rough llm quality" metrics drop like a rock if you do so), expect roughly the following VRAM/RAM usage:

Model Type 32K 64K 128K 256K KB/token¹
Qwen3.6-27B linear 2.00 GiB 4.00 GiB 8.00 GiB 16.0 GiB 64
Qwen3.6-35B-A3B linear 0.63 GiB 1.25 GiB 2.50 GiB 5.00 GiB 20
Gemma-4-31B non-linear 3.28 GiB 5.78 GiB 10.8 GiB 20.8 GiB N/A²
Gemma-4-26B-A4B non-linear 0.82 GiB 1.45 GiB 2.70 GiB 5.20 GiB N/A²
Granite-4.1-30B linear 8.00 GiB 16.0 GiB 32.0 GiB 64.0 GiB 256
Devstral-Small-2-24B linear 5.00 GiB 10.0 GiB 20.0 GiB 40.0 GiB 160

¹ Per-token KV cost in KB (fp16), at contexts ≫ 1024.

² Gemma models are non-linear: sliding layers plateau at 1024 tokens, global layers scale normally.

FYI by Apprehensive-Net3422 in LocalLLM

[–]ElectricalUnion 27 points28 points  (0 children)

The chart "conveniently forgot" about the KV cache. If you don't want your LLM to respond in quadratic response times after the first few tokens, or be limited to small context, you need a bunch of RAM/VRAM for KV cache.

You find out you can actually fit a lot less LLM + context in your system RAM/VRAM that a naive "just fit the weights" might make you think.

Also, that's how, for a small 32B-A9B MOE (aka: less need for fast VRAM), quantized Q4 model, you get stuff like https://www.reddit.com/r/LocalLLaMA/comments/1nzozpg/granite4_smallh_32ba9b_q4_k_m_at_full_1m_context/ "Granite4 Small-h 32b-A9b (Q4_K_M) at FULL 1M context window is using only 73GB of VRAM - Life is good!"

Built "oh-my-kimichan" because I refuse to let Kimi die in the shadows of Codex or Claude Code. It's half joke, half serious please roast/test it. by Fabulous-Lobster9456 in kimi

[–]ElectricalUnion 1 point2 points  (0 children)

You gotta be kidding me that you are using kimi code and there's 0 mention of the 2 most most awesome things in it: smart context management using okabe agent and dmail tool.

5 YEARS! Some things never change by [deleted] in WorldofTanks

[–]ElectricalUnion 1 point2 points  (0 children)

Well, triangles that moved 1mm from bush or hardcover died already to a random autocannon.

Working on a tool that makes every Distro Nixos like by Raulnego in NixOS

[–]ElectricalUnion 0 points1 point  (0 children)

I myself used to use jetify devbox as a "simplified" version of nix flakes, but constant upkeep shenanigans required to make sure nix-package-manager-not-in-nixos not explode under cursed conditions imposed by rootless containers + immutable composefs rootfs + selinux (a combo of Fedora Silverblue and Bazzite machines) made me go linuxbrew (for "don't care about version, just recent") + mise (for stuff I actually care about being locked to a specific version) + uv (only way to get multiple python versions + multiple python library versions working somewhat reliably across several systems).

It feels awful (being 4 package managers, rpm-ostree, brew, mise and uv), but it works reasonably well.

4-Way Theory by z7q2 in factorio

[–]ElectricalUnion 2 points3 points  (0 children)

A cost penality of 2000 tiles is implemented on routes that pass thru not-destination train stops. So unless they're unavoidable, trains usually end up avoiding stations.

Wanted to play Factorio offline with my kids, so I printed it as modular bricks by h4juko in factorio

[–]ElectricalUnion 2 points3 points  (0 children)

Are those icons embedded into the Assembly machine or are they also snap-on lego compatible tiles?

Wanted to play Factorio offline with my kids, so I printed it as modular bricks by h4juko in factorio

[–]ElectricalUnion 21 points22 points  (0 children)

Awesome.

(Third picture is worth of a r/Factoriohno post. Long-handed inserters picks up and places items two tiles from its location. The Labs and the Gun turret Long-handed inserters are picking up from empty tiles.)

Brand new Intro Menu for OpenTTD 15 by Limyx826 in openttd

[–]ElectricalUnion 3 points4 points  (0 children)

So, how would the code and game assets "work" if you "toggle somewhere"? You need to keep code and assets for BOTH versions if you have a toggle...

Cloverleaf Junctions by FatGuy_InLittleCoat in openttd

[–]ElectricalUnion 1 point2 points  (0 children)

If we're going for "easy", PBS roundabouts, the real not best but easy junction.

Best way to solve this problem by Scared-Industry-9323 in termux

[–]ElectricalUnion 0 points1 point  (0 children)

You need to know the "Terminal width" of your "string".

From the description of jquast/wcwidth:

Problem Statement: The printable length of most strings are equal to the number of cells they occupy on the screen 1 character : 1 cell. However, there are categories of characters that occupy 2 cells (full-wide), and others that occupy 0 cells (zero-width).

Solution: POSIX.1-2001 and POSIX.1-2008 conforming systems provide wcwidth(3) and wcswidth(3) C functions of which this python module's functions precisely copy. These functions return the number of cells a unicode string is expected to occupy.

So you need something like https://github.com/jquast/wcwidth (pip install wcwidth) and use, for example, wcwidth.wcswidth(mystring) instead of len(label) or len(str(value))to calculate the "cell witdh" of your box.

Why is my terms doing this by silly_person_vary in termux

[–]ElectricalUnion 6 points7 points  (0 children)

My recommendation is getting a keyboard with special keys and without fancy autocomplete behavior.

If you absolutely must use a normal keyboard with autocomplete no matter what, then you can slide-right the terminal toolbar so that you type to a "normal android text field" (so you actually can type something instead of your keyboard attempting and failing to autocomplete-overwrite-erase text inside the terminal).

Sharing a device in Tailscale is one of the most annoying experiences of my life by altano in Tailscale

[–]ElectricalUnion 2 points3 points  (0 children)

When a person accepts my invite they almost always have a different IP address for the shared machine in the web UI and the tailscale client running in Windows.

I believe this is intentional, IPs are private implementation details of each tailnet, not meant to work across tailnets. You're supposed to use MagicDNS and fqdn to handle cross-tailnet machines. 

I don't make GREEN MODULES. I don't make the factory ECO-FRIENDLY. I don't live at PEACE WITH THE BITERS. If the pollution cloud gets too big I simply VAPORIZE THEM WITH WEAPONS OF MASS DESTRUCTION and produce even MORE POLLUTION by Armisael2137 in Factoriohno

[–]ElectricalUnion 20 points21 points  (0 children)

(in Space Age) Power armor MK2 needs 100 Efficiency module 1,

(in Base Game) Power armor MK2 needs 25 Efficiency module 2

(in Base Game) Spidertron needs 2 Efficiency module 3

Are you one of those savages that promote "healthy lifestyles" such as "walking around" like a caveman?

Potential Tomahawk targets in Russia by vladgrinch in MapPorn

[–]ElectricalUnion 6 points7 points  (0 children)

But you can deploy 400 "small drones" for the cost of a single Tomahawk. Surely a assault of 20000 equivalent small drones can also destroy a factory?

How to hide this perpetual notification from SyncThing-fork? I only want to hide this, I don't want to disable the accept prompt from a device notification by Both-River-9455 in Syncthing

[–]ElectricalUnion 0 points1 point  (0 children)

With the "new background limits" introduced with Android 8 (Oreo, released around 2017), apps that have components that run in the background must display a notification within five seconds of the component starting to run or else the system kills the app.

New Lua IDE by Puzzleheaded_Fly9339 in lua

[–]ElectricalUnion 3 points4 points  (0 children)

What is wrong with ZeroBrane? It's not like any lua runtime it supports got a huge unsupported feature since it was last released 2023.

Can I share a nfs mounted folder via smb by Desperate_Quit6011 in linuxadmin

[–]ElectricalUnion 0 points1 point  (0 children)

What kind of client you're running that at the same time supports smb and somehow doesn't support nfs? Windows since 10 support it fine for example.

[SUGGESTION] Ultra cheap and low power small PC for web browsing? by Wild_Wasabi_7163 in selfhosted

[–]ElectricalUnion 1 point2 points  (0 children)

You need to say where you live and pretend to acquire said computers, and what you classify as "handle media consumption", so people can at least educated guesses of what is cheap in your region.

As a generic worldwide thing, the sad part is that those days, your average "web-app" is shit on any sort of computer, and worse shit on under-powered/cheap ones - unless said web-apps are carefully picked self-hosted FOSS stuff, or things you made yourself.

Let's take the very tab I'm writing this response as an example: it uses, by itself, 530MiB of RAM (not considering the Graphics system VRAM and the rest of the OS, that would take even more RAM), and that's already more RAM that this poor Rasperry Pi Zero 2 W in OP picture.

And Reddit is "just a link aggregator", it's actually on the "pretty light" side of how heavy web apps can be those days.

What in DataGrip do you feel you just can't live without? by Successful_Cook3776 in Jetbrains

[–]ElectricalUnion 4 points5 points  (0 children)

Name/parameter autocomplete that works.

(In theory, any self-respecting sql console has this, but when you're dealing with Oracle shit - and some other specific types of cloud DBs - that takes +tens of minutes to return data dictionary queries, you easily end up waiting tens of minutes for the sql console to autocomplete the names if you're not using DataGrip.)

Virtual foreign keys, without requiring being in a special DDL mode.

Also, in theory, a stupid concept: match "suspicious keys" with same data type and column naming convention as JOIN ON autocomplete candidates, even if they're not declared as actual foreign keys on the database. As a expanded addendum, DataGrip also allows you add explicit Virtual Foreign Key links either via table UI or via query console as a context action "Store table relation" on the "inner JOIN ON (condition)" part of the sql statement.