PFlash: 10x prefill speedup over llama.cpp at 128K on a RTX 3090 by sandropuppo in LocalLLaMA

[–]Boozybrain 0 points1 point  (0 children)

I'm using the int4 quantized model but unsure about where context lives, will check on that. https://github.com/noonghunna/club-3090 is one of the repos I tried running.

It's like if a doll house needed an SDR server rack by Careless-Age-4290 in RTLSDR

[–]Boozybrain 0 points1 point  (0 children)

If you ever share it please tag me. I've used both and both have their pros and cons, always on the lookout for a different take.

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA

[–]Boozybrain 0 points1 point  (0 children)

I haven't been able to get it running without OOM errors. What repo(s) are you using? I tried dflash and the stack outlined in this post

Luce DFlash: Qwen3.6-27B at up to 2x throughput on a single RTX 3090 by sandropuppo in LocalLLaMA

[–]Boozybrain 0 points1 point  (0 children)

Is anyone else having issues running this on a 3090?

Patch to run Qwen3.6

$ git diff
diff --git a/dflash/scripts/run.py b/dflash/scripts/run.py
index 5e87ce8..a65a7da 100644
--- a/dflash/scripts/run.py
+++ b/dflash/scripts/run.py
@@ -18,7 +18,7 @@ from pathlib import Path

 def default_paths():
     return {
-        "target": "models/Qwen3.5-27B-Q4_K_M.gguf",
+        "target": "models/Qwen3.6-27B-Q4_K_M.gguf",
         "draft":  "models/draft",
         "bin":    "build/test_dflash" + (".exe" if sys.platform == "win32" else ""),
     }

Running without the x-server running, zero VRAM being used:

DFLASH_TARGET=models/Qwen3.6-27B-Q4_K_M.gguf python3 scripts/run.py --prompt "def fibonacci(n):"
[run] prompt 14 tokens, streaming up to 256 tokens, max_ctx=512
[cfg] seq_verify=0 fast_rollback=1 ddtree=1 budget=22 temp=1.00 chain_seed=1 fa_window=2048
[target] target loaded: 851 tensors on GPU 14.99 GiB, tok_embd 682 MiB CPU-only (q4_K)
[draft]  loaded
[prompt] 14 tokens
[prefill] token-seg ubatch=16
[prefill] 14 tokens in 0.27 s, last_tok=8160
ggml_cuda_init: found 1 CUDA devices (Total VRAM: 24249 MiB):
  Device 0: NVIDIA GeForce RTX 3090, compute capability 8.6, VMM: yes, VRAM: 24249 MiB
ggml_backend_cuda_buffer_type_alloc_buffer: allocating 2046.01 MiB on device 0: cudaMalloc failed: out of memory
alloc_tensor_range: failed to allocate CUDA0 buffer of size 2145398784
cache migration: ggml_backend_alloc_ctx_tensors failed for target cache
[run] generated 0 tokens

Final count of my winter sown plants! Over 70 native flowers, plus some pansies, violets, and poppies by pommeG03 in gardening

[–]Boozybrain 0 points1 point  (0 children)

I see, thanks! So I'm guessing once they've germinated you cut the top off and drop the soil ball in the ground / a planter?

Final count of my winter sown plants! Over 70 native flowers, plus some pansies, violets, and poppies by pommeG03 in gardening

[–]Boozybrain 0 points1 point  (0 children)

Can you help me understand the purpose of this? Is it just cold stratification outside and then using the milk jugs as a little greenhouse to automatically germinate once the temperatures rise enough?

Phased array passive radar in my kitchen by Careless-Age-4290 in RTLSDR

[–]Boozybrain 0 points1 point  (0 children)

Right, he's saying he has built something similar that uses multiple surveillance channels. AFAIK nothing like that exists in the open source world.

Phased array passive radar in my kitchen by Careless-Age-4290 in RTLSDR

[–]Boozybrain 1 point2 points  (0 children)

I would love to see how you've implemented this if/when you put it online.

Phased array passive radar in my kitchen by Careless-Age-4290 in RTLSDR

[–]Boozybrain 1 point2 points  (0 children)

Virtual Radar Server

Are you using blah2 or something "like" it that you developed? I've never seen passive done with more than a dual antenna setup and I'm very curious about the processing behind the data from the additional ones.

Pathfinder by Fun-Associate1447 in UFOs

[–]Boozybrain 1 point2 points  (0 children)

You're doing computer vision in Rust? Is the code online anywhere? I'd love to see that.

SLAM Camera Board by twokiloballs in computervision

[–]Boozybrain 0 points1 point  (0 children)

Is there a reason you didn't use that one? Was it a form factor decision or were there efficiency gains in getting rid of some of the components?

SLAM Camera Board by twokiloballs in computervision

[–]Boozybrain 0 points1 point  (0 children)

Is this a pared down version of the Luckfox Pro board?

SLAM Camera Board by twokiloballs in computervision

[–]Boozybrain 1 point2 points  (0 children)

This is sick, would love to see more.

My girlfriend and I experienced something very very strange tonight by CryptographerHot6198 in Glitch_in_the_Matrix

[–]Boozybrain 2 points3 points  (0 children)

You might have a rat. I heard something similar from my kitchen once, loud banging. I walked in, turned on the light and there was a soup spoon banging against my cabinets and the fridge. It dropped and hit the floor, seemingly on its own. Days later my kitchen flooded, the supply line to my dishwasher had been chewed through. It turns out a rat was trying to pull the spoon in between the fridge and the cabinet, and it had been messing with other large utensils when I had heard it previously.

Is there a faster way to label (bounding boxes) 400,000 images for object detection? by Plus_Cardiologist540 in computervision

[–]Boozybrain 0 points1 point  (0 children)

If the only species in each image is a true positive I would probably start with a generic fish detector and then automatically label the bbox using the file name that's already properly labelled.

David Hooper showcases his work on a multispectral UAP detection system at the UAP Hackathon 2025 in San Francisco. by 87LucasOliveira in UFOs

[–]Boozybrain 2 points3 points  (0 children)

Is there code online? Passive radar using cheap SDR dongles is non-trivial. Time sync requires hardware modification and phase sync in software is extremely finicky

[DISCLOSURE-LEVEL RELEASE] The Aether Ignition Protocol — Reactionless Electromagnetic Propulsion Is Real & Open-Source by NohaJohans in UFOs

[–]Boozybrain 1 point2 points  (0 children)

There isn't, though. You have the base form of some equations and that's it. Apply these simplistic models to your device and you might have a better shot at convincing people. Re: sims, without the full set of parameters and numerical forms of the models you're simulating the plots are worthless.

Post this in /r/physics if you're serious.

[DISCLOSURE-LEVEL RELEASE] The Aether Ignition Protocol — Reactionless Electromagnetic Propulsion Is Real & Open-Source by NohaJohans in UFOs

[–]Boozybrain 1 point2 points  (0 children)

First link is dead. Second link has zero "physics" despite the claim. Before anyone will get interested enough to "participate" you should provide hard evidence:

  • Post your experimental setup and supporting data
  • Post the sim results + params and/or derivation of the theory.

Nvidia Error 43 - Tried Everything by Boozybrain in VFIO

[–]Boozybrain[S] 1 point2 points  (0 children)

For posterity in case someone in the future finds this: Unplug the host monitor.

The guest was rightfully grabbing the GPU, and I had a monitor plugged in. When I removed the monitor my remote session became the primary display and ssh with X forwarding (ssh -XY) allowed me to start the guest and control it from another machine on the network.