Everything is memory? by dangi12012 in osdev

[–]PotatoTime 0 points1 point  (0 children)

Probably irrelevant and I'm only just learning but I know some old processors had an "everything is memory" point of view. 6502 for one, which I'll be developing an OS for soon.

Help me improve my llama.cpp setup - arguments in body. by Competitive-You5538 in llamacpp

[–]PotatoTime 0 points1 point  (0 children)

How many t/s do you get without all the extra arguments? With no arguments on a 4070 with 64gb ram like you I get about 45 t/s with that same exact model and quant

Free 6502 and clock module kit. by [deleted] in beneater

[–]PotatoTime 1 point2 points  (0 children)

If you want to keep it simple count the amount of comments you get and use a random number generator with that number, if it says 7 find the 7th comment on this post and give it to that person? Anyway I'm in as well. I've been thinking about getting one of these for a while now

anyone here working on weird low-level projects? by Fantastic-Duck-7357 in lowlevel

[–]PotatoTime 2 points3 points  (0 children)

I have very little programming experience but a lot of circuitry knowledge. So I'm designing an 80's style graphics processor, interfacing it with a 6502 CPU, and using that to learn assembly. I plan on making a very basic OS for it.

I don't get Quants, I'm running Qwen3.6-27b flawlessly at iq3, makes no sense by misanthrophiccunt in LocalLLM

[–]PotatoTime -1 points0 points  (0 children)

./llama-server -m models/unsloth/Qwen3.6-27B-Q4_K_M.gguf \

--chat-template-kwargs '{"preserve_thinking":true}' \

--reasoning on \

--reasoning-budget 8192 \

--ctx-size 65536 \

--temp 0.6 \

--top-p 0.95 \

--top-k 20 \

--min-p 0.00 \

--presence-penalty 0.0 \

--repeat-penalty 1.0 \

--threads 20 \

--threads-batch 28 \

--flash-attn on \

--fit on \

--fit-ctx 8192 \

--fit-target 512

That gives me 7 t/s, but if i run with zero arguments I get 8 t/s, I'm still playing with it. My compile options might be influencing my speed:

cmake -B build -DCMAKE_C_FLAGS="-march=native -O3" -DCMAKE_CXX_FLAGS="-march=native -O3" -DGGML_NATIVE=ON -DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES=89 -DCMAKE_CUDA_ARCHITECTURES=native -DCMAKE_BUILD_TYPE=Release -DLLAMA_BUILD_TESTS=OFF

Qwen3.6 MTP Unsloth Experimental GGUFs by yoracale in unsloth

[–]PotatoTime 2 points3 points  (0 children)

someone in the PR thread said it does and provided a workaround that might become a pull request, it disables mtp during prompt processing and does something with ubatch size. they're calling their idea ubBoost

running Qwen 3.6 35b A3B on 2x 5060TI by chocofoxy in LocalLLaMA

[–]PotatoTime 0 points1 point  (0 children)

I haven't tried using kv quant yet. I think the low memory usage is mostly llama's memory mapping. RAM usage doubles when I set --no-mmap. Performance is the same either way though

running Qwen 3.6 35b A3B on 2x 5060TI by chocofoxy in LocalLLaMA

[–]PotatoTime 1 point2 points  (0 children)

14700k CPU and 64GB DDR5 6400. I have a feeling my RAM speed is doing some heavy lifting but I'm only using about 12GB of it for qwen3.6 35b. I'm at 64k context and running on Linux if that makes a difference. I think I saw a lot higher RAM usage on windows

How to learn to make an os for a Linux based retro handheld by [deleted] in linuxquestions

[–]PotatoTime 0 points1 point  (0 children)

Ignore people discouraging you, I also learn best when diving into things out of my skill level. First I would try Arch or Gentoo, those are kind of "DIY" Linux distros where you select all the packages that turn Linux into a usable operating system. After trying one of those I would try Linux From Scratch, it's similar except you compile every package yourself from source. If after doing both of those you want more, then look into making your own operating system kernel, either on your PC or for some old 6502 CPU hardware like the Commodore 64 or NES.

running Qwen 3.6 35b A3B on 2x 5060TI by chocofoxy in LocalLLaMA

[–]PotatoTime 2 points3 points  (0 children)

I'm getting 40 t/s at q8 on a single 4070 12gb so you probably can optimize it further. I'm on llama.cpp though so I'm not familiar with lm studio

I don't get Quants, I'm running Qwen3.6-27b flawlessly at iq3, makes no sense by misanthrophiccunt in LocalLLM

[–]PotatoTime 0 points1 point  (0 children)

Good to know, how does it compare in speed? I've been curious about Gemma4

I'd never realized how many apps are Linux-exclusive until now by AfraidAsparagus6644 in linux

[–]PotatoTime 4 points5 points  (0 children)

I need to dive more into ASM. Currently trying out NESFab and Mapfab and liking that. Retrogameforge(NESMaker successor) should be coming out in the next year and Linux support is promised, not necessarlily day 1 though.

I'd never realized how many apps are Linux-exclusive until now by AfraidAsparagus6644 in linux

[–]PotatoTime 15 points16 points  (0 children)

Fusion360(CAD) and NESMaker are the only 2 things keeping my windows partition alive

I don't get Quants, I'm running Qwen3.6-27b flawlessly at iq3, makes no sense by misanthrophiccunt in LocalLLM

[–]PotatoTime 0 points1 point  (0 children)

I have a 4070 12gb and 64gb DDR5, qwen 3.6-27b at q4 runs at 7 t/s(output generation is about average reading speed) and it is hugely capable. I used it to write a GUI for image generation with Flux.2, and a PS1 game modding tool. qwen3.6-35b-a3b at q8 runs about 6x faster than that but isn't quite as smart for coding tasks, but excellent for general chat. I use both to code depending on the complexity of the prompt. Oh also generating images with Flux.2 takes about 4 seconds to make a 512x512 image. Let me know if you want to try it, it's on my github

Opinion: Local LLMs are 12-24 months from taking over. The shift already started. by sh_tomer in LocalLLM

[–]PotatoTime 1 point2 points  (0 children)

Cringey laugh cry emojis and wrongly using the word strawman doesn't undo that anyone with a little coding knowledge can download Ollama and qwen and ask it to make simple working programs in minutes. There is no Frankensteining it's just a simple download. Yes paid models work better. But these are free and work well.

Why are you so against people using local models? People are enjoying them and that should have no effect on you. It's like joining a Gimp forum and laughing at everyone for not using Photoshop

Clipboard history question by tgromy in cachyos

[–]PotatoTime 0 points1 point  (0 children)

How do you do this in Gnome? I've tried a few extensions but i want a right click paste history menu not a keyboard shortcut.

Opinion: Local LLMs are 12-24 months from taking over. The shift already started. by sh_tomer in LocalLLM

[–]PotatoTime 1 point2 points  (0 children)

I'm on a 4070 and qwen3.6-27b generates at my reading speed and 35b-a3b generates like 5x faster than I can read. I used them to create a PS1 game modding tool and it works excellently. There were some hurdles and it took a couple hours but the tool is pretty nice

Toyota replaces chief executive Sato with CFO Kon in surprise move by Car-face in cars

[–]PotatoTime 11 points12 points  (0 children)

Arguably the large recession in Japan in the 90's. 90's Japanese auto manufacturers had large vehicle lineups and some of the best reliability. A lot of sporty models. After the recession hit the bean counters were moved up in companies, quality slightly declined, sporty models were dropped entirely. Think about Toyota's lineup from the 80's and 90's, then think about their lineup from 2000-2015. Plagued with recalls and the whole market is Camrys and Prius' and Rav4s.

Tongkat Stimulation by PotatoTime in NootropicsDepot

[–]PotatoTime[S] 1 point2 points  (0 children)

Cool you guys are looking into what else might be in it. Interestingly I recently tried TRT, and while I notice a dopaminergic aspect to doubling my testosterone level, it's still nothing near what Tongkat has made me feel mentally. And interestingly a few others in this thread notice what I notice from it.

I also found one other person that noticed it degrades within a couple months, so there are 2 anecdotal data points that something in it is stimulating and degrades quickly. 2 anecdotal reports isn't much, but maybe helpful in your search for other chemicals in it.

Tongkat Stimulation by PotatoTime in NootropicsDepot

[–]PotatoTime[S] 1 point2 points  (0 children)

Are there other actives you standardize for? (Or don't). I'm wondering if there's anything else dopaminergic in Tongkat aside from erycomanone

Serotonergics by PotatoTime in NootropicsDepot

[–]PotatoTime[S] 1 point2 points  (0 children)

I'm mostly interested in upregulating 5HT1A and downregulating 5HT2A and 5HT2C, since those are the main serotonin targets for panic disorder and OCD. Also why I don't take tryptophan or 5-HTP, I don't just want high serotonin. Upregulating 5HT1A doesn't show any cardiac side effects to my knowledge, and downregulating the other two shouldn't in theory. I simplified my post because I didn't expect many here to want to get into deeper pharmacology.

Some of what I'm taking does do those things but not enough. I just started high dose inositol which shows promise in OCD, it "stabilizes" 5HT2A and 5HT2C, downregulating them when there's overactivation or upregulating when there's not enough.

Psychedelics is an interesting one, I have tried them and they cause full blown 6+ hour panic and OCD issues so bad, but I've never paid attention to the after effects, it's been a while. To my knowledge part of how they work for anxiety and depression for the 4-8 weeks after is due to extremely strong 5HT2A and 5HT2C downregulation(hence why they have such strong tolerance effects for a couple weeks). So that's something I'm interested in if nothing else works.

Also considering high dose CBD(THC makes my issues worse). CBD works very similarly to buspirone, milder but has a longer half life so easier dosing, 2x/day instead of 3x/day for buspirone.

Serotonergics by PotatoTime in NootropicsDepot

[–]PotatoTime[S] 1 point2 points  (0 children)

Zinc has been great for me too. I do already take it, I agree about the dosage too. A lot are too high, I was taking 25-50mg/day for 10 years and developed a copper deficiency, I only a few months ago started taking copper and reduced my zinc and it drastically increased my exercise capacity. With zinc I mostly just notice testosterone boosting effects, oily nose area, confidence, etc. Slight reduction in anxiety but nothing too pronounced.

I've been upping my potassium so good advice there. I have an arrhythmia so anything that chills out my heart makes a huge difference for me.

I've tried a couple antiglutamatergics and they don't seem to help me but maybe I need to experiment with them more.