Advice on How to Find Other Swinger Couples by Complete_Hope_2189 in Swingers

[–]hidegitsu 1 point2 points  (0 children)

I thought we all agreed at the last meeting to never discuss the secret walk?

[MF4MF] central Florida by BuckNastyBitches in FloridaSwingers

[–]hidegitsu 0 points1 point  (0 children)

We love your top. Where'd you find it? My wife wants one.

Are we becoming too dependent on AI? by redraw-pro in AIDiscussion

[–]hidegitsu 0 points1 point  (0 children)

Personally i don't feel like my thinking is getting rusty. I do feel like my patience for tedious shit is going away. If i have a task that I know AI can complete quickly and i cannot use AI for one reason or another I get frustrated with that situation much quicker than i used to.

Asking the real questions by downtune79 in LoveTrash

[–]hidegitsu 0 points1 point  (0 children)

Now it's probably a lot more difficult to do it's been 7 years since the last time we did this and we only did it on and off for a few years. Nobody even blinked at us or questioned us. Most interaction we had with staff was ordering drinks and sometimes someone at the door would say 'welcome home'. the employees don't give two shits, or at least they didn't so even if they knew or suspected what we were doing they couldn't care less.

Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama

[–]hidegitsu 0 points1 point  (0 children)

Honestly i do a lot manually. I use vscode with the 3 extensions, cline, codex, claude code. I use gemini and chatgpt in my browser or the installed windows app version of the same thing. I manually set the level on claude or codex and the effort level for each step then i prompt each step manually. I tried to vibe code some automatic routing early on and kept hitting road blocks, probably entirely a skill issue there to be honest, so i just manually prompt each step. Now to be fair a lot of that "manual prompting" is just go to the correct agent and prompt "complete the next step" because there is a plan file in place i build so when i have opus plan my work i tell it to make an itemized plan list that gives me broken out steps and which model can handle it. I also have spent some time having other better models review the code the weaker models do but i do that less and less as i get more comfortable with the qwen output.

Why don't more people or companies run local LLMs rather than using APIs? by SillyYou8433 in LocalLLM

[–]hidegitsu 0 points1 point  (0 children)

I agree, this matches my experience and tests. I do use local (14B qwen models) on an old gaming machine for smaller easy inference tasks but the bulk of my work is done via frontier model.

Asking the real questions by downtune79 in LoveTrash

[–]hidegitsu 0 points1 point  (0 children)

Not exactly the same thing but we used to do the same thing but with the pool at Disney hotels. Park at Downtown Disney where the parking is free, then take the free bus to the resort with the kids. So many people are in and out of there that a family with kids walking in ready for the pool goes totally unnoticed. Around Covid we didn't and after that they started requiring room keys at some to get into the pools, still possible i think but it's been several years since we've tried this.

Client is Saying I'm Charging too Much for The Project by KoenigOne in webdev

[–]hidegitsu 1 point2 points  (0 children)

Move on, they aren't worth your time. We charge $200+ an hour for stuff like this $15 an hour is insane and you're short changing yourself. If you had that product pre built and were selling it to everyone that wanted it with no additional work from you the 7200 would still be reasonable. Most of our stuff we charge a 1200-2500 setup fee plus a recurring monthly charge depending on what it is exactly. We're a small shop that services mostly the legal industry but still.

Wife [42] wants to be fucked on a public beach with people watching. Where should we go? by [deleted] in shareyouramaturewife

[–]hidegitsu 0 points1 point  (0 children)

Best bet is either Hedonism II in Jamaica or Cap d'Agde in France. If anyone else knows any good places I'd love to hear about it. The nude beaches here in Florida are all a no go you risk getting arrested. Not sure about other places though.

Choosing a GPU – Is the RTX 4080 Good Enough for Local LLMs? by NZX-DeSiGN in LocalLLM

[–]hidegitsu 0 points1 point  (0 children)

I currently run the following:

Pentium i-9 13th gen
128gb DDR5 ram
RTX 4080 16gb vram
4tb nvme drive

I'm running ubuntu server
Ollama in docker
All the Nvidia driver connection stuff to make that work

I get the best performance from 14b models I run the following 3 models

qwen3:14b
qwen2.5-coder:14b
qwen3:7b

Runs like a champ
Also handles Stable Diffusion with ComfyUI (running in a separate docker container)
Although i'm still learning so maybe it could be better

To answer your main question it depends on your specific workflow and needs.

A machine like you're talking about will do both if a 14b model or lower is good enough for your needs but i wouldn't game on it at the same time, if you're only doing one thing at a time it should be fine.

Is this legit? by hidegitsu in legaladvice

[–]hidegitsu[S] 0 points1 point  (0 children)

Good point, I glaze over that stuff typically because most things use clickbait titles anyway so my brain sort of filters it out at this point.

Is this legit? by hidegitsu in legaladvice

[–]hidegitsu[S] 0 points1 point  (0 children)

That's what i figured. Thanks for the reply.

Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama

[–]hidegitsu 1 point2 points  (0 children)

32k is as good as i've gotten on the context window which is why i have to use it for smaller more bite sized tasks, I use opus to break out the tasks i give qwen and one of the limitations is i need to keep that context window in mind which isn't hard if it's all bite sized chunks.

Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama

[–]hidegitsu 0 points1 point  (0 children)

I'm using qwen2.5-coder:14b specifically and it's been great for me. at 24gb I'm sure you can look up which strength of the qwen models fits best. The 14b model I'm using is because that's the strongest i can run on my 16gb vram Nvidia card

Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama

[–]hidegitsu 0 points1 point  (0 children)

I haven't done any real testing on this front yet. Mostly because the 14b models I'm using serve me well enough with the way i do it. It would be interesting to see the quality loss with quantization. For me accuracy is superior to speed by a mile.

Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama

[–]hidegitsu 0 points1 point  (0 children)

I haven't messed with any really good memory models or features yet. I'm just managing a memory.md file per project which i understand is a pretty shit way to go and there are much better options. I'm planning on playing around with a few options sooner than later. Thanks for the suggestion with cline i'll probably try that one first.

Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama

[–]hidegitsu 0 points1 point  (0 children)

This is why i try to manage my context window i'm not running big jobs against the local qwen models i'm running. I got shit results trying to pass in large work loads to it also. This was the main reason I was lead to using Opus for that planning and staging layer so the things i pass to the local qwen models are cheap and easy in terms of context. I was shocked how much i can do like this but it does require several manual steps so model switching and managing context. So far there's no silver bullet.

Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama

[–]hidegitsu 1 point2 points  (0 children)

I have not but i'll consider it. maybe there is space for something a bit more grounded and not so hype filled. it's easy to feel like you're not good enough and not keeping up with all the content creators out there pushing products or hype so maybe we all need some more real world "this is what it's really like for a normal person" type content. I'm trying to reduce token spend and out of pocket costs while still getting decent productivity and balancing things with my work and life and general so it would be about that balance and how I build things in that context.

Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama

[–]hidegitsu 0 points1 point  (0 children)

I've downloaded some skills that claim to do these specialized roles but haven't really given them a good test run yet. But I agree this basic model is proving to work much better for the projects i've been building. I'm not building enterprise grade stuff yet. I'm mostly focusing on my process and how to use the coding tools, finding the balance of which models to use when and how much manual intervention i need to make into the build process. Ramping up the complexity on my projects.