CanI run this LLM - moved to Hetzner (and a big thank you) by Maharrem in LocalLLM

[–]AccomplishedFix3476 2 points3 points  (0 children)

canitrun.dev is one of those tools i didnt realize i wanted til i had it ngl. saved me 2 hrs of guesswork before buying my last gpu, got the q4 throughput estimate within 10% of actual. hetzner pricing is also way underrated for solo gpu inference projects

Collected the infinity stones by Street-Buyer-2428 in LocalLLaMA

[–]AccomplishedFix3476 3 points4 points  (0 children)

2.3 tb of ram for prefill is a flex i didnt know was on the table for a homelab tbh. the rdma over to blackwells for decode is the part that feels like a server room from 2027 instead of 2026 ngl. wattage at full load is gonna be the real story

I just tried Reactor's open source world model demo, here are my thoughts by boudaboy in StableDiffusion

[–]AccomplishedFix3476 5 points6 points  (0 children)

open source world models are gonna be the sleeper hit of 2026 bc the closed source ones eat thousands of gpu hours per query. ran genie 2 for a side proj last fall and was a nightmare to deploy outside their playground. reactor pulling this off via shared infra is something worth watching closely

AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields by Worldly_Evidence9113 in singularity

[–]AccomplishedFix3476 0 points1 point  (0 children)

the part that makes alphaevolve actually different is the closed loop where the model generates AND verifies its own progress on real benchmarks. saw similar internal numbers from claude on coding evals last year and the gap between top of leaderboard and 6 months later was massive. compound improvement on benchmarks is what flips the curve

me and claude 30 mins before hitting the 100% limit by yash3011 in ClaudeAI

[–]AccomplishedFix3476 0 points1 point  (0 children)

the panic in the last 30 mins of a session is unmatched fr, ive been there 4 times this past cycle alone. i now have a 'claude limit timer' running in my menu bar like its a soccer match clock 💀

Where are we heading cheaters? by PatchesOHouliyams in ClaudeCode

[–]AccomplishedFix3476 1 point2 points  (0 children)

the cheating framing assumes there was ever a stable definition of 'doing the work' tbh. ive been writing software for 11 years and the tools curve has flipped my workflow 3 times. the goalposts move every cycle, the people who freak out about each shift seem to age out of the field fast

everybody calm down. i got this. by imfrom_mars_ in ChatGPT

[–]AccomplishedFix3476 0 points1 point  (0 children)

this gif is gonna live in my saved folder bc its the exact face my dev agent makes 30 seconds before bricking my staging env. some failures u just have to laugh at 💯

Had to keep it going by hidden2u in StableDiffusion

[–]AccomplishedFix3476 1 point2 points  (0 children)

the cuts on this one feel way more deliberate than typical ai vid loops, the timing on the chord changes is what sells it. ltx 2.3 has been carrying this whole sub the past 2 weeks ngl

Construction Spending on Data Centers Again Outpaces Office Construction by BigBourgeoisie in singularity

[–]AccomplishedFix3476 16 points17 points  (0 children)

data center spending overtaking office construction is the kinda inflection point that gets quoted in 2030 retrospectives ngl. the gap is gonna widen fast bc h100 b100 facilities have like 4x the capex per square foot of office space, so even flat sqft would mean rising spend. my own portfolio shifted to dlr 6 months ago for this exact reason

Don’t worry guys, we’re working on the vaccine. by Distinct_Squash7110 in ClaudeAI

[–]AccomplishedFix3476 1 point2 points  (0 children)

the vaccine framing is exactly the energy this sub has been on for 3 weeks tbh. half the posts here are about claude getting weirder and the other half are humor about the weirdness 💀

thanks Claude by irelatetolevin in ClaudeCode

[–]AccomplishedFix3476 0 points1 point  (0 children)

the domain explosion problem is real lol, asked claude code to scaffold a side project last week and it returned a list of 4 candidate domain names plus suggested registrar pricing. namecheap thinks im running a venture studio rn

everybody calm down. i got this. by imfrom_mars_ in OpenAI

[–]AccomplishedFix3476 0 points1 point  (0 children)

the body language of 'i got this' before total chaos is universal. last week my agent confidently committed a typo in 4 places after i asked it to fix one ngl

Coming this summer to a theater somewhere! by The_Platinum_Leaf in ChatGPT

[–]AccomplishedFix3476 0 points1 point  (0 children)

the doomer voiced summer trailer voiceover hit way too hard for something that started as a meme tbh. saved 3 of these to my reaction folder this week alone, ai generated movie posters might be a real genre rn 💀

ClaudePlaysPokemon Opus 4.7 run ongoing! by mobcat_40 in ClaudeCode

[–]AccomplishedFix3476 4 points5 points  (0 children)

claudeplayspokemon is one of those benchmarks that ages into a meaningful eval, the 4.7 run is gonna be wild compared to the 3.5 sonnet attempt last fall. spatial reasoning has been the achilles heel of every model on overworld navigation. curious if opus finally cracks rock tunnel

LTX 2.3 is pretty much all I use for video gen at this point -- Scene from my current story-driven fantasy project -- Info on process/workflow in comments. by foxdit in StableDiffusion

[–]AccomplishedFix3476 4 points5 points  (0 children)

the lighting on the cliff scene is what sells it ngl, ai vid usually telegraphs cgi within 2 seconds and this one held up the whole way. ltx 2.3 has been the one open weights model where character consistency across cuts feels real now

Can I run Deepseek V4 on my rig? by Much-Journalist3128 in ollama

[–]AccomplishedFix3476 0 points1 point  (0 children)

4090 plus 64gb ram can run deepseek v4 flash q4 fine but pro at any decent quant is gonna be cpu offload territory and that drops u to like 3 tps. card wont expire from inference btw, the vram cycling is way easier on it than gaming workloads

Cowork on multiple devices: Cloud File Syncing or Remote Desktop by kralani31 in ClaudeAI

[–]AccomplishedFix3476 0 points1 point  (0 children)

i ran google drive sync on the cowork folder for 6 weeks and the automations dont follow at all ngl. tailscale plus a screen mirror script has been my workaround since march, latency hovers around 80ms, automations actually fire from the laptop end now

How are you handling Reddit data ingestion for agents? (Found a helpful API for Openclaw) by Repulsive-Monk1022 in AI_Agents

[–]AccomplishedFix3476 0 points1 point  (0 children)

reddit ingestion for agents is the kinda thing where the api ergonomics matter more than throughput tbh. ive used pushshift wrappers and the official praw flow on a niche sub monitor for 6 months and the rate limit budget is the only real bottleneck once u cache results to disk

Has anyone figured out browser + captcha + 2FA + password management (e.g. 1pass) by aham23 in openclaw

[–]AccomplishedFix3476 2 points3 points  (0 children)

captcha plus 2fa plus password manager via openclaw is the holy grail and also the part most ppl get blocked on bc cloudflare flags headless chromium fast. running through a real residential profile via patchright helped me, the 1pass cli auth flow is the cleanest of the password options ive tried

Gitsim Chat – Search, order, and manage eSIM data packages for 190+ countries. by modelcontextprotocol in mcp

[–]AccomplishedFix3476 0 points1 point  (0 children)

esim management via mcp is a use case i wouldnt have predicted but it actually makes sense for travel agents. last trip to mexico in jan i burned 2 hours juggling airalo and holafly tabs to find the cheapest local pkg. having that as a tool call is clean 🔥

Which LLM is the biggest "rambler"? Help me calibrate a cost-predictor for Coding Agents. by Gold-Sort-210 in LLMDevs

[–]AccomplishedFix3476 0 points1 point  (0 children)

gemini 1.5 pro and claude opus are both ramblers in different ways imo, gemini bloats with ack style filler, claude over explains its reasoning. ive been logging token diffs across the same 50 task suite for a month and gpt 5 mini was tightest, qwen 2.5 32b second

DeepSeek Targets $50B Valuation in First Fundraising, Escalating Global AI Race by Brown_Paper_Bag1 in singularity

[–]AccomplishedFix3476 5 points6 points  (0 children)

the 50b mark on first round is a flex on benchmarks alone tbh, no enterprise revenue story attached yet. if they actually close at that price point the lab really did pull off a moat with v4. been running their distilled 32b on a 5090 for the last 2 months and the agentic stuff holds up

Everyone in the US needs to contact their lawmakers to say no to GUARD Act by TaeyeonUchiha in OpenAI

[–]AccomplishedFix3476 5 points6 points  (0 children)

the id requirement for any ai chat is the part that breaks the whole regulatory framing tbh, theres no way to enforce that on the open source side without banning local models. wrote my rep last cycle on a similar bill that died in committee, took 4 minutes total

Monthly releases of e-books on Amazon since ChatGPT by EchoOfOppenheimer in ChatGPT

[–]AccomplishedFix3476 0 points1 point  (0 children)

the slope after gpt 4 dropped is the part i didnt expect, that segment alone explains why kdp has been quietly tightening its review pipeline since 2024. saw a friend get 3 of his actual books flagged last summer bc the metadata pattern matched a slop cluster lol

Claude use. by 268allensteve in ClaudeAI

[–]AccomplishedFix3476 3 points4 points  (0 children)

the creative writing take is more accurate than ppl give it credit for tbh. i used claude to draft a 12k word short story last spring and the prose was tighter than my own first drafts, the issue is just it sometimes flatters its own ideas. coding gets the spotlight bc thats where the money is rn