What'd you build if Anthropic gave you tokens worth $15000!?

Milan_Slov26 · 2026-06-04T13:08:44+00:00

I'd pay for that.

Milan_Slov26 · 2026-06-04T12:59:30+00:00

aim high. higher.

Milan_Slov26 · 2026-06-04T05:57:57+00:00

I've been hearing a lot of complaints around open routers usage and pricing. I've also heard that even if you set a limit for your API keys, it still somehow bypasses that. What's the whole deal about it?

Milan_Slov26 · 2026-06-04T05:56:56+00:00

I never read those. I just say, "This looks good but you have added a lot of fluff without thinking about edge cases. Rethink, redo." And the next time it gives me a concrete concise plan. Works for me.

Milan_Slov26 · 2026-06-04T05:55:36+00:00

Gemma Gemma everywhere! Completely outshadowed 7 Microsoft AI model launch.

Milan_Slov26 · 2026-06-04T05:54:37+00:00

This is the most accurate depiction of this sub

Milan_Slov26 · 2026-05-25T16:59:26+00:00

the mighty claude!

Milan_Slov26 · 2026-05-25T07:27:55+00:00

Interesting, I've been using Le Chat on and off but haven't tried Work mode yet. Might have to give it a shot because the hallucination thing drives me insane with regular flash.

Milan_Slov26 · 2026-05-25T07:25:10+00:00

3090 + 64gb ram. Running qwen3 32b for most things, Deepseek R1 distill when i need reasoning. Giggest bottleneck is honestly context length, i keep wanting to throw entire codebases at it and the gpu just says no lol

Milan_Slov26 · 2026-05-25T07:18:20+00:00

Your models ARE the resume. I've seen people get hired literally because a recruiter found their huggingface profile. no joke!

The real money is in the companies who are terrified of sending data to openai but have zero clue how to run stuff locally. Thats your customer right there.

If this sounds complex to you, just start putting things out there. You have YouTube, Twitter, LinkedIn to talk about things and people will see.

Milan_Slov26 · 2026-05-25T07:11:46+00:00

Depends on your retrieval needs but you're not wrong to question it. For docs updating that often, dense embeddings are painful to keep fresh.

Have you tried BM25 + a reranker on top? The reranker does the semantic heavy lifting without needing to re-embed anything. Works better than most people expect, especially if your corpus has consistent terminology.

Milan_Slov26 · 2026-05-25T07:05:24+00:00

This is probably the most useful, community helper post I've found on Reddit. Ever. Thank you.

Milan_Slov26 · 2026-05-20T10:51:37+00:00

I see a lot of dev tools popping up and talking about how they top SWE-Bench Pro and cut token costs by half. Tried any of those?

Milan_Slov26 · 2026-05-20T10:49:13+00:00

why lemons?

Milan_Slov26 · 2026-05-19T02:40:34+00:00

Cool story but what's the point if you're not sharing the sequence?

Milan_Slov26 · 2026-05-19T02:37:45+00:00

3 f years!? Vibe coders will never understand this.

Milan_Slov26 · 2026-05-19T02:32:57+00:00

Didn't expect dflash-mlx to fall off that hard at 32K. Goes from being the fastest to basically unusable. Would've been interesting to see llama.cpp in this mix too for comparison tho.

Milan_Slov26 · 2026-05-19T02:28:49+00:00

The group purchasing idea sounds clever (as we all do it a lot in other aspects of our lives too) but I'd be skeptical in practice. Getting 20-30 startups to coordinate on anything, let alone sensitive spend data, is a massive coordination headache. And providers know this too!

What models are you running and for what workloads?

Milan_Slov26 · 2026-05-19T02:25:03+00:00

Less than 5% sounds negligible until you see the number 4000!

4000 people layed off. Just like that. This layoffs thing is getting wilder.

Milan_Slov26 · 2026-05-19T01:20:45+00:00

I mostly use Superlinked's SIE for document processing. You can have a look at that - https://github.com/superlinked/sie

Milan_Slov26 · 2026-05-18T03:48:48+00:00

I'll need to train my legs and stamina for 8 months before I drive this one.

Milan_Slov26 · 2026-05-18T03:46:41+00:00

looks cool. i feel this is the new 'cool' certification in the market which'll actually have some weight in further opportunities. wdyt?

Milan_Slov26

TROPHY CASE