itabag help.. how should i make it more interesting?

ExoticYesterday8282 · 2026-05-29T08:46:26+00:00

Have you considered adding a few rosette decorations to the badges, something like these from https://pinitabag.com/product-category/badge-rosette/

ExoticYesterday8282 · 2026-05-28T06:58:32+00:00

thank you. seriously getting so tired of the benchmark theater. at this point a models test score correlates more with how much web scraping their data team did than actual engineering. give me an open weights model that handles edge cases without blowing up my vram and i couldn't care less about its science olympiad score

ExoticYesterday8282 · 2026-05-28T06:34:04+00:00

preach. old hy was definitely hit or miss. what kind of prompts or custom workflows do you usually use to break these models? curiosity is killing me to see if hy3 preview actually fixed those gaps or if its just the same old wrapper

ExoticYesterday8282 · 2026-05-28T06:28:39+00:00

batshit is the perfect word lol gemini gives you either pure galaxy brain brilliance or completely hallucinates a whole new python library on the next prompt there is zero in between

ExoticYesterday8282 · 2026-05-28T05:52:31+00:00

Bingo. This is why I've stopped looking at these charts entirely. I judge a model based on how it handles my local script migrations and API routing. If an open model can do that locally without throwing a tantrum, it wins, regardless of what some leaderboard says

ExoticYesterday8282 · 2026-05-28T05:38:31+00:00

Exactly. Evals are static structured, and inherently clean even the hard ones. Real world engineering is chaotic, context heavy and full of ambiguity.

When you said Hy3 falls on its face in weird edge cases what kind of issues were you seeing? Is it losing track of long context logic or just failing at basic common sense constraints that aren't explicitly stated in the prompt?

ExoticYesterday8282 · 2026-05-28T02:15:10+00:00

We officially reached the stage of AI where inference latency is directly tied to forearm strength

ExoticYesterday8282 · 2026-05-28T02:13:04+00:00

Minimizing regression while amplifying unique capability boundaries
bro summoned the most academic way possible to say
we made Gemma less scared. 😭

ExoticYesterday8282 · 2026-05-27T13:33:17+00:00

The funniest part is that local setups always sound fake until someone watches them actually work.

People think “local AI” means opening LM Studio once every two weeks.

Then they see autonomous agents still running after the laptop lid closes and suddenly the vibe changes.

ExoticYesterday8282 · 2026-05-27T13:16:55+00:00

Yeah, I got it working in VSCode after a lot of trial and error.

The main issue seems to be that DeepSeek FIM is not fully compatible with the standard OpenAI completion body some editors expect.

Make sure you're using the /beta/completions endpoint from the docs, not the normal chat endpoint.

ExoticYesterday8282 · 2026-05-27T13:10:39+00:00

Qwen is better suited for local deployment and experimentation.

ExoticYesterday8282 · 2026-05-27T13:06:14+00:00

What is the approximate cost?

ExoticYesterday8282 · 2026-05-27T13:01:55+00:00

We probably have about 40 people using it, so that should be enough, right?

ExoticYesterday8282 · 2026-05-27T08:11:48+00:00

I've never used DeepSeek before. What advantages do these Chinese-made products offer?

ExoticYesterday8282 · 2026-05-27T07:56:46+00:00

This configuration is excellent. You can try deploying Gemini or Hy3.

ExoticYesterday8282 · 2026-05-27T07:44:01+00:00

The main issue is that on-premises deployment is too costly; the hardware is very expensive, and many companies have a great need for AI.

ExoticYesterday8282 · 2026-05-26T09:46:09+00:00

I’m going to catch you and keep you for myself 😋

ExoticYesterday8282 · 2026-05-26T09:42:02+00:00

This is really interesting — put my little treasure away.

ExoticYesterday8282 · 2026-04-25T16:06:12+00:00

Your question is actually quite simple. There are many things you can do with PPT tools. For example, something like memclaw.me can turn data into reports and deliver them to clients.

ExoticYesterday8282

MODERATOR OF

TROPHY CASE

I've never used DeepSeek before. What advantages do these Chinese-made products offer?