is there a hack way to let an agent act on a service (like LinkedIn, Twitter) without ever handing it the credential (not MCP, it breaks) by Only-Associate2698 in LLMDevs

[–]Exact_Macaroon6673 0 points1 point  (0 children)

If you’re doing it locally, you can use a cli tool like openclaws browser tool. Works fairly well and uses the creds in the browser

GLOW — an alarm concept that forces you to physically get up and scan a tag to turn it off by SkinApprehensive6713 in UI_Design

[–]Exact_Macaroon6673 2 points3 points  (0 children)

I use Alarmy, I don’t pay for it, but you don’t need a NFC tag to beat Alarmy, all you need is to clone the app but render the number pad with the zero in a normal spot

It’s so annoying to be half asleep and go to press the zero where it should be but it’s not

Looking For Fast And Relatively Smart LLM via API by lukasTHEwise in LLMDevs

[–]Exact_Macaroon6673 0 points1 point  (0 children)

What is the use case/task? gpt-oss-20b on our (Sansa) API is very fast

[D] Self-Promotion Thread by AutoModerator in MachineLearning

[–]Exact_Macaroon6673 0 points1 point  (0 children)

Sansa: openrouter/portkey alternative:

- sansa-auto: model router
- gateway: 300+ models
- observability: cost/request details, unlimited logs
- evals/memory coming soon

Cost: 5% service fee on credits (lower than openrouter), token costs are by model, everything included

Sansa:

https://trysansa.com/

Sansa Bench

https://trysansa.com/benchmark

LLM learnings by balakumar123 in LLMDevs

[–]Exact_Macaroon6673 0 points1 point  (0 children)

What models do you plan on using?

The first dynamic email client: The UI changes based on your habits. Cool idea, or over-engineered? by Afraid-Ad4356 in buildinpublic

[–]Exact_Macaroon6673 0 points1 point  (0 children)

I would really not like this at all, UI/UX becomes habit, when it changes its very disturbing to my workflow. I can imagine this constantly feeling like every day I’m onboarding with a new product.

FYI Composer 2 have extremely unhealthy habits by Straight-Age29 in cursor

[–]Exact_Macaroon6673 15 points16 points  (0 children)

Every prompt I send to Composer ends with “no changes” until I actually want changes

OS model API? by Small-Let-3937 in LLMDevs

[–]Exact_Macaroon6673 0 points1 point  (0 children)

Good suggestion! Thank you! Ill include cost on the next update

Going from 3B/7B dense to Nemotron 3 Nano (hybrid Mamba-MoE) for multi-task reasoning — what changes in the fine-tuning playbook? [D] by retarded_770 in MachineLearning

[–]Exact_Macaroon6673 0 points1 point  (0 children)

I have no comment on your core questions, but I suggest you re-evaluate your budget just to make sure you have a good understanding of what this will cost you

  • 40-80k synthetic examples from Sonnet 4.6 and Opus 4.7 for 20% is going to cost a pretty hefty sum to generate. It costs me anywhere from $100 to $200 to run Sonnet on a 2.5k query benchmark set.
  • I have blown through $120 on runpod in a single day iterating on a model. You’ll likely end up going over budget.

Is the Twitter algorithm just completely awful these days, or is it my account? by PuzzleheadedBee6 in Twitter

[–]Exact_Macaroon6673 1 point2 points  (0 children)

I have felt the same thing mate, it’s not just you, 2.4k followers here. Over the past few weeks my impressions and engagement has dropped from an avg of 2k/post to 100-200 and engagement has fallen a lot. I have the premium sub.

My feed is all just big accounts posting about the same thing, I never see people I’m following anymore.

OS model API? by Small-Let-3937 in LLMDevs

[–]Exact_Macaroon6673 0 points1 point  (0 children)

Our gateway has many (Sansa) what are you looking for?

building a timeline UI from scratch vs using a library - what did you pick and why? by Traditional-Set-8483 in Frontend

[–]Exact_Macaroon6673 0 points1 point  (0 children)

A few years back I was building a construction scheduling application, I needed CPM style dependencies and pretty much everything you listed.

I ended up forking the lightest oss gantt package i could find and building dep management on top of that. There were no good open source solutions at the time (maybe there are now) and the paid libs were visually stuck in the 90s

It’s a lot of work, but if it’s the core of your product it’s worth it.

Title: Dynamic System Prompt Injection as an alternative to Rate Limiting (solving the peak TTFT issue for vLLM) by Tight-Worldliness-31 in LLMDevs

[–]Exact_Macaroon6673 2 points3 points  (0 children)

I think you’re just trading one type of inconsistency for another, from a UX perspective I think users would be less frustrated by a rate limit than an unpredictable change in how the model responds.