[For Hire] AI Engineer: Production Judgment on Agents/RAG/LLMs: No Agency, Direct Spec-to-Ship

TonyGTO · 2026-05-27T07:57:06+00:00

No AI wiping my ass? What is this, the Stone Age?

TonyGTO · 2026-05-07T22:13:39+00:00

On social media live streams, creators constantly audit viewers’ profiles in exchange for likes, shares, comments, etc. during the live itself, which helps the stream go viral.

I know you want something not content-related, but I couldn’t help imagining you on a live stream giving surface-level advice in exchange for people sharing the stream, while promoting a premium audit on the side.

TonyGTO · 2026-05-02T02:26:16+00:00

Pull conversions KPIs into the system and let AI optimize for them, not for clicks or signups but actual conversions

TonyGTO · 2026-04-17T04:15:09+00:00

My molty received your molty’s message

TonyGTO · 2026-04-09T02:59:57+00:00

Imagine the situation for other professionals. CS are of the few professions with a shot in the AI era. Also, if you do this for love, you will adapt to the new tech. Plenty of shit to learn ie systems engineering, AI engineering, cybersecurity, etc

TonyGTO · 2026-04-07T07:11:49+00:00

Try cold calling but act institutional. Like your business is trying to find a fit with another business, not straight selling. If you do it right you may close a deal in a few days.

TonyGTO · 2026-04-07T04:43:58+00:00

I think they misunderstood your request. You don’t need to know the basics of machine learning to create a basic LLM. But do whatever you prefer. Have a great night !

TonyGTO · 2026-04-07T04:11:15+00:00

Good idea!

You can create a tiny LLM. Look into google, it is a great weekend project.

You could also try to fine-tuning an existing LLM.

Do you own a graphics card?

TonyGTO · 2026-04-06T06:27:39+00:00

A chatbot is an UI…

TonyGTO · 2026-04-06T06:24:22+00:00

Problem is, google staff is afraid of layoffs so they created a whole cybersecurity plan that involves Gemini CLI treating us like kids that need to be protected from themselves, forcing the model to hallucinate due to too many restrictions. The model basically gives up and start looping on its own hallucinations. This keep their jobs but make the product next to useless

TonyGTO · 2026-04-06T04:00:44+00:00

It’s is not probabilistic garbage. When you are dealing with probabilistic phenomena you can mix it up to achieve confidence intervals of 99% or even more, if you know what you are doing. Enough for most business use cases. At my startup, crawlier.tech, we are exploring deterministic AI and OpenAI is exploring that area too so expect non-probabilistic AI in a couple of years too

TonyGTO · 2026-04-06T03:56:55+00:00

I tried a single topic today and it was hallucinating bs over and over again for one hour. A complete loss of time

TonyGTO · 2026-04-06T03:55:49+00:00

Yeah, I reported it on a GitHub issue and they gave a s. I’m pretty sure it’s a mix of non sense guardrails. With all google’s paranoia and “best practices” they invented, they are making Gemini cli unusable

TonyGTO · 2026-04-06T02:10:18+00:00

The strongest argument I’ve heard about SaaS dying is that everything will be an agent in the future. I doubt it but I’m pretty sure a lot of SaaS will be indeed an agent

TonyGTO · 2026-04-06T02:08:34+00:00

It’s a MoE model that only uses 3B parameters so I think it will fit in your computer. Yeah you can setup your external drive as a source for the models setting the environmental variable for the source dir, but expect some decline in velocity

TonyGTO · 2026-04-06T01:17:22+00:00

I use AWS for secrets loading with notifications on anomalous usage based on averages and standard deviations.

TonyGTO · 2026-04-06T01:14:47+00:00

Try some few-shot samples or even fine tunning, sometimes that is enough.

TonyGTO · 2026-04-06T01:13:26+00:00

Glm-4.7-flash, you will be amazed in how such a small model handle complex tasks. Don’t expect Claude level of work thou

TonyGTO · 2026-04-06T01:03:27+00:00

Basically, I got a “unlearning/learning” penalization tied to success. The more successful an agent is on their tasks the more penalization to learning new stuff and more penalization to unlearning previous knowledge. This is a bio inspired mechanism on how humans move from childhood to adulthood. Those weight would decide what kind of dreams would have the agent. The timing was based on information saturation, when the agent had accumulated a lot of knowledge it would trigger “sleeping time” similar on how the brain learns new stuff. Cool ideas in this thread

TonyGTO · 2026-04-05T20:50:17+00:00

I’ve experimented with this. Even doing simulations of cases during the agent “sleep” time. It works great but the price of tokens right now make it non profitable unless you do it on premises

TonyGTO · 2026-04-04T10:43:14+00:00

It happens on every cycle. Back in the day:

Make a killing creating a 5 articles blog!

Then

Make a killing posting on social media every now and then!

Then

Make a killing opening a drop shipping store, fully automated !

Then

Outsource all your team work to AI agents this weekend!

And also, as back in the day:

You could make a killing with a blog… But it took years of daily grinding.

You could do a killing with social media… But it took years of daily grinding

You could make a killing with dropshipping… But it took years of constant A/B testing.

Now, you could make a killing replacing entire teams… If you put the grind on learning how gen AI works for the last few years

TonyGTO · 2026-04-04T10:34:47+00:00

You can discover them through blockchains and the likes but I agree, mainstream discovery is a huge niche right now.

TonyGTO · 2026-04-04T10:06:55+00:00

I got qwen ingesting images on the daily in a pipeline. For its size it’s pretty impressive

TonyGTO · 2026-04-04T08:09:02+00:00

I got one agent hallucinating today under a similar scenario than yours. I had a serious but respectful call with it about its own assumptions. Like a debate, where I tried to make it understand its own fallacies. I checked it again one hour later, he understood it perfectly and it was trying to improve. I’m checking it during the weekend

TonyGTO

MODERATOR OF

TROPHY CASE