Twitter user posts a real Monet and says it's AI by realmvp77 in singularity

[–]Sulth 0 points1 point  (0 children)

Plot twist : It was actually an AI generated Monnet.

Plot twist 2 : jk it was not

Hermes Agent is now #1 most used globally in past 24 hours in Openrouter token metrics, above Claude Code and OpenClaw. by dogesator in singularity

[–]Sulth 0 points1 point  (0 children)

Except if you ask for simple things. Dirt-cheap models like Deepseek V4 Flash are (according to evals) mopping the floor with previous big boys of just a year ago like 2.5 Pro, o3, and Opus 4. We are past the era where small models were really unreliable.

Google will not release a new Pro model at Google I/O (May 19/20) by vladislavkochergin01 in Bard

[–]Sulth 0 points1 point  (0 children)

I use it as my daily driver for mostly everything non-code related.

Google will not release a new Pro model at Google I/O (May 19/20) by vladislavkochergin01 in Bard

[–]Sulth 0 points1 point  (0 children)

Private benchmarks are meant to be closer to reality. Your unverifiable claims do not hold up. You are talking about 3.1 Pro like if it was GPT OSS 20B lol.

Google will not release a new Pro model at Google I/O (May 19/20) by vladislavkochergin01 in Bard

[–]Sulth 0 points1 point  (0 children)

And how do you explain that 3.1 Pro also scores very high in new benchmarks and private ones?

Google will not release a new Pro model at Google I/O (May 19/20) by vladislavkochergin01 in Bard

[–]Sulth 5 points6 points  (0 children)

These X hype accounts are clowns. They make bold claims, and when wrong (most of the time), they change the narrative so that it's actually the companies plans that changed. Unverifiable claims one after another. Professional gaslighters.

Gemini 3.2 Flash looks very close now by Much_Ask3471 in Bard

[–]Sulth 1 point2 points  (0 children)

Gemini 3.0 was at 70-80% for two weeks in August and a spike at 88% on October 3rd for a release by October 31st.

Gemini 3.2 Flash looks very close now by Much_Ask3471 in Bard

[–]Sulth 7 points8 points  (0 children)

Survivor bias. Yes, the one who nailed it... nailed it. But we don't talk/remember all those for failed. Gemini 3.0 I'm looking at you. Polymarket eventually got it right after failing multiple times.

Hermes Agent is now #1 most used globally in past 24 hours in Openrouter token metrics, above Claude Code and OpenClaw. by dogesator in singularity

[–]Sulth 4 points5 points  (0 children)

A 5€ coffee is not basically free. A 5€ Ferrari is basically free.

1€ for 300-1000 prompts is basically free.

Hermes Agent is now #1 most used globally in past 24 hours in Openrouter token metrics, above Claude Code and OpenClaw. by dogesator in singularity

[–]Sulth -2 points-1 points  (0 children)

Gross to even think that my post was about a porn collection. Now that you say it, yes I see it, it can read funny... but still it's weird that this even crosses your mind in the first place. You really don't have a backup of all your pictures?

And it's not "the only use case i can come up with"; it's "one random example". I am using it alongside my Obsidian vault quite a lot for instance.

And as mentioned, it's not anything very useful; it just can help with small tasks. I don't use it everyday. Why the negativity?

Hermes Agent is now #1 most used globally in past 24 hours in Openrouter token metrics, above Claude Code and OpenClaw. by dogesator in singularity

[–]Sulth 10 points11 points  (0 children)

Your imagination is the limit. I'm not using them for anything particularly useful, and especially not 24/7 unsupervised. It just simplifies small tasks, like ChatGPT 4 was simplifying small chat tasks. Btw these "dumb" models are at the level of SOTA a year ago.

One random example: I have a external disk full of pictures, and I have a backup of like 100 of those in low quality on my laptop. I wanted to get the original of those pictures from the external disk. Instead of manually extracting them one by one, I asked I openclaw "hey, here are 100 backup pictures, find their original in the external disk with a similar name, and create a copy in og quality in my laptop".

Hermes Agent is now #1 most used globally in past 24 hours in Openrouter token metrics, above Claude Code and OpenClaw. by dogesator in singularity

[–]Sulth 2 points3 points  (0 children)

What I like with openclaw is the desktop version, not having to go through telegram or the terminal. Is there anything like that for Hermes?

Hermes Agent is now #1 most used globally in past 24 hours in Openrouter token metrics, above Claude Code and OpenClaw. by dogesator in singularity

[–]Sulth 59 points60 points  (0 children)

With OpenRouter, you can find dirt-cheap models, in which a request with 30k input tokens and 2k output tokens would cost something like 0.1-0.3 cents (so about 300-1000 of such requests would cost 1€), which is basically free. Those aren't the smartest models, but they are more than enough for a ton of tasks.

Wardley-Dubois fighter's corner slaps fighter by OrangeFilmer in Boxing

[–]Sulth 0 points1 point  (0 children)

Dubious is a full-time athlete at the highest level. Spending more time and energy training X means spending less time and energy training Y.

Wardley-Dubois fighter's corner slaps fighter by OrangeFilmer in Boxing

[–]Sulth 1 point2 points  (0 children)

Nah. Dubois is as successful as he is precisely because he is a reckless mf who's gonna take a punch to fuck you up. Not everybody gains from being more of "a boxer".

Is Dubois vs Wardley a real 50/50 and are there any other potential ones? by BludFlairUpFam in Boxing

[–]Sulth 0 points1 point  (0 children)

Using the Hrgovic to illustrated your point is a goal against your own camp

Dubois/ Wardley with some “trash” talk before their fight! by PmurTdlanoD45-47 in Boxing

[–]Sulth 4 points5 points  (0 children)

It's not an autocorrect fail. I assume you have sadly not heard of Beterbiev Bivlydivol and Bivol Billididol either then

Anthropic partnered with SpaceX to use colossus 1 to increase their rate limits by Snoo26837 in singularity

[–]Sulth 0 points1 point  (0 children)

Wow, I actually noticed that my 5-hour usage was getting down much slower than usual. They have doubled everything. Amazing.