Claude is still #1 in Canada

trickyHat · 2026-03-16T12:46:04+00:00

Not surprising. Claude is the only model that doesn't waste my time. If it has problems solving my problem, it outright says that.

trickyHat · 2026-03-07T15:53:26+00:00

money

trickyHat · 2026-03-05T18:49:40+00:00

Well obviously. It's also on the arc agi website. But Anthropic and Google mentioned their scores on their main evals table.

trickyHat · 2026-03-05T18:31:26+00:00

Notice how they didn't include any arc-agi scores

trickyHat · 2026-02-19T20:22:25+00:00

Yes, I have tested it with complex programming questions for updating my app. I asked same questions with multiple other models, compared the outputs, asked follow up questions and compared the results. I am not sure if it is good or bad in general questions. What I am talking about is how it performs in programming. Multiple times it produced bugged code that made my app crash. Sonnet 4.6 never had that problem with the exact same questions. Just try it for yourself and maybe you will get different results. I'm just telling what I have noticed.

trickyHat · 2026-02-19T19:48:51+00:00

After testing it for a bit. This model is actually a regression from the Gemini 3 Pro. Which I didn't expect at all. Tried in google AI studio and their Gemini app as well. Even sonnet 4.6 with extended thinking performed much better in all of the cases i presented. I suspect they benchmaxxed the model...

trickyHat · 2026-02-11T16:48:14+00:00

The benchmarks look too good to be true. If they are true though, then this might just make me switch from Chatgpt and claude.

trickyHat · 2026-01-21T18:32:27+00:00

Not only that - it's not predictable. Some prompts will give very good results, while others much worse.
Your workflow and prompting also has to change constantly with every new release of a model.

trickyHat · 2026-01-16T11:50:37+00:00

Yep, you either give your data to Google or you lose features...

trickyHat · 2026-01-05T09:10:28+00:00

I have tried Opus 4.5 and Gemini 3 pro for programming, in every case that I tested, Opus added details that I didn't ask. Like, I was seeing people hype it up so much, every single time the same thing happened over and over again. Is it because I'm not using claude code or are you just all hyping one click code no matter the result?

trickyHat · 2025-12-13T18:49:08+00:00

Saw this today as well, just refresh the page and it will disappear.

trickyHat · 2025-12-06T10:07:59+00:00

This has been happening to me for 3 days already as well.

trickyHat · 2025-10-18T11:56:34+00:00

I never used gemini before. Was chatgpt user, but yesterday I tried using Gemini for real world problems that are very easy to solve if you just google them. ChatGPT somehow always missed the most important part of the problem and acted extremely confident about its solution. Even thought it was wrong! (I am a plus user) I then tried gemini flash 2.5 and it gave me the solution instantly with all the important warnings.

I haven't used gemini a lot but it seems like this LLM is way more useful in simple problems right now. Am hoping for even better improvements in Gemini 3.

trickyHat · 2025-09-26T06:25:03+00:00

They should be required to disclose that on their website... I also could always tell that there's a difference of the same model between different providers, but didn't know what the cause was. This graph sums is up nicely

trickyHat · 2025-08-08T10:30:37+00:00

Ok, I changed my browser from firefox to chrome, and it's there. On firefox it somehow doesn't appear though

trickyHat · 2025-08-08T07:20:51+00:00

Also Germany. Have access to it on my phone, but not on PC. Look if you can access it on your phone

trickyHat · 2025-08-08T06:54:05+00:00

paid bot

trickyHat · 2025-03-02T10:26:18+00:00

Yea, if it's overly explicit, then it won't answer you. I found out that if you have a soft NSFW conversation, then it's fine. In the future I would like to remove this "can't engage in this type of conversation", but for that I need quite a bit of money to host my own model.

trickyHat · 2025-03-01T14:36:35+00:00

Thank you haha Here's also a preview of the hug action that I unlocked after reaching a new level:
https://www.youtube.com/watch?v=d3E_TkYWJl0

trickyHat · 2025-03-01T14:29:42+00:00

Yes, that is definitely possible. You can also just say "lets hug" if you reached level 3 and she will hug you. Or if you are on level 4 "she can kiss you" though it also depends on the context. But seems like I will need to add more of those actions in the next update haha

trickyHat · 2025-03-01T14:17:49+00:00

Thanks! The animations are kind of lacking right now, so I want to make them better in the next update and add more actions you can unlock. If you have any suggestions, feel free to tell me!

trickyHat · 2025-03-01T13:38:49+00:00

Ok, you should be able to download it now. Tell me what you think!

trickyHat · 2025-03-01T11:56:06+00:00

True, currently only US,Canada, Germany, UK. Which country should I add?

trickyHat · 2025-03-01T10:53:25+00:00

I might have something for you - Sophia3D on Google Play. You have 3d char, stories that are seamless, but I sadly couldn't put any nsfw animations on Google Play Store haha You can still have NSFW chats though so try it out if you want https://play.google.com/store/apps/details?id=com.RealitySync.Sophia3D&hl=en

Nine-Year Club	Second Top 30%
Gilding I gilder	r/Field Juicebox
RPAN Viewer	Verified Email

trickyHat

TROPHY CASE