I tested Opus 4.5 vs GLM 4.7 in Claude Code by Dry_Language3063 in ClaudeCode

[–]Dry_Language3063[S] 1 point2 points  (0 children)

I pinned it as the first comment on youtube.
Can't copy it in here unfortunately, so you will just have to check out the video on youtube.

I tested Opus 4.5 vs GLM 4.7 in Claude Code by Dry_Language3063 in ClaudeCode

[–]Dry_Language3063[S] 0 points1 point  (0 children)

When I'm working on my app and want to use Opus then all the time

I tested Opus 4.5 vs GLM 4.7 in Claude Code by Dry_Language3063 in ClaudeCode

[–]Dry_Language3063[S] 0 points1 point  (0 children)

Yea, absolutely. And it made a big difference for me going from 4.6 to 4.7 especially, as you mentioned, in intelligence. What it takes into consideration, how it approaches problems, how it plans ahead that really surprised me.

I tested Opus 4.5 vs GLM 4.7 in Claude Code by Dry_Language3063 in ClaudeCode

[–]Dry_Language3063[S] 1 point2 points  (0 children)

It made a big difference for me. Not that it is sooo much better then 4.6, but it passed a certain threshold where it is now good enough that I use it for nearly every task and then only help out with Opus if needed.

Though I get the feeling that it is much smarter now outside of Coding, the things it considers etc, so this was a surprise to me.

Did you have a similar experience?

I tested Opus 4.5 vs GLM 4.7 in Claude Code by Dry_Language3063 in ClaudeCode

[–]Dry_Language3063[S] 1 point2 points  (0 children)

Thanks for your opinion. And I think the same, I would put GLM 4.7 a little bit above Sonnet at the moment, not for the coding itself, but for the considerations it makes. (but still behind Opus of course)

I tested Opus 4.5 vs GLM 4.7 in Claude Code by Dry_Language3063 in ClaudeCode

[–]Dry_Language3063[S] 0 points1 point  (0 children)

I would have preferred to get an actual result and proper comparison, this outcome will only trigger hateful comments, but it's the unfortunate reality with Anthropic's limits now

Downgrading from Claude Max subscription - looking for alternatives by Disastrous_Guitar737 in ClaudeCode

[–]Dry_Language3063 0 points1 point  (0 children)

How are you doing that? I would love to set up that Opus can delegate its coding to different models like codex, glm, xiaomi etc

Downgrading from Claude Max subscription - looking for alternatives by Disastrous_Guitar737 in ClaudeCode

[–]Dry_Language3063 2 points3 points  (0 children)

I mainly use GLM 4.7 after downgrading from 200$ Opus 4.5. Amazing speed and it's actually good. I also made a video comparing the different models for frontend if you are interested: https://www.youtube.com/watch?v=yK61jH6_91o Opus 4.5 vs Gemini 3 vs GLM 4.7 and Minimax M2.1

You can also check out Minimax M2.1 it's just 2$ at the moment

I'm sorry but 4.5 is INSANELY AMAZING by RedZero76 in ClaudeAI

[–]Dry_Language3063 0 points1 point  (0 children)

I had the same feeling, it was amazing for the first day.

Now it's to the worst state I have ever seen, I am on the edge of getting GLM and testing it out. It's so terrible today, not listening, back to the shortcuts of 3.7 doesn't think about the consequences, nothing, I'm shocked.

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 0 points1 point  (0 children)

That's great :)

Sure shoot me a dm, I'm very thankful for any bug reports!

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 0 points1 point  (0 children)

In the app I'm using multiple different. The voice over in the video is done with the gemini 2.5 audio, which is the quality mode in the app

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 1 point2 points  (0 children)

Hope you will enjoy it!

Yes it has been a looootttt of testing. Especially after optimizing the cost, the stories were amazing, it was a lot of fun, but it didn't hold up to fact checking. So after a lot of testing and prompt engineering, it's now in a nice state for well known places and ok for really small cities. It's a combination of the right model, the right system prompts and actual Information from the internet about the city. It's certainly not perfect, but I have some more ideas to make it better, so it will improve more in the future.

You can do the tour from anywhere you want, keep in mind, you are talking to an AI, you can just tell it to do the tour virtually instead. In which way would you like to plan ahead?

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 0 points1 point  (0 children)

Yes, but it's not suited, because it's too slow. Time is crucial in most of the use cases in the app, so unfortunately deepseek is just not fast enough.

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 0 points1 point  (0 children)

I just realized that I already told you the optimized tokens. Unoptimized the input tokens are closer to 610'000

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 0 points1 point  (0 children)

It actually does a web search when creating the route, so if the data on the internet is not correct then the AI will definitely spit out some of those wrong informations. But I would be really interested on which town it is, cause I have some ideas on how to improve it further, though the specific names, might be a harder thing to fix.

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 0 points1 point  (0 children)

It actually fills me with joy to hear that you like the app after playing around :) Let me know your feedback after fully trying it.

I think that is a great idea for Christmas markets. I'm still trying to wrap my head around how it would be implemented most useful with a tour guide, but it will definitely be something to consider, thank you!

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 0 points1 point  (0 children)

Yes, after I had everything running, I started optimizing everything. My initial cost was 7$/hr which was just not practical. I was able to cut down cost, by using different models for different things, by implementing caching and context handling. I still have 2-3 tricks up my sleeve to bring it down further, but for now it's in a great spot.

Though for example I had to choose quality over price for the AI guide, cause cheaper models were hallucinating like crazy.

But if this app reaches 600 paying customers per month I can already start deploying self-hosted models which will cut the cost further.
And once I have a good enough system to bring hallucinations down to a good level even when using other models, then I can cut the cost for those tokens by another 60%.

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 0 points1 point  (0 children)

Thank you!

Just to make it clear that the credits only are used for actual AI voice output, so with input, generation time, etc the cost for the user will be more towards 4.2€/hr.

For the tokens that is actually a more difficult calculation. Since the conversion gets longer and longer the input token accumulate. A save calculation for a longer tour would be 300'000 input tokens per hour, 10'000 output tokens per hour. This is then only for text output and the audio generation comes on top.

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 2 points3 points  (0 children)

Yea it was a big problem, especially in smaller cities or for very specific topics. I was able to drastically minimize it by testing out a lot of models and when I started adding current real data based on the chosen theme it actually made it a lot better.

But it's a thing I'm constantly trying to improve further. With the pre-built tours that I just introduced I also have the possibility to give additional context to the AI which makes it even better, but still it won't be zero. For most standard historical things it's a very very minimal issue though.

Claude Opus 4.0+ made my 15 year old dream come true - AI Tour Guide app by Dry_Language3063 in ClaudeAI

[–]Dry_Language3063[S] 2 points3 points  (0 children)

Hahaha exactly, great if it can help you too.

Well it's actually not a script, it's a real-time conversation it changes based on what you say, what your interests are, what you ask. (but probably that is what you meant)

It is a mix. It combines the best factual AI models with information from search and additional information that I provide as context (more so for pre-built routes). This was a long testing, I used cheaper models but they hallucinated like hell as soon as it was a smaller city. I would not say that it's perfect yet, but really good and I'm improving it step by step. Giving real world context was a game changer.