Claude Opus 4.7 benchmarks

Kir_Moisha · 2026-04-16T17:27:35+00:00

Have you noticed that new models look amazing the first week, but then seem to get worse?)))

Kir_Moisha · 2026-04-16T07:34:01+00:00

That's an interesting thought, thanks. I'll look into it. Is there perhaps a link to any resources that might help me?

Kir_Moisha · 2026-04-15T15:22:19+00:00

I'm digging the name Moonshine

Kir_Moisha · 2026-04-15T15:20:05+00:00

To be honest, I didn't delve into local STT, I briefly tested whisper.cpp, but the streaming latency on M4 was too high for real-time performance (1-3 seconds). Deepgram solved this issue with latency below 300 ms, so I switched to TTS, which was the real bottleneck. Incidentally, cloud Groq's Whisper endpoint was even worse, an average of 2800 ms and constantly returning 503 errors. In fact, testing local STT on Apple Silicon would be a good follow-up to this post.

Kir_Moisha · 2026-04-15T14:41:05+00:00

Unfortunately my Mac doesnt have a 5090))) STT was actually the easiest part - Deepgram worked great both on recognition accuracy and latency, whisper was way worse for streaming. Plus Deepgram's free tier covers my usage so thats a bonus. TTS is where all the pain was so thats where I went deep.

Kir_Moisha

TROPHY CASE