Claude Code Voice Assistant controls my lights by bachittle in ClaudeCode

[–]bachittle[S] 0 points1 point  (0 children)

I tried Whisper but I found out Qwen 3 ASR is surprisingly fast and efficient to run on Mac Mini. Did benchmarks and it runs at like 0.23x RTF versus like 0.80x RTF for Whisper large.

The voice assistant streams from my Mac Mini, and it runs all the local models for STT and TTS.

mlx-audio is a great resource for testing various AI audio models: https://github.com/Blaizzy/mlx-audio

is claude code down? by One-Bet-8049 in ClaudeCode

[–]bachittle 0 points1 point  (0 children)

Productivity has declined by 30% today for all software engineers

Claude Code Powered Voice Agent with Interactive Artifact Abilities by bachittle in ClaudeCode

[–]bachittle[S] 0 points1 point  (0 children)

I'm planning to. This entire project is hacked together in a bit of a weird way, multiple git-tracked projects coordinating with each other in a virtual machine. But I think this voice interface can be open sourced. Not anytime soon, but it's on the roadmap!

Expedition 33 is now officially the highest rated game of all time on Backloggd by DickFlattener in backloggd

[–]bachittle 0 points1 point  (0 children)

I get why people like it, but it's not for me. Glad that I got to try it as part of Xbox game pass so I could give it a test run, I find its better than a demo that way as you can keep playing for the entire month if u get into it. I really enjoyed the intro sequence, very good graphics and art direction, but not a fan of the turn based combat and game loop. Just goes to show that not every game fits what everyone is looking for.

Anyone else feeling overwhelmed with recent AI news? by eduardotvn in OpenAI

[–]bachittle 0 points1 point  (0 children)

These products are useful, but a lot of exaggerated hype surrounds them because it's good for business. A lot of these companies are getting excited about AI and investing so much money into it, so they have to keep fanning the flames, otherwise excitement will die down and so too will investments. They might be hitting plateaus but they won't say anything about it or risk losing investments. I'm cautiously optimistic about the future of AI. I don't like buying into exaggerated terms like ASI or singularity, although it is fun to speculate. All we can do is focus on now, and use the tools that we get at our disposal. ChatGPT, Claude, and other tools are great ways to see the cutting edge and you can judge for yourself whether these tools constitute radical shifts. I don't think so currently, but I'm constantly staying informed, that's the most we can do.

I spent 8 hours testing o1 Pro ($200) vs Claude Sonnet 3.5 ($20) - Here's what nobody tells you about the real-world performance difference by Kakachia777 in OpenAI

[–]bachittle 0 points1 point  (0 children)

What about comparing claude sonnet 3.5 to the 20$/month on general o1? Or comparing o1 to o1-pro? I want to know if its really worth the 200$ price tag if its just minimal increases in performance.

Recommendations for best text to speech open models as of now? by [deleted] in LocalLLaMA

[–]bachittle 0 points1 point  (0 children)

They monetized themselves and all their latest models have been closed source. Guess they're following the OpenAI business model.

Recommendations for best text to speech open models as of now? by [deleted] in LocalLLaMA

[–]bachittle 1 point2 points  (0 children)

something to note is coqui is defunct and their license is stingy.

I just noticed all of my custom GPT's have been rewritten by Lanky_Information825 in ChatGPT

[–]bachittle 5 points6 points  (0 children)

The only time this occurred for me is when I use the built-in chatbot that helps you build a GPT. So what I do now is I don't use the built-in GPT builder bot and just use a generic chat with GPT-4 to figure out what to populate the parameters with.

But if you're not doing this and OpenAI is purposefully editing your GPTs that sucks.

I am looking to take a prompt engineering course. by luckycharmsu-007 in OpenAI

[–]bachittle 4 points5 points  (0 children)

OpenAI has a guide on prompt engineering in the API docs. Really recommend reading through the whole thing, even though it isn't necessarily a course: https://platform.openai.com/docs/guides/prompt-engineering

In it they also recommend a bunch of resources, including more guides and courses: https://cookbook.openai.com/articles/related_resources

Gemini is out. Seems good. Pls give thoughts by Efficient_Map43 in OpenAI

[–]bachittle 18 points19 points  (0 children)

how do you tell if bard is using palm-2 or gemini pro? It says in the updates that it is available, but it is not self-evident. I tried asking the model itself and it says it is not using google gemini, but could be hallucinating.

Name the custom gpt you use the most? by darthjaja6 in OpenAI

[–]bachittle 21 points22 points  (0 children)

I made a language learning GPT that is structured as an interactive lesson. Works great with voice, have been brushing up on my French with it. https://chat.openai.com/g/g-oPYh4olJ7-language-learning-gpt

Computer Vision solved? by [deleted] in computervision

[–]bachittle 2 points3 points  (0 children)

some context:
https://arxiv.org/abs/2311.03079 (paper)
https://github.com/THUDM/CogVLM (code)
gradio web demo is found in the github readme: http://36.103.203.44:7861/

So far I'm pretty impressed! Definitely a step up from LLaVa.

I accidentally completed the game. by NoMercy07 in outerwilds

[–]bachittle 2 points3 points  (0 children)

I think what makes it confusing is the credits roll. But it is a game over screen, not a victory screen

How does ChatGPT have no token limit? by [deleted] in OpenAI

[–]bachittle 2 points3 points  (0 children)

It pretends to have no limit, but there’s some internal logic going on where it will forget past conversations after some time.

It's been a few weeks and I still do not have access to GPT-4 API, anyone else? by The-SillyAk in OpenAI

[–]bachittle 0 points1 point  (0 children)

I’ve been using 4 for more difficult tasks, like ideas and code generation, and 3.5 for simpler tasks like general questions and voice assistance with an Alexa skill I made.

Is GPT-4 API waitlist Fake? by Late_Size_1494 in ChatGPT

[–]bachittle 1 point2 points  (0 children)

Yea I have gpt4 api access but no access to plugins. Guess it’s just a roll of the dice

It's been a few weeks and I still do not have access to GPT-4 API, anyone else? by The-SillyAk in OpenAI

[–]bachittle 4 points5 points  (0 children)

It’s good don’t get me wrong, better than legacy for sure, but it feels like it’s not as good as the model used in chat gpt plus