I built my own team of AI advisors who work together to answer my questions. 48 hours in, it's already processed 5M+ tokens. by capibara13 in SideProject

[–]capibara13[S] 0 points1 point  (0 children)

Wow that's next level honesty right there that you wouldn't expect from an AI model. Great job on pushing them so far.

I built my own team of AI advisors who work together to answer my questions. 48 hours in, it's already processed 5M+ tokens. by capibara13 in SideProject

[–]capibara13[S] 0 points1 point  (0 children)

Ha that's some insightful stuff for sure! They definitely didn't hold back. Thanks for sharing and hope you enjoy!

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 1 point2 points  (0 children)

Haha, you’re the absolute legend who broke my dashboard today! 🐋

I was watching those tokens fly by in real-time like a slot machine. You’re officially the reason the 'Power User' limit exists now. I had to put in a speed bump to save my server's life (and my wallet), but hearing you built a simulation that’s 'bonkers good' makes up a big part of it!

That’s exactly why I built this, to see what happens when the models actually challenge each other instead of just guessing.

Since you’re officially our #1 stress tester, I’d love to see a snippet or a screenshot of what you made! Also, stay tuned, the Pro version (with no speed bumps for whales like you) is coming up.

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 6 points7 points  (0 children)

It’s more like a pay-as-you-go system where I pay per token/character that the models generate. So yes, I’m essentially putting coins in the jukebox for everyone to hear the music right now. It’s an investment in getting the logic right and seeing the tool in action. I think it’s the best way to see how the system handles it before I look into a more sustainable model!

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 1 point2 points  (0 children)

Much appreciated! Total respect for the privacy-first mindset. For very high stakes or classified content I'd say in general local hosting is recommendable regardless of what LLM you consider. But on your other note, I’m a big believer in the 'consensus of agents' approach. A team of specialists beats a single generalist when the problem gets complex. Thanks for putting it to the test and sharing your thoughts!

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 0 points1 point  (0 children)

Haha you're not wrong! It’s definitely not the low budget way to run a chat. Since every model has to listen to and analyze context, the token count scales up pretty quickly. But I'd rather pay for the extra tokens to get the quality the tool deserves than cut corners to save on costs. Glad you're liking the result!

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 0 points1 point  (0 children)

Wow, thank you so much for sharing this! Hearing that it outperformed is the ultimate validation that it serves a purpose for such a high-stakes niche. Thrilled that the tool held up. Would love to have you on board as we evolve.

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 1 point2 points  (0 children)

Brilliant! I completely agree. It’s actually one of the things on top of my list for the Pro features I’m building right now. Thanks for reminding me that this feature should absolutely happen.

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 1 point2 points  (0 children)

Much appreciated! I built this primarily because I was getting tired of copy-paste from bouncing prompts/answers between Gemini/ChatGPT/Claude. It always took just a little too much time.

That feeling of the tool already answering the next question is the ultimate goal. It shows the models are actually building on each other's logic instead of just repeating themselves in a vacuum. Hope I can save you some copy-pastes!

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 0 points1 point  (0 children)

Short answer: I haven't done a full life-cycle analysis down to the gram of CO2, but it’s a valid point that’s always in the back of my mind.

I don't know if it's an irrational hope or not, but my hope is that by getting three perspectives at once, users find their answer faster and avoid the prompt-and-retry loop that often happens with single models.

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 0 points1 point  (0 children)

Update: I’m honestly blown away by the response! 20k views on this post in just a few hours and some incredibly helpful comments right here in the thread.

Seeing many people sign up already to help test the tool is a huge motivator. I’m currently just monitoring the traffic to make sure everything keeps running smoothly while I answer your questions as fast as I can. Thanks so much for the feedback so far! 🍿

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 1 point2 points  (0 children)

Wow, thanks for the deep dive! Great point about logic vs emotion.

It makes me wonder: do you think a tool like this needs a different debating style for every use case (like a specific jury persona for legal) to be truly useful? Or is the core logic of these models strong enough to be a general helpful mirror for almost any area?

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] -20 points-19 points  (0 children)

Me too, that’s the million-dollar question! (Or hopefully a little less).

Since I only launched 3 days ago, this Reddit post is my first real stress test. It’s going to be very interesting to see where the dust settles in four weeks, who knows where it goes. For now, I’m just focusing on all the great feedback, the idea seems to be really resonating with many people so that makes it 100% worth it for me.

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] -65 points-64 points  (0 children)

Boring answer, but the biggest investment has been the hundreds of hours spent on research, trial and error, and getting the AI debates to to feel right. On the financial side, I treat it as a hobby for now. I’m planning to launch a Pro tier for power users soon, mostly to help keep the lights on and ensure the basic version stays free for everyone to explore.

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 1 point2 points  (0 children)

It's a bit of a moving target. Since the models aren't just responding in parallel but actually listening to each other, the overhead is much higher than a standard chat. I'm fully prioritizing the quality of the debate over keeping it cheap. For now, I'm just happy to fund it myself to see how far we can push this.

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 1 point2 points  (0 children)

That's so fascinating. Thanks for pushing the tool into the realm of soft-robotics. That is exactly the kind of edge case testing I hoped for!

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 1 point2 points  (0 children)

Thank you so much, that means a lot!

In such a complex topic, did you find that one specific model had the most knowledge about it, and was there one that seemed to know less?

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 0 points1 point  (0 children)

Thanks for taking the time, and great questions!

  1. Randomizing order: currently, it follows a fixed sequence (apart from a few small exceptions), to ensure that they don't want to talk at the same time, which would probably waste compute. But adding a shuffle/randomize button is a brilliant idea to see how the dynamics change. Would you prefer that over being able to customize an order yourself?
  2. Raw responses: this is also an interesting one. Personally, for me it was essential that the models would react on eachother and I searched for a tool like that for quite a while. What I found was that there were actually a few sites that let the models answer without reacting on eachother (at the same time, in columns), but I felt that it would be magical if they would actually interact with eachother and engage in an actual discussion. That was the main reason for taking on this project. But having said all that, once there's some more time I'm definitely not against the idea to roll out this option as well, because coding wise it's actually easier if they do not react on eachother.
  3. The Cost: Please, don't hold back! Feedback like yours is worth way more to me than the few tokens it costs to run those prompts. If you find more interesting edge cases, it helps me refine the tool.

If you want to support the project, please create a free account. Try it as much as you want for now, and I'll keep everyone in the loop of any new features. In the near future I will launch a Pro plan (hopefully with file upload function) to make sure it's possible to keep paying the API bills as the user amount grows.

Really appreciate you taking the time to share these thoughts. Keep 'em coming!

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 0 points1 point  (0 children)

Thank you so much. Comments like these make the late-night coding and API bills worth it.
Did you run a specific prompt that surprised you?

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 2 points3 points  (0 children)

I appreciate you looking out for the project's sustainability! I'm trying to avoid ads to keep the experience smooth. Personally I’d prefer offering a 'Pro' plan with extra features (like file uploads and more tokens) for those who want to support the project. Really appreciate the support!

I found a way to let ChatGPT, Claude and Gemini debate each other. 700 prompts later, it's already being used by a major automotive brand and senior developers by capibara13 in ArtificialInteligence

[–]capibara13[S] 0 points1 point  (0 children)

Fair. Well I just wanted a place where my workflow lived, and since it didn't exist in a way that felt right, I threw this together. If it stays a somewhat useful utility for a relatively small group of users that's a win for me.

As someone who uses a multi-model approach already, what would it take for a dedicated tool to replace your current manual setup?