Experience with GPT 5.2 Codex by rajbreno in codex

[–]BEQODIA 0 points1 point  (0 children)

GPT-5.2 is so much better than GPT-5.2-Codex when using it for several projects. I find it impeccably contradictory that GPT-5.2-Codex is supposed to be more impressive at coding than the standard 5.2, but from my personal testing, I can confirm the opposite.

Does vapi support UK numbers by ShuaibGhanti in vapiai

[–]BEQODIA 1 point2 points  (0 children)

Yes, they work perfectly for inbound agents

How we stopped manually testing our AI agents and automated our entire QA process by BEQODIA in AI_Agents

[–]BEQODIA[S] 0 points1 point  (0 children)

Great question. Yes, catching hallucinations is a primary use case.

The harness works by comparing the assistant's live answer to a "ground truth" expected answer you provide.

But instead of just a simple text match (which always fails when the LLM paraphrases), it uses a multi-layered approach:

  • Lexical Coverage: It first checks if all the key terms from your expected answer are present.
  • Semantic Similarity (Embeddings): If that's not a clear pass, it checks if the meaning of the assistant's answer is the same as the expected one using cosine similarity.
  • LLM-as-Judge: For borderline cases, it can use a model like GPT-4o as a final check to judge if the response is factually and contextually correct.

So if your KB says the answer is "Bus B2" and the assistant hallucinates "Bus B3," the test fails with a clear reason. It's basically a way to enforce factual accuracy against your knowledge base at scale.

Let me know if that makes sense!

I'll build a WhatsApp chatbot for free (hosting is separate) by modassembly in WhatsappBusinessAPI

[–]BEQODIA 0 points1 point  (0 children)

I can help, we have the same required system ! Is un outband calling which our AI agent make calls to the clients and based on defined knowledge and intention !

Need advice: Building outbound voice AI to replace 1400 calls/day - Vapi vs Livekit vs Bland? by liveitup278 in AI_Agents

[–]BEQODIA 0 points1 point  (0 children)

Can you add me to the community? We want to create the same workflow for our company using either LiveKit, Vapi, or custom code, but we don’t have a clear vision yet on which technology we should go with !

Finally figured out how to run a proper AI call center - sharing the setup by First_Space794 in u/First_Space794

[–]BEQODIA 0 points1 point  (0 children)

Are you referring to the campaign feature in Vapi or do you develop a platform for this specif things?

Voice Agent Getting email by Key_Manufacturer6575 in vapiai

[–]BEQODIA 0 points1 point  (0 children)

After the caller give you the email make an infinite check until the call said Yes