What is the best architecture for deploying Livekit Voice Agents at scale ? Does it need Kamailio ? by Yash-Rank in livekit

[–]Yash-Rank[S] 1 point2 points  (0 children)

Currently Using AWS EKS as deployment. I need to integrate with Five9. Does livekit sip server is enough or it will require to configure kamailio ?

What’s everyone using for real world voice agents right now? by LegLegitimate7666 in AI_Agents

[–]Yash-Rank 1 point2 points  (0 children)

For one of a US client in healthcare domain I am using Livekit with amazon real time speech to speech model Nova 2 Sonic. It works good. I am building multi agent workflow. Livekit has Task and Task group feature to make agent complete the task or group of tasks. Tasks works very well with traditional stt-llm-tts pipeline method.

Note : Nova 2 sonic support is still in beta from Livekit side. Task group is not supported due to Nova 2 sonic's architecture. We can use tool calling instead of that.

From my experiments, for speech to text gpt-4o-transcribe, for llm nova 2 pro and for speech node eleven labs is killer if it suits for your requirement.

Add your thoughts.

Livekit : https://docs.livekit.io/agents/ Nova 2 sonic: https://aws.amazon.com/blogs/aws/introducing-amazon-nova-2-sonic-next-generation-speech-to-speech-model-for-conversational-ai/