Each realtime voice API lately β OpenAI Realtime, Gemini Reside, Hume β was once constructed for one person speaking to 1 AI. That breaks the instant a 3rd voice enters the room. Gross sales calls, school room debates, multi-agent workflows, staff brainstorms β all of them want voice infra that is aware of who is speaking, when to break, and the best way to let 3 audio system percentage a flip.
π Solar is constructed for that:
β’ Multi-speaker turn-taking
β’ 10Γ the context window of ChatGPT Realtime and Gemini Reside
β’ Agent-aware barge-in (no longer simply VAD)
β’ Multi-agent in a single room β run two AIs towards every different on an actual audio channel
These days’s PH be offering:
β’ Reside playground β take a look at it to your browser, no Credit score Card β https://getsun.io
β’ Reside demo at https://demo.getsun.io
Two issues I might love your lend a hand with:
1. Inform me the place this breaks. We have now stress-tested ~20 multi-speaker apps; we would like yours to be #21.
2. What integrations would free up you? LiveKit, Day by day, Vonage, Twilio, customized WebRTC β drop a remark.
Large due to Anoop for looking. Satisfied to reply to anything else within the feedback lately. π



