Hey folks in the voice AI community,
I've been grinding away on integrating Telnyx telephony with Pipecat for a custom customer request bot – an inbound voice assistant that handles real conversations(customer issues).
My Pipecat playground (STT, LLM, TTS) is rock-solid locally...
But Telnyx transport? It's been a battle. The WebSocket connects, VAD detects speech, but the AI just blanks on understanding me – like the 8kHz telephony signal isn't hitting STT right, causing silent or hallucinated transcripts. Tried GPT-4o-mini STT in another project and it butchered Polish language- and English; clearly need telephony-tuned STTs.
I can list the key challenges I've wrestled with in Pipecat (v0.0.93) + Telnyx, plus quick notes on what I solved (spoiler: core understanding is still pending). My understanding is that Pipecat is still maturing in that domain.
Anyone nailed a working Telnyx + Pipecat integration for real-time agents? How do you tune STT (Deepgram, OpenAi, AssemblyAI, or others?) to grok the Telnyx frequency without losing the plot? Would you be so kind to Share your setup or fixes? – this could turn into long-term collab gold.
Cheers,
Arek