Built an MCP for outbound phone calls that pauses to ask you for info mid-call
I built an mcp that gives your agent a phone (your phone). If it hits a question it can't answer mid-call, it pauses and pings you back with the specific question instead of guessing or hanging up.
You provide an objective along with phone number and identity of the recipient to initiate the call. Internally, it uses full-duplex system with speech-to-speech model rather than cascade of stt, llm and tts. The voice agent has tools to gracefully send questions to you mid-call while continuing the conversation, to navigate ivr and to hand-off the call back to you if needed.
I had been working with real-estate and manufacturing firms where phone calls are the most common forms of communication. A lot of them are follow-ups, arranging of meetings to showcase property/inventory, chasing deliveries etc. Too contextual yet too repetitive.
While there are voice agents and frameworks in the market like VAPI, Retell, Bland, they all cater to inbound workflows primarily geared for support and marketing. Outbound calls are much less structured and require an on-demand experience.
Phone number verification is required before making calls. This allows showing your number as the caller. The web app allows listening to calls live, downloading recordings and viewing transcripts.
Site: https://cocall.ai
Add as a connector using these instructions: https://cocall.ai/docs/claude
Would love feedback, and happy to answer anything about the implementation.