Responsibilities
* Design and implement real-time streaming voice pipelines, including audio ingestion, VAD / endpointing, Speech-to-Text (STT), LLM orchestration, and streaming Text-to-Speech (TTS) .
* Own and optimize turn-taking mechanisms, including barge-in, interruption handling, silence detection, endpoint tuning, and latency vs accuracy trade-offs.
* Integrate voice solutions with telephony and application channels such as SIP, WebRTC, CPaaS, PSTN, and IVR .
* Implement reliability and resilience mechanisms, including retries, backpressure handling, rate limiting, and fallback strategies between open-source and vendor components.
* Collaborate with Data Scientists and AI Engineers to design production-ready AI solutions aligned with business and technical constraints.
* Build, automate, and maintain ML pipelines for development, testing, deployment, and monitoring.
* Contribute to CI/CD workflows, including Docker image creation, automated testing, and deployment pipelines.
* Support production teams with environment setup, monitoring, retraining workflows, and performance optimization .
* Monitor solution quality through WER analysis, latency p95 measurement, conversation KPIs, and business performance metrics .
* Ensure continuous improvement of AI services through model monitoring, retraining, and production support .
Required Skills
* Strong expertise in Python
* Good knowledge of one systems language: Go / Rust / C++
* Experience with streaming audio architectures
* Strong hands-on experience with WebSockets and gRPC
* Expertise in ASR / speech AI frameworks such as:
* Whisper
* NVIDIA NeMo
* wav2vec2
* Experience with Neural TTS stacks
* Hands-on experience in:
Real-time STT
Real-time TTS
Barge-in / turn-taking
Latency optimization
* Telephony and voice integration experience:
SIP
WebRTC
CPaaS
PSTN / IVR
* Audio codecs ( u-law / A-law )
* Experience with Docker / Containerization / Virtualization
* Strong knowledge of GitLab CI / CI-CD
* Experience with code, model, and data versioning
* Dependency and package management experience
* Experience with PostgreSQL
* Knowledge of Agile methodology
Preferred Skills
* Speaker diarization
* Echo cancellation
* Semantic VAD / endpointing
* Experience in banking, insurance, or healthcare environments
* Experience with distributed systems / mainframe integrations
Soft Skills
* Strong communication skills
* Problem-solving and innovation mindset
* Attention to detail
* Results-driven attitude
* Continuous learning mindset
* Openness to feedback and change