About Omniloy
We’re Omniloy, a healthtech startup on a mission to transform healthcare with AI-driven automation solutions. Founded by two Spaniards, Mar Pujadas and Enrique Alcázar, and backed by top-tier funds like JME Ventures, Wayra Telefónica, and Restive Ventures, we're building a world-class team — and we need people like you!
We believe that great work happens when people work how they work best — whether that’s from an office, from home, or somewhere in between. We keep things flexible and trust you to manage your time, knowing we’ll do our best work when we stay connected and support each other.
Your Mission
As a Senior Software Engineer at Omniloy, you will be a key player in building, optimizing, and scaling our cutting-edge voice-to-voice AI pipeline. This system combines Automatic Speech Recognition (ASR), Large Language Models (LLMs), and Text-to-Speech (TTS) to enable natural, real-time conversations between healthcare professionals and our AI. Your mission will be to lead the development and deployment of this pipeline, ensuring it performs reliably at scale — even with tens of thousands of concurrent users.
As a Senior AI Software Engineer, here at Omniloy, you will…
* Architect, implement, and maintain a robust voice-to-voice AI pipeline using state-of-the-art ASR, LLM, and TTS technologies.
* Optimize models and infrastructure to support scalability, reliability, and low latency.
* Collaborate closely with machine learning engineers, data scientists, and product teams to refine conversational AI features and enhance user experience.
* Evaluate and integrate open-source and third-party models and technologies into the pipeline.
* Develop automated testing, monitoring, and logging to ensure system stability and performance.
* Identify bottlenecks and proactively implement performance improvements to scale the platform effectively.
* Document system architecture, deployment processes, and best practices.
Ideally, you’ll have
Must-haves:
* 5+ years of professional software development experience, preferably with a focus on real-time systems and scalable backend services.
* Demonstrated expertise in deploying ASR, LLM, or TTS models in production environments.
* Proficiency in Python and familiarity with frameworks such as PyTorch, TensorFlow, or Hugging Face.
* Experience deploying and scaling services using cloud platforms (e.g., AWS, GCP, Azure), Kubernetes, and Docker.
* Strong understanding of microservices architecture, REST APIs, and streaming data technologies.
* Excellent problem-solving skills, strong attention to detail, and a commitment to quality and performance optimization.
* Strong communication skills and ability to effectively collaborate in cross-functional teams.
Nice-to-haves:
* Experience specifically with voice-to-voice AI pipelines and real-time conversational agents.
* Hands-on experience working with Large Language Models (LLMs), including Langchain or similar LLM-related libraries.
* Knowledge of WebRTC, LiveKit, gRPC, or related real-time communication technologies.
* Experience with fine-tuning and optimizing LLMs for specific tasks.
* Familiarity with model optimization techniques (quantization, distillation) and hardware acceleration (GPUs, TPUs).
* Previous experience scaling services to tens of thousands or millions of users.
Wondering what’s in it for you?
* Competitive salary
* Remote-first culture with flexible hours and time off — including your birthday off
* Work on cutting-edge AI tech that’s transforming healthcare
* Two annual team offsite to connect with your coworkers and have fun
* A supportive, fast-moving team where you can grow your skills and your impact
If you're ready to join a passionate team that’s redefining how the healthcare system works, we’d love to hear from you 😊