Job Description
Leo Technologies is a pioneering company dedicated to developing innovative solutions that leverage advanced audio and speech processing technologies. Committed to transforming how organizations extract meaningful insights from audio data, Leo Technologies specializes in building scalable, high-precision transcription and speech recognition systems. Their focus on cutting-edge machine learning models, cloud infrastructure, and real-time processing enables them to serve diverse industries including security, media, healthcare, and enterprise analytics. With a culture rooted in innovation, collaboration, and continuous improvement, Leo Technologies strives to push the boundaries of what is possible in speech and audio intelligence, ensuring their clients stay ahead in a rapidly evolving digital landscape.
About The Role
We are seeking a highly skilled Transcription Engineer to join our Platform Team in a remote, work-from-home capacity. This role is central to our mission of extracting actionable intelligence from complex audio environments. As a Transcription Engineer, you will be responsible for enhancing transcription quality across various workflows by experimenting with Automatic Speech Recognition (ASR) models, integrating third-party services, and developing tooling that guarantees accuracy, reliability, and scalability. The ideal candidate possesses a robust software engineering background, with expertise in Python, audio processing, and applied machine learning techniques tailored for speech recognition. This position combines hands-on engineering, data science, and DevOps skills, involving experimentation with ASR models and deploying services into production at scale. It offers a challenging yet rewarding opportunity for professionals passionate about audio, language, and building high-quality systems that power real-world intelligence use cases.
Qualifications
The ideal candidate should have a strong foundation in software engineering, with a minimum of five years of professional experience in speech processing, NLP, or transcription systems. Proficiency in Python is essential, along with familiarity with system-level programming when necessary. Experience with ASR frameworks such as Whisper, Kaldi, Vosk, or NVIDIA NeMo is highly desirable. Candidates should have a solid understanding of audio engineering tools like ffmpeg and Sox, and techniques for denoising and voice enhancement. Knowledge of speaker diarization, speaker recognition, and multi-language ASR challenges is important. Experience with data analysis tools such as Pandas, NumPy, and Jupyter for evaluating model performance is required. A good understanding of cloud deployment and DevOps practices, including Docker, Kubernetes, and serverless architectures, is also necessary. The ability to work independently in a fast-paced environment, make tradeoffs, and deliver results with minimal supervision is crucial. Bonus points are awarded for experience in fine-tuning ASR models on domain-specific datasets, real-time streaming pipelines, search and retrieval systems like Elasticsearch, and prior work in audio forensics or noisy-channel speech analysis.
Responsibilities
The core responsibilities of this role include leading efforts to improve transcription quality by evaluating, testing, and fine-tuning various ASR models, whether commercial APIs or open-source solutions. You will build and optimize pipelines capable of speaker identification, diarization, multi-language support, and noise-robust transcription in challenging audio conditions. Developing and maintaining resilient services that integrate multiple ASR providers to ensure flexible and reliable workflows is essential. Collaborating with platform engineers to facilitate seamless ingestion and storage of transcription outputs within data pipelines is a key aspect of the role. You will analyze transcription data to identify error patterns and explore audio engineering techniques such as denoising, voice isolation, and signal processing to enhance speech clarity. Deploying and maintaining transcription-related services with basic DevOps practices to ensure scalability and reliability is also part of your responsibilities. Additionally, participating in all stages of the development lifecycle—from ideation and design through prototyping, implementation, deployment, and iteration—is expected to ensure continuous improvement and innovation.
Benefits
Leo Technologies offers a comprehensive benefits package designed to support your well-being and professional growth. This includes a competitive salary range of $140,000 to $170,000 annually, commensurate with experience, work location, and role level. The company provides a generous three-week paid vacation from the outset, along with sick leave and paid holidays to promote work-life balance. Employees enjoy a modern, flexible work environment that emphasizes collaboration and continuous learning. The organization supports remote work, allowing you to work from the comfort of your home while engaging with a talented team of professionals. Benefits also include comprehensive medical, dental, and vision plans to ensure your health and wellness are prioritized. The company fosters a culture of innovation, offering opportunities to work with cutting-edge technologies and contribute to impactful projects. Regular feedback and growth opportunities help employees develop their skills and advance their careers within a supportive environment.
Equal Opportunity
Leo Technologies is an equal opportunity employer. They are committed to fostering an inclusive workplace that values diversity and equal opportunity for all employees and applicants. The company does not discriminate on the basis of race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, or any other legally protected status. All employment decisions are made based on qualifications, merit, and business needs. Leo Technologies strives to create a welcoming environment where everyone can thrive and contribute to the company’s success.
Desired Skills and Experience
Speech Recognition, Audio Processing, Machine Learning, Python, DevOps, NLP