AI/ML Developer (Speech AI)

Brak informacji o wynagrodzeniu

NieokreśloneFull-time·B2B·Umowa o pracę

#358094·Dodano 20 dni temu·2

Źródło: Coherent Solutions

Aplikuj teraz

Tech Stack / Keywords

AIVoice Activity DetectionPyTorchTensorFlowLLMPythonCloud

Firma i stanowisko

Our client is a technology startup building advanced voice automation solutions for the quick-service restaurant industry. The company develops a privacy-conscious, high-performance drive-thru voice assistant that automates real-time customer interactions, improves order accuracy, and helps restaurant chains increase revenue and operational efficiency. Its product is designed for fast deployment in noisy, high-volume drive-thru environments and is already gaining market traction through cooperation with a major restaurant chain.

The project is a next-generation voice automation engine for drive-thru order-taking in quick-service restaurants. It enables fully automated, real-time conversations between customers and the restaurant ordering system using modern speech recognition, natural language processing, and text-to-speech technologies.

Wymagania

Advanced Python development skills
Deep hands-on expertise with Speech-to-Text and Text-to-Speech systems
Proven experience improving speech recognition quality in noisy or otherwise challenging acoustic environments
Strong expertise in noise suppression, echo cancellation, voice activity detection, and speech enhancement
Strong understanding of real-time and streaming audio architectures, including conversational voice pipelines and real-time inference
Experience building low-latency, production-grade AI systems
Experience with modern speech AI frameworks, models, and APIs
Experience deploying and scaling AI services in cloud environments
Ability to troubleshoot complex audio quality, latency, and reliability issues
Product-oriented mindset with a focus on real-world performance, customer experience, and high ownership
Ability to collaborate effectively with engineering, LLM, and conversational AI teams
English level: B2 or higher

Obowiązki

Optimize low-latency, real-time Speech-to-Text pipelines for production drive-thru environments
Improve Text-to-Speech naturalness, responsiveness, and overall conversational quality
Design, tune, and improve noise suppression, echo cancellation, and speech enhancement systems
Improve speech recognition accuracy and robustness under challenging acoustic conditions, including engine noise, weather, overlapping speech, poor microphone quality, and outdoor environments
Build and scale audio processing infrastructure for production deployments
Evaluate, benchmark, and compare speech models using real-world audio data and production scenarios
Experiment with modern Speech AI technologies, models, and architectures to improve system performance
Collaborate with LLM and conversational AI teams to improve end-to-end voice interaction quality

Oferta

Technical and non-technical training for professional and personal growth
Internal conferences and meetups to learn from industry experts
Support and mentorship from an experienced employee to help you professional grow and development
Health insurance
English courses
Sports activities to promote a healthy lifestyle
Flexible work options, including remote and hybrid opportunities
Referral program for bringing in new talent
Work anniversary program and additional vacation days

Opieka zdrowotna

Kursy językowe

Karta sportowa

Elastyczne godziny

Dofinansowanie szkoleń

Budżet konferencyjny

Coherent Solutions Sp z o.o.

18 aktywnych ofert

Zobacz wszystkie oferty

Aplikuj teraz