AI/ML Developer (Speech AI)

Brak informacji o wynagrodzeniu
NieokreśloneFull-time·B2B·Umowa o pracę
#358094·Dodano 20 dni temu·2
Źródło: Coherent Solutions
Aplikuj teraz

Tech Stack / Keywords

AIVoice Activity DetectionPyTorchTensorFlowLLMPythonCloud

Firma i stanowisko

Our client is a technology startup building advanced voice automation solutions for the quick-service restaurant industry. The company develops a privacy-conscious, high-performance drive-thru voice assistant that automates real-time customer interactions, improves order accuracy, and helps restaurant chains increase revenue and operational efficiency. Its product is designed for fast deployment in noisy, high-volume drive-thru environments and is already gaining market traction through cooperation with a major restaurant chain.

The project is a next-generation voice automation engine for drive-thru order-taking in quick-service restaurants. It enables fully automated, real-time conversations between customers and the restaurant ordering system using modern speech recognition, natural language processing, and text-to-speech technologies.


Wymagania

  • Advanced Python development skills
  • Deep hands-on expertise with Speech-to-Text and Text-to-Speech systems
  • Proven experience improving speech recognition quality in noisy or otherwise challenging acoustic environments
  • Strong expertise in noise suppression, echo cancellation, voice activity detection, and speech enhancement
  • Strong understanding of real-time and streaming audio architectures, including conversational voice pipelines and real-time inference
  • Experience building low-latency, production-grade AI systems
  • Experience with modern speech AI frameworks, models, and APIs
  • Experience deploying and scaling AI services in cloud environments
  • Ability to troubleshoot complex audio quality, latency, and reliability issues
  • Product-oriented mindset with a focus on real-world performance, customer experience, and high ownership
  • Ability to collaborate effectively with engineering, LLM, and conversational AI teams
  • English level: B2 or higher

Obowiązki

  • Optimize low-latency, real-time Speech-to-Text pipelines for production drive-thru environments
  • Improve Text-to-Speech naturalness, responsiveness, and overall conversational quality
  • Design, tune, and improve noise suppression, echo cancellation, and speech enhancement systems
  • Improve speech recognition accuracy and robustness under challenging acoustic conditions, including engine noise, weather, overlapping speech, poor microphone quality, and outdoor environments
  • Build and scale audio processing infrastructure for production deployments
  • Evaluate, benchmark, and compare speech models using real-world audio data and production scenarios
  • Experiment with modern Speech AI technologies, models, and architectures to improve system performance
  • Collaborate with LLM and conversational AI teams to improve end-to-end voice interaction quality

Oferta

  • Technical and non-technical training for professional and personal growth
  • Internal conferences and meetups to learn from industry experts
  • Support and mentorship from an experienced employee to help you professional grow and development
  • Health insurance
  • English courses
  • Sports activities to promote a healthy lifestyle
  • Flexible work options, including remote and hybrid opportunities
  • Referral program for bringing in new talent
  • Work anniversary program and additional vacation days
Opieka zdrowotna
Kursy językowe
Karta sportowa
Elastyczne godziny
Dofinansowanie szkoleń
Budżet konferencyjny
Coherent Solutions Sp z o.o.

Coherent Solutions Sp z o.o.

18 aktywnych ofert

Zobacz wszystkie oferty
Aplikuj teraz