RL Environments Engineer

35 000 - 74 000 PLN/ mies.B2B (netto)
SeniorFull-time·B2B
#333319·Dodano 20 dni temu·25
Źródło: SOLID.Jobs
Aplikuj teraz

Tech Stack / Keywords

PythonDockerLLMMLReinforcement LearningAI

Firma i stanowisko

US-based AI startup focused on building the next generation of training data for LLMs. The team partners with top AI labs to create realistic RL environments where models encounter research and engineering challenges, iterate, and learn from feedback, pushing AI closer to its full potential.


Wymagania

  • Strong Python (engineering-quality)
  • Docker and production mindset
  • Understanding of LLMs and their limitations
  • Ability to meet throughput expectations
  • Advanced English (C1/C2) and ≥4 hours overlap with US time zones

Nice to have:

  • Deep knowledge of transformer internals and LLM training/inference
  • Experience with inference libraries (vLLM, SGLang, etc.)
  • CUDA or Pallas kernel development experience
  • Publications or open-source contributions in active DL/ML research
  • Experience building interactive RL environments and RL-based learning systems
  • 5 years of experience in a similar position

Obowiązki

  • Build and maintain RL/ML environments for LLM training
  • Implement robust, production-quality Python code (not just notebooks)
  • Deploy and run environments in Docker with focus on reliability and iteration speed
  • Analyze model performance and respond to feedback efficiently
  • Collaborate with research teams to translate papers and ideas into RL problems

Oferta

  • Fully remote, flexible work schedule with some overlap to US time zone
  • Direct impact on how LLMs learn
  • Collaboration with top AI researchers and labs
  • Exposure to cutting-edge RL and ML projects
  • 35.0k–74.0k PLN netto/month (B2B)

Inne informacje

  • Fully remote work
  • Recruitment process: 2 meetings with hiring managers, followed by a phone screen with recruiter and technical test
Verita HR

Verita HR

201 aktywnych ofert

Zobacz wszystkie oferty
Aplikuj teraz