RL Environments Engineer

35k - 74k PLN35 000 - 74 000 PLN/ mies./ mies.B2BB2B (netto)

SeniorFull-time·B2B

#333319·Dodano dwa miesiące temu·107

Źródło: SOLID.Jobs

Aplikuj teraz

Tech Stack / Keywords

PythonDockerLLMMLReinforcement LearningAI

Firma i stanowisko

US-based AI startup focused on building the next generation of training data for LLMs. The team partners with top AI labs to create realistic RL environments where models encounter research and engineering challenges, iterate, and learn from feedback, pushing AI closer to its full potential.

Wymagania

Strong Python (engineering-quality)
Docker and production mindset
Understanding of LLMs and their limitations
Ability to meet throughput expectations
Advanced English (C1/C2) and ≥4 hours overlap with US time zones

Nice to have:

Deep knowledge of transformer internals and LLM training/inference
Experience with inference libraries (vLLM, SGLang, etc.)
CUDA or Pallas kernel development experience
Publications or open-source contributions in active DL/ML research
Experience building interactive RL environments and RL-based learning systems
5 years of experience in a similar position

Obowiązki

Build and maintain RL/ML environments for LLM training
Implement robust, production-quality Python code (not just notebooks)
Deploy and run environments in Docker with focus on reliability and iteration speed
Analyze model performance and respond to feedback efficiently
Collaborate with research teams to translate papers and ideas into RL problems

Benefity

Fully remote, flexible work schedule with some overlap to US time zone
Direct impact on how LLMs learn
Collaboration with top AI researchers and labs
Exposure to cutting-edge RL and ML projects
35.0k–74.0k PLN netto/month (B2B)

Inne informacje

Fully remote work
Recruitment process: 2 meetings with hiring managers, followed by a phone screen with recruiter and technical test

Verita HR

109 aktywnych ofert

Zobacz wszystkie oferty

Aplikuj teraz