RL Environments Engineer
35 000 - 74 000 PLN/ mies.B2B (netto)
SeniorFull-time·B2B
#333319·Dodano 20 dni temu·25
Źródło: SOLID.JobsTech Stack / Keywords
PythonDockerLLMMLReinforcement LearningAI
Firma i stanowisko
US-based AI startup focused on building the next generation of training data for LLMs. The team partners with top AI labs to create realistic RL environments where models encounter research and engineering challenges, iterate, and learn from feedback, pushing AI closer to its full potential.
Wymagania
- Strong Python (engineering-quality)
- Docker and production mindset
- Understanding of LLMs and their limitations
- Ability to meet throughput expectations
- Advanced English (C1/C2) and ≥4 hours overlap with US time zones
Nice to have:
- Deep knowledge of transformer internals and LLM training/inference
- Experience with inference libraries (vLLM, SGLang, etc.)
- CUDA or Pallas kernel development experience
- Publications or open-source contributions in active DL/ML research
- Experience building interactive RL environments and RL-based learning systems
- 5 years of experience in a similar position
Obowiązki
- Build and maintain RL/ML environments for LLM training
- Implement robust, production-quality Python code (not just notebooks)
- Deploy and run environments in Docker with focus on reliability and iteration speed
- Analyze model performance and respond to feedback efficiently
- Collaborate with research teams to translate papers and ideas into RL problems
Oferta
- Fully remote, flexible work schedule with some overlap to US time zone
- Direct impact on how LLMs learn
- Collaboration with top AI researchers and labs
- Exposure to cutting-edge RL and ML projects
- 35.0k–74.0k PLN netto/month (B2B)
Inne informacje
- Fully remote work
- Recruitment process: 2 meetings with hiring managers, followed by a phone screen with recruiter and technical test
Verita HR
201 aktywnych ofert