JetBrains
JetBrains
New

Senior Research Engineer (Code World Models)

Brak informacji o wynagrodzeniu
SeniorFull-time
#370873·Dodano wczoraj·0
Źródło: JetBrains
Aplikuj teraz

Tech Stack / Keywords

AIPythonNLP

Firma i stanowisko

JetBrains is a global software company that creates intelligent tools for software developers and teams. The JetBrains Research team focuses on developing models that learn how software systems behave, change, execute, and interact with developer tools.

Wymagania

  • Hands-on experience with model pre-training, continued training, or mid-training.
  • Strong engineering skills in Python and experience with modern ML frameworks.
  • Understanding of large-scale ML training workflows, including data processing, distributed training, checkpointing, evaluation, experiment tracking, and debugging.
  • Experience working with large datasets and attention to data quality, contamination, sampling, and reproducibility.
  • Background in NLP, ML for software engineering, or a similar domain.
  • Enjoy working on research problems with high uncertainty and turning ideas into working experiments.

Nice to have:

  • Experience training or adapting models for code generation, code understanding, software agents, program repair, test generation, or repository-level reasoning.
  • Experience with execution-based data such as unit tests, traces, logs, compiler feedback, runtime states, or sandboxed code execution.
  • Experience with large-scale distributed training of models with 70B+ parameters.
  • Understanding of evaluation challenges for code models, including benchmark contamination, flaky tests, execution-based scoring, and long-horizon task evaluation.
  • Contributions to ML infrastructure, open-source projects, or research systems.

Obowiązki

  • Design and run pre-training, continued pre-training, and mid-training experiments for code models.
  • Build and improve data pipelines for large-scale model training, including filtering, deduplication, mixture design, and dataset quality checks.
  • Work with code corpora, repositories, tests, execution traces, and synthetic data.
  • Develop evaluations for complex repository-level code reasoning tasks.
  • Collaborate with researchers and engineers working on ML for code and AI developer tools.

Inne informacje

We are an equal opportunity employer. We process the data provided in your job application in accordance with the Recruitment Privacy Policy.

JetBrains

JetBrains

52 aktywne oferty

Zobacz wszystkie oferty
Aplikuj teraz