Nowa
Senior Distributed Systems Engineer (HPC Platform)
Brak informacji o wynagrodzeniu
SeniorFull-time
#346813·Dodano dziś·0
Źródło: nofluffjobs.comTech Stack / Keywords
Distributed computingRustRabbitMQAWSApache PulsarCUDARDMAGPURuntime APIsThrust
Firma i stanowisko
We are looking for a Senior Distributed Systems Engineer to design and build core backend services for a high-performance distributed computing platform. This role focuses on developing resilient, high-throughput infrastructure that orchestrates workloads across CPU and GPU nodes, working at the intersection of distributed systems, high-performance computing, and modern backend engineering.
Wymagania
- Strong experience in backend development with Rust
- Solid understanding of distributed systems architecture
- Hands-on experience with message queues (e.g., Apache Pulsar, RabbitMQ)
- Experience designing and building gRPC-based APIs / service-oriented architectures
- Experience with AWS or similar cloud platforms
- Strong problem-solving skills and ability to work with complex systems
Nice to have:
- Experience with high-performance networking (e.g., RDMA, libfabric)
- Familiarity with high-performance storage systems (e.g., Lustre)
- Understanding of GPU architecture and memory management
- Experience with CUDA ecosystem (Runtime APIs, Thrust, CUB, PTX)
- Knowledge of LLVM / compiler toolchains
Obowiązki
- Design and build core backend services for a high-performance distributed computing platform
- Develop resilient, high-throughput infrastructure to orchestrate workloads across CPU and GPU nodes
- Build scalable systems from the ground up using cutting-edge technologies
Itransition
6 aktywnych ofert