Senior Site Reliability Engineer
Brak informacji o wynagrodzeniu
SeniorFull-time·Umowa o pracę
#370869·Dodano wczoraj·0
Źródło: XTBTech Stack / Keywords
AI
Firma i stanowisko
XTB is a global company from the financial industry, focusing on online trading of financial instruments. It is the largest FinTech in Poland and a leader in Central and Eastern Europe, operating in several countries including Asia and South America. XTB offers training and development programs and operates in an international business environment.
Wymagania
- At least 5 years of professional experience in SRE, Infrastructure, or DevOps roles managing high-scale, distributed environments.
- Advanced programming skills in Python focused on scalable automation, internal tooling, and robust scripts.
- Hands-on expertise with production-grade Kubernetes environments, configuration management tools like Ansible, and resilient infrastructure architectures in Azure Kubernetes Service and on-prem environments.
- Proficiency in building standardized telemetry ecosystems using self-hosted open-source tools such as Prometheus, Grafana, ELK Stack, Tempo, Thanos, and Jaeger.
- Ability to drive incident management, conduct post-incident analysis, and foster a culture of reliability and shared ownership.
- Ability to leverage AI/ML techniques for SRE tasks including AIOps, automated anomaly detection, and log analysis.
- Experience with commercial observability and APM solutions (e.g., Datadog, Splunk, New Relic) or chaos engineering frameworks is highly valued.
Obowiązki
- Develop a standardized observability ecosystem with a conscious telemetry model focusing on structured events, distributed tracing, and intelligent sampling.
- Act as a strategic partner to product engineering teams by providing platforms, standards, and data to own service reliability using error budgets and alerting.
- Enhance detection capabilities to identify issues before customer impact using early-warning systems and AI/ML for automated anomaly detection.
- Build internal automation and tooling to streamline SRE workflows and automate routine operational tasks.
- Participate in on-call rotation for incident management, ensuring rapid resolution, effective communication, and post-incident analysis.
Benefity
- Real influence on company and product development.
- Work in an experienced team with knowledge sharing.
- Clear development vision with regular feedback and career paths.
- Regular team-building meetings.
- Training budget for courses and conferences.
- An extra day off on your birthday and for parents.
- Equipment tailored to individual needs.
- Private medical care and group insurance.
- Access to e-learning platform for English and benefits platform.
- Access to wellbeing platform with workshops and private therapy sessions.
- Flexible work options: remote, office in Warsaw, or coworking space in your city.
Dofinansowanie szkoleń
Płatny urlop
Opieka zdrowotna
Ubezpieczenie
XTB
38 aktywnych ofert