Senior Site Reliability Engineer, GCP
Tech Stack / Keywords
Firma i stanowisko
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Wymagania
- 3+ years of experience in an SRE, DevOps or system administration role
- Knowledge of Google Cloud Platform (GCP)
- Expertise in Linux operating system internals with ability to diagnose and resolve complex system-level problems
- Understanding of containerization concepts and tools
- Experience with incident management and response using ServiceNow or similar tools
- Strong problem-solving skills and experience with debugging complex technical issues
- Understanding of monitoring, logging and alerting systems, preferably Cloud Monitoring
- Familiarity with version control using GitHub
- Experience with infrastructure-as-code
- Excellent communication and collaboration skills
- English proficiency at B2 level or higher
Nice to have:
- Experience with Kubernetes and containerization technologies
- Experience with Terraform for infrastructure-as-code
- Strong understanding of SDLC and CI/CD pipelines and experience with CI/CD tools
Obowiązki
- Participation in on-call rotations to cover 24/7 support for critical systems
- Response to alerts of running services and applications, conducting root cause analysis (RCA)
- Deployment of microservices according to release cadence
- Design, implementation and maintenance of scalable and reliable systems and applications on Google Cloud Platform (GCP)
- Development and maintenance of infrastructure as code using Terraform
- Collaboration with engineering teams to identify and prioritize reliability, performance improvements and rightsizing of the dedicated cloud resources
- Involvement in incident management and response using ServiceNow
- Management and resolution of technical issues and tickets using Jira
- Development of knowledge base for maintaining existing infrastructure and monitoring services
Oferta
- Engineering community of industry professionals
- Friendly team and enjoyable working environment
- Flexible schedule and opportunity to work remotely within Poland
- Chance to work abroad for up to 60 days annually
- Business-driven relocation opportunities
- Outstanding career roadmap
- Leadership development, career advising, soft skills, and well-being programs
- Certification (GCP, Azure, AWS)
- Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
- English classes
- Stable income (Employment Contract or B2B)
- Participation in the Employee Stock Purchase Plan
- Benefits package (health insurance, multisport, shopping vouchers)
- Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
- Referral bonuses
- Corporate, social and well-being events
Inne informacje
Please note that the working hours are standard for candidates from Poland, with the ability to adjust to evening calls, a few times per week, up to 6-7 pm.
EPAM Systems
230 aktywnych ofert