Senior Site Reliability Engineer (Remote)
Tech Stack / Keywords
Firma i stanowisko
Oxylabs is a company managing large-scale infrastructure with over 60PB monthly data traffic, including 6PB+ Ceph storage and processing 300k+ service requests per second. The role involves working on Webshare's production infrastructure and migrating from Docker Swarm to Kubernetes.
Wymagania
- Experience building and operating highly available infrastructure at scale, including hundreds of servers and dozens of services under real production load.
- Hands-on experience with Kubernetes in self-hosted or bare-metal environments.
- Proficiency with Infrastructure as Code.
- Ownership of CI/CD pipelines end-to-end, preferably with GitLab CI or equivalent.
- Experience participating in on-call rotations in production environments.
- Proactive communication and problem surfacing without prompting.
- Scripting and development skills.
Nice to have:
- Led at least one major infrastructure migration, including planning, execution, and stabilization.
- Familiarity with Python and/or Go (backend is Python, edge services are Go).
- Exposure to proxy and networking-heavy infrastructure.
- Experience in small teams with shared infrastructure responsibility.
- Familiarity with edge clusters or split compute/edge architectures.
Obowiązki
- Own and evolve Webshare's production infrastructure, leading migration from Docker Swarm to Kubernetes or hybrid K8s + Ansible.
- Maintain high availability across hundreds of servers and approximately 50 services.
- Drive observability in cooperation with the development team.
- Establish and enforce Infrastructure as Code practices, CI/CD pipeline reliability, and change management processes.
- Participate in on-call rotation alongside backend developers.
- Respond to and lead incident resolution, run post-mortems, and drive systematic remediation.
- Contribute platform tooling to improve developer experience and reduce infrastructure toil.
- Keep backend engineers informed and capable, promoting shared infrastructure ownership without silos.
Inne informacje
Please be informed that the data controller is Oxylabs, UAB (hereinafter "controller"). Controller may collect and process personal data including identification, contact details, employment history, education, qualifications, interview records, assessment results, references, and background check information. Data processing is based on legitimate interest, contractual necessity, legal obligations, or consent. Data retention is for 3 years after recruitment unless otherwise required or consented. Data may be shared with IT service providers, regulatory authorities, and background check providers. Data subject rights include access, rectification, erasure, restriction, objection, portability, and complaint lodging. Contact: [email protected].
Oxylabs
2 aktywne oferty