Senior Database Reliability Engineer (DBRE) (worldwide remote)

Brak informacji o wynagrodzeniu
SeniorFull-time
#351229·Dodano wczoraj·0
Źródło: Cloudlinux
Aplikuj teraz

Tech Stack / Keywords

SecurityPostgreSQLMongoDBRedisLinuxAnsibleTerraformGitLab

Firma i stanowisko

CloudLinux / TuxCare is a remote-first infrastructure and security company with more than 300 engineers building and operating products used by hosting providers, enterprises, and internal service teams worldwide. The Infrastructure Department runs platforms behind CloudLinux OS, Imunify, KernelCare, TuxCare ELS, and engineering systems.


Wymagania

  • Deep hands-on PostgreSQL experience in business-critical production environments, typically 5+ years or equivalent depth.
  • Strong understanding of PostgreSQL internals and operations including MVCC, WAL, transactions, locks, indexes, query planning, replication, autovacuum, bloat, major upgrades, backups, PITR, and restore testing.
  • Proven experience with highly available databases and ability to reason about quorum, split-brain risk, failover, rollback, and recovery.
  • Strong Linux and infrastructure fundamentals including systemd, networking, storage, filesystems, CPU/memory/disk bottlenecks, TLS, DNS, firewalls, and root-cause troubleshooting.
  • Automation skills with Ansible and scripting; Terraform/OpenTofu, GitLab CI/CD, and merge-request based delivery are strong advantages.
  • Ability to support more than one database engine; readiness to learn ClickHouse quickly and take responsibility.
  • Practical use of AI engineering assistants such as Claude and Codex to improve speed and quality while personally verifying generated SQL, commands, scripts, and operational conclusions.
  • Clear written English for asynchronous work in Jira, Slack, GitLab, Slite, and runbooks.

Nice to Have:

  • ClickHouse operations including replication, Keeper/ZooKeeper, MergeTree engines, distributed DDL, grants, row policies, backups, query troubleshooting, and cluster recovery.
  • MongoDB replica sets and Percona Backup for MongoDB.
  • Redis/Sentinel and broker/cache failure modes.
  • Database observability, SLOs, golden signals, alert tuning, and executable incident runbooks.
  • Building internal platforms, self-service portals, or DBaaS workflows for engineering teams.

Obowiązki

  • Own production PostgreSQL reliability including HA design, Patroni, PgBouncer, replication, failover, upgrades, vacuum/bloat control, query tuning, locks, indexes, capacity, backups, PITR, and restore validation.
  • Improve disaster recovery and operational evidence with tested restores, documented recovery paths, measurable RTO/RPO targets, runbooks, and safe maintenance plans.
  • Support ClickHouse, MongoDB, and Redis by troubleshooting incidents, reviewing access and data-safety changes, improving monitoring, and learning production ClickHouse patterns.
  • Automate DBA workflows using Ansible, Terraform/OpenTofu, GitLab CI/CD, scripts, and reproducible runbooks for provisioning, grants, backups, restores, health checks, and ownership metadata.
  • Build DBaaS-style self-service capabilities for engineering teams to request databases, access, credentials, and operational checks with less manual DBA intervention.
  • Improve observability and incident response through Grafana, metrics, logs, SLOs, alert rules, Opsgenie routing, and clear communication during production issues.

Oferta

  • Focus on professional development.
  • Interesting and challenging projects.
  • Fully remote work with flexible working hours.
  • Paid 24 days of vacation per year, 10 days of national holidays, and unlimited sick leaves.
  • Compensation for private medical insurance.
  • Co-working and gym/sports reimbursement.
  • Budget for education.
  • Opportunity to receive a reward for the most innovative idea that the company can patent.
Elastyczne godziny
Płatny urlop
Opieka zdrowotna
Karta sportowa
Dofinansowanie szkoleń

Inne informacje

By applying for this position, you agree with CloudLinux Privacy Policy and give consent to maintain and process personal data accordingly.

CloudLinux

CloudLinux

Pracodawca

Aplikuj teraz