Sr. Site Reliability Engineer - CPO

Brak informacji o wynagrodzeniu
SeniorFull-time
#344541·Dodano 8 dni temu·0
Źródło: Addepar
Aplikuj teraz

Tech Stack / Keywords

AINetworkAWSCloudLinuxUnixNetworkingScripting

Firma i stanowisko

Addepar is a global data and AI platform empowering investment professionals to turn complex financial information into actionable intelligence. It unifies portfolio, market, and client data in a total portfolio view and delivers AI-powered insights within investment and client workflows. More than 1,400 firms in nearly 60 countries use Addepar to manage and advise on nearly $9 trillion in assets. The platform integrates with nearly 650 software, data, and consulting partners to power end-to-end investment operations across firms of all sizes and complexity. Addepar supports clients worldwide with offices in New York City, Salt Lake City, London, Edinburgh, Pune, Dubai, Geneva, and São Paulo.


Wymagania

  • Extensive progressive experience in SRE/DevOps/Systems Engineering with increasing responsibility
  • Expert-level understanding of Cloud Infrastructure fundamentals, preferably AWS, including advanced networking, security, and managed services
  • Exceptional programming and scripting skills in Python, Bash, and general Linux tools; Java is a strong plus
  • Broad expertise with UNIX/BSD/Linux internals, including performance tuning, kernel-level debugging, and advanced system administration; Ubuntu preferred
  • Extensive containerization experience with Kubernetes (KOPS, EKS, ECS preferred), including cluster management, custom resource definitions, and advanced deployment strategies
  • Proficient with monitoring, logging, and alerting tools such as Prometheus, Grafana, Sentry, Sumologic, or advanced AWS cloud-native tools, focusing on observability strategy
  • Excellent interpersonal and communication skills for collaboration and articulating complex technical concepts
  • Demonstrable experience contributing to significant systems automation tooling or open-source projects is a strong plus
  • Exposure to industry practices in financial services is a plus

Must-have Skills:

  • Deep expertise with Terraform and Infrastructure as Code in large-scale, complex cloud environments, including best practices for modularity, reusability, and state management
  • Experience designing, building, and operating highly reliable, fault-tolerant distributed systems in cloud environments, preferably AWS, including resilience patterns and disaster recovery
  • Strong understanding of system design principles and ability to influence architectural decisions for large-scale, highly available systems
  • Passion for technology, pragmatic thinking, and ability to independently navigate ambiguous areas and solve complex cross-functional problems

Obowiązki

  • Lead the design, implementation, and operationalization of container infrastructure using Kubernetes, ensuring high availability, performance, and security
  • Build and maintain advanced, automated CI/CD pipelines using Jenkins, ArgoCD, AWS CodeBuild/Pipeline, GitHub Actions, or similar, establishing best practices for deployment strategies such as blue/green and canary
  • Drive the adoption and evangelism of Infrastructure as Code (IaC) principles using Terraform, focusing on scaling the Addepar Platform across regions with cost optimization and operational efficiency
  • Develop deep application-level knowledge to inform and influence infrastructure requirements and constraints for Developers, QA, and Management, including implementing dashboards for cost and inventory management, performance analysis, and capacity planning
  • Perform advanced monitoring and troubleshooting of infrastructure and application stack using various logging and monitoring tools, driving root cause analysis and implementing preventative measures
  • Initiate and lead collaborations with cross-functional teams to identify and resolve complex application or infrastructure issues, serving as a technical subject matter expert
  • Serve as a primary on-call responder for critical incidents, demonstrating strong problem-solving skills under pressure and contributing to post-incident reviews to improve system resilience

Inne informacje

Applicants must have legal authorization to work in the country where this role is based on the first day of employment. Visa sponsorship is not available for this position.

Addepar is an equal opportunity employer committed to inclusion and providing reasonable accommodation for individuals with disabilities.

PHISHING SCAM WARNING: Addepar warns about phishing scams involving imposters posing as hiring managers. No job offers will be made without a formal interview process, and Addepar will not ask to purchase equipment or supplies as part of onboarding.

Addepar

Addepar

14 aktywnych ofert

Zobacz wszystkie oferty
Aplikuj teraz