Senior/Lead Data Software Engineer (Python, Spark, Azure)

Brak informacji o wynagrodzeniu
SeniorFull-time·Umowa o pracę·B2B
#336826·Dodano dziś·0
Źródło: theprotocol.it
⚠️Uwaga: ta oferta może już nie być aktualna. Sprawdź na stronie pracodawcy, czy rekrutacja jest nadal otwarta.
Aplikuj teraz

Tech Stack / Keywords

PytestSparkMicrosoft AzurePySparkDockerKubernetesTerraform CloudGtilab

Firma i stanowisko

EPAM is a leading global provider of digital platform engineering and development services. The team works on a scalable, ML-ready platform that enhances portfolio model development and deployment with advanced data governance and AI capabilities. The role involves migrating from an IaaS Big Data platform to Azure-native Databricks, optimizing data workflows, and improving data quality to boost client services and regulatory compliance.


Wymagania

  • Proficiency in Python and Spark with at least 3 years in data engineering roles
  • Strong experience with Azure Databricks and PySpark
  • Proven expertise in designing and implementing ETL/ELT solutions
  • Experience migrating big data platforms to Azure-native services
  • Proficiency with Delta tables for model tuning
  • Knowledge of data governance and regulatory compliance frameworks
  • Familiarity with Docker, Kubernetes (AKS), and Terraform for infrastructure automation
  • Ability to manage large data volumes with high efficiency
  • Excellent problem-solving and analytical skills
  • Strong communication and collaboration abilities
  • English proficiency at B2 level or higher

Obowiązki

  • Migrate and optimize over 500 data jobs using Azure Databricks optimization techniques
  • Manage and process 12 TB of data efficiently across platforms
  • Tune machine learning models for Azure environments using Java Spark and Delta tables
  • Update and maintain libraries to address security vulnerabilities
  • Develop and maintain ETL/ELT pipelines using PySpark and related technologies
  • Collaborate with cross-functional teams to integrate GenAI capabilities into data workflows
  • Monitor data quality and implement improvements to ensure accuracy and reliability
  • Automate deployment and operational tasks using Terraform and GitLab CI/CD
  • Support data governance initiatives to comply with regulatory standards
  • Troubleshoot and resolve performance issues in data processing systems
  • Document system processes and provide technical guidance to junior engineers
  • Implement best practices for code quality and data security
  • Participate in code reviews and knowledge sharing sessions
  • Optimize costs associated with data storage and processing

Oferta

  • Engineering community of industry professionals
  • Friendly team and enjoyable working environment
  • Flexible schedule and opportunity to work remotely within Poland
  • Chance to work abroad for up to 60 days annually
  • Business-driven relocation opportunities
  • Outstanding career roadmap
  • Leadership development, career advising, soft skills, and well-being programs
  • Certification (GCP, Azure, AWS)
  • Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
  • English language classes
  • Stable income (Employment Contract or B2B)
  • Participation in the Employee Stock Purchase Plan
  • Benefits package (health insurance, multisport, shopping vouchers)
  • Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
  • Referral bonuses
  • Corporate, social and well-being events
Elastyczne godziny
Pakiet relokacyjny
Budżet konferencyjny
Dofinansowanie szkoleń
Imprezy teamowe
Kursy językowe
Karta sportowa
Opieka zdrowotna
Bonusy
Opcje na akcje
Darmowe przekąski
EPAM Systems (Poland) sp. z o.o.

EPAM Systems (Poland) sp. z o.o.

46 aktywnych ofert

Zobacz wszystkie oferty
Aplikuj teraz