Data Engineer (Databricks)
Tech Stack / Keywords
Firma i stanowisko
Xebia is a global tech company with a presence in Central and Eastern Europe, originating from two Polish companies: PGS Software and GetInData. The company has over 1,000 experts working on cloud, data, and software projects across various industries including fintech, e-commerce, aviation, logistics, media, and fashion. Xebia partners with major clients such as McLaren, Aviva, Deloitte, Spotify, Disney, ING, UPS, Tesco, Truecaller, AllSaints, Volotea, Schmitz Cargobull, Allegro, and InPost. The company uses modern open-source technology stacks and is a trusted partner of Databricks, dbt, Snowflake, Azure, GCP, and AWS, being the first AWS Premier Partner in Poland.
Wymagania
- 2–4+ years of professional experience in Data Engineering, Software Engineering, or Operational Engineering
- Experience with Databricks and PySpark for large-scale data processing
- Strong proficiency in Python, including building and debugging data pipelines and automation scripts
- Hands-on experience with Apache Airflow (DAG development, operators, troubleshooting)
- Very good knowledge of SQL, including complex joins, window functions, and JSON-based data
- Experience working with cloud platforms (AWS and/or GCP)
- Upper-intermediate English
- Readiness to work in a hybrid setup (in the Warsaw office once per week)
Nice to have:
- Experience with Unity Catalog
- Experience with database migrations and schema/version management
- Comfort working in environments with frequent production support and delivery deadlines
- Experience building agentic or AI-driven automation workflows
Obowiązki
- Designing, building, and maintaining end-to-end data pipelines for client-facing measurement reports and licensed datasets
- Operating and troubleshooting Apache Airflow DAGs supporting scheduled and on-demand data deliveries
- Managing push-based delivery workflows including cloud storage, file transfers, and delivery verification
- Investigating and resolving production incidents across distributed systems such as Airflow, databases, and cloud storage
- Implementing automation and AI-driven agents to streamline operational processes and data validation
- Supporting custom delivery requests including matching files, cross-referencing datasets, and bespoke client configurations
- Developing data quality and validation tooling to ensure accuracy before client delivery
- Writing and maintaining database migrations for delivery configurations and client setups
- Collaborating with product, engineering, measurement science, and client-facing teams
- Documenting operational processes, runbooks, and delivery workflows
Oferta
- Training budget
- Private healthcare
- Multisport subscription
- Integration events
- International projects
- Mental health support
- Referral program
- Modern office
- Canteen
- Free snacks
- Free beverages
- Free tea and coffee
- No dress code
- Playroom
- In-house trainings
- In-house hack days
- Normal atmosphere
Inne informacje
Work from the European Union region and a work permit are required
Xebia sp. z o.o.
58 aktywnych ofert