Data Engineer (DBX,dbt)

Brak informacji o wynagrodzeniu
SeniorFull-time·Umowa o pracę·B2B
#337009·Dodano dziś·0
Źródło: theprotocol.it
Aplikuj teraz

Tech Stack / Keywords

PythonSQLPysparkPydanticPydanticAIGitHubWindows

Firma i stanowisko

Webellian is a well-established Digital Transformation and IT consulting company committed to creating a positive impact for clients in sectors such as insurance, banking, healthcare, retail, and manufacturing. The project involves building an advanced solution leveraging Large Language Models (LLMs) to scan documents and support automated decision-making processes for a key client in the insurance industry. The role includes collaboration with global stakeholders and business users in a hybrid work model based in Poland.


Wymagania

  • Strong experience with Databricks (DBX)
  • Advanced knowledge of Python
  • Solid experience in building and optimizing ETL/ELT pipelines
  • Very good knowledge of SQL and relational databases
  • Experience with PySpark
  • Knowledge of CI/CD practices and tools (e.g. GitHub)
  • General understanding of infrastructure, orchestration, and IT security principles
  • Proven experience in Data Engineering (Senior level)
  • Bachelor’s or Master’s degree in a technical field (e.g. Computer Science, Engineering) or equivalent experience
  • Fluent English (written and spoken)
  • DevOps mindset (“you build it, you run it”)
  • Ability to understand complex requirements and translate them into actionable solutions
  • Strong communication skills and ability to work with cross-functional teams
  • Attention to detail, especially regarding data quality and business logic
  • Proactive attitude, ownership, and willingness to learn new technologies

Nice to have:

  • Experience with libraries such as Pydantic; PydanticAI is a strong advantage
  • Familiarity with dbt
  • Experience in the insurance domain

Obowiązki

  • Contribute by building a training dataset for document processing and decision support
  • Design and implement end-to-end data pipelines (ingestion, transformation, storage, and consumption)
  • Prepare and maintain high-quality training datasets for machine learning models
  • Work with large-scale data on a modern cloud data platform (Databricks)
  • Apply best practices in data engineering, testing, and deployment
  • Collaborate closely with data scientists, engineers, and business stakeholders
  • Continuously improve performance, reliability, and automation of data workflow

Oferta

  • Contract under Polish law: B2B or Umowa o Pracę
  • Benefits such as private medical care, group insurance, Multisport card
  • English classes available
  • Hybrid work (at least 1 day/week on-site) in Warsaw (Mokotów)
  • Opportunity to work with excellent professionals
  • High standards of work and focus on the quality of code
  • New technologies in use
  • Continuous learning and growth
  • International team
  • On-site amenities including Pinball, PlayStation & more
  • Sharing the costs of sports activities
  • Remote work opportunities
  • Fruits, coffee/tea, drinks
  • Parking space for employees
  • Leisure zone
Opieka zdrowotna
Ubezpieczenie
Karta sportowa
Kursy językowe
Elastyczne godziny
Płatny urlop
Płatne święta
Pakiet relokacyjny
Imprezy teamowe
Budżet konferencyjny
Dofinansowanie szkoleń
Bonusy
Telefon służbowy
Stołówka
Darmowe napoje
Darmowe przekąski
Parking dla aut
Parking rowerowy
Prysznic
Chill room
Pakiet wypoczynkowy
Concierge
Opcje na akcje
Udział w zysku
Webellian

Webellian

44 aktywne oferty

Zobacz wszystkie oferty
Aplikuj teraz