Data Engineer (DBX,dbt)

Brak informacji o wynagrodzeniu
MidFull-time·Umowa o pracę·B2B
#337010·Dodano dziś·0
Źródło: theprotocol.it
Aplikuj teraz

Tech Stack / Keywords

PythonSQLPysparkPydanticPydanticAIGitHubWindows

Firma i stanowisko

Webellian is a well-established Digital Transformation and IT consulting company committed to creating a positive impact for clients in sectors such as insurance, banking, healthcare, retail, and manufacturing. The project involves building an advanced solution leveraging Large Language Models (LLMs) to scan documents and support automated decision-making processes, working in a hybrid model with teammates in Poland and global stakeholders including business users.


Wymagania

  • Strong experience with Databricks (DBX)
  • Advanced knowledge of Python
  • Solid experience in building and optimizing ETL/ELT pipelines
  • Very good knowledge of SQL and relational databases
  • Experience with PySpark
  • Experience with Azure Data Services is an advantage
  • Knowledge of CI/CD practices and tools (e.g. GitHub)
  • General understanding of infrastructure, orchestration, and IT security principles
  • Proven experience in Data Engineering (Regular level)
  • Bachelor’s or Master’s degree in a technical field (e.g. Computer Science, Engineering) or equivalent experience
  • Fluent English (written and spoken)
  • DevOps mindset (“you build it, you run it”)
  • Ability to understand complex requirements and translate them into actionable solutions
  • Strong communication skills and ability to work with cross-functional teams
  • Attention to detail, especially regarding data quality and business logic
  • Proactive attitude, ownership, and willingness to learn new technologies

Nice to have:

  • Experience with libraries such as Pydantic; PydanticAI is a strong advantage
  • Familiarity with dbt
  • Experience with Azure Data Services
  • Experience in the insurance domain

Obowiązki

  • Contribute by building a training dataset for document processing and decision support
  • Design and implement end-to-end data pipelines (ingestion, transformation, storage, and consumption)
  • Prepare and maintain high-quality training datasets for machine learning models
  • Work with large-scale data on a modern cloud data platform (Databricks)
  • Apply best practices in data engineering, testing, and deployment
  • Collaborate closely with data scientists, engineers, and business stakeholders
  • Continuously improve performance, reliability, and automation of data workflow

Oferta

  • Contract under Polish law: B2B or Umowa o Pracę
  • Benefits such as private medical care, group insurance, Multisport card
  • English classes available
  • Hybrid work (at least 1 day/week on-site) in Warsaw (Mokotów)
  • Opportunity to work with excellent professionals
  • High standards of work and focus on the quality of code
  • New technologies in use
  • Continuous learning and growth
  • International team
  • Pinball, PlayStation & much more (on-site)
  • Sharing the costs of sports activities
  • Private medical care
  • Life insurance
  • Remote work opportunities
  • Fruits
  • Video games at work
  • Coffee / tea
  • Drinks
  • Parking space for employees
  • Leisure zone
  • English classes
Opieka zdrowotna
Ubezpieczenie
Karta sportowa
Kursy językowe
Elastyczne godziny
Płatny urlop
Darmowe napoje
Darmowe przekąski
Parking dla aut
Pakiet wypoczynkowy
Imprezy teamowe
Webellian

Webellian

44 aktywne oferty

Zobacz wszystkie oferty
Aplikuj teraz