Nowa
Data Engineer
150 - 185 PLN/ godz.B2B (netto)
SeniorFull-time·B2B
#345144·Dodano dziś·0
Źródło: 7NTech Stack / Keywords
databricksreal-world dataOMOPCDISC SDTM
Firma i stanowisko
7N is an IT services company with 30 years of experience creating IT solutions for over 200 organizations. They focus on providing stable and rewarding collaborations for IT experts, supported by dedicated agents who ensure professional comfort and continuous development initiatives.
Wymagania
- Proven experience designing and implementing ETL pipelines in Databricks/Spark and Delta Lake.
- Strong knowledge of OMOP CDM and experience mapping datasets to OMOP; familiarity with CDISC SDTM is a plus.
- Expertise in data modelling, partitioning, performance tuning, and best practices for large clinical/Real-World Data datasets.
- Experience with vocabulary services and terminology mapping (OHDSI/Athena, UMLS, or similar).
- Experience integrating AI/NLP components into data pipelines (entity extraction, mapping suggestions) is desirable.
- Familiarity with testing frameworks for data (Great Expectations, Deequ), CI/CD, infrastructure as code, and orchestration tools (Databricks Jobs, Airflow).
- Good communication skills and experience working with domain experts to capture requirements.
- Fluent English.
Nice to have:
- Prior experience in pharma or clinical research environments.
- Knowledge of data governance, privacy regulations and secure handling of patient data.
- Experience with Unity Catalog, Databricks Delta Sharing, and cloud infrastructure (Azure/AWS).
Obowiązki
- Design, build and maintain production ETL pipelines in Databricks/Delta Lake to ingest Real-World Data (registries, claims, EHR extracts) and transform into standard models.
- Implement harmonisation workflows to map incoming Real-World Data to OMOP and to the internal CDISC SDTM canonical model; handle vocabulary mapping, units normalization and provenance.
- Extend the medallion architecture (bronze/silver/gold) patterns with robust validation, lineage, partitioning and performance tuning.
- Develop configurable, input-driven transformation frameworks so clinical experts can drive mapping rules via config files and catalogs.
- Integrate AI/automation components (e.g., model-assisted mapping, NLP for free text) with human-in-the-loop review and confidence scoring.
- Establish testing, CI/CD, monitoring and alerting for ETL jobs and automations; ensure reproducibility, versioning and governance.
- Collaborate with clinical data scientists, data stewards and stakeholders to define requirements, data contracts and success metrics.
Oferta
- Ongoing support from a dedicated agent managing project continuity, client contact, formalities, work comfort and development.
- Consultant Development Program offering advice on growth planning with consultations from agents and growth mentors.
- Access to 7N Learning & Development platform with webinars, articles, industry reports, and development events.
- Spectacular integration events including annual Kick-Off trip, Christmas parties, Summer Olympics sports events, family picnics, and movie premieres.
- Opportunities for professional development including knowledge transfer within 7N Services.
- Relationships and access to experienced IT experts with average tenure over 10 years.
- Complete benefits package including funding for medical care, life insurance, sports cards for employees and loved ones, and discounts in stores in Poland and abroad.
Dofinansowanie szkoleń
Spotkania integracyjne
Opieka zdrowotna
Ubezpieczenie
Karta sportowa
7N
134 aktywne oferty