Senior Data Engineer with AI
220 PLN/ godz.B2B
SeniorPart-time·B2B
#370478·Dodano dziś·0
Źródło: emagineTech Stack / Keywords
AIPythonNLPSparkMachine LearningPandasScalaLangGraph
Firma i stanowisko
International project focused on building and improving AI-driven data solutions for large-scale web content processing, attribute extraction, and market expansion in the technology industry.
Wymagania
- Strong Python skills, especially with Polars and/or Pandas.
- Experience with NLP and fine-tuning lightweight ML models.
- Practical experience in designing, building and evaluating data pipelines.
- Experience with Spark and, ideally, Scala.
- Familiarity with agent frameworks, especially LangGraph.
- Understanding of data quality, model evaluation and performance measurement.
- Ability to adapt ML/data solutions to different countries, languages and data domains.
- Experience with pipeline orchestration and optimisation for large-scale data ingestion.
- A hands-on, problem-solving mindset and ability to work in a fast-moving environment.
Obowiązki
- Building and optimising Spark pipelines for large-scale web content ingestion and processing.
- Using Python, including Polars and/or Pandas, for data processing, analysis and pipeline development.
- Fine-tuning lightweight ML models for task-specific attribute extraction.
- Preparing training data, managing data quality and evaluating model performance end-to-end.
- Working with NLP techniques to extract, classify and reason over information from web content.
- Expanding an internal AI research agent to new geographic markets and adapting logic to local data conditions.
- Supporting evidence collection and reasoning logic for new place-related attributes.
- Evaluating ML systems across different locales, domains and data sources.
- Working with pipeline orchestration, optimisation and multi-source ingestion processes.
- Potentially using Scala and Spark in data engineering tracks.
Benefity
- Onboarding: 2 weeks in Malmö fully covered by the client.
- Notice period: ideally around 1 week maximum.
- Recruitment process: 1 technical and 1 non-technical interview, approximately 60 minutes each.
emagine
233 aktywne oferty