Senior AI/Data Engineer
125 - 175 PLN/ godz.B2B (netto)
SeniorFull-time·B2B
#313114·Dodano około miesiąc temu·30
Źródło: Jit Team🚫Oferta wygasła. Ta oferta pracy nie jest już aktywna i rekrutacja została zakończona.
Tech Stack / Keywords
AISOLIDNLPAPICI/CDPythonAWSLambda
Firma i stanowisko
Our client is a Scandinavian supplier of software and services for purchasing and tendering processes in highly formalized conditions, such as orders in accordance with public procurement law at the level of the European Union or individual member states. The delivery center is located in Utrecht. Client’s platforms allow to announce tenders and conduct the entire procedure of collecting and evaluating offers at every stage of the purchasing process. They are used both by public administration bodies at various levels in European countries and by purchasing departments of large companies and corporations.
Wymagania
- 5+ years in Python
- 2+ years of AWS experience (Lambda, Step Functions, ETL pipelines)
- Experience with infrastructure as code (e.g. Terraform, AWS CDK)
- Solid understanding of core NLP techniques: TF-IDF, tokenisation, K-NN, document embeddings, and textual similarity
- Hands-on experience developing solutions powered by cloud-hosted LLMs (OpenAI, Claude, etc.)
- Experience working with enterprise data platforms (EDP) and/or data lake architectures (e.g. AWS Lake Formation, S3-based data lakes, Delta Lake, or similar)
- Experience building and maintaining CI/CD pipelines
Preferred Experience:
- Apache Spark for large-scale data processing
- AWS CDK (TypeScript)
- Deep learning experience: transformers, dimensionality reduction techniques
- Working with embeddings, fine-tuning, and evaluating LLMs
- Implementing web crawlers and agentic AI systems
- Experience with Docker and container orchestration (Kubernetes, ECS, or similar)
- API development (FastAPI, Flask, or similar)
- Java knowledge is an asset, particularly for integrating with or contributing to existing JVM-based backend services
Obowiązki
- AI-powered document intelligence: Using web scraping and LLMs to extract insight from unstructured public data sources, surfacing signals that help customers stay ahead of the market
- Enterprise data platform (EDP) integration: Ingesting, normalising, and joining data from multiple external sources into our EDP, making it accessible and useful for downstream services across the organisation
- External API data ingestion: Building and maintaining integrations with third-party APIs to continuously gather relevant data at scale
- CI/CD pipeline ownership: Setting up and maintaining robust CI/CD pipelines for the services the team develops and operates
Jit Team
223 aktywne oferty