Senior Python Data Scraping Engineer (Freelance)

40 USD/ godz.
SeniorPart-time
#360977·Dodano wczoraj·0
Źródło: Mindrift
Aplikuj teraz

Tech Stack / Keywords

PythonAIGenerative AIJavaScriptSoftware DevelopmentScriptingSeleniumHTML

Firma i stanowisko

Mindrift connects specialists with AI projects from major tech innovators. The company focuses on unlocking the potential of Generative AI by integrating real-world expertise globally. This role is part of the Tendem project, which combines AI and human collaboration for specialized data scraping workflows.


Wymagania

  • At least 5+ years of relevant experience in data engineering, web scraping, automation, or software development.
  • Bachelor’s or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields is a plus.
  • Strong technical foundation and practical experience with scripting, automation, and AI-assisted workflows.
  • Ability to work confidently with LLMs and systematically collect, structure, and validate data from diverse sources.
  • Experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxies.
  • Proven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML).
  • Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON, Google Sheets).
  • Experience handling anti-bot mechanisms and dynamic site structures at scale.
  • Experience with cloud infrastructure (AWS or equivalent) and containerization (Docker).
  • Hands-on experience with LLM frameworks (LangChain, OpenRouter, or similar) applied to automation tasks.
  • Strong attention to detail and commitment to data accuracy.
  • Self-directed work ethic with ability to troubleshoot independently.
  • English proficiency: Upper-intermediate (B2) or above.

Obowiązki

  • Own end-to-end data extraction workflows across complex websites, ensuring complete coverage, accuracy, and reliable delivery of structured datasets.
  • Leverage internal tools (Apify, OpenRouter) alongside custom workflows to accelerate data collection, validation, and task execution while meeting defined requirements.
  • Ensure reliable extraction from dynamic and interactive web sources, adapting approaches as needed to handle JavaScript-rendered content and changing site behavior.
  • Enforce data quality standards through validation checks, cross-source consistency controls, adherence to formatting specifications, and systematic verification prior to delivery.
  • Scale scraping operations for large datasets using efficient batching or parallelization, monitor failures, and maintain stability against minor site structure changes.

Oferta

  • Part-time remote freelance role.
  • Compensation up to $40 per hour equivalent depending on level and pace of contribution.

Inne informacje

This is a freelance role with an estimated workload of 10–20 hours per week during active project phases. The workload is not guaranteed and applies only while the project is active.

Mindrift

Mindrift

41 aktywnych ofert

Zobacz wszystkie oferty
Aplikuj teraz