Senior Python Data Scraping Engineer (Freelance)

30 USD/ godz.
SeniorPart-time
#337224·Dodano dziś·0
Źródło: Mindrift
Aplikuj teraz

Tech Stack / Keywords

PythonAIGenerative AIJavaScriptSoftware DevelopmentSeleniumHTMLSOLID

Firma i stanowisko

Mindrift is a platform connecting specialists with AI projects from major tech innovators. Their mission is to unlock the potential of Generative AI by tapping into real-world expertise globally. This role is part of the Tendem project, focusing on specialized data scraping workflows within a hybrid AI and human system.


Wymagania

  • At least 5 years of relevant experience in data engineering, web scraping, automation, or software development.
  • Bachelor's or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields is a plus.
  • Strong experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxies.
  • Proven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML).
  • Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON, Google Sheets).
  • Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale.
  • Experience with cloud infrastructure (AWS or equivalent) and containerization (Docker) as part of real workflows.
  • Hands-on experience with LLM frameworks (LangChain, OpenRouter, or similar) applied to automation tasks.
  • Strong attention to detail and commitment to data accuracy.
  • Self-directed work ethic with ability to troubleshoot independently.
  • A link to GitHub is a plus.
  • English proficiency: Upper-intermediate (B2) or above (required).

Obowiązki

  • Own end-to-end data extraction workflows across complex websites, ensuring complete coverage, accuracy, and reliable delivery of structured datasets.
  • Leverage internal tools (Apify, OpenRouter) alongside custom workflows to accelerate data collection, validation, and task execution while meeting defined requirements.
  • Ensure reliable extraction from dynamic and interactive web sources, adapting approaches as needed to handle JavaScript-rendered content and changing site behavior.
  • Enforce data quality standards through validation checks, cross-source consistency controls, adherence to formatting specifications, and systematic verification prior to delivery.
  • Scale scraping operations for large datasets using efficient batching or parallelization, monitor failures, and maintain stability against minor site structure changes.

Oferta

  • Work fully remote on your own schedule with just a laptop and stable internet connection.
  • Gain hands-on experience in a unique hybrid environment where human expertise and AI agents collaborate seamlessly.
  • Participate in performance-based bonus programs that reward high-quality work and consistent delivery.

Inne informacje

This is a freelance, part-time remote role. English proficiency at upper-intermediate (B2) level or above is required.

Mindrift

Mindrift

32 aktywne oferty

Zobacz wszystkie oferty
Aplikuj teraz