Senior AI QA Engineer (Automation & Manual, AI-based Applications Testing)

Brak informacji o wynagrodzeniu
SeniorFull-time
#375900·Dodano wczoraj·0
Źródło: EPAM Systems
Aplikuj teraz

Tech Stack / Keywords

TestingAIGenAIPythonSQLRESTAPI Testing

Wymagania

  • 5+ years of experience in software QA, with at least 1 year focused on testing AI agents, agentic solutions or LLM-based systems
  • Hands-on experience with both manual and automated testing of AI agents, including prompt/instruction testing and evaluation of agentic workflows
  • Strong programming skills in Python test automation — pytest or equivalent, scripting and AI/ML library integration
  • Expertise in AI agent frameworks, prompt engineering and evaluation metrics for LLM-based systems
  • Demonstrated experience testing and evaluating Gen AI / LLM applications — grounding, answer accuracy and hallucination/determinism checks
  • Applied knowledge of Gen AI / LLM evaluation frameworks and metrics — precision, recall, criteria recall and efficiency
  • Familiarity with issue and test management tools such as Jira, QMetry and TestRail
  • Experience with version control systems and integrating tests into CI/CD pipelines
  • Flexibility to use AI-powered tools for QA such as GitHub Copilot and LLM-based test generation
  • Understanding of cloud environments, particularly AWS
  • Excellent communication, collaboration and leadership skills

Nice to have:

  • Experience with agentic AI platforms such as LangChain, OpenAI Function Calling or similar
  • Experience with AI safety, bias and reliability testing
  • Experience with test data generation for AI/ML systems

Obowiązki

  • Research and evolve automation frameworks in line with Gen AI tooling and best practices
  • Design and automate evaluation of Gen AI features — grounding, answer accuracy, determinism/reproducibility, precision, recall, and criteria recall
  • Build automated LLM test harnesses that scale evaluation beyond human-in-the-loop
  • Select and apply Gen AI evaluation frameworks, measuring answer quality and pipeline efficiency
  • Perform manual testing as needed to validate new features, integrations, and user stories
  • Build and maintain test cases from requirements and user stories
  • Test applications that may include AI agents, APIs, databases, and other integrations
  • Collaborate with product, engineering, and operations teams to understand requirements and deployment environments
  • Track and report test results, defects, and quality metrics
  • Assist with troubleshooting production issues and escalate risks as needed
  • Guide and support team members, including onshore and offshore consultants

Benefity

  • Flexible schedule and opportunity to work remotely within Poland
  • Chance to work abroad for up to 60 days annually
  • Business-driven relocation opportunities
  • Outstanding career roadmap
  • Leadership development, career advising, soft skills, and well-being programs
  • Certification (GCP, Azure, AWS)
  • Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
  • English classes
  • Stable income (Employment Contract or B2B)
  • Participation in the Employee Stock Purchase Plan
  • Benefits package (health insurance, multisport, shopping vouchers)
  • Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
  • Referral bonuses
  • Corporate, social and well-being events
Elastyczne godziny
Płatny urlop
Dofinansowanie szkoleń
Budżet konferencyjny
Kursy językowe
Opieka zdrowotna
Karta sportowa
Premie
Udziały pracownicze
Spotkania integracyjne
Darmowe przekąski

Inne informacje

This is a fully remote position with a requirement to work from 13:00 to 21:00 Polish time, due to the client team's location.

EPAM Systems

EPAM Systems

243 aktywne oferty

Zobacz wszystkie oferty
Aplikuj teraz