Senior AI QA Engineer (Automation & Manual, AI-based Applications Testing)
Brak informacji o wynagrodzeniu
SeniorFull-time
#375900·Dodano wczoraj·0
Źródło: EPAM SystemsTech Stack / Keywords
TestingAIGenAIPythonSQLRESTAPI Testing
Wymagania
- 5+ years of experience in software QA, with at least 1 year focused on testing AI agents, agentic solutions or LLM-based systems
- Hands-on experience with both manual and automated testing of AI agents, including prompt/instruction testing and evaluation of agentic workflows
- Strong programming skills in Python test automation — pytest or equivalent, scripting and AI/ML library integration
- Expertise in AI agent frameworks, prompt engineering and evaluation metrics for LLM-based systems
- Demonstrated experience testing and evaluating Gen AI / LLM applications — grounding, answer accuracy and hallucination/determinism checks
- Applied knowledge of Gen AI / LLM evaluation frameworks and metrics — precision, recall, criteria recall and efficiency
- Familiarity with issue and test management tools such as Jira, QMetry and TestRail
- Experience with version control systems and integrating tests into CI/CD pipelines
- Flexibility to use AI-powered tools for QA such as GitHub Copilot and LLM-based test generation
- Understanding of cloud environments, particularly AWS
- Excellent communication, collaboration and leadership skills
Nice to have:
- Experience with agentic AI platforms such as LangChain, OpenAI Function Calling or similar
- Experience with AI safety, bias and reliability testing
- Experience with test data generation for AI/ML systems
Obowiązki
- Research and evolve automation frameworks in line with Gen AI tooling and best practices
- Design and automate evaluation of Gen AI features — grounding, answer accuracy, determinism/reproducibility, precision, recall, and criteria recall
- Build automated LLM test harnesses that scale evaluation beyond human-in-the-loop
- Select and apply Gen AI evaluation frameworks, measuring answer quality and pipeline efficiency
- Perform manual testing as needed to validate new features, integrations, and user stories
- Build and maintain test cases from requirements and user stories
- Test applications that may include AI agents, APIs, databases, and other integrations
- Collaborate with product, engineering, and operations teams to understand requirements and deployment environments
- Track and report test results, defects, and quality metrics
- Assist with troubleshooting production issues and escalate risks as needed
- Guide and support team members, including onshore and offshore consultants
Benefity
- Flexible schedule and opportunity to work remotely within Poland
- Chance to work abroad for up to 60 days annually
- Business-driven relocation opportunities
- Outstanding career roadmap
- Leadership development, career advising, soft skills, and well-being programs
- Certification (GCP, Azure, AWS)
- Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
- English classes
- Stable income (Employment Contract or B2B)
- Participation in the Employee Stock Purchase Plan
- Benefits package (health insurance, multisport, shopping vouchers)
- Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
- Referral bonuses
- Corporate, social and well-being events
Elastyczne godziny
Płatny urlop
Dofinansowanie szkoleń
Budżet konferencyjny
Kursy językowe
Opieka zdrowotna
Karta sportowa
Premie
Udziały pracownicze
Spotkania integracyjne
Darmowe przekąski
Inne informacje
This is a fully remote position with a requirement to work from 13:00 to 21:00 Polish time, due to the client team's location.
EPAM Systems
243 aktywne oferty