Type: Hourly contract (Full-time or Part-time)
Compensation: $28.74/hour
Location: Remote
Role Responsibilities
* Red team conversational AI models and agents (jailbreaks, prompt injections, misuse cases, bias exploitation, multi-turn manipulation).
* Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
* Apply structured taxonomies, benchmarks, and playbooks to ensure consistent adversarial testing.
* Produce reproducible reports, datasets, and attack cases customers can act on.
* Identify vulnerabilities missed by automated evaluation systems.
Requirements
* Native-level fluency in English and Brazilian Portuguese (required).
* Prior experience in AI red teaming, adversarial testing, cybersecurity, or socio-technical risk analysis.
* Strong adversarial mindset with structured, methodical testing approaches.
* Clear written communication for technical and non-technical stakeholders.
* Comfortable reviewing sensitive AI-generated content (guidelines and wellness support provided).
#J-18808-Ljbffr