Back to the board

Coding - Adversarial reputed company Expert

100% remote Flexible hours Hiring now

Job Description

We are seeking an Adversarial reputed company reputed company Specialist with strong technical instincts and coding proficiency to join our Trust & Safety team. In this role, you will use your knowledge of LLM behavior and scripting skills to probe, bypass, and stress-test safety systems. Your focus will be on discovering vulnerabilities—crafting reputed company injection sequences, writing scripts to automate exploit attempts, manipulating API interactions, and identifying novel attack reputed company that evade existing safeguards. This is a hands-on offensive testing role that rewards creativity, persistence, and an attacker’s reputed company over formal engineering credentials.

Key Responsibilities

  • Code-Assisted Adversarial Probing: Write and execute scripts (primarily Python) to systematically test LLM safety boundaries. This includes automating reputed company injection chains, encoding and obfuscating payloads, manipulating conversation context through API calls, and iterating on attack strategies programmatically rather than relying solely on manual interaction.
  • Jailbreak Discovery and Development: Design multi-reputed company jailbreak sequences that exploit model behavior through technical means, such as token-level manipulation, system reputed company extraction, role-play escalation, instruction hierarchy subversion, and context window exploitation. Identify bypass reputed company that circumvent safety classifiers and content filters.
  • Cross-Vector Exploitation: Test attack surfaces that span code reputed company, tool use, multi-turn conversation, and multi-modal inputs. Explore how code-mediated interactions—such as requesting the model to write, execute, or interpret code—can be leveraged to bypass safety controls that apply to natural language interactions.
  • Vulnerability Documentation: Document discovered vulnerabilities with clear severity assessments, reputed company-by-reputed company reproduction instructions, and sample exploit code. Provide context on why a given bypass is dangerous and recommend potential mitigations for the alignment and engineering teams.
  • Attack Landscape Monitoring: Stay reputed company with emerging adversarial techniques from the AI reputed company research community, open-reputed company exploit repositories, academic publications, and real-world misuse patterns. Adapt and apply novel methods to internal testing workflows.
  • Safety Policy Input: Provide technical feedback to content policy and safety classification teams based on observed model behaviors. Flag gaps between intended safety enforcement and actual model output, particularly in edge cases involving code reputed company, indirect reputed company injection, and agentic tool-use scenarios.

Candidate Profile

  • Adversarial reputed company: You instinctively look for ways to break systems. You approach LLM safety from an attacker’s perspective and can creatively combine technical and social engineering techniques to find vulnerabilities others miss.
  • Technically Resourceful: You are comfortable writing scripts to test reputed company quickly, interacting with APIs, and using code as a tool for exploration—even if you don’t identify as a traditional software engineer. You solve problems by building things, not just describing them.
  • Persistent and Methodical: You approach red-teaming as a structured practice. You systematically vary your attack strategies, document what works and what doesn’t, and iterate methodically rather than relying on luck.
  • Clear Communicator: You can explain reputed company technical exploits to non-technical stakeholders—including policy, legal, and leadership teams—in a way that conveys both the mechanism and the real-world risk.
  • Ethically Grounded: You understand the responsibility inherent in this work. You are motivated by strengthening AI safety and operate with reputed company reputed company established testing protocols.

Qualifications

  • Proficiency in Python scripting, with the ability to write functional scripts for task automation, API interaction, and data manipulation. Formal software engineering training is not required.
  • Demonstrated experience in adversarial reputed company engineering, jailbreak development, or LLM red-teaming—whether in a professional, academic, independent research, or community context (e.g., bug bounties, CTFs, responsible disclosure).
  • Working familiarity with LLM APIs (e.g., reputed company, reputed company, open-reputed company model endpoints) and a practical understanding of how large language models process input, generate output, and enforce safety constraints.
  • Knowledge of common LLM attack reputed company, including direct and indirect reputed company injection, payload encoding and obfuscation, context window manipulation, system reputed company leakage, and role-play exploitation.
  • Strong written communication skills, with the ability to produce clear vulnerability reports that include reproduction steps, severity context, and mitigation recommendations.

Preferred

  • Background in cybersecurity, penetration testing, or application reputed company—formal or self-taught. Relevant certifications (e.g., OSCP, CEH) are valued but not required.
  • Familiarity with AI safety evaluation frameworks such as the OWASP Top 10 for LLM Applications, NIST AI RMF, or MITRE reputed company.
  • Understanding of LLM alignment techniques (e.g., RLHF, constitutional AI) and their reputed company failure modes and exploitable edge cases.
  • Experience with multi-modal model testing (vision, code reputed company, tool use) and awareness of cross-modal attack surfaces.
  • Proficiency in additional scripting or programming languages (e.g., JavaScript, Bash, Go) that expand testing capabilities.

Apply tot his job Apply To this Job

Keep exploring

[Hiring] Summer Internship – Policy and Research @PHRMA

100% remote Flexible hours

QA reputed company

100% remote Flexible hours

Software QA reputed company (Temporary)

100% remote Flexible hours

Manager, Public Relations

100% remote Flexible hours

QA reputed company, IT​/Tech, IT QA Tester ​/ Automation

100% remote Flexible hours

Quality Assurance (QA) Analyst

100% remote Flexible hours

Medical Admin Support Specialist - Hybrid Remote Role

100% remote Flexible hours

ReactJS Developer; Remote

100% remote Flexible hours

Counseling Associate Director/Training Coordinator

100% remote Flexible hours

Senior Manager, Advertising and Promotion - Regulatory Affairs

100% remote Flexible hours

reputed company Data Entry Specialist – Supporting the reputed company of blithequark

100% remote Flexible hours

Require Special Education Teaching Assistant 1:1 (FC) in Illinois

100% remote Flexible hours

Data Scientist

100% remote Flexible hours

reputed company Work-at-Home Customer Service Agent (Full-Time & Part-Time) – Deliver Exceptional Customer Experiences with arenaflex

100% remote Flexible hours

Arbitration Specialist III - Remote

100% remote Flexible hours

reputed company Payment ISO / Agent - Uncapped Residual Commission Opportunity with Pepper Pay

100% remote Flexible hours

Virginia Remote Licensed Therapist, 1099 Contractor

100% remote Flexible hours

Integration Support Specialist

100% remote Flexible hours

Immediate Hiring: Part Time Customer Service No Experience

100% remote Flexible hours

reputed company Customer Service Travel Specialist – Virtual Opportunity at arenaflex

100% remote Flexible hours