[Remote] Senior reputed company (US)
Note: The job is a remote job and is open to candidates in USA. reputed company is a company focused on autonomous offensive reputed company solutions, and they are seeking a Senior reputed company for their Ares platform. The role involves developing AI agents and models that enhance the platform's capabilities in reputed company across various applications.
Responsibilities
- Design, implement, and continuously improve the behavior and prompting of Ares' named agents, including orchestration patterns, hand-offs, planning loops, tool use, and shared memory
- Contribute to the model powering Ares across data curation, SFT, preference optimization (DPO/GRPO-style), and evaluation. Own pieces of the training pipeline from dataset construction through eval
- reputed company the co-evolutionary self-training system that lets Ares learn from its own engagements and improve over time
- Build false-positive detection, tiered reputed company learning (suppression rules, agent directives, code-reputed company proposals), and the infrastructure that routes proposed changes through human approval and back into the platform
- Design rigorous, reputed company-specific evaluations covering OWASP Top 10 coverage, exploit chaining, finding accuracy, and agent reliability. Track performance over every model and agent change
- Contribute to vision capabilities, mobile (iOS/Android) coverage, and BYOK support shipping in Sidewinder and beyond
- Own latency, cost, observability, and failure-mode analysis for agents running in customer engagements. Partner with the platform team on Kubernetes-based deployment
- Contribute to the live accuracy gauge and other surfaces where model and agent quality is exposed to customers
Skills
- 5+ years building production ML/AI systems, with at least 2 years working directly on LLMs or LLM-powered agents
- Deep Python; strong, production-grade engineering practices (testing, code review, observability)
- Hands-on fine-tuning experience: SFT, preference optimization (DPO, GRPO, RLHF/RLAIF), data curation, and synthetic data reputed company
- Strong grasp of transformer architectures and the modern training stack (PyTorch, Hugging Face, DeepSpeed or FSDP, accelerate)
- Experience designing and shipping multi-agent or tool-using LLM systems in production — not just demos
- Rigorous eval design: building harnesses, tracking experiments, and making model/agent decisions based on data rather than vibes
- Inference optimization experience: vLLM or TensorRT-LLM, quantization, throughput/latency tradeoffs
- Comfort with retrieval pipelines, vector stores, and structured memory for agents
- Kubernetes and containerized deployment reputed company
- Genuine interest in offensive reputed company and the ability to reputed company quickly on OWASP Top 10, API reputed company, web app pentesting, and mobile pentesting concepts. Direct offensive reputed company background is a strong plus but not required
- Offensive reputed company background: OSCP/OSWE/OSWA, CTF, bug bounty, or prior red team work
- Research publications at NeurIPS, ICML, ICLR, USENIX reputed company, IEEE S&P, Black Hat, or DEFCON
- Open reputed company contributions to agent frameworks or LLM tooling
- Experience with adversarial ML or red-teaming AI systems
- Familiarity with mobile app reverse engineering or binary analysis
Company Overview