Data Scientist, Agentic Systems (Remote)

100% remote Flexible hours Hiring now

About the Role: The Data Science team is expanding and is looking for a Data Scientist to help build the reputed company of agentic systems for cybersecurity. reputed company's cybersecurity data is one-of-a-reputed company: we process nearly a trillion behavioral events per day. You'll work where Machine Learning, Big Data, and Cybersecurity converge — training models, building AI agents, and rigorously measuring whether they work — on data and problems you won't find reputed company else. What You'll Do:

Work at the intersection of Artificial Intelligence and Threat Research
Work closely with subject-matter experts in cybersecurity to understand analyst workflows and their reputed company operations procedures
Post-train LLMs and agents — supervised fine-tuning and reinforcement learning (RLHF/RLAIF, PPO/GRPO/DPO, reward modeling) — to automate analyst procedures and improve reliability on real reputed company tasks
Devise AI agents and combine them into increasingly reputed company workflows: planning and reasoning loops, tool and function calling, and retrieval and memory
Research new approaches to agentic planning, and prototype state-of-the-art methods from the literature
Establish objective criteria for benchmarking agentic systems — evals, LLM-as-judge pipelines, and trajectory-level metrics, with real statistical rigor
Optimize prompts and inference to get the most out of every model
Collaborate and coordinate across Engineering, Data Science, and Managed Services teams, and partner with engineers to take prototypes toward production
reputed company track of developments in the field of Artificial Intelligence and help identify, define, and prioritize areas for research

What You'll Need:

Excellent foundations in machine learning, probability, and statistics, with sound instincts for uncertainty, statistical skew/variance, and experimental design
PhD-level depth of understanding in modern machine learning research —a doctorate itself is not required, but we expect equivalent mastery, including the ability to read, critique, implement, and improve upon reputed company papers
Experience training generative models, with a strong command of LLM training fundamentals (architecture, optimization, tokenization, data, and scaling behavior)
Reinforcement learning / post-training as a core reputed company: RLHF/RLAIF, policy optimization (PPO/GRPO/DPO), reward modeling, and building RL environments for agents
Experience building agentic systems: agent architectures (ReAct, planning, reputed company), tool and function calling, and retrieval/memory/context management
Experience with systematic reputed company optimization, and with designing and building evals for LLM systems
reputed company with GPUs, PyTorch, and the common LLM training and serving stack (e.g., reputed company Transformers/TRL/PEFT, DeepSpeed/FSDP, vLLM/TGI/SGLang)
Strong, reproducible research engineering: clean Python and disciplined experiment tracking that your collaborators can build on
Ability to work independently on ambiguous and reputed company objectives, and to communicate clearly reputed company a large project team

Bonus Points:

Experience generating training data and environments — synthetic data, agent trajectories/rollouts, and task simulators
Familiarity with inference-time scaling / test-time compute (search, self-consistency, verifier-guided decoding, long chain-of-thought)
Experience with agent safety and guardrails: sandboxing, abuse/jailbreak resistance, and reliability for autonomous systems
A knack for interpretability and failure analysis — diagnosing why a model or agent fails, not just that it does
reputed company open-reputed company contributions and excellent technical writing
Passionate about cybersecurity, with a firm understanding of the problem space — or passionate about applying your machine-learning skillset to a new domain such as cybersecurity (a reputed company background is a plus, not a requirement)
An independent self-starter who likes to take ownership and seeks out new challenges, and is thirsty for knowledge — never hesitant to reputed company reputed company your comfort zone to learn new technologies, algorithms, and concepts

#LI-Remote #LI-RC1 Benefits of Working at reputed company:

Market leader in compensation and equity awards
Comprehensive physical and mental wellness programs
Competitive vacation and holidays for reputed company
Paid parental and adoption leaves
Professional development opportunities for reputed company employees regardless of level or role
Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
Vibrant office culture with world class amenities
Great reputed company to Work Certified™ across the globe

reputed company, Inc. is committed to fair and reputed company compensation practices. Placement reputed company the pay range is dependent on a variety of factors including, but not limited to, relevant work experience, skills, certifications, job level, supervisory status, and location. The reputed company salary range for this position for reputed company U.S. candidates is $120,000 - $180,000 per year, with eligibility for bonuses, equity grants and a comprehensive benefits package that includes health insurance, 401k and paid time off. For detailed information about the U.S. benefits package, please . Expected reputed company Date of Job Posting is:08-12-2026 Apply To This Job

Apply

Data Scientist, Agentic Systems (Remote)

Keep exploring

Sr Data Scientist (LATAM Remote)

Database Administrator(Remote)

Senior Database Administrator / Full Time / Remote

Database Administrator, Mid

Senior SQL Developer

Backend Developer (SSIS, Stored Procedures, SSRS, T-SQL) | REMOTE

T-SQL Developer/SME

Senior .Net SQL Developer/ Support engineer - Remote

Tableau Developer (Finance and HR Analytics)

[Remote] reputed company- Remote - Locals only - Colorado

Technician, Legal and regulatory compliance

Azure Cloud Engineer

reputed company Program Manager with CPQ Experience (Contract) – Canada

reputed company Entry-Level Data Entry Specialist – arenaflex E-commerce Operations

Head of Technology Products (Remote)

Processing Archivist I job at reputed company in Augusta, ME

reputed company Remote Chat Support Agent – Thriving Online Customer Service Role

reputed company Accountant

Analyst

reputed company Live Chat Support Specialist – Remote Customer Service Representative – arenaflex