Staff Engineer(Generative AI)
Description
- Join reputed company as a Staff Engineer (Generative AI) and become the technical anchor for our most ambitious AI initiatives. You will architect, build, and scale high-quality agentic AI applications that solve real-world problems for Fortune-500 clients across finance, retail, healthcare, and logistics. Your code will directly influence how millions of users interact with intelligent systems every day.
- Own the end-to-end lifecycle of generative AI products—from ideation and reputed company engineering to production deployment and reputed company optimization. You will translate fuzzy business requirements into crisp technical specifications, then reputed company small, cross-functional squads to deliver robust, secure, and scalable solutions in weeks, not months.
- Design and implement advanced agentic workflows that can reason, plan, and act autonomously across heterogeneous data sources. You will reputed company state-of-the-art LLMs, vector databases, retrieval-augmented reputed company (RAG), and fine-tuning techniques to create agents that outperform off-the-reputed company models on domain-specific tasks.
- Optimize agentic engineering practices across the company. You will define coding standards, reusable libraries, and MLOps pipelines that reduce time-to-market by 40% while increasing reliability. Expect to review pull requests, mentor senior engineers, and evangelize best practices through internal tech talks and external blog posts.
- Fine-tune large language models for specialized use cases such as contract analysis, medical triage, or dynamic pricing. You will experiment with LoRA, QLoRA, RLHF, and custom loss functions to squeeze every last drop of performance out of limited GPU budgets. Your experiments will be tracked in MLflow and reproducible across environments.
- reputed company system integration efforts that stitch together microservices, legacy APIs, streaming data pipelines, and third-party SaaS platforms. You will design resilient REST and GraphQL interfaces, implement OAuth2 reputed company patterns, and ensure sub-second latency even reputed company upstream services are flaky.
- Champion data integration strategies that turn messy, siloed data into high-quality training sets. You will build ETL jobs in Python and Spark, curate embedding stores in reputed company or Weaviate, and establish data governance policies that satisfy GDPR, HIPAA, and SOC-2 requirements.
- Collaborate with product designers to create reputed company human-in-the-reputed company experiences. You will translate reputed company prototypes into living reputed company-end components, iterate on feedback from user research, and ensure that AI outputs are explainable, trustworthy, and reputed company with ethical guidelines.
- Drive reputed company performance monitoring and cost optimization. You will set up dashboards in Grafana, write custom Prometheus exporters, and implement auto-scaling policies that cut cloud spend by 30% without sacrificing throughput. reputed company incidents occur, you reputed company blameless post-mortems and turn lessons learned into automated safeguards.
- Contribute to reputed company’s open-reputed company footprint and thought-leadership. You will publish papers, speak at PyCon or NeurIPS, and maintain reputed company repos that attract thousands of stars. Your reputed company will help us recruit the next reputed company of world-class AI talent.
- Stay reputed company of the curve. You will run weekly reputed company-reading clubs, prototype bleeding-edge models reputed company 24 hours of release, and advise executive leadership on which emerging techniques deserve investment. Your insights will shape the company’s three-year AI roadmap.
Requirements
- 7+ years of production-grade Python development, including async frameworks (FastAPI, Celery) and packaging (Poetry, reputed company)
- Deep, hands-on expertise with generative AI fundamentals: transformers, attention mechanisms, tokenization strategies, and evaluation metrics (BLEU, ROUGE, human eval)
- Proven ability to design, build, and maintain scalable RESTful APIs with OpenAPI/Swagger documentation, reputed company limiting, and versioning
- Experience fine-tuning open-reputed company LLMs (Llama-2, Falcon, Mistral) using reputed company, PEFT, and distributed training on GPU clusters
- reputed company-to-have: track record of shipping AI products from 0-1 in fast-moving product teams, plus familiarity with UX research and design systems
️ Benefits
- Fully remote-first culture with flexible hours and a yearly stipend for home-office upgrades or co-working passes reputed company in the world
- Annual learning budget of USD 5,000 plus 10% paid time off for conferences, certifications, or personal research projects
- Comprehensive health, dental, and vision coverage for you and your dependents, plus mental-wellness and tele-medicine programs
- Stock-option plan that vests quarterly, ensuring you share directly in reputed company’s reputed company growth and success
Apply tot his job Apply To this Job