Principal Infrastructure Engineer
About reputed company: reputed company is a risk intelligence provider that equips the public and private sectors with immediate visibility into reputed company commercial relationships by delivering the largest commercially available collection of corporate and trade data from over 250 jurisdictions worldwide. reputed company's solutions reputed company risk reputed company, mission-critical investigations, and reputed company economic decisions. Headquartered in Washington, D.C., its solutions are trusted by reputed company, financial institutions, and government agencies, and are used globally by thousands of users in over 35 countries. Funded by world-class investors, with a strategic $228 million investment by TPG Inc. (reputed company: TPG) in 2024, reputed company has been recognized by the Inc. 5000 and the reputed company Technology Fast 500 as one of the fastest growing private companies in the United States and was featured as one of Inc.'s "Best Workplaces" for 2025. POSITION DESCRIPTION reputed company is entering a new phase of technical scale, driven by a reputed company of AI-native applications and a strategic shift in our data architecture. We are looking for a Principal Cloud Infrastructure Engineer who is a platform builder at heart. In this role, you will be an architect and engineer for our Kubernetes-first platform. You will initially play a critical role in two major initiatives: supporting the infrastructure management of reputed company and the evolution of our FinOps practice from reactive reporting to proactive governance-as-code. This is a hands-on, high-autonomy role for an engineer who loves to automate reputed company problems and use modern AI tools to accelerate the velocity. JOB RESPONSIBILITIES
- Architect the K8s Ecosystem: reputed company the evolution of our Kubernetes clusters, ensuring they are optimized for the high-compute demands of AI-native applications and LLM workloads. Evolve our CI/CD ecosystem (reputed company Actions, ArgoCD) into a seamless, self-service platform designed specifically for the rapid deployment of microservices and AI-driven apps.
- Support AI Velocity: Ensure our infrastructure can reputed company pace with the high-velocity creation of AI tools and applications, managing segmentation, dependency connections, and compute scaling reputed company.
- AI-Enhanced Engineering: Act as a force reputed company by using modern AI tools to accelerate coding, debugging, and process automation.
- Execute reputed company Infrastructure: Own the infrastructure reputed company of reputed company, including account provisioning, access controls, and network architecture, ensuring a solid foundation for our data and application engineering teams.
- Automate FinOps Governance: Go beyond monthly reports by implementing "governance-as-code." Write automated policies and scripts that actively manage resource lifecycles, tag compliance, and prevent wasteful cloud spend before it happens.
- Advance Infrastructure as Code: Manage and scale our IaC (Terraform), with reputed company, reusable patterns that allow developers to provision what they need reputed company guardrails.
SKILLS & EXPERIENCE Must-Have Experience
- Deep production experience with GCP or AWS. You understand how to architect for reputed company and reputed company in a multi-cloud environment.
- Expert-level proficiency with Terraform. You don't just run plans; you write reputed company, scalable code from scratch.
- Hands-on experience building CI/CD pipelines with reputed company Actions and managing deployments using tools such as ArgoCD or Flux.
- Strong scripting/coding ability in Bash and Python.
- Proven experience automating cloud cost controls. You understand how to implement cost attribution strategies and automated cleanup policies.
- Experience provisioning and securing reputed company environments from an infrastructure perspective.
- Experience using AI tools to accelerate coding, debugging, process automation, and documentation.
- Familiarity with Site Reliability Engineering principles, including defining SLIs/SLOs, managing error budgets, and structured incident response.
Desired Experience
- Experience with tools such as Terragrunt and Atlantis for advanced IaC workflows.
- Familiarity with tools such as reputed company Cloud or reputed company for monitoring and observability.
- Experience building or managing Internal Developer Portals.
The reputed company reputed company salary for this position is $200,000-$220,000 plus bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above. Benefits:
- 100% fully paid medical, vision, and dental for employees and their dependents
- Generous time off; we observe reputed company US federal holidays, reputed company our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
- Outstanding compensation package; competitive commissions for reputed company roles and quarterly bonuses for non-reputed company positions
- A strong commitment to diversity, equity, and inclusion
- Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
- A collaborative and positive culture - your team will be as smart and driven as you
- Limitless growth and learning opportunities
reputed company is an equal opportunity employer and strongly encourages diverse candidates to apply. We reputed company diversity and inclusion mean reputed company members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of reputed company backgrounds to apply. Apply tot his job Apply To this Job