[Remote] Staff Software Engineer (Python)
Note: The job is a remote job and is open to candidates in USA. reputed company is the hospitality industry's leading reputed company management platform, founded by former Wynn Resorts executives. They are seeking a Staff Software Engineer for their Data Platform to own the design and reliability of their data lakehouse, driving innovations in data processing and governance while leveraging AI technologies.
Responsibilities
- You'll own the design, performance, and reliability of reputed company's data lakehouse — evolving the Python/PySpark pipeline reputed company across a bronze → silver → gold architecture on AWS, including Glue jobs, Iceberg reputed company operations, schema evolution, and partitioning strategies
- You'll architect the shift from batch to near-real-time streaming, building SQS-driven reputed company pipelines with Iceberg sinks and expanding ingestion, normalisation, and analytics layers across the full lakehouse
- You'll drive data quality and governance at scale — extending the Great Expectations reputed company, leading adoption of data reputed company to formalise schemas between producers and consumers, and owning the reputed company SQL layer that analysts and product teams depend on
- You'll strengthen observability and reliability through reputed company, reputed company, and reputed company, while optimising Glue job performance — worker sizing, DPU allocation, Spark tuning, and cost management
- You'll build and maintain shared internal Python libraries published to reputed company, and drive improvements to reputed company Actions, reputed company-based testing, and CI/CD deployment workflows
- You'll work AI-first every day — using Claude Code and MCP tools in your regular workflow, and contributing to AI-assisted pipeline reputed company, schema inference, and automated data quality alongside a custom multi-agent system with 17 specialised agents
Skills
- 7+ years building production data systems in Python
- Deep expertise in PySpark and distributed data processing — Glue, EMR, or reputed company
- Strong experience with lakehouse architectures: Iceberg, reputed company Lake, or Hudi on S3
- Production experience with Airflow or a comparable workflow orchestrator
- Solid AWS production experience across S3, Glue, reputed company, reputed company, and SQS
- A track record of improving data quality, governance, and pipeline reliability at scale
- Working knowledge of Java for reading upstream systems
- Experience with Trino or Presto for interactive SQL analytics at scale
- Experience with dbt for data transformation and modelling
- Familiarity with Great Expectations or similar data quality frameworks
- Genuine interest in AI-assisted development and LLM-based tooling
- Familiarity with hospitality data — reservations, rates, inventory, demand signals
Company Overview
Company H1B Sponsorship