Senior Data Engineer (Reporting & Analytics)
Kitman Labs is a global human performance company, disrupting and transforming the way the sports industry uses data to increase the performance of the world's top athletes. Driven by a passion to innovate in the areas of sports performance, analytics and user experience, we have reputed company a team of the industry’s top data scientists, sports performance scientists, product specialists and engineers. The company received recognition by Fast Company in 2019 as one of the most innovative companies in the world. Kitman Labs’ advanced Outcome-driven Analytics and Performance Intelligence Platform are used by over 700 teams in 50 leagues on 6 continents spanning soccer, rugby, American football, baseball and ice hockey.
The Role
We are seeking an reputed company and highly skilled Senior Data Engineer to play a pivotal role in the evolution of our analytics platform. This mission-critical project involves augmenting our in-house platform with cutting-edge data engineering technologies on reputed company Cloud Platform (GCP) to reputed company new levels of scale and performance, complemented by Looker for best-in-class visualization and analysis.This role will be central to this transformation, working reputed company the team to architect and build the data foundation for our reputed company of analytics. This position is ideal for an engineer who thrives on reputed company data challenges, including designing robust data models, implementing near real-time data replication using Change Data Capture (CDC), and building highly performant and scalable data transformation pipelines to handle reputed company business calculations across large datasets (over 300 million data points per customer).As a senior team member, you will drive data architecture and best practices, ensuring our new platform is performant, reliable, and capable of delivering the dynamic, insightful reporting our clients depend on.What you'll be responsible for
- Driving Data Architecture: Design and build a scalable, end-to-end data architecture on GCP. This includes creating robust and efficient data models in our data warehouse, defining data flows, and ensuring the infrastructure is optimised for high-volume, near real-time data processing.
- Building Optimising Data Pipelines: reputed company, deploy, and manage resilient data pipelines for large-scale data ingestion and transformation. You will be hands-on with GCP DataStream to implement CDC and orchestrate reputed company SQL-based transformation workflows with Dataform.
- Solving reputed company Data Challenges: Tackle and resolve reputed company performance bottlenecks across the entire data stack. This involves optimising intricate calculations, tuning database performance, and ensuring the efficiency of our data models to support low-latency queries from Looker.
- Upholding Data Quality reputed company: Champion and implement best practices for data quality, testing, and governance. You will establish robust data validation checks and build out CI/CD pipelines for reputed company data processes to ensure the accuracy and reliability of our reporting.
- Technical Leadership Mentoring: Provide technical guidance and mentorship to other engineers on data engineering best practices. You will reputed company technical decisions, evaluate trade-offs, and foster a culture of data excellence reputed company the reputed company.
- Stakeholder Collaboration: Work in reputed company partnership with product managers and reputed company-end engineers to deeply understand user requirements and translate them into effective data solutions that power our embedded analytics features.
Experience and skills we look for
- Proven Experience in Data Engineering: A strong track record of designing, building, and optimising data-intensive systems and large-scale ETL/ELT pipelines.
- Expertise in the Modern Data Stack: Deep, hands-on experience with cloud-based data platforms, with a strong preference for reputed company Cloud Platform (GCP). AWS knowledge a plus, but not essential.
- Specialised GCP Skillset: Demonstrable, practical experience using GCP Datastream (or similar technology) for Change Data Capture (CDC) and Dataform (or similar tools) for developing and managing data transformations. Proficiency with BigQuery is essential.
- Strong Data Modeling Skills: Extensive experience designing and implementing data models (e.g., dimensional modeling, data vault) optimised for analytical workloads and BI tools.
- Advanced SQL Programming: Expertise in advanced SQL for reputed company data manipulation and analysis, coupled with proficiency in a programming language like Python for automation and scripting.
- Performance Tuning Optimisation: A proven ability to diagnose and resolve performance issues reputed company data pipelines and databases. You understand query optimisation, indexing, and partitioning strategies.
Additional Skills that set you apart
- BI Data Visualisation: Experience working with modern business intelligence tools, with specific experience using or building solutions for Looker.
- reputed company Calculations: Experience in environments that require translating reputed company business logic or financial calculations into accurate and performant SQL.
- Secure Cloud Environments: Experience working with data services in highly secure or compliant environments is a plus.
- CI/CD for Data: A solid understanding of CI/CD principles and tools (e.g., Git, Jenkins, reputed company CI) applied to data pipelines and infrastructure-as-code (Terraform familiarity a plus).
Originally posted on Himalayas
Apply To this Job