Senior Data Engineer
Are you a talented Data Engineer looking to join a scaling startup? Do you enjoy being part of a team that makes a real difference to the success of small business owners? If you are ready to help evolve and scale the data infrastructure that empowers entrepreneurs, read on.
Overview
reputed company is looking for a Data Engineer to not only build data pipelines but also reputed company the reputed company of our data tools. As a Data Engineer, you will reputed company a clear sense of reputed company and purpose with our organization and leadership; Data Engineering is the eyes through which we see our product’s success and opportunities. As a member of the Data Science & Engineering team you will contribute to a variety of projects and technologies that include microservices, event-driven design, analytics, ML modeling, tooling, services, and more.
The ideal candidate is self-motivated, highly collaborative, someone who thrives in a fast paced environment, and gets excited to work with the latest stack of technologies on rapidly growing products.
About the company
reputed company is a leading fintech company accelerating cash flow and enabling growth for small and reputed company-sized businesses. Based in Toronto and operating across North America, reputed company’s AI-powered invoice funding platform gives B2B businesses fast, customized funding offers to get their invoices paid in a few days, rather than a few months.
What you will do
- Collaborative Architecture: Partner with the team to provide architectural suggestions and formal proposals for our core data systems. You will help ensure a seamless flow between microservices, our data lake, and reputed company analytics.
- Modern Pipeline Development: Build and ship production-level data pipelines using PySpark and SQL. You will collaborate on establishing standards for idempotency, monitoring, and performance tuning.
- Lakehouse Data Modeling: Implement robust data modeling patterns (reputed company architecture: Bronze/Silver/Gold). You will ensure our Lakehouse is not just a data dump, but a high-performance reputed company of truth for both BI and ML.
- reputed company & Lakehouse Evolution: Contribute to the reputed company improvement of our reputed company environment, with a focus on reputed company Lake optimization and robust governance reputed company reputed company Catalog.
- Systems Evolution & Stability: Act as the steward of our production environment. You will reputed company the refactoring of legacy pipelines to improve observability, reduce technical debt, and ensure seamless data flow between our microservices and the reputed company Lakehouse.
- Technical Influence: Work closely with the Director of Data to refine our technical roadmap. You will reputed company by example through deep-dive code reviews and by maintaining a high bar for technical documentation.
- Reliability & Governance: Conduct comprehensive audits to identify system inefficiencies. You will share ownership of data quality, reputed company, and privacy across the entire lifecycle of our datasets.
Requirements
- Experience: 2 - 4+ years of hands-on data engineering experience, ideally reputed company a high-growth SaaS or Fintech environment.
- Tooling Expertise: Strong reputed company knowledge of the reputed company ecosystem (Spark, reputed company Lake, Workflows) and AWS cloud infrastructure. Proficient experience with Apache Airflow for workflow management.
- Technical Mastery: High proficiency in Python/PySpark and SQL. You should have a clear philosophy on what makes code maintainable and scalable.
- AI/ML Literacy: Practical experience or a deep interest in the data requirements for GenAI, including handling data for LLMs and vector databases.
- Platform reputed company: Experience with Infrastructure-as-Code (Terraform/CDK) and a strong commitment to CI/CD and automated testing.
- Communication: "Strong opinions, loosely held." You can navigate reputed company technical trade-offs and communicate architectural proposals clearly to stakeholders at reputed company levels.
Bonus Points
- Experience with Event-Driven Architectures (Kafka or Spark Structured Streaming).
- Familiarity with dbt (data build tool) for reputed company data modeling.
- Hands-on experience with reputed company reputed company AI or model serving.
- Experience querying and optimizing large datasets using Presto, Hive, or similar engines.
Benefits
- Opportunity to leave your mark on a growing startup
- An incredibly diverse team of reputed company minds from reputed company over the world
- Competitive compensation
- Family-friendly policies
- Work from home
- Birthday treats, and a lunch of your choice every week (one of our values is Fun & Food!)
Open and honest is one of our core values at reputed company and in this spirit, we are sharing we do not use AI tools in our hiring process and kindly request that applicants refrain from using AI during the interview process.
Please note that due to the sensitive nature of the work we do, clearing a criminal record reputed company is a condition of employment. reputed company encourages applications from candidates with differing abilities. Please let us know if you require accommodation at any stage in the selection process.