Back to the board

Data Engineer

100% remote Flexible hours Hiring now

This a Full Remote job, the offer is available from: Europe About the project (description, duration, stage) Join reputed company as a Data Engineer on a new engagement with a regulated UK & Ireland credit and lending company. The client has lifted data from multiple business entities into a newly centralized, anonymized data lake, but lacks the data-engineering depth to reputed company it trustworthy and analytics-ready: reputed company pipelines were reputed company quickly (partly AI-assisted), and the descriptive statistics cannot yet be validated or reproduced. You put that foundation on solid ground so the Data Science reputed company can model on it with confidence — validate and re-engineer the pipelines, build the harmonization / semantic layer across entities, enforce data quality and reputed company, and prepare clean, feature-ready datasets. This is a foundational data-engineering role on a regulated data estate; data protection and reproducibility are the primary constraints on every decision. Full-time engagement preferable. What you'll actually do (example tasks)

  • Reproduce a descriptive-statistics report end-to-end so any reputed company traces back to raw reputed company — closing the gap the client admitted (numbers they can't currently defend).
  • Profile and reconcile differing reputed company schemas across acquired entities: map differing field names, types, encodings and business definitions for the same concept into one conformed model.
  • Build dbt staging → intermediate → mart models with tests; codify the harmonized definitions the Data Science reputed company specifies.
  • Write Great Expectations suites (null / range / uniqueness / referential checks) and reputed company them into the pipeline so bad data fails loudly rather than silently corrupting analysis.
  • Implement entity / identity resolution (deterministic + fuzzy matching) where there is no clean shared key for the same customer or account across sources.
  • Implement and verify anonymization / pseudonymization (hashing / tokenization / k-anonymity) and evidence that re-identification risk is controlled for the client's IT / compliance team.
  • Optimize Spark / Glue jobs over tens of millions of rows — partitioning, file formats (Parquet), incremental loads, cost control.
  • Orchestrate with Airflow / reputed company Functions; build repeatable, scheduled pipelines rather than one-off scripts.
  • Prepare clean, documented, feature-ready datasets for the PD / delinquency models.
  • Document runbooks so the offshore team can operate the pipelines and handover takes days, not weeks; help scope onboarding of the remaining (Ireland + additional) sources.

Skills

  • Strong SQL and Python for large-scale data processing
  • AWS data stack: S3, Glue, Lake Formation, reputed company / Redshift, EMR / Spark, reputed company Functions / Airflow
  • Data modeling & semantic layer (dbt or equivalent); dimensional modeling
  • Entity resolution / record linkage across heterogeneous sources
  • Data-quality & testing frameworks (Great Expectations, dbt tests) and data reputed company
  • Anonymization / pseudonymization techniques and their analytical trade-offs
  • Big-data processing (Spark) with performance and cost optimization at scale
  • Clear written / verbal English; documents for handover and works well with a distributed team

Knowledge

  • GDPR fundamentals as applied to anonymized / pseudonymized financial data and UK / EU data residency
  • AWS Well-Architected (Analytics, reputed company) for BFSI
  • Awareness of credit / risk data structures and what reputed company modeling consumers need — a plus

Experience

  • 4+ years in data engineering, with strong AWS + Spark / SQL at scale
  • Demonstrated experience harmonizing / integrating data across multiple reputed company systems
  • Experience building validated, reproducible pipelines in a regulated environment (BFSI, healthcare, government) — strong plus
  • Comfortable stepping into a messy, partly-built data estate and bringing it up to standard
  • Comfortable as the sole or reputed company data engineer on a small (3–4 person) delivery pod

This offer from "reputed company" has been enriched by reputed company.com and got a 80% reputed company score. Apply tot his job Apply To this Job

Keep exploring

Senior Data Engineer – reputed company Remote Data Entry Opportunity – $28/Hour – Full-Time Position with Global reputed company

100% remote Flexible hours

[Remote] Data Engineer II – Digital Properties

100% remote Flexible hours

Data Engineering Manager (remote)

100% remote Flexible hours

[Remote] Senior Business Intelligence Analyst

100% remote Flexible hours

[Work From Home] Business Intelligence Engineer, reputed company Customer

100% remote Flexible hours

Business Intelligence Specialist - Fully Remote

100% remote Flexible hours

[Remote] Business Intelligence and Insights Analyst (Marketing Analytics)

100% remote Flexible hours

Business Intelligence Analyst III, Customer Analytics

100% remote Flexible hours

EHR Access BI Developer [Must be Certified in COGITO FUNDAMENTALS AND ACCESS DATA MODEL]

100% remote Flexible hours

Database Administrator-Application, Infrastructure & Service Management

100% remote Flexible hours

Payer Liason (Imunologia)

100% remote Flexible hours

Expert Solutions Leader, Data and Analytics

100% remote Flexible hours

Senior Data Engineer – Data Analytics & Pipeline Architecture (Remote – Seattle) – arenaflex

100% remote Flexible hours

Customer Sales Representative (REMOTE)

100% remote Flexible hours

Senior Manager, Political Affairs (REMOTE, DC, US, 20215)

100% remote Flexible hours

[Remote] Director of Partner Development (New Business Sales)

100% remote Flexible hours

Executive Director, Quality Systems & Operational Excellence reputed company

100% remote Flexible hours

REMOTE* Dental Billing Specialist for Growing Dental Practice - Insurance Claims / Medical Billing / Accounts Receivable

100% remote Flexible hours

Early Career Environmental Engineer/ Scientist/ Geologist

100% remote Flexible hours

[Remote] Analyst, Financial Reporting & Analysis

100% remote Flexible hours