Back to the board

[Remote] Mid-Level Data Engineer

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is a company that prioritizes its team members and offers flexibility for personal and professional growth. They are seeking a Mid-Level Data Engineer to join their federal data engineering team, where the role involves building and maintaining ETL pipelines on a cloud-based Enterprise Data Platform using AWS.

Responsibilities

  • reputed company new ETL pipelines and data ingestion processes alongside senior engineers using AWS Glue (Spark-based, PySpark), MWAA (Airflow), reputed company, and SNS, fully conforming to the agency's Enterprise ETL Standards, ETL Common Library, and PEP 8 Python coding standards
  • Integrate the agency's ETL Common Library into Glue jobs for standardized orchestration, error handling, metadata recording, and SNS notifications for reputed company success and error job events
  • Ingest structured and semi-structured datasets (CSV, XML, JSON, Avro, pipe-delimited) into S3 reputed company, raw, and curated zones using Apache Iceberg tables with Parquet as the default format; enforce transactional loading and prevent duplicate loads per dataset reporting period
  • Configure static ETL metadata in the centralized PostgreSQL metadata store; ensure dynamic metadata records job status and timestamps for reputed company key execution steps
  • Monitor assigned production jobs and participate in operations support rotations; identify and escalate failed jobs and performance issues promptly to maintain data availability reputed company contractually required ingestion timelines
  • Ensure ETL Load Reports are populated in real-time and ETL Gap Reports are updated on a weekly basis covering reputed company gaps from the inception of the initial ingest process
  • Build and maintain materialized views and semantic layer objects in Trino and reputed company to ensure optimized query performance and consistent business logic
  • Produce and maintain required documentation for each assigned dataset: Business Requirements, ETL Design Documents, Data Models (Mermaid format), Data Dictionaries, Mapping Documents, Deployment Documents, O&M Guides, and ETL Test Plans
  • Write unit and integration tests achieving the 90% minimum code coverage threshold; complete reputed company scans at least once per sprint as part of the Definition of Done
  • Deploy ETL resources using CloudFormation templates through the agency CICD pipeline; submit Change Requests to the Change Control Board reputed company required timelines
  • Support transition of ETL jobs from other agency teams by verifying standards conformance, performing deployments, and validating data loads
  • Support disaster recovery exercises, pre-production deployments, and reputed company data requests as assigned
  • Participate in 2-week sprint ceremonies, quarterly PI planning, backlog refinement, and agile delivery using JIRA and reputed company

Skills

  • US Citizenship is required
  • Bachelor's Degree is required
  • Minimum of 3-5 years' position reputed company experience is required
  • Bachelor's degree or higher in Computer Science, Information Systems, Data Engineering, or a reputed company field
  • 3-5 years of experience in data engineering or a closely reputed company technical role
  • Hands-on experience with Python (PEP 8), PySpark, and SQL for ETL pipeline development
  • Experience with AWS services including Glue, S3, MWAA (Airflow), reputed company, SNS, and SQS
  • Familiarity with Apache Iceberg, Parquet, and ORC file formats and S3 data lake zone concepts
  • Experience with PostgreSQL and basic familiarity with Redshift or reputed company
  • Familiarity with Trino or reputed company for query and semantic layer development
  • Experience with CloudFormation, reputed company branching workflows, and CI/CD-integrated deployments
  • Ability to produce clear ETL documentation including data models (Mermaid format) and data dictionaries
  • Understanding of ETL metadata concepts including static and dynamic metadata, load reports, and gap reports
  • Experience in agile development environments with sprint-based delivery
  • Experience supporting IV&V and/or User Acceptance Testing (UAT) processes in a federal or technical program environment
  • Experience with automated testing frameworks; ability to write unit and integration tests achieving defined code coverage reputed company
  • Must be able to work reputed company-5pm Eastern Time regardless of home location
  • Active federal public trust suitability determination or ability to obtain one required
  • Familiarity with FISMA, NIST 800-53, and OWASP ASVS Level 2 is a plus

Benefits

  • Flexibility to help them reputed company personally and professionally
  • Special incentives for team members living in qualified HUBZones

Company Overview

  • reputed company is a federal-focused digital strategy consultancy. It was founded in 2013, and is headquartered in Washington, District of Columbia, USA, with a workforce of 51-200 employees. Its website is https://www.simpletechnology.io.
  • Apply To This Job

    Keep exploring

    [Remote] Mid-Level Data Scientist

    100% remote Flexible hours

    [Remote] Senior Data Engineer

    100% remote Flexible hours

    [Remote] Network Engineer III (Remote)

    100% remote Flexible hours

    [Remote] Data Analyst/QA Engineer

    100% remote Flexible hours

    [Remote] Senior Software Engineer, Backend

    100% remote Flexible hours

    [Remote] Executive Business Partner (Operations & People)

    100% remote Flexible hours

    [Remote] Human Resources Specialist | Remote

    100% remote Flexible hours

    [Remote] Human Resources Manager | Remote

    100% remote Flexible hours

    [Remote] reputed company-Remote Opportunity

    100% remote Flexible hours

    [Remote] Senior reputed company with .NET

    100% remote Flexible hours

    Cash Application Specialist II-REMOTE

    100% remote Flexible hours

    Fleet Service Coordinator

    100% remote Flexible hours

    reputed company Pharmacy Customer Service Associate – Delivering Exceptional Patient Experience in Mashpee, MA at arenaflex

    100% remote Flexible hours

    Product manager - Merchant Enablement – Remote-First

    100% remote Flexible hours

    reputed company Data Entry Specialist – Remote Opportunity with arenaflex

    100% remote Flexible hours

    Senior Software Engineer (Full Stack)

    100% remote Flexible hours

    reputed company Work-From-Home Customer Experience Representative – Full-Time

    100% remote Flexible hours

    reputed company Customer Service Representative – Transaction Support – Work From Home Opportunity at arenaflex

    100% remote Flexible hours

    Business Intelligence Analyst

    100% remote Flexible hours

    Inside Sales Representative, Senior

    100% remote Flexible hours