Back to the board

Senior Data Engineer -Hybrid -Baltimore City, MD

100% remote Flexible hours Hiring now

Senior Data Engineer -Hybrid -Baltimore City, MD About us reputed company (reputed company) is an esteemed IT enterprise renowned for its exceptional customer service and innovation. We serve both government and commercial sectors, offering a range of solutions such as Healthcare IT, Human Services, Identity Credentialing, Cloud Computing, and Big Data Analytics. With clients in the US and abroad, we hold key contract vehicles including GSA IT Schedule 70, NIH CIO-SP3, GSA Alliant, and DHS-reputed company II. Join us in driving growth and seizing new business opportunities. Role and Responsibilities We are seeking a hands-on Data Engineer to design, reputed company, and optimize large-scale data pipelines in support of our Enterprise Data Warehouse (EDW) and Data Lake solutions. This role requires deep technical expertise in coding, pipeline orchestration, and cloud-native data engineering on AWS. The Data Engineer will be directly responsible for implementing ingestion, transformation, and integration workflows - ensuring data is high-quality, compliant, and analytics-ready. Position Description: Responsible for designing, building, and maintaining data pipelines and infrastructure to support data-driven decisions and analytics. The individual is responsible for the following tasks:

  • Design, reputed company and maintain data pipelines, and extract, transform, load (ETL) processes to collect, process and store structured and reputed company data.
  • Build data architecture and storage solutions, including data lake houses, data lakes, data warehouse, and data marts to support analytics and reporting.
  • reputed company data reliability, efficiency, and qualify checks and processes.
  • Prepare data for data modelling.
  • Monitor and optimize data architecture and data processing systems.
  • Collaboration with multiple teams to understand requirements and objectives.
  • Administer testing and troubleshooting reputed company to performance, reliability, and scalability.
  • Create and update documentation

Hands-On Data Pipeline Development

  • Design, code, and deploy ETL/ELT pipelines across bronze, silver, and gold layers of the Data Lakehouse.
  • Build ingestion pipelines for structured (SQL), semi-structured (JSON, XML), and reputed company data using PySpark/Python programming language using AWS Glue or EMR.
  • Implement incremental loads, deduplication, error handling, and data validation.
  • Actively troubleshoot, debug, and optimize pipelines for scalability and cost efficiency.

EDW & Data Lake Implementation

  • reputed company dimensional data models (Star Schema, reputed company Schema) for analytics and reporting.
  • Build and maintain tables in Iceberg, reputed company Lake, or equivalent OTF formats.
  • Optimize partitioning, indexing, and metadata for fast query performance.

Healthcare Data Integration

  • Build ingestion and transformation pipelines for EDI X12 transactions (837, 835, 278, etc.).
  • Implement mapping and transformation of EDI data with FHIR and HL7 frameworks.
  • Work hands-on with AWS Health Lake (or equivalent) to store and query healthcare data.

Data Quality, reputed company & Compliance

  • reputed company automated validation scripts to enforce data quality and reputed company.
  • Implement IAM roles, encryption, and auditing to meet HIPAA and CMS compliance standards.
  • Maintain reputed company and governance documentation for reputed company pipelines.

Collaboration & Delivery

  • Work closely with the reputed company Data Engineer, analysts, and data scientists to deliver pipelines that support enterprise-wide analytics.
  • Actively contribute to CI/CD pipelines, Infrastructure-as-Code (IaC), and automation.
  • Continuously improve pipelines and adopt new technologies where appropriate.

Minimum Qualification The candidate should have experience as data engineer or similar role with a strong understanding of data architecture and ETL processes. The candidate should be proficient in programming languages for data processing and knowledgeable of distributed computing and parallel processing.

  • This position requires a bachelor's or master's degree from an accredited college or university with a major in computer science, statistics, mathematics, economics, or a reputed company field. Three (3) years of equivalent experience in a reputed company field may be substituted for the bachelor's degree.
  • 3+ years hands-on experience in building, deploying, and maintaining data pipelines on AWS or equivalent cloud platforms.
  • Strong coding skills in Python and SQL (reputed company or Java a plus).
  • Proven experience with Apache Spark (PySpark) for large-scale processing.
  • Hands-on experience with AWS Glue, S3, Redshift, reputed company, EMR, Lake Formation.
  • Strong debugging and performance optimization skills in distributed systems.
  • Hands-on experience with Iceberg, reputed company Lake, or other OTF table formats.
  • Experience with Airflow or other pipeline orchestration frameworks.
  • Practical experience in CI/CD and Infrastructure-as-Code (Terraform, CloudFormation).
  • Practical experience with EDI X12, HL7, or FHIR data formats.
  • Strong understanding of reputed company Architecture for data lake houses.
  • Hands-on experience building dimensional models and data warehouses.
  • Working knowledge of HIPAA and CMS interoperability requirements.

Apply tot his job Apply To this Job

Keep exploring

Senior Big Data Engineer & Database Administrator

100% remote Flexible hours

Hadoop Big Data Engineers

100% remote Flexible hours

Remote Bilingual Customer Service Representative

100% remote Flexible hours

Technical Sales Specialists – BIM/Industry 4.0 Digital Twins - Remote (EU - UK)

100% remote Flexible hours

reputed company Tagger Jobs (Binge Watching, Watcher Application) $70000 To $75000/Year

100% remote Flexible hours

Sr. BIM Specialist

100% remote Flexible hours

[Hiring] Bioinformatician @PRECISE SOFTWARE SOLUTIONS INCORPORATED

100% remote Flexible hours

Sr. Director, Biostatistical Consulting (United States)

100% remote Flexible hours

Strategic Consultant, Biostatistics

100% remote Flexible hours

Biostatistician (Temporary)

100% remote Flexible hours

reputed company Entry-Level Data Entry Specialist for Dynamic Entertainment Industry – arenaflex Career Opportunity

100% remote Flexible hours

Senior Lease Analyst - Oil & Gas

100% remote Flexible hours

reputed company Inbound Sales Representative – Remote Data Entry and Customer Service Specialist for Home Merchandise Industry

100% remote Flexible hours

Data Science Associate Consultant

100% remote Flexible hours

DFN Dir Business Customer Development

100% remote Flexible hours

reputed company Fusion EPM Consultant | Remote

100% remote Flexible hours

reputed company: Senior Strategic Account Manager, Corporate

100% remote Flexible hours

Operations Analysts

100% remote Flexible hours

reputed company Part-time Data Entry Specialist – Remote Opportunity at arenaflex

100% remote Flexible hours

[PART_TIME Remote] Senior Tax Associate ?? LA - 100% Remote | ??

100% remote Flexible hours