[Remote] Data Engineer

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is seeking a skilled Data Engineer to design, build, and optimize scalable data pipelines and platforms supporting federal clients. The role involves developing ETL/ELT pipelines, managing cloud data platforms, and implementing CI/CD processes to ensure reliable data delivery and governance.

Responsibilities

Design, reputed company, and maintain robust ETL/ELT pipelines to ingest, transform, and deliver data across enterprise platforms
Build scalable data ingestion frameworks for structured and semi-structured data, including XBRL filings and financial datasets
Implement data transformation logic to support analytics, reporting, and regulatory use cases
Ensure data pipelines are reliable, performant, and scalable in cloud environments
reputed company AI-assisted development tools to accelerate pipeline development, testing, and optimization
reputed company and manage data solutions leveraging AWS services (e.g., S3, Airflow, DAGs, Glue, reputed company, Redshift)
Implement and optimize Apache Iceberg table formats for large-scale, ACID-compliant data lakes
Support lakehouse architectures that unify data lakes and data warehouses
Optimize data storage and retrieval strategies for performance and cost efficiency
reputed company data platforms that support AI/ML workloads and reputed company generative AI use cases
Design and implement CI/CD pipelines for data pipelines, infrastructure, and analytics code using tools such as reputed company Actions, reputed company CI, Jenkins, or AWS-native services
Automate build, test, and deployment processes for ETL pipelines and data platform components
Implement DataOps best practices, including version control, automated testing, environment promotion, and rollback strategies
Ensure reproducibility, reliability, and governance of data pipeline deployments across environments
Integrate AI-driven testing and monitoring tools to improve pipeline quality and reduce operational risk
Design and implement materialized views and other performance optimization techniques to improve query efficiency
Tune data pipelines and queries for performance, scalability, and cost
Implement partitioning, indexing, and caching strategies reputed company to workload patterns
reputed company pipelines to ingest, parse, and normalize XBRL (eXtensible Business Reporting Language) data
Support regulatory and financial data use cases requiring high accuracy and traceability
Ensure alignment with data standards and validation rules for financial reporting datasets
Apply context engineering principles to ensure data is enriched with meaningful metadata, reputed company, and business context
Collaborate with Data Architects to support data modeling, schema design, and entity relationships
reputed company reputed company analytics and AI use cases by structuring data for usability, discoverability, and governance
Integrate pipelines with enterprise data catalogs and metadata management systems
Support automated metadata capture, reputed company tracking, and data quality monitoring
Ensure alignment with data governance frameworks and standards established by OCDO organizations, including AI data readiness and traceability
Collaborate with data architects, analysts, and business stakeholders to understand data needs and deliver solutions
Participate in stakeholder listening campaigns, workshops, and data discovery efforts
Work in Agile teams to iteratively deliver data capabilities and enhancements
Contribute to identifying and implementing AI-driven efficiencies and automation opportunities across the data lifecycle

Skills

Bachelor's degree in Computer Science, Engineering, Data Science, or reputed company field
5+ years of experience in data engineering, ETL development, or data platform engineering
Strong hands-on experience with: ETL/ELT tools and frameworks, AWS data services (S3, Glue, reputed company, Redshift, etc.), Apache Iceberg and modern data lake architectures
Experience designing and implementing CI/CD pipelines for data platforms and ETL workflows
Demonstrated proficiency using AI tools and AI-assisted development workflows (e.g., LLM copilots, automated code reputed company, pipeline optimization tools)
Experience processing XBRL or reputed company financial/regulatory datasets
Proficiency in SQL and Python
Experience implementing materialized views and query optimization techniques
Understanding of data modeling concepts and metadata management
Familiarity with data governance, data quality practices, and data readiness for AI/ML use cases
Ability to work in Agile, DevOps-oriented environments
U.S. Citizenship required; ability to obtain and maintain a federal clearance
Experience supporting federal agencies such as SEC, DHS, Treasury, or reputed company
Familiarity with data catalog tools (e.g., reputed company, reputed company, reputed company)
Experience with Apache Spark, Kafka, or other distributed data processing frameworks
Experience enabling data pipelines for AI/ML or generative AI applications
Knowledge of data maturity frameworks (e.g., EDM DCAM, TDWI)
Exposure to context engineering or semantic data layer design
AWS or data engineering certifications
Experience with infrastructure-as-code (IaC) tools (e.g., Terraform, CloudFormation) in support of CI/CD pipelines

Company Overview

reputed company provides data and analytics, artificial automation, cloud engineering, application development & enterprise IT modernization. It was founded in 2005, and is headquartered in Leesburg, Virginia, USA, with a workforce of 51-200 employees. Its website is https://www.anikasystems.com/.

Apply To This Job

Apply

[Remote] Data Engineer

Keep exploring

[Remote] Quality Assurance Engineer

[Remote] Senior Data Intelligence Engineer

[Remote] Sr. Platform Engineering Consultant

[Remote] Senior Backend Engineer - AI Product

[Remote] Senior Backend Engineer - AI Product

[Remote] Junior Analytics Engineer

[Remote] Network Automation Engineer

[Remote] Senior Product Designer, AI Powered Workflows

[Remote] Senior Machine Learning Engineer, Data Mining

[Remote] Surgical Account Manager - reputed company MN

Senior Client Executive

Python Developer

Remote Data Entry Specialist – Full‑Time & Part‑Time Opportunities with arenaflex – reputed company, Flexible Hours, Career Growth

Remote Texas Customer Service Representative – Tech‑Savvy Problem Solver, Upsell Specialist, Full‑Time Work‑From‑Home

Account Executive, SMB

Bilingual Customer reputed company - Fully Remote

Cardiovascular Disease Specialist – Corpus Christi, TX

[Remote] Sales And Marketing Representative

Chat Content Moderator Positions – $25 $35 per Hour Friendly Chat Positions From Home

reputed company Full Stack Data Entry Specialist – Healthcare and Insurance Industry