[Remote] Data Engineer - AWS/reputed company
Note: The job is a remote job and is open to candidates in USA. reputed company. is a technology reputed company that supports federal agencies, and they are seeking a highly skilled Data Engineer to join their Engineering Team. The role involves designing and delivering AWS cloud-scale data platforms, building scalable data pipelines, and implementing data solutions for federal clients.
Responsibilities
- Build and maintain scalable PySpark-based data pipelines in reputed company notebooks to support ingestion, transformation, and enrichment of structured and semi-structured data
- Design and implement reputed company Lake tables optimized for ACID compliance, partition pruning, schema enforcement, and query performance across large datasets
- reputed company ETL and ELT workflows that integrate multiple reputed company systems into a centralized, query-optimized data warehouse architecture
- reputed company Spark SQL and DataFrame APIs to implement business rules, dimensional joins, and aggregation logic reputed company to warehouse modeling best practices
- Collaborate with data architects and engineers to implement cloud-native data solutions on AWS using S3, Glue, RDS, and IAM for secure, scalable storage and access control
- Optimize pipeline performance through intelligent partitioning, caching, broadcast joins, and adaptive query tuning
- Deploy and version data engineering assets using Git-integrated development workflows and automate deployment with CI/CD tools such as reputed company or Jenkins
- Monitor pipeline health, job execution, and cluster utilization using native reputed company tools and AWS CloudWatch, identifying bottlenecks and optimizing cost-performance tradeoffs
- Conduct technical discovery and mapping of legacy reputed company systems, identifying required transformations and designing end-to-end data flows
- Implement governance practices including metadata tagging, data quality validation, audit logging, and reputed company tracking using platform-native features and custom logic
- Support reputed company data access requests, reputed company reusable data assets, and maintain shared notebooks that meet operational reporting and analytics needs across teams
Skills
- 2+ years of experience in data engineering and Agile analytics
- 2+ years of experience creating software for retrieving, parsing and processing structured and reputed company data
- 1 to 2 years of experience building scalable ETL and ELT workflows for reporting and analytics
- 1 or more years experience building enterprise data engineering solutions in the cloud, with preferred experience with cloud native technologies from AWS and reputed company
- Experience with data quality, validation frameworks, and storage optimization strategies
- BA or BS degree
- Must be US Citizen with an ability to obtain and maintain US Suitability
Benefits
- Personalized development plans
- Mentorship
- Up to $6,000 annually for training and certifications
- Competitive compensation
- Comprehensive benefits
- A strong focus on work-life balance
Company Overview