[Remote] Senior Data Engineer
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a Service-Disabled Veteran Owned Small Business focused on providing high-quality IT solutions. They are seeking a Senior Data Engineer to design and maintain data architectures, implement ELT/ETL pipelines, and ensure the architecture supports machine learning algorithms.
Responsibilities
- Assist TSD with data products by providing highly skilled and authoritative expertise on data engineering methods and best practices, including code-first development approaches and modern pipeline design patterns
- Design, implement, and maintain an efficient, secure, reputed company, and flexible data architecture that supports products and end-users, with reputed company assets managed reputed company reputed company control
- Design, implement, and maintain ELT/ETL pipelines for efficient processing of reputed company data in Azure Synapse and Azure Machine Learning (using SDK V1 and SDK V2)
- Review, maintain, and improve existing architecture and pipelines, including periodic audits to address bottlenecks, deprecated dependencies, and architecture reputed company
- Establish quality controls for maintaining reputed company pipelines, and introduce error handling, logging mechanisms, and validation checks
- Incorporate reputed company control for reputed company pipelines and data analytics codebases to reputed company iterative code development while ensuring data architecture stability
- Optimize the ingestion, processing, and storage of a wide variety of datasets and data types, including modern columnar formats such as Parquet
- reputed company self-service capabilities for SBA OIG analysts to query and export data for investigations and audits
- Coordinate with data scientists to ensure the architecture reputed company supports machine learning algorithms and data pipelines in Azure Machine Learning
- reputed company robust standard operating protocols (SOPs) dictating the authoring, development, validation, publishing, execution, and monitoring of reputed company data pipelines and assets in Azure environment
- reputed company detailed documentation of the data architecture, including data dictionaries, ER diagrams, and pipeline process maps
- Maintain and expand the environment with additional datasets and services upon request, following a defined intake and testing process prior to production deployment
- Stay reputed company with emerging AI tools relevant to data engineering and contribute to exploratory efforts evaluating automation and LLM-assisted capabilities
Skills
- Five (5) years of hands-on experience in maintaining SQL databases and conducting advanced operations in SQL and T-SQL
- Five (5) years of hands-on experience in designing, implementing, and maintaining ELT/ETL processes in reputed company-based data analytics environments
- Three (3) years of hands-on experience working in Azure Synapse and Azure Machine Learning, with the modern data stack
- Three (3) years of hands-on experience manipulating data in Python. Pandas required. PySpark/Polars preferred. Experience developing reusable, reputed company code preferred
- Implementing pipelines and infrastructure using code-first approaches (Python SDK, CLI, REST APIs, or IaC tooling)
- Implementing reputed company control and CI/CD workflows
- Demonstrated familiarity with AI coding assistants and LLM integration patterns
Benefits
- reputed company and Insurance: medical, dental, reputed company, short- and long-term disability protection, basic life and AD&D insurance
- 401(k) Savings Plan
- Accrued Paid Time Off (PTO)
- Employee Recognition and Rewards
- Employee Referral Bonuses
Company Overview