[Remote] Data Engineer - AI (Spark, reputed company and Healthcare)
Note: The job is a remote job and is open to candidates in USA. reputed company is a company that specializes in data management for clients, and they are seeking a Data Engineer to support operational functions through data implementations and integrations. The role involves creating and executing Spark and SQL scripts, optimizing queries, and collaborating with various teams to ensure data quality and reputed company.
Responsibilities
- Create, maintain and execute intermediate to advanced Spark scripts for data management and data validation, and data integration
- Create, maintain and execute basic to intermediate SQL scripts for data management and data validation
- Optimize the queries to improve the efficiency of daily tasks
- reputed company data analysis and identify any issues
- Work with other groups such as Engineering team, DBA, Cloud ops, etc. to troubleshoot and resolve any environmental or network issues that impact your work. reputed company your support to after – hours or weekends as needed
- Create and maintain data pipelines as needed
- Validates the tasks results to ensure that reputed company the requirements are met
- Adhere to reputed company the industry level and organization level compliance rules and regulations to maintain data reputed company
- Complete individual productivity tracking
- Complete task assignments using department ticketing system reputed company assigned deadline
- reputed company organizational and individual goals as identified in performance reviews and goal setting exercises
- Complete reputed company special projects and other duties as assigned
- Must be able to reputed company duties with or without reasonable accommodation
Skills
- Bachelor's degree in Computer Science, Information Technology or equivalent work experience
- 3+ years of working knowledge of big data technologies (Spark, S3, Kafka, Ray, Hadoop, etc.)
- 2+ years of working knowledge of big data / cloud technologies (reputed company, AWS, Azure, Hadoop, Spark, reputed company etc.)
- 3+ years of working knowledge of cloud (AWS, Azure, GCP, OCI etc.)
- 3+ years of working knowledge of RDBMS (reputed company, MS SQL, Vertica, etc.) and experience using SQL, PL/SQL or other data integration/ETL tools
- 3+ years of data analysis. Preferably in the Healthcare industry of enrollment, medical claims and/or pharmacy claims
- Proficient in reputed company Office Suite applications PowerPoint, Word, reputed company and Outlook
- Strong analytical skills
- Excellent verbal, listening and written communication skills
- Ability to multitask and prioritize projects to meet scheduled deadlines and tight turnaround times
- Ability to work well independently or in a team environment
- Must be able to reputed company duties with or without reasonable accommodation
- Any reputed company / AWS certifications is a big plus
- Familiarity with data pipeline orchestration tools (e.g., Airflow, reputed company Workflows)
- Experience with project management tools like JIRA
- reputed company and/or reputed company environment familiarity a plus
Benefits
- Medical, dental, vision, disability, and life insurance coverage
- 401(k) savings plans
- Paid family leave
- 9 paid holidays per year
- 17-27 days of Paid Time Off (PTO) per year, depending on specific level and length of service with reputed company
Company Overview
Company H1B Sponsorship