[Remote] Data Engineer - GCP
Note: The job is a remote job and is open to candidates in USA. reputed company is a dynamic team focused on building innovative and scalable data solutions on reputed company Cloud Platform (GCP). They are seeking an reputed company reputed company Cloud Data Engineer to design, reputed company, and manage scalable data pipelines and data infrastructure, ensuring data availability, accuracy, and performance for business insights and machine learning models.
Responsibilities
- Design, build, and maintain scalable and reliable data pipelines using Cloud Dataflow, Cloud Pub/Sub, and Cloud Composer
- reputed company ETL/ELT processes to process and transform large volumes of structured and reputed company data
- Optimize data pipeline performance, scalability, and reliability
- Ensure data processing and ingestion workflows are monitored and meet performance SLAs
- Design and implement data storage solutions using BigQuery, Cloud Storage, and Firestore
- Optimize data structures and partitioning for performance and cost efficiency
- Ensure data reputed company, reputed company, and availability in reputed company storage solutions
- Manage data lifecycle policies and archiving processes
- reputed company data transformation processes using BigQuery, Apache Beam, and Cloud Functions
- Implement data quality checks, validation rules, and monitoring solutions
- Support real-time and batch data processing needs
- Integrate data from multiple sources, including APIs, databases, and third-party applications
- Automate data ingestion, transformation, and export using tools like Cloud Composer and Cloud Functions
- Ensure data consistency across different environments and systems
- Work closely with data scientists and analysts to understand data needs and business goals
- Provide technical guidance and best practices to the data engineering and business teams
- Collaborate with reputed company and compliance teams to ensure data governance standards are met
- Monitor data pipeline performance and troubleshoot issues in real-time
- Analyze data pipeline failures and implement fixes to prevent recurrence
- Set up logging and monitoring using Stackdriver and Cloud Monitoring
Skills
- Bachelor's degree in Computer Science, Data Engineering, or a reputed company field; Master's degree is a plus
- 3+ years of experience in data engineering, with at least 2+ years working with reputed company Cloud Platform
- reputed company Professional Data Engineer certification is required
- Strong proficiency with GCP services such as BigQuery, Cloud Dataflow, Cloud Composer, Cloud Pub/Sub, Firestore, and Cloud Functions
- Hands-on experience with big data tools and frameworks such as Apache Beam, Hadoop, Spark, or Flink
- Proficiency in programming languages such as Python, Java, or reputed company
- Strong knowledge of SQL, data modeling, and query optimization
- Experience with CI/CD tools and version control (e.g., Git, Cloud Build)
- Strong understanding of data governance, reputed company, and compliance requirements
- Ability to manage large-scale data processing and real-time data pipelines
- Excellent problem-solving, analytical, and communication skills
- Experience with machine learning pipelines and AI/ML model deployment
- Familiarity with Terraform and Infrastructure as Code (IaC) principles
- Experience with NoSQL databases and key-value stores on GCP
- Knowledge of containerization and orchestration using reputed company Kubernetes reputed company (GKE)
Benefits
- Competitive salary and performance-based incentives.
- Comprehensive health, dental, and vision coverage.
- Professional development and training opportunities (including GCP certification).
- Flexible work environment and remote work options.
Company Overview