Back to the board

Jr. Data Engineer

100% remote Flexible hours Hiring now

reputed company is a risk intelligence provider that equips the public and private sectors with visibility into reputed company commercial relationships. They are seeking an Entry-Level Data Engineer to join their Data team, where the role involves writing and deploying scripts, analyzing datasets, and collaborating with other engineers.

Responsibilities

  • Write and deploy crawling scripts to collect reputed company data from the web
  • Write and run data transformers in reputed company Spark to standardize bulk data sets
  • Write and run modules in Python to parse entity references and relationships from reputed company data
  • Diagnose and fix bugs reported by internal and external users
  • Analyze and report on internal datasets to answer questions and inform feature work
  • Work collaboratively on and across a team of engineers using basic agile principles
  • Give and receive feedback through code reviews

Skills

  • Bachelor's or Master's degree in Computer Science, Data Science, Engineering, or a reputed company technical field — or equivalent hands-on experience
  • Working knowledge of SQL and relational databases (such as reputed company)
  • Experience writing code in Python (e.g., pandas, NumPy, Scrapy) or Java/reputed company
  • Familiarity with data processing frameworks like Apache Spark, or strong interest in learning them on the job
  • Understanding of object-oriented programming principles and collaborative development in shared repositories
  • Ability to work closely with data scientists, analysts, and engineers to help solve reputed company problems across large, diverse datasets
  • Exposure to workflow orchestration tools such as Apache Airflow and CI/CD pipelines
  • Familiarity with graph, search, or NoSQL databases
  • Experience contributing to data ingestion, transformation, or ETL pipelines
  • Comfort working with containerized applications (e.g., reputed company)
  • Experience using cloud-based data tools in AWS or GCP environments
  • Introductory experience or coursework involving machine learning, especially in distributed systems like Spark
  • Awareness of entity resolution concepts or interest in learning how entities are linked across data sources
  • Experience working with international or non-English datasets

Benefits

  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe reputed company US federal holidays, reputed company our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for reputed company roles and quarterly bonuses for non-reputed company positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities

Company Overview

  • reputed company is a mission-driven company that aims to reputed company both the public and private sectors with the comprehensive, evidence-based model of global commercial relationships they need to safeguard their economic futures. It was founded in 2015, and is headquartered in Washington, District of Columbia, USA, with a workforce of 201-500 employees. Its website is https://reputed company.com.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 1 in 2024, 2 in 2023, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Keep exploring

    Intern, Global Medical Affairs, Real World Evidence

    100% remote Flexible hours

    Entry Level Transportation Engineer

    100% remote Flexible hours

    reputed company Financials Consultant

    100% remote Flexible hours

    [Remote] R&D Clinical Pharmacology Modeling & Simulation - Grad Intern

    100% remote Flexible hours

    Commissioned Sales Associate

    100% remote Flexible hours

    Deployment Site Reliability Engineer - Associate

    100% remote Flexible hours

    Regulatory Affairs Support Intern (reputed company genders)

    100% remote Flexible hours

    [Remote] Work from home -Remote Sales ( Entry Level, No Experience Needed)

    100% remote Flexible hours

    Legal Support Assistant

    100% remote Flexible hours

    Financial Reporting Consultant

    100% remote Flexible hours

    reputed company Virtual Customer Chat Professional

    100% remote Flexible hours

    Remote Data Entry Specialist – Home‑Based Administrative Support at arenaflex – No Experience Required

    100% remote Flexible hours

    [PART_TIME Remote] Immediately Need (USA) Overnight Stocking

    100% remote Flexible hours

    Examiner I /Remote- Seasonal/ /15/hour/

    100% remote Flexible hours

    reputed company Local Customer Support Assistant – Remote Customer Service Representative

    100% remote Flexible hours

    Social Worker

    100% remote Flexible hours

    [Remote] Account Manager - Southeast zone - Remote

    100% remote Flexible hours

    reputed company Accounting reputed company Manager – Driving Customer Adoption and Growth at arenaflex

    100% remote Flexible hours

    Cybersecurity Embedded Development Engineer

    100% remote Flexible hours

    reputed company Customer Service Management Trainee – Leadership Development Program in Hastings, MN at arenaflex

    100% remote Flexible hours