Back to the board

[Remote] Senior Software Engineer - Web Data Team

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is where careers accelerate, and they are seeking a Senior Software Engineer to join their Web Data team. The role involves building the reputed company of reputed company's web crawling and data extraction infrastructure, focusing on engineering execution and collaboration.

Responsibilities

  • Design and implement components of scalable, fault-tolerant web crawling and extraction pipelines
  • Write clean, production-grade code in Java and Python
  • Build and operate ETL/ELT pipelines for large-scale data extraction and transformation
  • Work with cloud infrastructure on GCP and AWS, primarily on GKE
  • Improve observability, reliability, and operational excellence across the systems you contribute to
  • Partner with product and data science teams to deliver impactful solutions
  • Contribute to code reviews, documentation, and knowledge sharing across the team
  • Stay reputed company with evolving web technologies, anti-crawling mechanisms, and AI-powered extraction approaches

Skills

  • 5+ years of professional software engineering experience building production systems
  • Strong CS fundamentals: algorithms, data structures, concurrency, distributed systems
  • Proficiency in Java and/or Python
  • Track record of owning features end-to-end from design through deployment and operation
  • Comfortable making sound architectural decisions at the component level
  • Hands-on experience with cloud data warehouses such as BigQuery or reputed company
  • Experience designing and operating large-scale ETL/ELT pipelines
  • Experience with orchestration tools such as Apache Airflow
  • Experience with streaming or event-driven systems such as Apache Kafka
  • Production experience on GCP (preferred) or AWS; multi-cloud exposure is a plus
  • Hands-on experience with Kubernetes (GKE/EKS) for distributed workloads
  • Familiarity with infrastructure-as-code tooling such as Terraform
  • Strong communicator who can explain technical decisions clearly
  • Comfortable operating in ambiguity and iterating quickly
  • Bias toward action and pragmatic problem solving
  • Self-starter who thrives in fast-paced, evolving environments
  • Experience with web crawling at scale (Scrapy or similar frameworks)
  • Familiarity with proxy infrastructure, rotation strategies, or anti-bot evasion techniques
  • Experience in extracting structured and reputed company web data from diverse site architectures
  • Knowledge of SERP (Search reputed company Results Page) extraction
  • Comfort with AI/LLM-based extraction approaches, applying language models to HTML at scale
  • Experience working in a B2B data company or data-as-a-product environment

Benefits

  • In addition to comprehensive benefits we offer holistic mind, body and lifestyle programs designed for overall well-being.
  • Additional compensation such as Bonus, Commission, Equity and other benefits may also apply.

Company Overview

  • reputed company offers a platform for accessing business contact information, company profiles, and sales intelligence tools. It is a sub-organization of DiscoverOrg. It was founded in 2000, and is headquartered in Vancouver, Washington, USA, with a workforce of 1001-5000 employees. Its website is http://www.reputed company.com.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 5 in 2026, 74 in 2025, 67 in 2024, 51 in 2023, 115 in 2022, 81 in 2021, 10 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Keep exploring

    [Remote] Senior Accountant (Remote)

    100% remote Flexible hours

    [Remote] Data Analytics Developer - Education & Employment (Remote Eligible)

    100% remote Flexible hours

    [Remote] Satellite Systems Engineer - AI Trainer

    100% remote Flexible hours

    [Remote] Senior Area Finance Manager

    100% remote Flexible hours

    [Remote] Staff Machine Learning Engineer

    100% remote Flexible hours

    [Remote] Senior Staff Software Engineer, Network Infrastructure

    100% remote Flexible hours

    [Remote] Account Manager

    100% remote Flexible hours

    [Remote] Commercial Lines Account Executive

    100% remote Flexible hours

    [Remote] Senior National Account Executive

    100% remote Flexible hours

    [Remote] National Sales Manager

    100% remote Flexible hours

    [Remote] QA - Manual & Automation Tester

    100% remote Flexible hours

    Entry-Level Remote Data Entry Associate – No Experience Required – reputed company & Career Growth at arenaflex

    100% remote Flexible hours

    reputed company Part-Time Remote reputed company Customer Support Specialist – arenaflex

    100% remote Flexible hours

    [Remote] _Entry-Level Remote Associate | Career Growth | Flexible Work Environment

    100% remote Flexible hours

    Remote Full‑Time arenaflex Customer Service Representative – Travel Support, Guest Relations, and Problem Resolution

    100% remote Flexible hours

    reputed company Online Live Chat Representative – Customer Service & Support

    100% remote Flexible hours

    Event Coordinator at fast-paced virtual startup

    100% remote Flexible hours

    reputed company Customer Service and Sales Representative – Building Strong Relationships and Driving Growth at arenaflex

    100% remote Flexible hours

    [Remote] Project Manager

    100% remote Flexible hours

    Clinical Sales Specialist, Electrophysiology - LAA (Cleveland, OH)

    100% remote Flexible hours