Back to the board

[Remote] Data Engineer, Web Scraping

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. 10a Labs is a company focused on safety and threat-intelligence for AI systems, collaborating with prominent technology platforms and companies. The Data Engineer role involves designing and optimizing data pipelines, conducting web scraping, and collaborating with teams to develop actionable insights and tools.

Responsibilities

  • Design, implement, and optimize end-to-end data pipelines for scraping and processing structured and unstructured data using Google Cloud Platform (or similar) and best practices
  • Conduct ad hoc web scraping and data collection to support research and intelligence initiatives
  • Prepare data for further analysis, including data cleaning, transformation, anonymization, and masking
  • Contribute to the development of internal and external APIs, following best practices
  • Collaborate with ML engineers, other data engineers, and software developers to deliver actionable insights and functional tools, including internal and external dashboards, APIs, and data dumps; and
  • Drive other critical initiatives

Skills

  • Degree (or equivalent work experience) in Computer Science, Engineering, Information Science, Data Science or a related field (graduate degree preferred)
  • 2+ years of professional experience in data engineering or a closely related field
  • Ability to communicate complex technical ideas clearly to non-technical audiences
  • Proficiency in Python, SQL
  • Experience with web scraping/crawling (e.g., Beautiful Soup, Selenium, Scrapy)
  • Experience with Google Cloud Platform (or similar), including storage and database services (e.g., Cloud Storage, CloudSQL, Cloud Spanner) and workflow orchestration (e.g., Cloud Composer/Airflow, Cloud Run, Pub/Sub)
  • Experience building and managing data pipelines, especially for text data
  • Comfort working in fast-moving, high-impact environments, such as startups, AI research labs, or security-focused teams

Benefits

  • Performance-based annual bonus
  • Support for conferences, continuing education, or leadership training
  • Fully remote, U.S.-based
  • Comprehensive health, dental, and vision coverage
  • Generous PTO and paid holiday schedule
  • 401(k) plan

Company Overview

  • 10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. It was founded in 2021, and is headquartered in , with a workforce of 11-50 employees. Its website is https://10alabs.com/.
  • Apply To This Job

    Keep exploring

    [Remote] Senior Legal Operations Specialist (Contract)

    100% remote Flexible hours

    [Remote] Remote Legal Recruiter

    100% remote Flexible hours

    [Remote] Sr. Fraud Data Analyst (Remote USA)

    100% remote Flexible hours

    [Remote] Online | Entry Level | Customer Support Coordinator | Hotels

    100% remote Flexible hours

    [Remote] Field Account Manager

    100% remote Flexible hours

    [Remote] Regional Business Development Manager-BA Healthcare (Southeast region)

    100% remote Flexible hours

    [Remote] Senior Software Engineer, AI Product Insights

    100% remote Flexible hours

    [Remote] Senior Account Executive (Public Relations)

    100% remote Flexible hours

    [Remote] Data Analyst, Data Analytics

    100% remote Flexible hours

    [Remote] Senior Full-Stack Software Engineer, Clinical Intelligence

    100% remote Flexible hours

    Senior Project Manager, Publishing Operations job at Princeton University Press in Princeton, NJ

    100% remote Flexible hours

    Salesforce Administrator

    100% remote Flexible hours

    Document Specialist - ROMANIA/UK/EU (CP08Ti620)

    100% remote Flexible hours

    Experienced Customer Success Associate - Live Chat Support Specialist (Entry Level / No Prior Experience Required)

    100% remote Flexible hours

    Golang Developer Web3 Experiance

    100% remote Flexible hours

    Politics & Government Specialist – Freelance AI Trainer Project

    100% remote Flexible hours

    Medical Writer II – Healthcare Copywriter (contract)

    100% remote Flexible hours

    Nurse Educator – Clinical Development Nurse I (Case Management ) Seattle/Bellevue/Everett/Renton

    100% remote Flexible hours

    Associate Strategy Analyst

    100% remote Flexible hours

    Associate Attorney – Insurance Defense (Trucking Defense & Commercial Auto) Dallas, TX

    100% remote Flexible hours