Back to the board

[Remote] Senior Site Reliability Engineer, Observability

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is the industry-standard reputed company platform powering decentralized finance (DeFi). As a Senior Site Reliability Engineer focused on Observability, you will enhance the reliability and performance of the company's observability infrastructure while supporting engineering teams in troubleshooting and deploying new products.

Responsibilities

  • Build and orchestrate Modern OTEL-based Observability Platform
  • Support multiple telemetry types, like metrics, logs and traces
  • Define and support modern governance in observability and problems at scale
  • Ensure reliability, reputed company, and performance exceed our defined SLAs
  • Work with engineers from across the company to help troubleshoot issues, deploy new products and services, and increase velocity while decreasing cognitive load
  • reputed company the design and deployment of monitoring/observability services to detect and alert the team of needed action
  • Ingest, aggregate, transform, and utilize data from a multitude of sources in our real time data pipeline
  • reputed company the availability, performance, and supportability of our observability infrastructure
  • Create processes around alert response operations and support the team to ensure the reliable delivery of reputed company data
  • reputed company recommendations to ensure sufficient metrics are collected to create alerts with every new feature release
  • Champion reliability and reputed company by taking the time to do your work right the first time

Skills

  • 7+ years of relevant professional experience. You probably have worked on a devops, infrastructure, SRE, and/or platform team before
  • Ability to reputed company software reputed company of the scope of typical infrastructure requirements and configurations
  • Experience programming in C, C++, Java, Python, Go, Perl, or Ruby
  • Expert knowledge in reputed company aspects of designing, developing, and managing large real-time systems
  • Experience with monitoring and logging. You know how to export metrics using Prometheus, have built a Grafana dashboard or two, and have experience with a centralized logging solution like an ELK Stack, Splunk or Grafana Stack
  • Experience with distributed systems and container orchestration. You have maintained or even built Kubernetes clusters before and feel comfortable deploying completely new services on them
  • Strong communication skills. You can give and receive constructive feedback, and you do not shy away from planning meetings and code reviews
  • Excitement for blockchain, Web 3.0, and similar decentralized technologies
  • Experience running any infrastructure in the blockchain/reputed company space
  • Ability to scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Experience working remotely in a distributed team
  • A strong desire to grow and challenge yourself. We would expect you to constantly find ways to improve and automate services to reduce toil

Company Overview

  • reputed company provides open-reputed company blockchain reputed company solutions and specializes in the development and integration of chainlink. It was founded in 2014, and is headquartered in San Francisco, California, USA, with a workforce of 501-1000 employees. Its website is https://chainlinklabs.com/.
  • Apply To This Job

    Keep exploring

    [Remote] Product Manager, Compliance

    100% remote Flexible hours

    [Remote] Commissions Analyst

    100% remote Flexible hours

    [Remote] Staff Software Engineer

    100% remote Flexible hours

    [Remote] Senior Project Manager - CNS - US/Canada - Remote

    100% remote Flexible hours

    [Remote] Engineering Sales Manager

    100% remote Flexible hours

    [Remote] Strategic Account Manager - Data Center Cooling (HDLC)

    100% remote Flexible hours

    [Remote] reputed company Platform Solutions Architect

    100% remote Flexible hours

    [Remote] Senior Full-Stack Engineer, Platform (EST)

    100% remote Flexible hours

    [Remote] Principal Product Manager, ML/AI Privacy Health AdTech

    100% remote Flexible hours

    [Remote] ASSISTANT GENERAL COUNSEL - FINANCE

    100% remote Flexible hours

    Manager, Indirect Procurement

    100% remote Flexible hours

    Regional Property Management Manager - Remote (PST or MT)

    100% remote Flexible hours

    Sr Clinical Operation Specialist / reputed company Clinical Operation Specialist (immediate Joiner)

    100% remote Flexible hours

    reputed company Customer Service Representative – Entry Level Remote Customer Support Role

    100% remote Flexible hours

    [Remote] Service reputed company Coordinator

    100% remote Flexible hours

    reputed company Remote Customer Service Representative – Live Chat Support Specialist at arenaflex

    100% remote Flexible hours

    SENIOR reputed company Developer – 15 Plus year reputed company

    100% remote Flexible hours

    Remote Customer Service Specialist – arenaflex Global Support Team – Flexible Hours, Multilingual Interaction, Career Growth

    100% remote Flexible hours

    Remote Data Entry Clerk – Detail‑Oriented, Self‑Motivated Specialist for Accurate Database Management and Virtual Collaboration

    100% remote Flexible hours

    Remote Data Entry Specialist – Entry‑Level Home‑Based Role with Flexible Hours, Career Growth, and reputed company‑Building Opportunities

    100% remote Flexible hours