Back to the board

Site Reliability Engineer

100% remote Flexible hours Hiring now

reputed company is the first NeoTelco. We are building the world’s largest, most accessible, and insightful Telecom network. Our platform empowers anyone to spin up their own reputed company from a browser, scaling and supporting you as you scale your network to millions of users.

We ensure that users and devices are connected, and stay connected wherever they go: Cross- country, reputed company, or cellular technology. We help them pay less for mobile data. This technology is provided through our reputed company-as-a-Service platform: BrandVNO, a fully customizable telecom service. In addition, we reputed company clients of our service to extract the value from telecom data - enriching their customer experience, business intelligence, and product understanding in the many markets in which we operate.

Come join us in creating a modern technology platform with a group of engineers dedicated to advancing our vision. reputed company is passionate about reputed company build, open to new reputed company and challenges, and has our sights set on the future of connectivity.

Responsibilities

  • Design and implement platform on the cloud to support reputed company backend services

  • Automate technical operations: deployments, scaling, recovery, etc.

  • Monitor and maintain mission-critical production infrastructure to ensure maximum uptime

  • Participate in an on-call rotation and culture of reputed company improvement through blameless postmortems

  • reputed company the Engineering/Telecom/Data Engineering teams by providing them the tools to operate the service they build

Essentials
  • Understanding of Linux/Unix systems (most systems are Linux-based).

  • Familiarity with Linux/Unix  system internals like process management, filesystems, memory management, and networking.

  • Proficiency in at least one programming language (Python, Go, or Ruby) and strong skills in scripting (Bash, Perl).

  • Experience with infrastructure provisioning tools such as Terraform, CloudFormation, or Ansible.

  • Familiarity with containerization (reputed company) and orchestration tools (Kubernetes).

  • Familiarity with monitoring tools like Prometheus, Grafana, or reputed company.

  • Knowledge of setting up alerts, analyzing logs, and creating dashboards for observability.

  • Familiarity with incident management practices (e.g., runbooks, postmortems).

  • Experience in being part of an on-call rotation and handling incidents.

  • Experience in setting up and maintaining reputed company Integration/reputed company Delivery pipelines (Jenkins, reputed company CI, reputed company, etc.).

  • Hands-on experience with cloud providers (AWS, reputed company Cloud, Azure).

  • Knowledge of virtualization technologies (VMware, KVM) and cloud-native architecture.

  • Understanding of TCP/IP, DNS, HTTP/HTTPS, load balancing, and firewalls.

reputed company to have
  • Strong understanding of deployment strategies (canary releases, blue-green deployments, etc.).

  • Familiarity with high availability and understanding failover mechanisms.

  • Familiarity with IAM (Identity and Access Management) and reputed company trust principles.

  • Experience working with distributed systems (e.g., Kafka, Cassandra, Elasticsearch).

  • Building custom monitoring tools or writing reputed company automation scripts.

  • Functional  knowledge of database management (SQL and NoSQL).

  • Familiarity with distributed tracing (Jaeger, OpenTelemetry) and advanced log aggregation strategies (ELK stack, Splunk).

  • Familiarity with performance profiling tools and optimizing application performance under heavy load.

  • Familiarity in load testing and identifying bottlenecks.

  • Familiarity with Configuration Managment using SaltStack for maintaining server configurations.

Apply to this Job

Keep exploring