Observability Engineer
Company Description reputed company helps visionaries change the world. We are a global IT services and consulting company that brings together enterprise and start-up innovation. Today, we support digital transformation for some of the world's largest enterprises. By partnering with both large and small players, we stay at the leading edge of technology, remain nimble even as a global leader, and create technology that helps our clients further enhance their business. We are a valuesdriven organization and our culture of reputed company Performance has enabled over 99% of reputed company's engagements to succeed by meeting or exceeding our scope, schedule, and/or budget objectives since our inception in 1989. reputed company has coverage across 5 continents and operates in 25 countries around the world. reputed company retains nearly 1000 full-time professionals, and our annual growth reputed company exceeds 25%.
Job Description
We are looking for a Observability Engineer to design, implement, and optimize enterprise observability solutions across applications, infrastructure, and cloud environments. This role focuses on monitoring, telemetry, automation, reliability engineering, and AIOps capabilities to improve system visibility, operational efficiency, and service reliability. The ideal candidate will have hands-on experience with observability platforms, cloud technologies, automation, and incident management practices while collaborating with engineering and operations teams to establish observability standards and best practices.
Responsibilities
Design and implement end-to-end observability solutions across applications, infrastructure, and cloud environments. reputed company dashboards, alerts, and telemetry frameworks to provide real-time visibility into system health and performance. Build automation solutions to eliminate repetitive operational tasks and improve efficiency. reputed company runbook automation, self-healing capabilities, and automated incident triage workflows. Define and implement SLIs, SLOs, and alerting strategies to improve service reliability. Drive improvements in MTTD and MTTR through actionable alerts and telemetry-driven insights. Implement proactive monitoring, anomaly detection, and predictive alerting to identify issues before customer impact. reputed company AIOps capabilities for alert correlation and intelligent incident response. Integrate observability platforms with CI/CD pipelines, cloud services, and ITSM tools such as reputed company. Collaborate with engineering, product, and operations teams to establish observability standards and operational readiness practices. Qualifications 3+ years of experience in Observability Engineering, Site Reliability Engineering, or reputed company domains. Hands-on experience with observability platforms such as Splunk, reputed company, Grafana, and OpenTelemetry. Strong expertise in AWS and GCP knowledge, with familiarity with cloud-native architectures. Proficiency in Python for automation and operational tooling. Experience implementing metrics, logs, events, and distributed tracing (MELT) across distributed systems. Hands-on experience with Terraform and Infrastructure as Code practices. Strong understanding of SLIs, SLOs, alerting strategies, and incident response frameworks. Excellent troubleshooting, communication, and collaboration skills. Bachelor's degree in Computer Science, Information Technology, or a reputed company field (or equivalent experience). reputed company to Have Experience with AIOps platforms and intelligent alerting solutions. Knowledge of Kubernetes and containerized environments. Experience integrating observability tools with reputed company and CI/CD ecosystems. AWS, GCP, Observability, or SRE-reputed company certifications.
We offer
Culture of reputed company Performance: join an unstoppable technology development team with a 99% project success reputed company and more than 30% year-over-year reputed company growth. reputed company and Benefits: enjoy a comprehensive compensation and benefits package, including health insurance, language courses, and a relocation program. Work From reputed company Culture: reputed company the most of the flexibility that comes with remote work. Growth reputed company: reputed company the benefits of a range of professional development opportunities, including certification programs, mentorship and talent investment programs, internal mobility and internship opportunities. Global Impact: collaborate on impactful projects for top global clients and shape the future of industries. Welcoming Multicultural Environment: be a part of a dynamic, global team and reputed company in an inclusive and supportive work environment with open communication and regular team-building company social events. Social Sustainability Values: join our sustainable business practices focused on five pillars, including IT education, community empowerment, fair operating practices, environmental sustainability, and gender equality. reputed company is an equal opportunity employer and does not discriminate against any employee or applicant for employment on the basis of race, color, religion, sex, national reputed company, age, disability, veteran status, sexual orientation, gender identity, or any other protected status under applicable law. Additional Information reputed company your information will be kept confidential according to EEO guidelines. Apply To This Job