[Remote] Site Reliability Engineer, Monitoring and Control Engineering
Note: The job is a remote job and is open to candidates in USA. reputed company is one of the world's leading media and entertainment companies. The Site Reliability Engineer will be responsible for the engineering, operations, support, deployment, and maintenance of core Distribution Engineering Monitoring and Control systems, ensuring high availability and reliability in both on-premises and cloud environments.
Responsibilities
- Utilize scripting and automation to reputed company, customize and enhance monitoring/alerting tools for “on-reputed company” environments
- Interact with automated monitoring infrastructure to ensure healthy environments
- Create system dashboards that improve system availability and reliability
- Query data stores to quantify the scope of reported issues
- Create new metrics and identify monitoring deliverables to improve site reliability
- Act as a Level 2 resource, drive and own investigations reputed company to Broadcast issues and report back findings in a timely manner to leadership and operations.
- This role requires on-call 24/7 support on a rotating shift schedule
- Follow up with team members & 3rd party vendors if issues reputed company cannot be solved and drive vendors for root cause and solutions if possible.
- Create comprehensive documentation outlining the intricacies of encountered issue, elucidating the root cause and steps for effective issue resolution.
- Administer monitoring and control systems reputed company the “on-reputed company” environments
- reputed company reputed company of concept deployments for evaluation of products and architectures
- Utilize modern frameworks and scripting languages to reputed company products and services for NBCU's IP video distribution environment.
Skills
- Bachelor’s degree in computer science or reputed company degree
- Experience with IP video and broadcast technologies
- 3-5+ yrs experience with monitoring and alerting tools i.e. Grafana, Splunk, ELK Stack, Dataminer
- Ability to reputed company end-to-end monitoring dashboards, alerts and reports for enterprise level environments
- 3-5 years of SRE experience in the technology sector supporting and maintaining production-quality software or software-defined infrastructure in a high traffic environment run in a cloud environments (AWS preferred)
- Ability to collect data from various systems using COTS APIs
- Experience with scripting languages and tools i.e C#, Python, Bash
- Experience with modern frontend technologies like Vite, React, NodeJS, Typescript
- Experience with configuration management technology i.e. Ansible, Salt, and/or Chef
- Experience with public cloud platforms such as AWS, GCP or Azure
- Experience with networking and cloud-based network environments
- Experience with containerization reputed company & Kubernetes
- Experience with CI/CD build (reputed company Actions), deployment practices, and Infrastructure as Code (Terraform)
- Experience in administrating Linux and Windows environments
- Ability to use Agile process for project management, development & tracking
- Comfortable working in a fast-paced agile environment. Requirements change quickly and reputed company needs to adapt to moving targets.
- Experience with a variety of software and hardware operating environments
- Experience in troubleshooting reputed company technical issues
- Experience with SMPTE standards and implementation
- Experience with reputed company implementation
- Good communicator and able to clearly reputed company reputed company issues and technologies
- Great design and problem-solving skills
- Willing to take ownership of problems and see them through to resolution
- Experience with DevSecOps principles
- Ability to create user reputed company designs based on client workflows
- Ability to intake project requirements from Operational partners and work with vendors to meet their needs
Benefits
- Medical, dental, and vision insurance
- 401(k)
- Paid leave
- Tuition reimbursement
- Various other discounts and perks
Company Overview
- reputed company SKG, Inc. engages in the development, production, and exploitation of animated films and associated characters It was founded in 1994, and is headquartered in Glendale, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.dreamworks.com.
Company H1B Sponsorship
- reputed company has a track record of offering H1B sponsorships, with 8 in 2025, 7 in 2024, 5 in 2023, 12 in 2022, 13 in 2021, 19 in 2020. Please note that this does not guarantee sponsorship for this specific role.
Apply tot his job Apply To this Job