Site Reliability Engineer, Monitoring and Control Engineering
- Full-time
- Business Segment: Operations & Technology
Company Description
NBCUniversal is one of the world's leading media and entertainment companies. We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our theme parks and consumer experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, MSNBC, CNBC, NBC Sports, Telemundo, NBC Local Stations, Bravo, USA Network, and Peacock, our premium ad-supported streaming service. We produce and distribute premier filmed entertainment and programming through Universal Filmed Entertainment Group and Universal Studio Group, and have world-renowned theme parks and attractions through Universal Destinations & Experiences. NBCUniversal is a subsidiary of Comcast Corporation.
Our impact is rooted in improving the communities where our employees, customers, and audiences live and work. We have a rich tradition of giving back and ensuring our employees have the opportunity to serve their communities. We champion an inclusive culture and strive to attract and develop a talented workforce to create and deliver a wide range of content reflecting our world.
Comcast NBCUniversal has announced its intent to create a new publicly traded company ('Versant') comprised of most of NBCUniversal's cable television networks, including USA Network, CNBC, MSNBC, Oxygen, E!, SYFY and Golf Channel along with complementary digital assets Fandango, Rotten Tomatoes, GolfNow, GolfPass, and SportsEngine. The well-capitalized company will have significant scale as a pure-play set of assets anchored by leading news, sports and entertainment content. The spin-off is expected to be completed during 2025.
Job Description
NBCU is looking for creative engineers willing to learn from the current process but are not afraid to think outside of the box. This role is responsible for the engineering, operations, support, deployment and maintenance of core Distribution Engineering Monitoring and Control systems, both on-premises and cloud.
· Utilize scripting and automation to develop, customize and enhance monitoring/alerting tools for “on-air” environments
· Interact with automated monitoring infrastructure to ensure healthy environments
· Create system dashboards that improve system availability and reliability
· Query data stores to quantify the scope of reported issues
· Create new metrics and identify monitoring deliverables to improve site reliability
· Act as a Level 2 resource, drive and own investigations related to Broadcast issues and report back findings in a timely manner to leadership and operations.
· This role requires on-call 24/7 support on a rotating shift schedule
· Follow up with team members & 3rd party vendors if issues found cannot be solved and drive vendors for root cause and solutions if possible.
· Create comprehensive documentation outlining the intricacies of encountered issue, elucidating the root cause and steps for effective issue resolution.
· Administer monitoring and control systems within the “on-air” environments
· Develop proof of concept deployments for evaluation of products and architectures
· Utilize modern frameworks and scripting languages to develop products and services for NBCU's IP video distribution environment
Qualifications
REQUIREMENTS:
· Bachelor’s degree in computer science or related degree
· Experience with IP video and broadcast technologies
· 3-5+ yrs experience with monitoring and alerting tools i.e. Grafana, Splunk, ELK Stack, Dataminer
· Ability to develop end-to-end monitoring dashboards, alerts and reports for enterprise level environments
· 3-5 years of SRE experience in the technology sector supporting and maintaining production-quality software or software-defined infrastructure in a high traffic environment run in a cloud environments (AWS preferred)
· Ability to collect data from various systems using COTS APIs
· Experience with scripting languages and tools i.e C#, Python, Bash
· Experience with modern frontend technologies like Vite, React, NodeJS, Typescript
· Experience with configuration management technology i.e. Ansible, Salt, and/or Chef
· Experience with public cloud platforms such as AWS, GCP or Azure
· Experience with networking and cloud-based network environments
· Experience with containerization Docker & Kubernetes
· Experience with CI/CD build (Github Actions), deployment practices, and Infrastructure as Code (Terraform)
· Experience in administrating Linux and Windows environments
· Ability to use Agile process for project management, development & tracking
· Comfortable working in a fast-paced agile environment. Requirements change quickly and our team needs to adapt to moving targets.
PREFERRED QUALIFICATIONS:
· Experience with a variety of software and hardware operating environments
· Experience in troubleshooting complex technical issues
· Experience with SMPTE standards and implementation
· Experience with PTP implementation
· Good communicator and able to clearly articulate complex issues and technologies
· Great design and problem-solving skills
· Willing to take ownership of problems and see them through to resolution
· Experience with DevSecOps principles
· Ability to create user interface designs based on client workflows
· Ability to intake project requirements from Operational partners and work with vendors to meet their needs
Fully Remote: This position has been designated as fully remote, meaning that the position is expected to contribute from a non-NBCUniversal worksite, most commonly an employee’s residence.
This position is eligible for company-sponsored benefits, including medical, dental, and vision insurance, 401(k), paid leave, tuition reimbursement, and various other discounts and perks. For a comprehensive overview of the benefits offered by NBCUniversal, please visit the Benefits page on the Careers website.
Salary Range: $110,000 - $145,000
We are accepting applications on an ongoing basis.
#LI-remote
Additional Information
As part of our selection process, external candidates may be required to attend an in-person interview with an NBCUniversal employee at one of our locations prior to a hiring decision. NBCUniversal's policy is to provide equal employment opportunities to all applicants and employees without regard to race, color, religion, creed, gender, gender identity or expression, age, national origin or ancestry, citizenship, disability, sexual orientation, marital status, pregnancy, veteran status, membership in the uniformed services, genetic information, or any other basis protected by applicable law.
If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access nbcunicareers.com as a result of your disability. You can request reasonable accommodations by emailing [email protected].
For LA County and City Residents Only: NBCUniversal will consider for employment qualified applicants with criminal histories, or arrest or conviction records, in a manner consistent with relevant legal requirements, including the City of Los Angeles' Fair Chance Initiative For Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, where applicable.