Manager, Site Reliability Engineering - Engineering Effectiveness
- Seattle, WA, USA
- Employees can work remotely
Twitter serves the public conversation by encouraging people all over the world to connect, learn, debate and solve problems together. We believe conversation can change the world, and that’s why Tweeps (that’s what we call Twitter employees) come to work everyday.
Who We Are
Twitter’s Site Reliability Engineering (SRE) team focuses on the reliability, scalability, and performance of Twitter’s production environment. SREs are Software and Systems Engineers who specialize in large-scale distributed systems, low-level systems, and associated automation, tooling, and processes.
SREs drive operational excellence across Twitter by enabling partner teams across Product, Platform, and Infrastructure to up-level the health of their services in production, achieve desired reliability outcomes, and empowering them to self-serve in key areas of building, operating, and maintaining production services. This is achieved through strong partnerships and continuous collaboration with partner engineering teams.
We’re looking for a technical and industry-experienced engineering manager to join us and lead a team of talented SREs to partner with our Engineering Effectiveness product teams. The candidate must have experience building, operating, and driving reliability for production systems.
Engineering Effectiveness helps engineers at Twitter enjoyably iterate faster, from ideas and code, to shiping high-quality products. Our vision is to enable software development at the scale of Twitter, but with the ease and speed of a startup. We do this by supporting the entire software development lifecycle (SDLC), including core developer tools and environments, build tools, code review, source version control, CI/CD pipelines, education, and documentation.
What You’ll Do:
We believe passion and personality matter; as such, we need leaders that can manage diverse, smart, and driven engineers while balancing day to day people management with moving the business forward both technically and culturally.
As an Engineering Manager of SREs, you will lead a team of SREs who are working to keep Twitter reliable and scalable. Your responsibilities include, but are not limited to:
Drive cross-team and cross-org alignment around reliability initiatives between platform and site reliability engineering teams
Partner with other Engineering Managers across Twitter to achieve reliability outcomes for their services
Establish standard practices and processes for planning and prioritizing reliability work
Driving a culture of reliability, and ensuring teams are aligned around common priorities and approaches
Participate in deep technical design discussions within your team, and across partner teams, and ensure that we're building the right systems and keeping the quality high
Mentor, grow, and empower your team by giving them the skills, confidence and motivation to make decisions independently that lead to their personal and professional success, and enable them to become technical leaders.
Help the individuals on your team to build and execute personal development plans that align with Twitter’s goals and objectives, and understand how their work fits into the bigger picture.
Scale the team up by sourcing and hiring talented SREs both externally and internally.
Manage SREs in Twitter offices around the world and remotely
Take an active role in driving and evolving the roadmap for the SRE Org
- You have worked in organizations with established reliability engineering practices
You will bring a strong perspective and demonstrated track record of establishing collaborative partnerships to drive change
You have experience formulating a team's technical strategy and roadmap, and you've collaborated and partnered effectively with several other teams.
You have 3+ years of software engineering, reliability, and/or operations engineering experience in a highly customer-focused environment.
You have 1+ years experience successfully managing a distributed team of 5-8 engineers on large-scale projects that included technical deep-dives and production troubleshooting in the areas of: distributed systems, code, networking, storage, and operating systems.
You possess strong leadership skills and the ability to motivate teams.
You can provide a strong technical vision for systems and infrastructure teams.
You have knowledge of source code management, build, integration, and deployment systems such as Git and Jenkins.
You have successfully taken projects from inception to production, and are comfortable diving in to provide leadership for major projects when needed.
You have experience forming strong partnerships and continuous collaboration with engineering teams in Product and Infrastructure.
You are capable of leading a discussion with senior management, and are able to tailor the level of technical detail to suit your audience.
B.S. in Computer Science or equivalent experience.
We are committed to an inclusive and diverse Twitter. Twitter is an equal opportunity employer. We do not discriminate based on race, ethnicity, color, ancestry, national origin, religion, sex, sexual orientation, gender identity, age, disability, veteran, genetic information, marital status or any other legally protected status.
San Francisco applicants: Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
All your information will be kept confidential according to EEO guidelines.