Engineering Manager - Cortex Platform Infrastructure
- San Francisco, CA, USA
- Employees can work remotely
Twitter is what’s happening and what people are talking about right now. For us, life's not about a job, it's about purpose. We believe real change starts with conversation. Here, your voice matters. Come as you are and together we'll do what's right (not what's easy) to serve the public conversation.
Who we are
The ML Infrastructure team’s mission is to provide Twitter’s ML Engineers with the orchestration tools and compute capacity to reliably run state-of-the-art ML experiments. We provide this capability through managed Kubeflow clusters built on top of Google Kubernetes Engine, running in Google Cloud Platform. We partner with our sister teams within Cortex Platform to provide an end-to-end ML experimentation platform experience.
Cortex Platform empowers internal teams to efficiently leverage ML by providing a platform and by unifying, educating, and advancing the state of the art in ML technologies within Twitter. We win when our customers win by helping our users stay informed, share and discuss what matters; by serving the public conversation. We’re building an AI-first company and every major initiative is increasingly dependent on the successful application of machine learning. Cortex is at the nexus of this evolution.
We are building one of the strongest machine learning platforms in the world by marrying the latest ML industry practices with engineering excellence and the need to perform at Twitter scale. Our customers are all the ML engineers at Twitter and our goal is to provide a unified tooling ecosystem that allows these engineers to focus on what they are good at, building ML models with novel approaches, and abstracting away the complexities of bringing these models into a production environment.
What Success Looks like
We look for someone who can lead a team of diverse, smart, and driven engineers distributed across multiple locations. We need leaders who take an active role in shaping the future of Twitter engineering while embodying our principles. A successful engineering manager will:
Own your team’s deliverables and ensure we develop scalable, highly-available infrastructure that enables product engineers to experiment and rapidly iterate on products that delight our users and customers.
Take responsibility for the team’s short-term and long-term strategy. Define the team's roadmap, success metrics (OKRs), and priorities in close collaboration with product management, other engineering teams and multi-functional partners.
Maintain a balance between building sustainable, high-impact projects and shipping things quickly.
Lead a team of talented machine learning software engineers who like to ship code and tackle hard engineering problems.
Build cohesive, high-functioning teams that thrive in a culture of trust, respect, and inclusion.
Balance autonomy with guidance by giving your team the tools, context, confidence, and motivation to make decisions effectively and independently.
Have the technical capacity to partner with tech leads and be comfortable diving into the fray to help drive resolution in the case of incidents.
We care deeply about:
Engineering excellence such as good design abstractions, API stability, unit testing, leading best practices for other engineers to follow, and solid documentation.
Staying abreast and compatible with a quickly shifting technology landscape for ML platform components and related open source solutions.
Creating the best ML Platform environment for Twitter that provides an exceptional developer experience for our engineering customers.
Encouraging engineering creativity and innovative solutions
Our Current projects include:
Establishing Kubeflow as a managed offering at Twitter
Enabling and sustaining GCP Infra/Platform components for broader use in Cortex platform; e.g. AI Platform, Dataflow, Data Proc, etc.
Improving Operations of essential ML Platform services
ML Training Service
Continuous model training and deployment
Who you are
You have a technical background with experience in building ML Platform tools and infrastructure and are able to contribute to planning and design discussions. You have a collaborative working style with a strong focus on disciplined execution and results.
You are familiar with capacity planning for machine learning workloads at a large scale.
You have experience leading engineers up to staff and you’re skilled at coaching and guiding your directs through their career development.
You have led projects with a complicated stakeholder space and can build organizational alignment
You believe in building both teams and infrastructure that scale.
You have a customer and impact focussed mindset in building software.
You like to ground decisions in data and reasoning and solve root causes of problems rather than surface issues.
You ensure the team fully understands the goals and objectives of the company and how their work fits into 'the bigger picture.
You seek diverse perspectives to drive bottom-up innovation and create buy-in from all technical stakeholders inside and outside the team.
6+ years of software engineering experience.
Experience leading platform software based teams, ML Platform teams a plus.
Have 2 or more years of experience managing a team of 4 or more engineers building complex production-grade software systems.
B.S., M.S., or Ph.D. in computer science or machine learning.
Experience driving impact through cross-functional leadership.
Exceptional written and verbal communication skills.
Understand the cost of software development/tech debt and long-term maintenance.
You are passionate about technology and can keep up with the latest trends.
Experience working with distributed teams across multiple time zones.
All your information will be kept confidential according to EEO guidelines. We are committed to an inclusive and diverse Twitter. Twitter is an equal opportunity employer. We do not discriminate based on race, color, ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, veteran status, genetic information, marital status or any legally protected status.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Notice (Colorado Equal Pay for Equal Work Act)
The expected salary range for this role to be performed in Colorado is USD$167,000.00 - USD$234,000.00. Starting pay for the successful applicant will depend on a variety of job-related factors, which may include education, training, experience, location, business needs, or market demands. This range may be modified in the future.
This job is also eligible for participation in Twitter’s Performance Bonus Plan and Equity Incentive Plan subject to the terms of the applicable plans and policies.
Twitter offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, sick time, and parental leave. Twitter's benefits prioritize employee wellness and progressive support to our diverse workforce.