Staff Product Manager, Compute for ML Training

  • Full-time

Company Description

Twitter is what’s happening and what people are talking about right now. For us, life's not about a job, it's about purpose. We believe real change starts with conversation. Here, your voice matters. Come as you are and together we'll do what's right (not what's easy) to serve the public conversation.

 

Our company's purpose is to serve the public conversation. The Platform organization builds the cloud infrastructure which helps serve that conversation by enabling us to deliver a reliable service to all our customers across the world at high quality. We operate a distributed system in a hybrid cloud environment through our own data centers (on-prem), GCP (Google Cloud Platform), and AWS (Amazon Web Services) (https://techcrunch.com/2020/12/15/twitter-taps-aws-for-its-latest-foray-into-the-public-cloud/). This includes bare metal and containerized services for messaging, graph storage, database clusters, etc.

 

Job Description

Build the foundation for rapid machine learning (ML) experiments. 

The Platform Product team is a critical team in Twitter’s Core Tech organization that provides direction and a product lens on the internal tools, platforms, as well as Twitter’s compute, network, and storage infrastructures on premises and on public clouds. Within the Platform Product team, the Compute for Data and ML Product team leads the creation and operation of a managed compute platform as a service that enables Twitter’s ML practitioners to experiment, prototype and take to production various ML capabilities that make Twitter, Twitter. Our mission is to provide a scalable, reliable, performant, efficient and secure orchestration platform that abstracts our users from the complexity of infrastructure management on hybrid- and multi- clouds.

 

We are looking for a Staff Product Manager to build the future of Compute Platform for ML experiments and training. Your customers are Twitter’s researchers, data scientists, data engineers and ML engineers running millions of data analytics, ML experiments and model training jobs. In partnership with the Cortex Data Platform and ML Platform teams, you will be responsible for delivering products and features that drive infrastructure flexibility, performance, efficiency, and manageability for ML experiments and training. You will

  • Build a deep understanding of our customers and their needs.

  • Define and track metrics to measure product success and business impact.

  • Dive deep on technical requirements, evaluating trade-offs and drive roadmap prioritization. 

  • Collaborate with cross-functional teams to plan and implement our product roadmap. 

  • Grow adoption of our managed ML infrastructure across the company. 

  • Communicate product status and impact to customers, engineering partners and leadership.

 

A few other things we value:

 

Challenge - We solve some of the industry’s hardest problems. Come to be challenged, learn, and thrive as an engineer.

 

Diversity - Diversity makes us a better organization and team. We value diverse backgrounds, ideas, and experiences.

 

Work, Life, Balance - We work hard, but we believe with hard work should come balance.

 

Qualifications

 

  • 10+ years of software engineering, solution architect or technical product or program management experience, in which at least 5 years spent in a product management role.

  • 3+ years experience shipping AI/ML infrastructure products in production at large scale. 

  • Experienced in products built on large-scale distributed systems in both public clouds and on-prem data centers.  

  • Experienced in products that optimize the performance and efficiency of AI/ML infrastructures.  

  • Solid understanding of data and machine learning workload characteristics, end to end workflow, and ecosystem.  

  • Solid understanding of Kubernetes, Kubeflow, GCP, AWS, as well as data center infrastructure considerations for CPU, GPU and other AI accelerators. 

  • MBA or Master/PhD degree in Computer Science, Computer Engineering or related fields. 

Additional Information

All your information will be kept confidential according to EEO guidelines.

Notice (Colorado Equal Pay for Equal Work Act)

The expected salary range for this role to be performed in Colorado is USD$191,000.00 - USD$267,000.00. Starting pay for the successful applicant will depend on a variety of job-related factors, which may include education, training, experience, location, business needs, or market demands. This range may be modified in the future.

This job is also eligible for participation in Twitter’s Performance Bonus Plan and Equity Incentive Plan subject to the terms of the applicable plans and policies.

Twitter offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, sick time, and parental leave. Twitter's benefits prioritize employee wellness and progressive support to our diverse workforce.

Privacy Policy