Sr. Staff Software Engineer, Systems Infrastructure
- Full-time
- Workplace Type: Hybrid
- Career Track & Grade: IC5/10
- Department: Engineering
Company Description
LinkedIn is the world’s largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exciting opportunities, build necessary skills, and gain valuable insights every day. We’re also committed to providing transformational opportunities for our own employees by investing in their growth. We aspire to create a culture that’s built on trust, care, inclusion, and fun – where everyone can succeed. Join us to transform the way the world works.
Job Description
This role will be based in Sunnyvale or Mountain View, CA.
At LinkedIn, our approach to flexible work is centered on trust and optimized for culture, connection, clarity, and the evolving needs of our business. The work location of this role is hybrid, meaning it will be performed both from home and from a LinkedIn office on select days, as determined by the business needs of the team.
LinkedIn’s AI Infrastructure organization is responsible for building the foundational platforms that power AI across LinkedIn. The LLM Serving team builds the critical infrastructure that enables efficient, reliable, and large-scale deployment of large language models and other advanced AI models in production.
This team sits at the center of LinkedIn’s AI platform, owning the layer between model training and production serving. The work focuses on making large-scale models run faster, cheaper, and more efficiently on GPUs at LinkedIn scale. The team builds and extends high-performance serving infrastructure and contributes to leading open-source technologies such as SGLang, vLLM, and related model serving frameworks.
We are looking for a Senior Staff Software Engineer with deep expertise at the intersection of systems, machine learning, GPU infrastructure, and large-scale inference. This is a highly technical, high-leverage role for someone who enjoys going deep into how models interact with runtimes, compilers, and hardware, and who wants to drive meaningful improvements in performance, cost, latency, and scalability across LinkedIn’s AI systems.
Responsibilities
Lead the design, development, and optimization of LinkedIn’s large-scale LLM serving infrastructure
Drive performance improvements across AI inference systems, including latency, throughput, GPU utilization, and cost efficiency
Build and scale online and offline inference systems for LLMs and other AI models
Optimize model execution across the full stack, including model architecture, runtime, compiler, kernel, and hardware layers
Drive model optimization techniques such as quantization, pruning, compression, batching, and memory optimization
Improve GPU efficiency through low-level systems work, including kernel-level optimization, runtime tuning, and hardware-aware performance improvements
Partner closely with ML, infrastructure, and product teams to identify serving bottlenecks and improve end-to-end model performance
Contribute to and/or extend open-source LLM serving frameworks such as SGLang, vLLM, Triton, or similar technologies
Set technical direction for model serving, inference performance, and next-generation AI infrastructure design
Mentor engineers and influence technical strategy across AI Infrastructure
Qualifications
Basic Qualifications
BA/BS degree in Computer Science or related technical field, or equivalent practical experience
8+ years of experience in software engineering, distributed systems, infrastructure, or machine learning systems
Experience building or optimizing large-scale production ML systems, model serving platforms, or AI infrastructure
Experience with GPU-based systems, CUDA, kernel optimization, or hardware-aware performance tuning
Experience with large-scale inference systems, including latency, throughput, reliability, and cost optimization
Experience with deep learning frameworks such as PyTorch, TensorFlow, or similar
Experience programming in one or more systems languages such as C++, Go, Python, or Java
Preferred Qualifications
Deep experience with LLM serving infrastructure, AI inference platforms, or large-scale model deployment systems
Familiarity with or contributions to open-source serving frameworks such as vLLM, SGLang, Triton, TensorRT, Ray, or similar technologies
Experience with ML compilers, runtimes, or graph optimization frameworks such as XLA, TVM, TensorRT, Triton, or similar
An understanding of model optimization techniques such as quantization, pruning, compression, batching, caching, and memory optimization
Experience improving GPU utilization and cost/performance efficiency for large-scale ML workloads
Experience building high-performance online or offline inference pipelines
An understanding of distributed systems, scheduling, resource management, and large-scale infrastructure operations
Experience operating across the stack from model-level optimization to runtime, compiler, kernel, and hardware-level performance improvements
Experience influencing technical direction across teams and partnering effectively with ML researchers, infrastructure engineers, and product teams
Suggested Skills
AI/ML Systems and Infrastructure
GPU and Performance Optimization
Model Serving and Inference Systems
Distributed Systems
Technical Leadership
LinkedIn is committed to fair and equitable compensation practices.
The pay range for this role is $198,000 to $326,000. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to skill set, depth of experience, certifications, and specific work location. This may be different in other locations due to differences in the cost of labor.
The total compensation package for this position may also include annual performance bonus, stock, benefits and/or other applicable incentive compensation plans. For more information, visit https://careers.linkedin.com/benefits.
Additional Information
Equal Opportunity Statement
We seek candidates with a wide range of perspectives and backgrounds and we are proud to be an equal opportunity employer. LinkedIn considers qualified applicants without regard to race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other legally protected class.
LinkedIn is committed to offering an inclusive and accessible experience for all job seekers, including individuals with disabilities. Our goal is to foster an inclusive and accessible workplace where everyone has the opportunity to be successful.
If you need a Reasonable Accommodation to search for a job opening, apply for a position, or participate in the interview process, connect with us and describe the specific Accommodation requested for a disability-related limitation.
Fill out an Accommodation request here: https://app.smartsheet.com/b/form/b660a0327d044969abfd7a4e73d15c36
Reasonable accommodations are modifications or adjustments to the application or hiring process that would enable you to fully participate in that process. Examples of reasonable accommodations include but are not limited to:
- Documents in alternate formats or read aloud to you
- Having interviews in an accessible location
- Being accompanied by a service dog
- Having a sign language interpreter present for the interview
A request for an accommodation will be responded to within three business days. However, non-disability related requests, such as following up on an application, will not receive a response.
LinkedIn will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by LinkedIn, or (c) consistent with LinkedIn's legal duty to furnish information.
San Francisco Fair Chance Ordinance
Pursuant to the San Francisco Fair Chance Ordinance, LinkedIn will consider for employment qualified applicants with arrest and conviction records.
Pay Transparency Policy Statement
As a federal contractor, LinkedIn follows the Pay Transparency and non-discrimination provisions described at this link: https://lnkd.in/paytransparency.
Global Data Privacy Notice and Compliance Posters for Job Candidates
Please use this link to access documents that provide information about how LinkedIn handles the personal data of employees and job applicants, as well as the E-Verify Participation Notice and the Department of Justice Immigrant and Employee Rights Section Right to Work posters: https://www.linkedin.com/legal/candidate-portal.
By clicking the link above or any third-party link within this posting, you are leaving this site and going to a third-party website where the third-party website's terms and privacy policy apply