Senior DevOps Engineer, OpenStack Team
- Full-time
Company Description
About Mirantis
Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers platform engineering teams to deliver composable, production-ready developer platforms across any environment—on-premises, in the cloud, at the edge, or in sovereign data centers. As enterprises navigate the growing complexity of AI-driven workloads, Mirantis delivers the automation, GPU orchestration, and policy-driven control needed to manage infrastructure with confidence and agility. Committed to open standards and freedom from lock-in, Mirantis ensures that customers retain full control of their infrastructure strategy.
Job Description
We are looking for a Senior DevOps Engineer to join the MOSK (Mirantis OpenStack for Kubernetes) product team, working at the intersection of OpenStack, Kubernetes, and product engineering. In this role, you will help design, validate, and evolve a cloud-native OpenStack platform that enables customers to build and operate enterprise-grade private clouds at scale.
As a senior engineer, you will act as a technical authority on OpenStack and MOSK behavior, contribute to architectural decisions, support complex customer scenarios, and help ensure stable, high-quality product releases.
Key Responsibilities
Feature Design & Development
Contribute to Product Roadmap: Actively influence the MOSK roadmap and backlog by providing technical input on feature feasibility, architectural scalability, and long-term maintainability. Evaluate incoming customer requests for specific features by analyzing upstream OpenStack code, specifications, blueprints, and community documentation to assess feature maturity, stability, limitations, and operational caveats.
Lifecycle Management Engineering: Design, implement, and document lifecycle management for OpenStack on Kubernetes — a core MOSK capability — with emphasis on non-disruptive upgrades, patching, scaling, rollback strategies, and feature compatibility across upgrades.
Feature Validation & Integration: Drive validation of enabled features by defining test scenarios, validating upstream code behavior in simulated or production-like deployments, and then integrating feature support into MOSK’s lifecycle management.
Technical Design Reviews: Review and refine product designs, APIs, and implementations to ensure robustness, security, and a high-quality developer experience.
Engineering Leadership: Lead design discussions, planning sessions, and retrospectives; help the team to uphold high standards for software quality, architectural rigor, and operational readiness.
Product Engineering & Automation
Frameworks & Tooling Development: Design, implement, and maintain core frameworks, libraries, and internal tools that underpin MOSK development, testing, and lifecycle operations.
Advanced Automation: Build and evolve automation for deploying and managing complex OpenStack-on-Kubernetes environments used in development, testing, and validation.
CI/CD Architecture: Design, build, and maintain CI/CD pipelines and integrations with external systems to streamline build, test, validation, upgrade, and release workflows.
Operational Reliability
Tier-3 Escalation Support: Debug and resolve complex issues in customer production environments involving Kubernetes, OpenStack, networking, storage, and distributed systems.
Upstream Troubleshooting & Contributions: Investigate issues in OpenStack services, analyze root causes at the code level, and propose or contribute minor fixes upstream to improve product stability and supportability.
Performance & Scalability Analysis: Identify and analyze performance bottlenecks across hardware and software layers; provide fixes, optimizations, and tuning recommendations.
Operational Readiness: Manage and improve development, staging, and validation environments that closely mirror customer deployments to ensure release stability.
Product Knowledge Authority: Act as an authoritative technical source for MOSK and OpenStack behavior, providing accurate, engineering-level input to documentation, field, and enablement teams to ensure correctness and consistency of product documentation and guidance. Participate in technical reviews, demos, and knowledge-sharing sessions to support engineering and other customer-facing teams.
Qualifications
Experience & Education
At least 5 years of practical administration experience in Linux (CentOS, Ubuntu, RHEL) as a server platform.
Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent work experience.
Experience in software development, Systems Administration, or SRE role.
Technical Skills
Operating Systems: Deep understanding of the OS, ability to debug and tune underperforming systems, and troubleshoot software and hardware issues.
Networking: Strong knowledge of networking technologies, including protocols, firewalls, and VPNs. Experience with managing and configuring networking equipment.
Automation Tools: Practical experience with SaltStack, Puppet, Ansible, or Chef for medium and large environments.
Infrastructure as Code: Practical experience with Terraform.
Containerization: Be an expert in Docker and Kubernetes in production environments. Thorough understanding and practical experience with Kubernetes operators and Helm.
CI/CD: Strong experience with Jenkins and building dependable build/release pipelines.
Familiarity with code-review systems
Cloud Platforms: Understanding of principles of working in cloud environments such as OpenStack, AWS, GCP, or Azure.
Scripting & Programming: Strong scripting skills in Bash, Groovy, and Python (at least one is required). Mid-level experience with Go or C is valuable.
Preferred Qualifications (Will be a plus)
Deep understanding of OpenStack architecture and experience with running it in production.
Knowledge and experience with SDN (Software Defined Networking) solutions - Open Virtual Network, Open vSwitch.
Experience in managing and using Open Search, Grafana, and Prometheus stacks.
Contribution to open-source projects.
Soft Skills
English language proficiency at least at an intermediate level (spoken and written).
Good customer-facing communication skills and a sense of diplomacy.
Ability to work with geographically distributed international teams in a dynamic environment.
Analytical, troubleshooting, and problem-solving skills.
Additional Information
What does Mirantis offer you?
Work in a global, collaborative, remote-first culture that rewards initiative and execution.
Play a pivotal role in shaping the next era of cloud and AI modernisation.
Manage high-impact enterprise accounts with immediate opportunity for growth.
Work with exceptionally passionate, talented and engaging colleagues, helping Fortune 500 and Global 2000 customers implement next-generation cloud technologies.
Be a part of cutting-edge, open-source innovation.
Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued.
Professional development and training.
Attend conferences and working groups.
Customized workstation (macOS, Windows).
Professional development and training.
Competitive compensation, performance incentives, and opportunities for advancement.
It is understood that Mirantis, Inc. may use automated decision-making technology (ADMT) for specific employment-related decisions. Opting out of ADMT use is requested for decisions about evaluation and review connected with the specific employment decision for the position applied for. You also have the right to appeal any decisions made by ADMT by sending your request to [email protected]
By submitting your resume, you consent to the processing and storage of your personal data in accordance with applicable data protection laws, for the purposes of considering your application for current and future job opportunities.
We are a Leader for Container Management in G2 (#2 after AWS)!