Senior QA Engineer – Performance & Reliability Job Description
- Full-time
- Department: Verification
- Compensation: USD100000 - USD160000 - yearly
Company Description
Axiado is an AI-enhanced security processor company redefining the control and management of every digital system. The company was founded in 2017, and currently has 60 employees. At Axiado, developing great technology takes more than talent: it takes amazing people who understand collaboration, respect each other, and go the extra mile to achieve exceptional results. It takes people who have the passion and desire to disrupt the status quo, deliver innovation, and change the world. If you have this type of passion, we invite you to apply for this job.
Job Description
Axiado is a top-tier manufacturer of TCU (Trusted Control/Compute Unit) solutions. We are hiring a Senior QA Engineer – Performance & Reliability to lead the performance characterization and reliability validation of our Secure TCU System, ensuring it meets rigorous data center standards.
In this role, you will own the test design, execution, and deep-dive analysis for performance and reliability, working closely with development teams to identify bottlenecks and resolve complex system-level issues.
Key Responsibilities
Performance & Reliability Strategy
Test Design & Execution: Design and execute comprehensive test plans for performance benchmarking, stress testing, longevity/endurance testing, and thermal/power characterization of TCU/BMC systems.
Workload Analysis: Analyze system behavior under various heavy workloads to identify performance bottlenecks in throughput, latency, and resource utilization (CPU, Memory, PCIe).
Reliability Validation: Conduct Mean Time Between Failures (MTBF) prediction, long-duration stability tests, and error injection campaigns to validate system robustness.
Deep Dive & Issue Resolution
Root Cause Analysis: Lead the deep-dive investigation of performance degradation and reliability failures. Use advanced debugging tools (oscilloscopes, logic analyzers, firmware traces) to isolate issues.
Developer Collaboration: Work directly with firmware and hardware engineers to reproduce complex bugs, analyze crash dumps, and verify fixes.
Infrastructure Enhancement: Develop and maintain automated performance testing frameworks and reporting dashboards to track regression and trends over time.
Reporting & Leadership
Reporting: Generate detailed performance assessment reports and reliability analysis metrics for stakeholders.
Mentorship: Mentor junior engineers on performance testing methodologies and system debugging techniques.
Qualifications
Experience: 5+ years of experience in embedded system testing, with a strong focus on performance verification and reliability engineering.
System Knowledge: Deep understanding of TCU, BMC, HMC, RoT (Root of Trust), Secure Boot, TPM, HSM, PCIe (Gen4/5), DDR memory, and networking protocols.
Performance Tools: Proficiency with performance profiling tools, traffic generators, and standard benchmarks (e.g., SPEC, IOzone, iperf). Experience with thermal and power measurement tools.
Programming: Strong scripting skills in Python for test automation and data analysis; familiarity with C/C++ for code analysis is a plus.
Operating Systems: Strong Linux/Unix skills, including kernel tuning, system monitoring, and log analysis.
Tools: Experience with CI/CD pipelines (Jenkins, GitLab CI) and version control (Git).
Education: BS/MS degree in Computer Science, Electrical Engineering, or a related field.
Ways to Stand Out
Experience with AI-driven log analysis or anomaly detection tools to predict reliability issues.
Background in validation of high-speed interfaces (PCIe, CXL) and memory subsystems (DDR5/LPDDR5).
Experience with data center server architecture and thermal management.
Knowledge of industry reliability standards (e.g., Telcordia, JEDEC).
Note: We do not expect candidates to meet every single requirement. A strong core of these skills with a problem-solving mindset is what we value most.
Additional Information
Additional Information
Axiado is committed to attracting, developing, and retaining the highest caliber talent in a diverse and multifaceted environment. We are headquartered in the heart of Silicon Valley, with access to the world's leading research, technology and talent.
We are building an exceptional team to secure every node on the internet. For us, solving real-world problems takes precedence over purely theoretical problems. As a result, we prefer individuals with persistence, intelligence and high curiosity over pedigree alone. Working hard and smart, continuous learning and mutual support are all part of who we are.
Axiado is an Equal Opportunity Employer. Axiado does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status or any other basis covered by appropriate law. All employment is decided on the basis of qualifications, merit, and business need.