Sr. Resilience Engineer, Incident Management
- Remote, CA
- Employees can work remotely
We’re looking for a Senior Resilience Engineer to join Procore’s Incident Management and Resilience Engineering (IMRE) team within the Cloud Platform Engineering department. In this role, you’ll work directly with Engineering, Customer Success, and Product teams to help them better understand their technology, people, processes, and organization through the lens of incidents. You’ll drive the adoption of modern reliability practices like Service Level Objectives (SLOs), error budgets, fault-tolerant design patterns, incident retrospectives, chaos testing, and end-to-end ownership. If you’re interested in an exciting opportunity to have a significant impact on our internal systems—join us!
This position will report to the Manager, Cloud Platform Incident Management, and can be located in our Carpinteria, CA headquarters, New York City, or Austin, TX office. Remote candidates will be considered based on experience with the expectation of occasional travel to these offices. We’re looking for someone to join our team immediately.
What you will do:
Own Procore’s full incident response lifecycle, from defining and updating processes to coaching teams on incident management
Evolve Procore toward a “learn and adapt” approach
Drive post-incident investigations and analysis by conducting interviews, identifying contributing factors, and reviewing incident response
Lead initiatives that focus on process improvements and improving customer experience
Identify and promote the disciplines that help Procore evolve as a learning organization
Stay up to date on topics like cognitive systems engineering, safety science, resilience engineering, UX research, human-computer interaction, organizational psychology, or cultural anthropology
What we are looking for:
BS or MS degree; Technical Certifications are a plus
5+ years of combined experience as a Software Engineer and DevOps Engineer, with coding knowledge in an object-oriented language
Strong experience documenting and driving process improvements
Experience in engineering or operations, specifically participating in incidents
Experience with version control systems, CI/CD, distributed applications, and service-oriented architectures
Strong technical writing skills, code literacy, and cross-functional communication skills
Experience working with observability teams and operations teams is preferred
If you'd like to stay in touch and be the first to hear about new roles at Procore, join our Talent Community.
Procore Technologies is building the software that builds the world. We provide cloud-based construction management software that helps clients more efficiently build skyscrapers, hospitals, retail centers, airports, housing complexes, and more. At Procore, we have worked hard to create and maintain a culture where you can own your work and are encouraged and given resources to try new ideas. Check us out on Glassdoor to see what others are saying about working at Procore.
We are an equal opportunity employer and welcome builders of all backgrounds. We thrive in a diverse, dynamic, and inclusive environment. We do not tolerate discrimination against employees on the basis of age, color, disability, gender, gender identity or expression, marital status, national origin, political affiliation, race, religion, sexual orientation, veteran status, or any other classification protected by law.
Perks & Benefits
You are a person with dreams, goals, and ambitions—both personally and professionally. That's why we believe in providing benefits that not only match our Procore values (Openness, Optimism, and Ownership) but enhance the lives of our team members. Here are just a few of our benefit offerings: generous paid vacation, employee stock purchase plan, enrichment and development programs, and friends and family events.