Senior Software Engineer - Site Reliability
- Full-time
Company Description
Job Title: Site Reliability Engineer (SRE)
Job Type: Full-time
Level: IC2 (Senior Software Engineer)
About the Role
As a Site Reliability Engineer, you will ensure the availability, scalability, and performance of mission-critical database systems. This role requires a blend of software engineering, infrastructure expertise, and database administration to enhance the reliability and efficiency of our data platforms. You will collaborate closely with application developers, SREs, and platform teams to automate operations, optimize performance, and implement best practices for database resilience in cloud-native environments.
Job Description
Key Responsibilities
Database Performance & Reliability: Optimize and fine-tune databases to ensure high availability, scalability, and efficiency.
Infrastructure as Code & Automation: Automate database provisioning, scaling, and maintenance using Terraform, Ansible, or Kubernetes operators.
Kubernetes & Containerized Databases: Deploy and manage databases in Kubernetes environments using operators, StatefulSets, and Helm.
Incident Response & Root Cause Analysis: Proactively monitor, troubleshoot, and resolve database-related incidents to minimize downtime.
Capacity Planning & Observability: Implement robust monitoring and alerting for database health and performance using Prometheus, Grafana, or Datadog.
Security & Compliance: Enforce database security best practices, access controls, and ensure compliance with industry standards.
Disaster Recovery & High Availability: Design and maintain backup, replication, clustering, and failover strategies.
Cross-functional Collaboration: Work closely with engineering, SRE, and platform teams to improve database reliability and service SLAs.
Qualifications
Technical Skills & Experience
Hands-on experience with relational (MySQL, PostgreSQL, SQL Server) and/or NoSQL (Cassandra, MongoDB, DynamoDB) databases.
Experience deploying and managing databases in cloud-native environments (AWS, GCP, Azure) with tools like Vitess, KubeDB, or CrunchyData.
Proficiency in Python, Go, or Bash for infrastructure automation.
Experience with Terraform, Ansible, or Helm for managing database infrastructure.
Strong experience with Prometheus, Grafana, Datadog, or New Relic for monitoring distributed systems and databases.
Additional Information
At Freshworks, we are creating a global workplace that enables everyone to find their true potential, purpose, and passion irrespective of their background, gender, race, sexual orientation, religion and ethnicity. We are committed to providing equal opportunity for all and believe that diversity in the workplace creates a more vibrant, richer work environment that advances the goals of our employees, communities and the business.