Senior Software Engineer - Site Reliability

  • Full-time

Company Description

Job Title: Site Reliability Engineer (SRE)
Job Type: Full-time
Level: IC2 (Senior Software Engineer)

About the Role

As a Site Reliability Engineer, you will ensure the availability, scalability, and performance of mission-critical database systems. This role requires a blend of software engineering, infrastructure expertise, and database administration to enhance the reliability and efficiency of our data platforms. You will collaborate closely with application developers, SREs, and platform teams to automate operations, optimize performance, and implement best practices for database resilience in cloud-native environments.

  •  

Job Description

Key Responsibilities

  • Database Performance & Reliability: Optimize and fine-tune databases to ensure high availability, scalability, and efficiency.

  • Infrastructure as Code & Automation: Automate database provisioning, scaling, and maintenance using Terraform, Ansible, or Kubernetes operators.

  • Kubernetes & Containerized Databases: Deploy and manage databases in Kubernetes environments using operators, StatefulSets, and Helm.

  • Incident Response & Root Cause Analysis: Proactively monitor, troubleshoot, and resolve database-related incidents to minimize downtime.

  • Capacity Planning & Observability: Implement robust monitoring and alerting for database health and performance using Prometheus, Grafana, or Datadog.

  • Security & Compliance: Enforce database security best practices, access controls, and ensure compliance with industry standards.

  • Disaster Recovery & High Availability: Design and maintain backup, replication, clustering, and failover strategies.

  • Cross-functional Collaboration: Work closely with engineering, SRE, and platform teams to improve database reliability and service SLAs.

Qualifications

Technical Skills & Experience

  • Hands-on experience with relational (MySQL, PostgreSQL, SQL Server) and/or NoSQL (Cassandra, MongoDB, DynamoDB) databases.

  • Experience deploying and managing databases in cloud-native environments (AWS, GCP, Azure) with tools like Vitess, KubeDB, or CrunchyData.

  • Proficiency in Python, Go, or Bash for infrastructure automation.

  • Experience with Terraform, Ansible, or Helm for managing database infrastructure.

  • Strong experience with Prometheus, Grafana, Datadog, or New Relic for monitoring distributed systems and databases.

Additional Information

At Freshworks, we are creating a global workplace that enables everyone to find their true potential, purpose, and passion irrespective of their background, gender, race, sexual orientation, religion and ethnicity. We are committed to providing equal opportunity for all and believe that diversity in the workplace creates a more vibrant, richer work environment that advances the goals of our employees, communities and the business.

Privacy Policy