SRE Engineer (Cloud Infrastructure)
- Full-time
- Time Type: Full time
Company Description
Engineered in Berlin. Driving the Future of Europe.
At AUTODOC, we are proud to be a homegrown European success story. Founded in Berlin in 2008 and still headquartered in the heart of the continent today, we have evolved from a local specialist into Europe’s leading online platform for the automotive aftermarket.
Our strength lies in our unique identity: we are a digital-first, e-commerce powerhouse with strong German roots and an expansive reach across 27 European countries. This continental focus allows us to blend our heritage with high-tech innovation, staying agile and closely connected to the markets we serve.
Today, our team of more than 5,500 professionals from over 50 different countries is redefining mobility through cutting-edge technology. Bring your talent and expertise to a company that is building a sustainable, tech-driven future for drivers everywhere.
Come join us and impact the future of AUTODOC!
Job Description
Our Cloud Infrastructure department consists of 4 specialized SRE teams, and we are currently looking for an engineer to join one of them. Each team is dedicated to a specific area, supporting its own group of services and development units.
In this role, you will be part of a team responsible for an area that supports over 200 services running in GCP/GKE. As an SRE, you will balance between maintaining high system availability for your domain and engineering new solutions to enhance our global infrastructure.
Responsibilities
- Service Ownership: Act as the primary point of contact for developers within your domain, handling service-related queries in chats and managing SRE-specific tasks.
- Infrastructure Evolution: Maintain and improve current cloud infrastructure, ensuring high availability and scalability.
- Embedded DevOps: Integrate SRE/DevOps best practices into the development lifecycle, from architecture planning to deployment.
- Innovation & PoC: Research, develop, and implement new infrastructure tools; conduct Proof of Concept (PoC) projects to drive technical excellence.
- Automation: Partner with the Automation team to build efficient CI/CD pipelines and custom automated workflows.
- Reliability & Metrics: Participate in developing quality metrics (SLIs/SLOs) and maintain comprehensive project documentation.
- On-call Support: Join the on-call rotation to ensure 24/7 stability of our mission-critical services.
Qualifications
- 3+ years as a SRE/DevOps Engineer.
- Proven experience with containerization and orchestration tools, Kubernetes is the must (GKE is preferred).
- Knowledge of SRE/DevOps methodologies, such as CI/CD, IaC, gitOps, etc.
- Knowledge of at least one tool from the gitOps approach (FluxCD is preferred).
- Experience in Cloud based infrastructures (GCP is preferred).
- Research and troubleshooting skills.
- Experience in administering and tuning relational and columnar databases, specifically PostgreSQL, MySQL, and ClickHouse.
- Experience in deployment and maintenance of distributed high-load systems.
- Experience in development of fault-tolerance mechanisms - clustering, replication, scaling approaches, etc.
- Configuration of monitoring solutions (Grafana, VictoriaMetric (operator) are preferred).
- Good scripting skills (bash / python are preferred).
Will be as a plus:
- Configuration of logging/tracing solutions (open telemetry stack, ViktoriaLogs, Grafana Loki, Grafana Tempo are preferred).
- Hands-on knowledge of maintaining and scaling Elasticsearch.
- Proficiency with message brokers and event-streaming platforms such as Kafka and RabbitMQ.
- Proficiency with GitlabCI.
- Proficiency in developing, maintaining, and refactoring complex Helm charts.
- Experience in migrating applications to Kubernetes.
- Deep understanding of Linux-like OS processes.
- Experience in implementing security controls in containerized environments.
- Boundless desire to automate any processes with an emphasis on improving security.
- Excellent communication skills
- Spoken English
Additional Information
What do we offer?
- Stable employment in the fast-growing international company
- International career in a multicultural environment with lots of opportunities to grow
- Annual vacation of 28 calendar days and 1 additional day off on your birthday
- Mental Wellbeing Program – providing you and your immediate family members with free and confidential mental and physical health support services for a wide range of personal and work-related issues
- Opportunities for advancement, further trainings (over 650 courses on soft and hard skills on our e-learning platform) and coaching
- Free English and German language classes
- Flexible working hours and hybrid work
Join us today and let’s create a success story together!