Senior Observability Platform Engineer - Kubernetes, Prometheus Stack, ELK (All genders)
- Full-time
- Department: Product & Tech
Company Description
Our mission started more than 20 years ago, where we dedicated ourselves to creating moments of happiness between pets and pet parents across 30 European countries. Today, we stand strong as Europe’s leading online pet platform, delivering moments of happiness to more than 9 million pet parents each 🐶🐈
Job Description
We are actively seeking a Senior Platform Engineer who possesses a deep passion for IT Observability, excels in handling vast datasets, and demonstrates expertise in navigating intricate IT infrastructures. In this role, you will be at the forefront of cutting-edge technologies, particularly in the realm of cloud-native and open-source solutions. Moreover, your contributions will be instrumental in perpetually enhancing the performance of our E-commerce infrastructure, ensuring its uninterrupted operation around the clock.
Our Platform Engineering team plays a pivotal role in managing the Container Orchestration Platform and the Observability Platform, which are closely interlinked. These components are critical parts of the Core Platform Stack, serving our entire IT department.
Leveraging our well-established DevOps and Infrastructure as Code approach, we are committed to advancing our platform, exploring new avenues, and providing essential guidance to our technical users and DevOps teams.
You will be responsible for:
Work closely with our Container Orchestration Platform to maintain and improve Observability on our production systems.
Improve our central Logging Platform based on the Elastic (ELK) Stack.
Design and improve additional services for Monitoring & Alerting, Tracing and APM.
Drive automation to help our teams to improve their time-to-market.
Provide solutions by making use of open-source tools or implementing your own
Support our DevOps teams in using our products and services in the most efficient way.
Ensure 24/7 availability of our platform services and practice incident response in our platform on-call duty once in a while
Qualifications
- At least 3 Years of Observability Platform Engineering Experience.
University degree in Computer Science or equivalent.
Systematic problem-solving abilities coupled with distinct communication and collaboration skills.
Extensive knowledge in several of the following areas:
Kubernetes: deploying and managing workloads as well as using helm charts and creating your own
Terraform: infrastructure provisioning
Elasticsearch: managing clusters at scale, configuring index templates for efficient storage and search, handling index lifecycles, etc.
Prometheus: configuring metric scraping and storage, setting up alertmanager notification routing and notification receivers, etc.
Log processing: log normalization and enrichment
Log and Time Series Data visualization: analyzing data and creating versatile dashboards using Kibana and Grafana
Thanos: centralized long term metric storage of a kubernetes multi-cluster setup
Jaeger: distributed tracing
Be familiar with GIT, CI/CD tools and pipelines
Provide coding skills, preferably in python and bash
Have good understanding of network infrastructure and services such as DNS, DHCP, firewall, load balancing
Ability to debug, optimize code and automate routine tasks.
Engage in and improve the whole lifecycle of services: From inception and design - to deployment, operation and refinement.
Be open-minded, aligned to business needs and solution-oriented, and keen on working in cross-functional teams.
Speak fluent English.
Additional Information
With more than 1,000 passionate professionals located across 10 European offices, we believe our success comes from working together and leveraging our international strengths. Expect to work in a hybrid environment, collaborating with colleagues in different locations remotely or face-to-face at the office.
Our benefits:
🐾 20% discount in our zooplus shop
📖 Internal and external training
🎈 Team events
✈️ 28 vacation days and days off on 24th and 31st of December #LI-Hybrid
🏋️ Corporate rates at a local gym chain (Body & Soul)
📱 Company mobile phone for work and personal use
Want to know more? Learn more about who we are and what we do and visit our LinkedIn company profile.
zooplus is committed to equal opportunity. We value and embrace diversity and inclusion of all Team Members.