Site Reliability Engineer
- Chiasso, Switzerland
lm group is a publicly traded multinational Group, among the worldwide leaders in the online travel industry, and we operate a portfolio of well-known brands such as lastminute.com, Bravofly, Rumbo, Volagratis,Crocierissime, Jetcost and Hotelscan.
Every month, the Group reaches across all its websites and mobile apps (in 17 languages and 40 countries) 43 million unique users that search for and book their travel and leisure experiences. More than 1,200 people enjoy working with us and contribute to provide our audience with a comprehensive and inspiring offering of travel related products and services
To support and participate in company-wide Continuous Deployment introductions we are looking for a Site Reliability Engineer with certified experience as SRE for our Technology department based in Chiasso
“Hope is not a strategy. Engineering solutions to design, build, and maintain efficient large-scale systems is a true strategy, and a good one.”
Key Responsibilities will include
As Site Reliability Engineers we are responsible for the availability, performances, monitoring, and incident response of the platform and services running on multiple environments.
Improve infrastructure automation and automate repetitive tasks and build a scalable infrastructure
Improve and envolve the Self-Service Capabilities to developers and other stakeholders
Collaborate closely with architects, developers, database administrators in order to handle the reliability and scalability of the infrastructure.
Working closely with the Infrastructure team to define and implement solutions necessary for the success of the development teams.
Participate in periodic on-call duties
Skills and Experience
+ 3 years experience as DevOps
Experience with Linux operating systems (Ubuntu, RHEL) internals and administration (e.g., filesystems, inodes, system calls) and networking (e.g., TCP/IP, routing, network topologies)
Exposure to configuration management tools like Puppet, Ansible, Terraform and their best practices
Knowledge of Docker and Kubernetes
Experience in Virtualization technologies
Good Knowledge with languages like Python, Ruby, GO and sh/bash as well
Production Support Experience (Systems administration/deployments)
Knowledge of cloud solutions (AWS, Google Cloud)
Good understanding of webservers/load balancers (Apache HTTPd, Nginx, F5)
Solid understanding of change management and incident management processes
Ability to debug and optimise code and automate routine tasks
Familiarity with Centralised logs solutions (Fluentd, Logstash, Splunk)
Some exposure to Agile methodologies like Kanban, Scrum, XP.
Focus on Security and Compliance
Experience in data centre operations and management on middle size environments (500-1000 servers)
● Travel domain experience
● Certifications in the area of expertise (OS and App Server related)
● Familiarity with microservices, contracts, REST interfaces
● Good understanding of hybrid cloud architecture
● Familiarity Continuous delivery and deployment tools like Jenkins, GoCD, Spinnaker
● Experience in the programming language Java
● Good communication skills, written and verbal
● Enthusiasm to learn new technologies
● Attitude to teamwork and ability to work in multi-location teams