DevOps Engineer - Level II (Remote-US)
- New York, NY, USA
- Employees can work remotely
At Casebook PBC, our software makes people’s lives better. Our company is committed to empowering community well-being through the delivery of adaptive, research-based and practice-driven technology. Designed to “help the helpers,” our innovative and award-winning SaaS solutions help improve outcomes in human services.
We are looking for an Infrastructure / DevOps engineer, who will play a central role managing a mission-critical SaaS infrastructure in the AWS cloud. Our ideal candidate has a passion for infrastructure automation, “Infrastructure as a Service” design, and is committed to continuous improvement, solving problems as part of a diverse team that includes engineers, QA testers, a release manager, product managers, designers and client stakeholders. He or she is meticulous about user facing infrastructure work, and has a service oriented ethic focused on meeting the needs of both developers and customers. Responsibilities include building and integrating tools enabling automated software development pipeline, automated software deploys, database backups and restores, software updates, and driving our infrastructure and service delivery platform forward.
What You’ll Do (Responsibilities):
Creating and integrating tools to improve automation, monitoring and troubleshooting in the Amazon cloud and Amazon GovCloud
Configuring and managing a variety of internally developed and third-party applications and services, across multiple environments
Familiarity with infrastructure-as-code with tools such as Ambassador, Kubernetes, Terraform, CloudFormation and Ansible
Maintain and enhance containerization, CI and deployment with GitHub, Gitlab, Jenkins
Enhancing system reliability, stability, performance and security through advanced disaster recovery processes, horizontal scaling and caching
Monitoring, troubleshooting and resolving issues for application, database, Elastic Search, Kafka, RDS, Redis, Data Warehouse, AWS EMR, AWS Lambda, PostgreSQL, and background job servers.
Participating in 24/7 on-call rotation
Contributing to decision making to address issues that affect application uptime, security, performance or deployment schedules
Supporting data integrity processes on testing and production environments
What You Have (Skills and Experience):
3+ years of experience as an Infrastructure engineer supporting a multi-tier web application in production
Deep knowledge of AWS cloud architecture and services
Experience with at least one infrastructure-as-code DSL
Solid understanding and experience with software application builds and deployments
Excellent communication skills and interpersonal skills
Linux system admin experience
Experience with PostgreSQL (design, configuration, replication, backups, migrations and upgrades)
Strong scripting skills (Ansible, Ruby, Bash, Python)
Operational experience with Elastic Stack (Elastic Search, Logstash, Kibana)
Knowledge of the IP protocol suite and associated troubleshooting, especially cloud networking and security group management
Familiar with application monitoring and insight tools, such as DataDog, New Relic and RollBar
Experience working in a zero-downtime environment
Advanced knowledge of the following highly preferred: software development lifecycle, agile methodologies, Git, Ruby on Rails, Pivotal Tracker
Passionate and motivated about DevOps work, enhancing system reliability and optimizing operational processes
Everyone's career and life history are unique. If you're not a perfect match, let us know why you're the right person for the position along with your specific traits or experience that can help Casebook in its mission of improving outcomes in human services through technology.