Site Reliability Engineer
- Full-time
- Clearance: Top Secret/SCI
Company Description
At RED GATE we do everything we can to serve our clients:
Using the right technical skills, unique methodologies, best practices, and integrated technology, we help clients implement bold solutions. New approaches to emerging and evolving threats. Non-traditional ways to overcome entrenched obstacles. Advantage through opportunity. If you have a serious challenge or problem, we can help you solve it. The below job description provides details on how this role will help to serve our clients.
Job Description
The Red Gate Group is seeking a Site Reliability Engineer to support DTRA. This hybrid role combines on-site and remote work, where you’ll enhance system resilience and efficiency for the DoD by building a robust infrastructure. By leveraging your expertise in Kubernetes, Ansible, AWS Cloud Migration, and Cloudera, you’ll build in redundancy, implement monitoring tools, and automate processes to reduce toil. This position offers the opportunity to guide junior engineers and expand your knowledge base while contributing to innovative cloud migration solutions.
Responsibilities:
- Develop resilient infrastructure for the DoD.
- Implement monitoring tools and automate routine tasks.
- Build or modify Ansible playbooks with Bash scripts.
- Troubleshoot and resolve issues related to CI/CD pipeline failures.
- Collaborate with application development teams across the software development life cycle.
Qualifications
Required Skills & Qualifications
Active TS/SCI
5+ years of experience with working in Linux environments
5+ years of experience with troubleshooting, triaging, and resolving issues related to CI/CD pipeline failures or slowness on production Enterprise environments
Experience with developing enterprise cloud-native solutions involving Kubernetes, Docker, Cloudera, AWS, Jenkins, or RHEL Systems
Experience in working with application development teams across the software development life cycle and creating solutions to complex problems in a collaborative team environment
Ability to build or modify Ansible playbooks with Bash scripts
Active DoD 8570 Level II Security Certification, including Security+
Desired Skills & Qualifications
Experience with Python and Go, Microservices, Serverless, MLOps, AIOps, Cloudera, and Kubernetes
Experience with Big Data stack using Hadoop, Spark, Accumulo or MongoDB, and Solr or Elasticsearch
Experience with software development processes and code management tools and processes
Experience with declarative Infrastructure as Code tools, including Puppet, Terraform, and Ansible
Experience with GitOps and CI/CD tools, including ArgoCD, Gitlab CI, or Jenkins
Possession of excellent verbal and written communication skills
Additional Information
The Red Gate Group, Ltd. is an Equal Opportunity/Affirmative Action Employer. The Red Gate Group, Ltd. considers applicants without regard to race, color, religion, age, national origin, ancestry, ethnicity, gender, gender identity, gender expression, sexual orientation, marital status, veteran status, disability, genetic information, citizenship status, or membership in any other group protected by federal, state, or local law. EEO is the Law