Site Reliability / DevOps Engineer - Remote Work

  • Full-time

Company Description

About Us: 

PromptCloud is a Data as a Service company that helps businesses harness the power of data. Our technology fuels some of the most interesting big data projects in the world. We are a small bunch of people working towards shaping the imminent data-driven future by solving some of its fundamental and toughest challenges.

Job Description

.What does this role entail?

From designing a distributed system, improving the architecture for scale, optimizing the performance, provisioning the servers, monitoring the metrics, and troubleshooting the bugs, which means own the entire infrastructure.

Responsibilities:

  • Design, build, and maintain core infrastructure pieces that allow PromptCloud to scale 10X in a year(We scrape millions of urls and process TBs of data every hour).
  • Collaborate with business and product development teams to implement the infrastructure required to support the goals set for upcoming years.
  • Lead cross-organizational efforts by collaborating with different teams to diagnose operational surprises and carry forward improvements
  • Foresee challenges and take corrective action accordingly.

You’ll fit in, if

You want to work on:

  • Designing and deploying large scale systems, distributed infrastructure systems
  • Maintaining and improving service-oriented architecture and web services
  • Adapt monitoring for an increasingly dynamic environment of servers and services.

You have experience in:

  • 4+ years of experience in the related domains.
  • Configuration management systems such as Ansible.
  • Load balancing and reverse proxies such as Nginx, HAProxy
  • Configuring and maintaining SQL databases like MySQL for high availability
  • Configuring and maintaining Time-series databases and Key/Value stores (Redis, ElasticSearch cluster)
  • Writing bash shell script.
  • Deep understanding of UNIX/LINUX related environments.
  • Prior experience in mentoring and team lead/team management role.

Qualifications

Bonus points if you have experience in:

Worked on any of these tools Selenium, GGR, Proxies, AWS, GCE, Jenkins, RabbitMq, Resque

Team handling and project management.

Additional Information

To know more about this position you can reach me out on parvesh (at) promptcloud.com