MT SRE Admin
- Full-time
Company Description
Sutherland is seeking an attentive and goal-oriented person to join us as a Full Stack Developers for one of the core engineering initiatives.
Job Description
Must have / Required Skills:
- Site Reliability Engineering Practices.
- Should be good at understanding Microservices, KUBERNETES, DOCKER, AWS CLOUD, Oracle/IBM/Tomcat application servers, DYNATRACE
- Should have good understanding on Business flows, Customer Experience, KPis and SLA's.
- Good Understanding on Logging frameworks and tools like Elastic/Open search, Logstash and Kibana, or Splunk.
- Experience in troubleshooting JVM failures, JDBC connection leaks and service integration failures
- Experience with any of Application Monitoring tools like Dynatrace, AppDynamics, Grafana, etc
Good to have :
- 8+ years of experience in IT
- Experience working in Telecom Domain
- In-depth knowledge of configuring, tuning, and maintaining java application servers and micro services on Kubernetes platform
- Strong understanding of SDLC
- Experience working on CI/CD pipelines using FlexDeploy, Jenkins, Artifactory etc
Job Description
- Working experience on Web Servers , Application Servers, Java Messaging services( JMS Queues & topics) and containerized micro services.
- Good understanding on Kubernetes platform and Service Mesh like ISTIO, NGNIX, etc
- Hands on Experience on AWS services like EC2, ALB, NLB, RabbitMQ.
- Responsible for Application's reliability and defining SLA, SLI and SLO
- Capacity planning, JDBC tuning and performance tuning.
- Should be able to provide requirements and analyze performance and chaos test results.
- Strong Experience and understanding on SOAP and REST Webservices.
- Strong Log analysis skills. Should be able to identify System Errors vs Business Errors.
- Assess and implement best practices for Observability and tracing.
- Strong Incident Management and Problem Management Skills.
- Working experience on APM tools like Dynatrace, Datadog , AppDynamics, etc. Creation of Dashboards.
- Strong knowledge on Load Balancers, HTTP/HTTPS protocols and Networking concepts.
- Collaborate with multiple teams for Incident resolution.
- Experience in Automation to reduce MTTR
Additional Information
- Primary Skill :- Microservice and Kubernetes application support and troubleshooting.
- Secondary Skill :- AWS, SRE
- Required Start Date :- 15- June -2024
- Demand type - Employee or Contractor :- Employee
- Level 1 - Interview panel 1 :- Brahmayya Kandara and Rajesh Kancherla
- Level 2 - Interview panel 1:- Nithin Vemireddy and Vivek Kaila
- Package :- 12-15 LPA
- SOW name if created in the system :- TBD
- Offshore / Onshore:- Offshore