Senior AWS Observability Engineer
- Full-time
Company Description
Derex Technologies Inc specializes in providing IT consulting, staffing solutions and software services. Globally headquartered in Harrison New Jersey since 1996 Derex delivers the highest quality technology professionals and an array of customized IT talent solutions designed to improve productivity and drive results to global clients throughout North America.
With over two decades of unparalleled experience, Derex provides supports to its clientele, across such industries as Systems Integration, Banking and Finance, Telecommunications, Pharmaceutical and Life Sciences, Energy, Healthcare, Technology, Transportation, and local and federal Government agencies.
Job Description
Role: Senior AWS Observability Engineer
Location: Erie PA(On-Site)
Experience: 10+ Years
Primary Skill
- AWS Observability and Monitoring tooling and installation.
- Integration with Observability Tools such as Splunk observability cloud, OpenTelemetry, AppDynamics, DataDog, DynaTrace, Amazon CloudWatch, AWS Cloud Trail.
About the Role:
We are seeking a highly skilled, hands-on, and experienced Observability Engineer with a focus on AWS Observability and Monitoring to join our dynamic team.
In this role, you will be responsible for installing agents, integrating applications running on AWS with cloud-native observability tools. You will collaborate closely with clients and internal teams to understand requirements, translate them into solutions, and present these solutions to clients.
Your expertise will be crucial in guiding clients through their cloud observability journey, from initial assessment and planning to integration and optimization. As a Senior Cloud Engineer, you will stay current with the latest AWS services and best practices to ensure our solutions remain cutting-edge. You will work with clients, engineering centers, delivery teams, and various business lines to deliver high-quality monitoring solutions that drive business value.
The ideal candidate will have a strong background in AWS, be hands-on, possess excellent problem-solving skills, and have a passion for leveraging cloud technologies.
Key responsibilities:
- Design, develop and maintain an observability strategy that covers application performance, user experience, and system health on AWS and on-prem using tools like Splunk observability cloud, CloudWatch , OpenTelemetry..etc
- Design, configure, and maintain AppDynamics dashboards tailored to business needs, giving clear visibility into key metrics and KPIs.
- Implement OpenTelemetry SDKs and APIs across applications to collect traces, metrics, and logs. Ensure consistent instrumentation in .NET, Java, COTS, and other applications in the environment.
- Integrate and configure Splunk Observability Cloud to capture detailed trace data, enabling a comprehensive view across distributed applications.
- Ensure dashboards highlight trends, bottlenecks, and critical alerts for quick incident response.
- Define alerts and thresholds in both AppDynamics and Splunk to detect anomalies early.
- Extensive hands-on experience with installing necessary agents on servers, virtual machines, and AWS-supported services, and forwarding logs to a centralized location with configured aggregators.
- Deep understanding of logs, metrics, and tracing services and their capabilities.
- Conduct assessments related to AWS platform and observability, providing detailed comparative analysis, cost metrics, advantages of various monitoring tools, and setting benchmarks.
- Develop and maintain documentation processes by creating templates for observability, including assessment checklists, questionnaires, presentations, and proof of concepts with a hands-on approach.
- Participate in sessions and workshops for clients and internal team members on observability, delivering high-quality presentations.
- Participate in solution design and proposal development activities.
- Proven experience in IT monitoring and observability with a focus on cloud environments.
Requirements:
- Bachelor’s degree in computer science, Information Technology, or a related field.
- At least 10 years of IT experience, with a minimum of 6 years focused on AWS Cloud with an emphasis on observability.
- Extensive experience in implementing end-end unified observability using OpenTelemetry, Splunk observability cloud on multi cloud environment.
- Experience in design, configure, and maintain AppDynamics dashboards tailored to business needs
- Extensive experience with AWS services and a thorough understanding of compute, storage, networking, security, and database services in the cloud such as EC2, S3, VPCs, Network Flow Logs, RDS, etc.
- Expertise in AWS monitoring solutions such as Amazon CloudWatch and AWS CloudTrail.
- Proficiency in AWS-supported monitoring solutions such as Dynatrace, AppDynamics, DataDog, and Sumo Logic.
- Experience in monitoring infrastructure, APIs, microservices, JVMs, and RUM, with the ability to create necessary dashboards for visualization.
- Experience integrating with notification systems and incident management systems.
- Understanding of self-healing concepts and AIOps.
- Proficiency in programming languages such as Python, Java, or Go.
- Ability to work independently and as part of a team, demonstrating leadership when required.
- Relevant cloud certifications are required.
Regards,
Manoj
Derex Technologies INC
Contact : 973-834-5005 Ext 206
Additional Information
All your information will be kept confidential according to EEO guidelines.