Sr Analyst IT Operations(Smartops/Enterprise Platforms)

  • Full-time
  • Location: India - Hyderabad

Job Description

Mattel India is seeking a motivated Junior Monitoring Engineer to support the configuration, maintenance, and daily operations of key enterprise monitoring tools (SCOM, Nagios, SolarWinds, AlertSite) in a SOX-regulated infrastructure environment. You will assist senior engineers and leads in providing proactive visibility into servers, networks, applications, cloud resources, and critical business systems (e.g., financial, supply chain, e-commerce). This role ensures early detection of issues, compliance with SOX ITGCs (e.g., audit logging, performance monitoring, anomaly detection), and high availability to support iconic toy brands worldwide. 

Key Responsibilities 

  • Assist in the day-to-day configuration, tuning, and maintenance of monitoring platforms: 

  • Microsoft SCOM (System Center Operations Manager): Management packs, agents, alerts, dashboards, and reporting. 

  • Nagios (or Nagios-based tools like Icinga): Plugins, host/service checks, NRPE, notifications. 

  • SolarWinds (Orion platform): NPM, SAM, NCM modules for network/server/app monitoring, alerts, and polling. 

  • AlertSite (SmartBear): Synthetic monitoring, transaction checks, API/web monitoring, global nodes, and alert integrations. 

  • Monitor infrastructure health, performance metrics, availability, and alerts across on-prem data centers, hybrid/cloud environments (AWS/Azure), and SOX-critical systems. 

  • Respond to alerts/incidents during 24x7 shifts/on-call: Perform initial triage, acknowledge/escalate, execute basic remediation (e.g., restarts, threshold adjustments), and document in ticketing systems (ServiceNow/Jira). 

  • Support SOX compliance: Configure monitors for audit-relevant events (e.g., unauthorized changes, access anomalies, SLA breaches), generate compliance reports/evidence, and assist in control testing/remediation. 

  • Perform routine tasks: Agent deployments, threshold tuning, false-positive reduction, dashboard/report creation, and monitor health checks. 

  • Collaborate with senior engineers, infrastructure, security, and application teams to implement new monitors, integrate tools (e.g., with PagerDuty, Splunk, ServiceNow), and improve alerting accuracy. 

  • Document configurations, procedures, runbooks, and SOX-related artifacts; participate in knowledge sharing and team handovers. 

  • Contribute to proactive monitoring improvements, capacity planning insights, and post-incident reviews to enhance system reliability. 

  • Support disaster recovery drills and ensure monitoring continuity during changes/upgrades. 

Required Skills & Experience 

  • 2–5 years of hands-on experience in IT monitoring/operations, with exposure to at least 2–3 of the following tools: SCOM, Nagios/Icinga, SolarWinds Orion, AlertSite (or similar synthetic monitoring tools like Dynatrace Synthetics/New Relic). 

  • Basic to intermediate knowledge of: 

  • Windows/Linux server monitoring, network devices, applications, and cloud resources. 

  • Alert configuration, thresholding, notifications, and integration with incident tools. 

  • Performance metrics (CPU, memory, disk, response times) and troubleshooting basics. 

  • Understanding of SOX compliance in IT environments: Monitoring for ITGCs, logging/auditing, change detection, and regulatory reporting basics (preferred but trainable). 

  • Experience in 24x7 production support environments, including shift work/on-call, incident response, and high-availability monitoring. 

  • Familiarity with ITIL processes, ServiceNow/Jira for ticketing, and basic scripting (PowerShell/Bash/Python for simple automation/tasks). 

  • Good analytical skills, attention to detail, and ability to work collaboratively in a global team. 

Privacy Notice