Analyst IT Operations(SCOM Engineer – Enterprise Monitoring & Infrastructure Operations)

  • Full-time
  • Location: India - Hyderabad

Job Description

Mattel India is seeking an experienced SCOM Engineer to design, maintain, and optimize Microsoft System Center Operations Manager (SCOM) environments in a SOX-regulated infrastructure setup. You will provide proactive visibility, performance monitoring, alerting, and stability across servers, applications, networks, cloud resources, and critical business systems (e.g., financial reporting, supply chain, e-commerce). This role ensures early detection of issues, adherence to SOX ITGCs (e.g., monitoring for unauthorized changes, SLA breaches, anomalies impacting financial controls), and high availability to support global toy brand operations with minimal disruptions.

Key Responsibilities

  • Design, deploy, configure, and optimize SCOM (current versions, e.g., SCOM 2019/2022 or Azure-integrated): Management packs (MPs), agents, overrides, rules, monitors, discoveries, and distributed setups.
  • Implement comprehensive monitoring for infrastructure (Windows/Linux servers, VMware virtualization, storage, networks) and applications (custom .NET/web/SQL, cloud resources in Azure/AWS).
  • Develop and tune alerting: Thresholds, state-based monitors, notifications (email, PagerDuty/ServiceNow integrations), dashboards, reports, and escalation policies to reduce noise and ensure actionable insights.
  • Monitor real-time health, performance metrics (CPU, memory, disk, response times), availability, and trends; generate compliance/performance reports for leadership and audits.
  • Provide 24x5 support during core hours for SCOM operations, tuning, troubleshooting, and enhancements; participate in weekend on-call rotation for critical incidents, alert triage, agent restarts, or failover procedures.
  • Support SOX compliance: Configure SCOM for ITGC-relevant monitoring (e.g., configuration drift, access anomalies, performance impacting financial systems), assist in audit evidence collection (logs/reports), control testing, and remediation of findings.
  • Integrate SCOM with other tools (e.g., ServiceNow for ticketing, Splunk for advanced analytics, Azure Monitor) and automate routine tasks (e.g., via PowerShell scripting).
  • Perform root-cause analysis on incidents, contribute to post-incident reviews, and drive continuous improvement (e.g., MP optimization, scaling, reducing false positives).
  • Collaborate with Infrastructure, Security, Compliance, Application, and Dev teams to define monitoring SLAs, requirements, and best practices in a regulated environment.
  • Document configurations, management packs, procedures, runbooks, and SOX-related artifacts; support knowledge sharing and team handovers.

Required Skills & Experience

  • 3–5 years of hands-on experience administering, designing, and optimizing Microsoft SCOM in enterprise/production environments.
  • Strong expertise in:
  • SCOM architecture, agent management, management pack authoring/customization, overrides, and reporting.
  • Monitoring Windows servers, .NET/SQL applications, virtualization (VMware/Hyper-V), networks, and cloud (Azure preferred).
  • Alerting, dashboards (e.g., SCOM Console, Web Console), and performance tuning.
  • Solid understanding of SOX compliance in IT monitoring: ITGCs, change/access monitoring, logging/auditing, anomaly detection, and regulatory evidence/reporting.
  • Experience in 24x5/24x7 operations with on-call (including weekends), incident response, and high-availability monitoring.
  • Proficiency in PowerShell scripting for SCOM automation/tasks, plus familiarity with ITIL processes, ServiceNow/Jira, and integrations.
  • Knowledge of infrastructure technologies: Windows Server, Active Directory, SQL, networking, cloud platforms, and virtualization.
  • Strong troubleshooting, analytical, and documentation skills in a global, regulated context.

Preferred Qualifications

  • Experience in consumer goods, retail, manufacturing, or entertainment sectors (e.g., monitoring supply chain, DTC/e-commerce, or financial systems).
  • Certifications: Microsoft Certified: Azure Administrator, SCOM-specific (e.g., Microsoft Operations Manager), ITIL Foundation, or compliance-focused (CISA/CRISC).
  • Exposure to hybrid monitoring (SCOM + Azure Monitor, SolarWinds, Nagios) or additional Microsoft tools (e.g., Intune, SCCM).
  • Scripting/automation in regulated environments.
Privacy Notice