NOC System Engineer

  • Full-time
  • Department: Data Center

Company Description

PubMatic is a digital advertising technology company for premium content creators. The PubMatic platform empowers independent app developers and publishers to control and maximize their digital advertising businesses. PubMatic’s publisher-first approach enables advertisers to maximize ROI by reaching and engaging their target audiences in brand-safe, premium environments across ad formats and devices. Since 2006, PubMatic has created an efficient, global infrastructure and remains at the forefront of programmatic innovation.  Headquartered in Redwood City, California, PubMatic operates 13 offices and nine data centers worldwide.

Job Description

The Network Operations Center (NOC) team member will be responsible for first and second level support for a wide variety of systems and processes supporting our platform. This individual will perform routine tasks to maintain system, application and network services. In this role the individual is required to work shifts (rotating shifts). Supporting, troubleshooting of internal and external systems relying on pre-established guidelines is required in this role.

Responsibilities

  • Proactively ensures the highest levels of systems and infrastructure availability.
  • Perform daily system monitoring, verifying the integrity and availability of systems and key processes, reviewing system and application logs, and verifying completion of scheduled jobs.
  • Provides support for operations to resolve critical issues quickly.
  • This may include occasional off-hours and weekend work and periodic on-call support.
  • Be a main point of contact for communication ensuring that all stakeholders are aware of NOC services status and issues, coordinate activities, P1 and P2 incidence on time and as per defined SLA.
  • The primary responsibility will be creating and implementing integrated communications.
  • Collate feedback on process performance.
  • Measure and monitor the effectiveness of processes to ensure consistent value delivery.
  • Works with operations and engineering teams to perform incident root cause analysis.
  • Assist in developing, and documenting actions to assure problem resolution or to implement corrective/preventive action and document resolution;
  •  Monitors and tests application performance for potential bottlenecks, identify possible solutions, and work with developers to implement those fixes.
  • Translates business requirements into system specifications and functional designs by producing documents on how to implement changes to the system.
  •  Oversees the execution of testing strategies applicable to the systems of interest.
  •  Oversees and recommends systems enhancements to the business issues and process challenges for the organization.
  • Performs other duties as may be assigned by management.
  • Identify opportunities to improve efficiency of support and monitoring
  • The Ideal Candidate will possess the Following Additional Education and Experience
  • Works on issues of diverse scope where analysis of a situation or data requires evaluation of a variety of factors, including an understanding of current business trends.
  • Follows processes and operational policies in selecting methods and techniques for obtaining solutions.
  • Willing to learn and develop new skills.
  • Good self-awareness. Actively seeks out tasks that help develop skills and knowledge.
  • Ability to work on own initiative. Actively seeks ways of improving existing systems and processes.
  • ‘Can do’ attitude. Flexible and adaptable approach to problem solving.
  • Co-operate with other teams. Actively encourage strong working relationships with other teams.

Qualifications

  • College diploma or university degree in the field of computer science, information sciences, or related field preferred.
  • At least 3 to 5 years of work experience.
  • RHCSA, RHCE, CCNA and ITIL certifications recommended.
  • Exceptional knowledge of computer hardware, including Super micro and Dell servers.
  • Knowledge for Monitoring tools like Nagios, Observium etc.
  • Basic knowledge of the OSI model, switching and internet routing technologies (to junior network administrator level).
  • Experience with switching and internet routing technologies.
  • Strong Linux network diagnostic skills.
  • Knowledge of Linux kernel, command line and system diagnostics (to junior sysadmin level).
  • Experience configuring internet applications such as Apache, nginx, BIND.
  • Basic understanding of network monitoring concepts and management tools.
  • Experience of network monitoring tools and protocols
  • Exposure to scripting basics. Experience of scripting (Perl, Bash) and another web application
  • Experience of analyzing system and network performance using monitoring and graphical data.
  • Ability to assess faults, prioritize, respond and escalate accordingly.
  • Experience of diagnosing network and service issues, following them through to resolution.
  • Capable of multi-tasking, good time

Additional Information

PubMatic is proud to be an equal opportunity employer; we don’t just value diversity, we promote and celebrate it. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

All your information will be kept confidential according to EEO guidelines.