Platform AI Observability Engineer

  • Full-time
  • Department: Others
  • Job Category: Data & Analytics

Company Description

NCS is a leading technology services firm that operates across the Asia Pacific region in over 20 cities, providing consulting, digital services, technology solutions, and more. We believe in harnessing the power of technology to achieve extraordinary things, creating lasting value and impact for our communities, partners, and people. Our diverse workforce of approximately 14,000 has delivered large-scale, mission-critical, and multi-platform projects for governments and enterprises in Singapore and the APAC region. 
 

Job Description

As a Platform AI Observability Engineer, you will be part of the AI Center of Excellence (AI COE), a centralized platform organization responsible for building and operating enterprise-grade AI platforms used by multiple internal teams. You will own and evolve the observability, quality, and safety layer of the AI COE platform, with a strong focus on LLMOps and AgentOps. This role is centered on monitoring, evaluating, and governing AI behavior in production—ensuring AI systems are reliable, transparent, and responsible at scale. This is a platform ownership role, not a model training or application development role.

 

What will you do?

AI Observability & Monitoring

  • Design and operate observability frameworks for LLM- and agent-based workloads

  • Define metrics, logs, and traces for AI inference, agent execution, and tool invocation

  • Build and maintain dashboards and alerts to support AI operations and governance

AI Quality, Safety & Reliability

  • Enable continuous monitoring of AI quality, safety, and behavioral correctness

  • Support continuous evaluation, regression detection, and drift analysis for AI systems

  • Implement and evolve quality and safety checks for LLM outputs and agent behavior

Platform Enablement & Collaboration

  • Collaborate with AI, platform, and security teams to embed observability, quality, and safety by default

  • Contribute shared standards and best practices for AI observability and governance across the platform

  • Support adoption of observability capabilities by internal engineering and project teams
     

Qualifications

The ideal candidate should possess:

  • 3–5 years of experience in observability, SRE, platform engineering, or AI operations roles

  • Hands-on experience supporting production-grade AI or ML systems

  • Strong understanding of LLMOps and AgentOps concepts, including tracing, continuous evaluation, and feedback loops

  • Experience with AI observability tools (e.g. Langfuse, prompt evaluation or agent tracing frameworks)

  • Strong observability fundamentals using OpenTelemetry (e.g. OTEL Collector, Grafana Alloy)

  • Experience building dashboards and alerts using Grafana with backends such as Prometheus, Loki, and Tempo

  • Ability to distinguish between infrastructure metrics (e.g. latency, GPU saturation) and semantic AI metrics (e.g. hallucination rate, tone, relevance)

  • Experience operating observability stacks in Kubernetes or OpenShift environments

  • Familiarity with log aggregation platforms (e.g. ELK) in enterprise settings

  • Strong analytical mindset with attention to AI behavior, quality, and risk

  • Platform ownership mentality with a strong focus on reliability and long-term value

  • Clear and effective communication across engineering, AI, and governance teams
     

Additional Information

Why Join NCS 

  • Lead high-impact AI management consulting programs for major enterprises and public sector clients.  

  • Shape enterprise strategies and governance frameworks that drive real transformation.  

  • Work with a talented, multidisciplinary team in a collaborative environment.  

  • Competitive compensation and strong professional development support.  

We are driven by our AEIOU beliefs—Adventure, Excellence, Integrity, Ownership, and Unity—and we seek individuals who embody these values in both their professional and personal lives. We are committed to our Impact: Valuing our clients, Growing our people, and Creating our future.

 

Together, we make the extraordinary happen.

 

Learn more about us at ncs.co and visit our LinkedIn career site.

 

Scam Alert

 

We are aware of fraudulent job offers and impersonations of NCS recruiters. Phishing emails using convincing-looking but fake addresses are also commonly used to trick you into thinking that they come from official NCS sources.

 

Please note that all official communications from NCS Group will only be sent from verified corporate email addresses. Always check that the sender’s email address ends with the genuine NCS domain, @ncs.com.sg and beware of extra letters, symbols or misspellings. When in doubt, verify the sender’s identity by contacting us at [email protected].

Privacy Notice