Cloud Operation Center Engineer - Bilingual in Korean

  • Full-time

Company Description

Cloud Operation Center Engineer R&R (Onsite / Full-Time)

Bilingual in Korean MUST!

Salary: $70K+ jr / $90K mid + DOE (open to negotiation)

Position Overview

The Cloud Operation Center Engineer is responsible for operating and supporting Private Cloud infrastructure, including Network, Firewall, Load Balancer, Storage, Linux OS, and Monitoring systems.

This role focuses on ensuring stable operations of an OpenStack-based Private Cloud environment, performing infrastructure incident analysis, vendor support coordination, monitoring improvements, and automation initiatives.

     

        Job Description

        Roles & Responsibilities

        1. Firewall Operations & Troubleshooting

        • Create, modify, and manage Palo Alto firewall policies
        • Review traffic logs and validate firewall policy changes
        • Perform network troubleshooting using packet capture / traffic dumps
        • Diagnose issues based on NAT, security policies, and session states

        2. Load Balancer Operations

        • Configure and manage Citrix ADC VPX-based load balancers (LB/CLB)
        • Manage VIPs, services, service groups, and health checks
        • Resolve LB incidents such as backend server failures and connection issues
        • Analyze logs using nstrace, tcpdump, and related tools

        3. OpenStack Private Cloud Operations

        • Provision, deploy, and manage OpenStack instances
        • Manage volumes, shared volumes, networks, and security groups
        • Troubleshoot instance boot failures, network issues, and volume attachment problems
        • Support compute node failures and live migration operations

        4. Server Operations & Vendor Support

        • Operate Dell servers used as OpenStack compute nodes
        • Monitor server health via iDRAC and perform basic maintenance
        • Handle hardware failures and coordinate with Dell support
        • Support hardware replacement and vendor service activities

        5. Storage Operations

        • Manage NetApp ONTAP storage systems
        • Monitor nodes, SVMs, volumes, and aggregates
        • Analyze performance metrics such as latency, IOPS, and throughput
        • Respond to storage incidents and performance degradation issues

        6. Linux OS Operations

        • Perform Linux system administration and incident troubleshooting
        • Handle root password changes, mount issues, repository issues, etc.
        • Resolve boot failures, filesystem issues, and network interface problems
        • Analyze system logs and systemd service issues

        7. Monitoring & Observability

        • Operate monitoring platforms such as Zabbix, Grafana, and Prometheus
        • Monitor servers, networks, storage, OpenStack, LB, and firewall systems
        • Manage alerts, dashboards, metrics, and triggers
        • Perform root cause analysis using logs, metrics, and alerts

        8. Automation & Operational Improvement

        • Automate deployments and updates using Ansible
        • Automate repetitive operational tasks
        • Build workflow automation using Microsoft Teams / Power Automate
        • Automate alerts, reporting, approval, and request workflows
        • Develop scripts using Python, Shell, or PowerShell
        • Leverage AI coding tools for scripting, log analysis, and documentation

         

          Qualifications

           

          • Experience in Linux OS administration and troubleshooting
          • Understanding of TCP/IP, routing, NAT, and firewall policies
          • Hands-on experience with Palo Alto or similar firewalls
          • Experience with Citrix ADC VPX or similar load balancers
          • Experience with OpenStack or private cloud environments
          • Experience with NetApp ONTAP or enterprise storage systems
          • Experience with Dell servers and iDRAC-based operations
          • Packet capture / traffic dump troubleshooting experience
          • Experience with Ansible or scripting for automation
          • Ability to perform root cause analysis using logs, metrics, and network data

           

          Preferred Qualifications

          • Experience with Zabbix, Grafana, or Prometheus
          • OpenStack component experience (Nova, Neutron, Cinder, Glance)
          • NetApp ONTAP CLI and performance tuning experience
          • Citrix ADC nstrace / tcpdump troubleshooting experience
          • Microsoft Teams / Power Automate workflow automation experience
          • Scripting skills in Python, Shell, or PowerShell
          • Understanding of REST API, Webhook, JSON, YAML
          • Git-based version control experience
          • Experience with AI-assisted development tools (ChatGPT, Copilot, etc.)
          • Experience using AI for RCA, log analysis, and automation

          Key Skills

          OpenStack, Palo Alto Firewall, Citrix ADC VPX, NetApp ONTAP, Dell Server/iDRAC, Linux Administration, Monitoring (Zabbix/Grafana/Prometheus), Ansible, Automation Tools, Python/Shell/PowerShell, TCP/IP Networking, Packet Analysis, Incident Troubleshooting, Vendor Coordination, AI-assisted Operations

          Additional Information

          All your information will be kept confidential according to EEO guidelines.

          By clicking the link above or any third-party link within this posting, you are leaving this site and going to a third-party website where the third-party website's terms and privacy policy apply

          Privacy Notice