Vertex AI platform Engineer- Alpharetta, GA

  • Contract

Job Description

Hi,

 

Job Title: Vertex AI Platform Engineer

Location: Alpharetta, GA

Duration: 6 months Contract 

 

We have below job opening.

If you are interested and your experience match with job description.

Please send your updated resume with below details....Asap

 

Expected rate all inc on W2/1099 :

Visa status:

Current Location:

Availability:

Bachelor degree and years of passing :

Master degree and years of passing :

 

Job Description:

 

Maintain and optimize the operational stability and performance of the Vertex AI environment

Monitor health and performance of Vertex AI services including notebooks, pipelines, endpoints, and managed instances

Troubleshoot and resolve issues related to scheduling, testing, and configuration

Collaborate with DevOps teams to implement automated deployment and testing processes

Ensure new Vertex AI features (e.g., GenAI) are properly configured and integrated

Triage and resolve support tickets related to Vertex AI platform issues

Perform root cause analysis to identify and prevent future problems

Develop and maintain documentation on incident resolution procedures

Investigate and address performance bottlenecks in the Vertex AI environment

Implement monitoring and alerting systems to proactively identify potential issues

Collaborate with AI/ML teams to optimize resource utilization and cost efficiency

Stay up-to-date on new Vertex AI features and releases

Plan and execute platform upgrades and enhancements

Work with AI/ML teams to assess the impact of new features on existing workflows

Strong knowledge of Vertex AI and its components

Experience with containerization and orchestration technologies such as Docker and Kubernetes

Familiarity with LLMs and anomaly detection techniques

Proficiency in Python and other scripting languages

Experience with cloud monitoring and logging tools like Stackdriver

Familiarity with DevOps practices and tools including CI/CD pipelines

Ability to quickly diagnose and resolve complex technical issues

Strong analytical and troubleshooting skills

Proactive approach to identifying and preventing potential problems

Ability to effectively communicate technical concepts to both technical and non-technical stakeholders

Excellent written and verbal communication skills

Ability to collaborate effectively with cross-functional teams

Experience with machine learning frameworks such as TensorFlow or PyTorch

Knowledge of data engineering and pipeline management

Understanding of security best practices for AI/ML platforms

 

Additional Information

All your information will be kept confidential according to EEO guidelines.