Vertex AI platform Engineer- Alpharetta, GA
- Contract
Job Description
Hi,
Job Title: Vertex AI Platform Engineer
Location: Alpharetta, GA
Duration: 6 months Contract
We have below job opening.
If you are interested and your experience match with job description.
Please send your updated resume with below details....Asap
Expected rate all inc on W2/1099 :
Visa status:
Current Location:
Availability:
Bachelor degree and years of passing :
Master degree and years of passing :
Job Description:
Maintain and optimize the operational stability and performance of the Vertex AI environment
Monitor health and performance of Vertex AI services including notebooks, pipelines, endpoints, and managed instances
Troubleshoot and resolve issues related to scheduling, testing, and configuration
Collaborate with DevOps teams to implement automated deployment and testing processes
Ensure new Vertex AI features (e.g., GenAI) are properly configured and integrated
Triage and resolve support tickets related to Vertex AI platform issues
Perform root cause analysis to identify and prevent future problems
Develop and maintain documentation on incident resolution procedures
Investigate and address performance bottlenecks in the Vertex AI environment
Implement monitoring and alerting systems to proactively identify potential issues
Collaborate with AI/ML teams to optimize resource utilization and cost efficiency
Stay up-to-date on new Vertex AI features and releases
Plan and execute platform upgrades and enhancements
Work with AI/ML teams to assess the impact of new features on existing workflows
Strong knowledge of Vertex AI and its components
Experience with containerization and orchestration technologies such as Docker and Kubernetes
Familiarity with LLMs and anomaly detection techniques
Proficiency in Python and other scripting languages
Experience with cloud monitoring and logging tools like Stackdriver
Familiarity with DevOps practices and tools including CI/CD pipelines
Ability to quickly diagnose and resolve complex technical issues
Strong analytical and troubleshooting skills
Proactive approach to identifying and preventing potential problems
Ability to effectively communicate technical concepts to both technical and non-technical stakeholders
Excellent written and verbal communication skills
Ability to collaborate effectively with cross-functional teams
Experience with machine learning frameworks such as TensorFlow or PyTorch
Knowledge of data engineering and pipeline management
Understanding of security best practices for AI/ML platforms
Additional Information
All your information will be kept confidential according to EEO guidelines.