Job Description

Hiring for AI/ML Ops/GPU acceleration + AI inference/TensorRT/ONNX


Build and maintain containerized applications using OpenShift, OpenShift AI, Kubernetes, and Helm charts.

  • Integrate and optimize inference engines such as Triton and vLLM for scalable model serving.
  • Lead model deployment, monitoring, and lifecycle management in production environments.
  • Implement monitoring and alerting solutions using Grafana and Prometheus.
  • Collaborate on GenAI and LLM projects, including Agentic AI initiatives.
  • Automate CI/CD pipelines and infrastructure using Jenkins, Ansible, Groovy, and Terraform.
  • Develop automation scripts and tools in Python.
  • Architect, deploy, and manage AI/ML solutions on AWS Cloud; experience with Bedrock and SageMaker is a plus.
  • Build and enhance AI Platform ( both on premise and in public cloud).
  • Make is scalable, high performance and resilient
  • Contribute to fut...

Ready to Apply?

Take the next step in your AI career. Submit your application to Jobworld Management Consultancy LLC today.

Submit Application