Job Description

Job Responsibilities :

We are hiring a Senior / Lead Software Engineer to design and build an AI/ML platform capable of high-throughput training and inference across local and cloud GPU environments. This role focuses on systems architecture, GPU acceleration, performance engineering, and reliable operation of AI workloads at scale. You will lead engineering initiatives, define platform architecture, and collaborate closely with ML and hardware teams.

Responsibilities

System Architecture & Core Engineering

  • Design and implement the architecture for model training, fine-tuning, and serving.
  • Build platform components that support heterogeneous compute environments (GPUs, NPUs, accelerators).
  • Develop and optimize high-performance inference stacks using frameworks such as vLLM, SGLang, TensorRT-LLM, or Triton.
  • Develop APIs, CLI tools, and backend services for model lifecycle management.
  • Local & Cloud G...

    Ready to Apply?

    Take the next step in your AI career. Submit your application to Razer today.

    Submit Application