Job Description

Description and Requirements

Job Responsibilities:1. Design and develop AI local inference code to support large-scale deployment scenarios.2. Conduct architecture design and implement acceleration solutions for heterogeneous hardware platforms (including iGPU and d-GPU).3. Optimize inference performance by applying advanced technical methods such as operator fusion, memory bandwidth reduction, quantization, and mixed precision.4. Engage in device perception algorithm development and LLM performance fine-tuning.5. Drive technological innovation in inference acceleration algorithms and maintain technical leadership in the field.Job Requirements:1. Profound professional knowledge and practical experience in AI local inference code development and large-scale deployment.2. Solid expertise in architecture design of heterogeneous hardware platforms (iGPU/d-GPU) and hands-on experience in implementing corresponding acceleration solutions.3. Proficient in applying advance...

Ready to Apply?

Take the next step in your AI career. Submit your application to Lenovo today.

Submit Application