
普通员工/个人贡献者
AI 估算 · 40k–80k
高级深度学习性能架构师,技能稀缺,GPU和AI领域人才需求旺盛,NVIDIA作为行业龙头提供极具竞争力的薪资。
该职位负责分析新型深度学习网络(如LLM),识别并原型化性能优化机会,影响英伟达当前和下一代推理产品的软硬件架构
BS, MS, or PhD in a relevant field (CS, EE, Math, etc.) or equivalent experience. 5+ years’ work experience. Excellent C/C++ programming and software build skills. Experience in kernel development and performance tuning on GPUs (or other accelerators). Familiarity with typical deep learning SW frameworks (e.g., Torch/JAX/TensorFlow/TensorRT) and popular AI models (e.g., LLM and AIGC models). Familiarity and background with hardware frameworks for deep learning applications.
Analyze brand-new DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next-gen inference products. Develop prototypes of the fastest kernels on present and future NVIDIA GPUs. Define hardware and software setups along with measurements to evaluate performance, power consumption, and accuracy in current and upcoming chips. Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.
Experience in the performance optimization of DL workloads. Experience with MLIR and AI compiler development.
优点
缺点 / 挑战
暂无明显挑战项
顶级AI芯片公司的高阶技术岗,技术前沿、薪资优厚,但工作强度大、现场办公。
该职位来自上市公司英伟达,薪资水平在行业中处于高位,且福利待遇全面,能较好满足补偿性动机。
岗位涉及最前沿的AI加速技术和GPU架构,技术成长空间极大,且有机会影响下一代硬件设计。
职位要求现场办公,未提及弹性工作或WLB信息,且技术岗高强度加班较常见,生活化动机满足程度一般。
英伟达在AI基础设施领域具有核心影响力,工作成果直接推动行业进步,使命感和行业前景较强。