搜索建议:

外企
远程办公
remote
french
项目经理
兼职
marketing
sourcing
data
国外工作
product manager
实习
finance
澳門
香港
顺德区
上海市
Shanghai
澳門
屯門區
Chengdu
湖北省
Guangzhou City
離島區
Shenzhen
申请

Deep Learning Performance Architect

NVIDIA
Shanghai, 上海市
全职
3周前

NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning performance architect to join our AI performance modelling, analysis and optimization efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company.

What you'll be doing:

  • Analyze state-of-the-art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products.

  • Develop analytical models for the state-of-the-art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency.

  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uniprocessor and multiprocessor configurations.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.

What we need to see:

  • BS, MS or PhD in relevant discipline (CS, EE, Math, etc.) or equivalent experience.

  • 5+ years’ work experience.

  • Experience with popular AI models (e.g., LLM and AIGC models)

  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow/TensorRT)

  • Knowledge and experience on hardware architectures for deep learning applications

Ways to stand out from the crowd:

  • Background with CUDA and GPU computing systems

  • Experience on performance modelling or optimization of DL workloads

保存 申请
举报职位
其他职位推荐:

Deep Learning Performance Architect Intern - 2025

NVIDIA
Shanghai, 上海市
  • Unlock Architectural Insights: Analyze GPU workloads to...
  • AI-Powered Automation: Build AI/ML-driven tools to automate...
2周前

Senior Performance Software Engineer, Deep Learning Libraries

NVIDIA
Shanghai, 上海市
We are now looking for a Senior Performance Software Engineer for Deep Learning Libraries! Do you enjoy tuning parallel...
3周前

Senior Infrastructure Software Engineer, Deep Learning Libraries

NVIDIA
Shanghai, 上海市
  • Building scalable automation for build, test, integration,...
  • Developing throughout the software stack, from the user...
3周前

Senior Project Manager / Senior Architect / Architect / Junior Architect

BIG (Bjarke Ingels Group)
Shanghai, 上海市
  • Lead the management of small- to large-scale and complex...
  • Coordinate project schedules, deliverables, and client...
1周前

LLM High-Performance Optimization Architect

AMD
Beijing, 北京市
  • Develop and implement LLM training and inference frameworks...
  • Analyze and optimize accuracy and performance issues during...
2周前