搜索建议:

外企
远程办公
remote
communication
行政
marketing
兼职
trainee
实习
procurement
国外工作
finance
part time
香港
上海市
澳門
顺德区
澳門
Shanghai
荃灣區
北區
Chengdu
四川省
饶平县
Guangzhou City

Senior Technical Account Manager - GPU (北上广深)

亚马逊
澳門
2天前

DESCRIPTION

  • Hiring location: Beijing, Shanghai, Guangzhou, Shenzhen, Hong Kong(visa sponsorship provided)

Would you like to join one of the fastest-growing teams within Amazon Web Services (AWS) and help shape the future of GPU optimization and high-performance computing? Join us in helping customers across all industries to maximize the performance and efficiency of their GPU workloads on AWS while pioneering innovative optimization solutions.

As a Senior Technical Account Manager (Sr. TAM) specializing in GPU Optimization in AWS Enterprise Support, you will play a crucial role in two key missions: guiding customers' GPU acceleration initiatives across AWS's comprehensive compute portfolio, and spearheading the development of optimization strategies that revolutionize customer workload performance.

Key Job Responsibilities
  • Build and maintain long-term technical relationships with enterprise customers, focusing on GPU performance optimization and resource allocation efficiency on AWS cloud or similar cloud services.
  • Analyze customers’ current architecture, models, data pipelines, and deployment patterns; create a GPU bottleneck map and measurable KPIs (e.g., GPU utilization, throughput, P95/P99 latency, cost per unit).
  • Design and optimize GPU resource usage on EC2/EKS/SageMaker or equivalent cloud compute, container, and ML services; implement node pool tiering, Karpenter/Cluster Autoscaler tuning, auto scaling, and cost governance (Savings Plans/RI/Spot/ODCR or equivalent).
  • Drive GPU partitioning and multi-tenant resource sharing strategies to reduce idle resources and increase overall cluster utilization.
  • Guide customers in PyTorch/TensorFlow performance tuning (DataLoader optimization, mixed precision, gradient accumulation, operator fusion, torch.compile) and inference acceleration (ONNX, TensorRT, CUDA Graphs, model compression).
  • Build GPU observability and monitoring systems (nvidia-smi, CloudWatch or equivalent monitoring tools, profilers, distributed communication metrics) to align capacity planning with SLOs.
  • Ensure compatibility across GPU drivers, CUDA, container runtimes, and frameworks; standardize change management and rollback processes.
  • Collaborate with cloud provider internal teams and external partners (NVIDIA, ISVs) to resolve cross-domain complex issues and deliver repeatable optimization solutions.

-

About the team
AWS Global Services includes experts from across AWS who help our customers design, build, operate, and secure their cloud environments. Customers innovate with AWS Professional Services, upskill with AWS Training and Certification, optimize with AWS Support and Managed Services, and meet objectives with AWS Security Assurance Services. Our expertise and emerging technologies include AWS Partners, AWS Sovereign Cloud, AWS International Product, and the Generative AI Innovation Center. You’ll join a diverse team of technical experts in dozens of countries who help customers achieve more with the AWS cloud.

Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Inclusive Team Culture
AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do.

Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.


BASIC QUALIFICATIONS

  • 5+ years in cloud technical support, solutions architecture, or customer success management, with at least 3 years of hands-on experience in GPU/accelerated computing platforms.
  • In-depth understanding of GPU instance families (e.g., AWS G/P/H series) or similar offerings from other cloud providers, AMI/driver/CUDA/container compatibility management, and cloud storage/network performance tuning (e.g., S3 I/O, EBS/Instance Store equivalents, preprocessing pipelines). Proficient in scheduling GPU workloads with EKS or equivalent Kubernetes-based orchestration services, including node pool tiering, resource quotas, elastic scaling, and auto-recovery strategies. Experienced in multi-GPU/multi-node distributed computing (NCCL, topology awareness, tensor parallelism, pipeline parallelism) with expertise in communication optimization for large-scale AI training and inference.
  • Skilled in PyTorch/TensorFlow performance analysis and optimization, including DataLoader tuning, mixed precision, operator fusion, and inference acceleration toolchains (ONNX, TensorRT, CUDA Graphs).
  • Experienced in cost and capacity governance, familiar with Savings Plans, RI, ODCR, Spot, Capacity Blocks, and right-sizing strategies or their equivalents in other cloud platforms.
  • Demonstrated cross-functional communication and influence skills, capable of driving technical solutions with data and business objectives.

PREFERRED QUALIFICATIONS

  • AWS Solutions Architect Professional, Machine Learning Specialty, or DevOps Professional certification or equivalent credentials from other cloud providers.
  • Hands-on experience with NVIDIA ecosystem software and toolchains (CUDA/cuDNN/NCCL, TensorRT, CUDA Graphs) and proven ability to maintain performance consistency across versions and platforms.
  • Delivered quantifiable performance improvements (GPU throughput, latency reduction, cost savings) with demonstrated benchmarking and regression testing methodology.
  • Proven repeatable optimization results in LLM inference, batch AI training, real-time video processing, or high-performance computing (HPC).
  • Contributions to open source projects (Run:ai, Ray, vLLM, DeepSpeed, Kubeflow, etc.) or published technical articles, whitepapers, or performance benchmarking.
  • Experience with Infrastructure as Code (Terraform, AWS CDK **or equivalent cloud development frameworks**), Helm Charts, baseline container image management, and DevOps automation.
  • Able to present performance-business tradeoffs and results to senior stakeholders using PR/FAQ documents, architecture diagrams, and capacity/cost reports.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Job details

    CHN, Guangzhou
    CHN, Shanghai
    CHN, Beijing
    CHN, Shenzhen

    Support

    Solutions Architect
申请
保存
举报职位
其他职位推荐:

Ass.Account Manager, Imaging&IGT

Philips
澳門, 澳門
一个极具职业发展前途的机会。飞利浦全公司的市场营销能力在不断地提高。 成功应聘这个在多元化环境中以市场开发为己任的职位,将让您在自己的长期职业生涯、飞利浦公司内部的其他领域甚至其他行业中获得众多发展机会。 作为雇主,我们也希望为您提供最好的待遇来回馈您的付出。...
1天前

Senior Account Sales, Ultrasound

Philips
Beijing, 北京市
一个极具职业发展前途的机会。飞利浦全公司的市场营销能力在不断地提高。 成功应聘这个在多元化环境中以市场开发为己任的职位,将让您在自己的长期职业生涯、飞利浦公司内部的其他领域甚至其他行业中获得众多发展机会。 作为雇主,我们也希望为您提供最好的待遇来回馈您的付出。...
1天前

AI ML GPU Optimization Software Engineer

AMD
Beijing, 北京市
  • Develop & Optimize Models: Design and optimize deep...
  • Collaborate with GPU Library Teams: Work closely with...
2周前

Senior Associate Operator, Production, Process Expert

Celanese
顺德区, 广东省
2. 具备优秀的生产一线的工艺知识来监控,预测,汇报,消除,解决或者升级可能影响4个核心原则的问题
1周前

Local Marketing Manager

美国雅培
離島區, 香港
1. 确保在推广周期内,根据市场策略执行相关的推广计划。配合区域销售团队开展学术活动,如城市会,院内会,和KOL 活动等; 2. 和市场部密切配合,确保所有的推广资料,符合产品定位,传递最新的产品信息; 3....
1周前

Technical Artist, Tools & Rigging (Contract)

Riot Games, Inc.
上海市
有在 DCC 软件(如 Maya,Motion Builder 和 Unreal Engine等)中进行内容创作和工具开发的实际经验 熟悉现代脚本语言及其 UI 框架,如 Python 和 PyQt/PySide
3周前

SC-District Manager-Guangzhou

Sanofi
澳門
  • 根据区域销售目标和营销计划,制定销售计划
  • 跟进辖区销售活动及销售业绩
  • 进行A&P、OP 预算规划和控制。...
2周前

AI ML GPU Optimization Software Development Engineer

AMD
Shanghai, 上海市
  • Develop GPU Kernels: Create and optimize GPU kernels to...
  • Develop & Optimize Models: Design and optimize deep...
3周前

Cloud Services Sales Specialist

NTT DATA
Shanghai, 上海市
As a Cloud Client Partner at NTT DATA, your role will focus on identifying, developing, and closing managed service and...
1天前

Order Processing Assistant

DELO
Shanghai, 上海市
  • Process, arrange and track all sales orders with customers
  • Process, arrange and track all purchase orders with HQ ...
1天前