搜索建议:

外企
远程办公
remote
marketing
国外工作
实习
english teacher
account
兼职
前端
行政
finance
sustainability
香港
澳門
顺德区
Shanghai
上海市
澳門
福建省
广东省
Shenzhen
大埔區
Chengdu
重庆市

Senior Applied Scientist (Copilot Platform AML Team)

微软
Beijing, 北京市
3天前
The Copilot Platform AML Team is driving the next generation of intelligent assistant infrastructure, powering Microsoft Copilot experiences across the enterprise. Our mission is to build foundational language models that make Copilot more helpful, responsive, and accessible to millions of users worldwide.
We are looking for Applied Scientists to pioneer innovations in scalable training and inference optimization for both Small and Large Language Models (SLMs/LLMs). In this role, you will directly shape the core platform capabilities of Copilot, influencing how organizations interact with AI-driven assistants every day.
Our work spans the entire model lifecycle—from supervised fine-tuning to advanced post-training techniques such as instruction tuning, reinforcement learning, and alignment. We also push the boundaries of model efficiency with cutting-edge compression strategies, including GPTQ, AWQ, and pruning, to deliver faster, more cost-effective inference at scale.
If you’re passionate about creating intelligent assistant systems that combine deep model expertise with world-class engineering, and want to shape the future of enterprise AI, we’d love to have you on our team.

Responsibilities

Model Optimization & Deployment:
Design and implement efficient workflows for training, distillation, and fine-tuning Small and Large Language Models (SLMs), leveraging techniques such as LoRA, QLoRA, and instruction tuning.
Apply model compression strategies—including quantization (e.g., GPTQ, AWQ) and pruning—to reduce inference costs and improve latency.
Optimize LLM inference performance using frameworks like vLLM and TensorRT-LLM (TRT-LLM) to enable scalable, low-latency deployment.
Build robust and scalable inference systems tailored to heterogeneous production environments, with a strong focus on performance, cost-efficiency, and stability.
Evaluation & Data Management:
Develop evaluation datasets and metrics to assess model performance in real-world product scenarios.
Build and maintain end-to-end machine learning pipelines encompassing data preprocessing, training, validation, and deployment.
Cross-functional Collaboration:
Collaborate closely with product managers, engineers, and research scientists to translate business needs into impactful AI solutions, driving real-world adoption and seamless product integration.

Qualifications

Basic Qualifications:
Master’s degree or above (or equivalent experience) in Computer Science, Engineering, Mathematics, Physics, or a related field.
Strong programming skills with hands-on experience in managing large-scale data and machine learning pipelines.
Deep understanding of open-source ML frameworks such as PyTorch, vLLM, and TensorRT-LLM (TRT-LLM).
Solid knowledge of model optimization techniques, including quantization, pruning, and efficient inference.
Preferred Qualifications:
1+ years of experience optimizing LLM inference using frameworks like vLLM or TRT-LLM.
Practical experience in model compression and deployment within production systems.
Experience designing agentic AI systems, such as multi-agent orchestration, tool usage, planning, and reasoning.

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
保存 申请
举报职位
其他职位推荐:

Senior Applied Scientist (Copilot Platform AML Team)

Microsoft
Beijing, 北京市
We also push the boundaries of model efficiency with cutting-edge compression strategies, including GPTQ, AWQ, and pruning, to...
3天前

Principal Applied Scientist

微软
Beijing, 北京市
  • Ph.D. (or equivalent research experience) in Computer...
  • 6+ years of experience in AI/ML, with a focus on LLMs,...
2周前

Senior Associate Operator, Production, Process Expert

Celanese
顺德区, 广东省
2. 具备优秀的生产一线的工艺知识来监控,预测,汇报,消除,解决或者升级可能影响4个核心原则的问题
1周前

Senior Applied Scientist--M365

Microsoft
Suzhou City, 江苏省
  • Drive data exploration and analysis by collecting initial...
  • Build and evaluate ML models by running modeling tools on...
1周前

Principal Applied Scientist (LLMs)

微软
Beijing, 北京市
  • Ph.D. (or equivalent research experience) in Computer...
  • 6+ years of experience in AI/ML, with a focus on LLMs,...
3天前

Scientist

Huntsman Corp
Shanghai, 上海市
  • Carry out lab evaluations to identify value proposition and...
  • Investigate customer needs and develop an in-depth...
6天前

Senior Concept Artist, Environment (Contract)

Riot Games, Inc.
上海市
  • 与场景概念负责人紧密合作构思场景概念设计
  • 为游戏关卡中的关键场景、地貌、建筑、生态与文化元素等提供高质量的layout设计/单体设计/氛围概念设计图,以适应不同开...
  • 通过设计传达场景的历史背景与世界观设定, 参与定义艺术基调与世界构造逻辑...
2周前

Reception Supervisor

Marriott International
Beijing, 北京市
Coordinate with Housekeeping to track readiness of rooms for check-in Count bank at the beginning and end of shift Balance and...
2天前

Lead, Solution Architect, FCSO

Standard Chartered Bank
澳門
  • Responsible for Solution Architecture for Financial Crime...
  • Responsible to work with other Solution Architects across...
2天前

平台与技术负责人 Head of Marketplace & Tech

Siemens
澳門
  • 制定并执行平台与技术战略,聚焦沉浸式电商与AI技术底座,快速迭代销售工具、科技资讯、社群及工业AI发展基地等核心功能。
  • 设计与发展Xcelerator Marketplace整体架构,满足业务发展需要。...
2天前