将语言模型转化为现实应用
我们正在为全球用户构建 AI 系统。当前正处于 AI 变革的关键时期 —— 本项目团队致力于构建能够真正落地、创造现实世界最大影响与使用量的 AI 应用。
该职位为全球岗位,采用灵活混合办公模式 —— 结合远程办公与总部现场协作。你将与产品、工程、运营、基础设施和数据等地区团队紧密合作,共同构建并扩展具有影响力的 AI 解决方案。
为什么这个岗位重要
你将运行并优化最前沿的开源模型、设计推理框架,并将 AI 功能稳定上线。你的工作将确保我们的模型不仅具备智能,还能在规模化场景中保持安全性、可靠性与性能表现。
你的职责
高效运行并管理开源大模型,优化推理的成本与可靠性
确保在 GPU、CPU 与内存资源之间的高性能与稳定性
实时监控与排查推理性能问题,确保低延迟与高吞吐量
与工程团队协作,实现可扩展、可靠的模型服务架构
我们正在寻找这样的你:
喜欢主导项目并独立推动落地
相信“清晰来自行动” —— 原型、测试、迭代,而非等待完美计划
在初创环境中依然冷静高效 —— 不惧从零开始或变化快速
重视速度 —— 优先交付有价值的产品,而非追求完美版本
视反馈与失败为成长的一部分 —— 持续进阶自己的技能
拥有谦逊、进取心与执行力,并在协作中带动他人前进
任职要求
有使用 vLLM、HuggingFace TGI 等模型推理平台的经验
熟悉 GPU 调度与资源编排,掌握 Kubernetes、Ray、Modal、RunPod、LambdaLabs 等工具
具备根据流量动态监控推理延迟、成本并高效扩展系统的能力
熟悉为后端工程师设置推理 API 接口的流程与规范
你将获得
扁平化团队结构与真实项目主导权
全程参与产品方向与决策制定
灵活办公制度
高影响力角色,跨产品、数据与工程多团队协作
顶尖市场薪酬 + 绩效奖金
全球化产品开发机会
丰厚福利:住房租赁补贴、优质公司食堂、加班餐补
健康、牙科与视力保险
全球差旅保险(适用于你与家属)
无限制、弹性带薪休假
团队与文化
我们是一支高密度、高绩效的团队,专注于高质量产品与全球影响力。我们像主人一样承担责任,重视速度、清晰与极致执行。如果你渴望成长并追求卓越,欢迎加入我们!
关于 BJAK
BJAK 是东南亚最大的保险聚合平台,服务用户超过 800 万,且由员工全资持股。公司总部位于马来西亚,在泰国、台湾与日本设有业务。我们通过 Bjak.com 帮助数百万用户获取透明且可负担的金融保障。
我们通过 API、自动化与 AI 等前沿科技,简化复杂金融产品,致力于打造下一代智能金融系统。
如果你对构建真正落地的 AI 系统充满热情,并希望在高影响力环境中快速成长,我们期待与你相遇!
-
Transform Language Models into Real-World Applications
We’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.
This role is a global role with hybrid work arrangement - combining flexible remote work with in-office collaboration at our HQ. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.
Why This Role Matters
You’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.
What You’ll Do
Run and manage open-source models efficiently, optimizing for cost and reliability
Ensure high performance and stability across GPU, CPU, and memory resources
Monitor and troubleshoot model inference to maintain low latency and high throughput
Collaborate with engineers to implement scalable and reliable model serving solutions
What Is It Like
Likes ownership and independence
Believe clarity comes from action - prototype, test, and iterate without waiting for perfect plans.
Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.
Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.
See feedback and failure as part of growth - you’re here to level up.
Possess humility, hunger, and hustle, and lift others up as you go.
Requirements
Experience with model serving platforms such as vLLM or HuggingFace TGI
Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, LambdaLabs
Ability to monitor latency, costs, and scale systems efficiently with traffic demands
Experience setting up inference endpoints for backend engineers
What You’ll Get
Flat structure & real ownership
Full involvement in direction and consensus decision making
Flexibility in work arrangement
High-impact role with visibility across product, data, and engineering
Top-of-market compensation and performance-based bonuses
Global exposure to product development
Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals
Health, dental & vision insurance
Global travel insurance (for you & your dependents)
Unlimited, flexible time off
Our Team & Culture
We’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.
About Bjak
BJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through Bjak.com. We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems.
If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.