将语言模型转化为现实应用
我们正在为全球用户打造 AI 系统。当前正处于 AI 变革时代 —— 本项目团队专注于构建真正落地、产生现实世界影响力与大规模使用率的应用。
该职位为全球岗位,支持灵活混合办公模式 —— 结合远程办公与总部现场协作。你将与产品、工程、运营、基础设施与数据等区域团队紧密合作,共同构建并扩展具有深远影响的 AI 解决方案。
为什么这个职位重要
你将参与高质量数据集的构建与管理,为大语言模型的微调、模型安全性、内容可信度提供基础支撑。你的工作直接决定了模型是否能在规模化应用中保持智能、可靠、安全。
你的职责
收集、清洗与预处理用户生成的文本与图像数据,用于大模型微调
设计并管理可扩展的数据标注流程,结合众包平台与内部标注团队
构建与维护用于内容审核的自动化数据集(例如:安全与不安全内容的识别)
与研究员及工程师协作,确保数据集具有高质量、多样性,并对齐模型训练目标
我们正在寻找这样的你:
喜欢拥有主导权并独立完成任务
相信“清晰来自行动” —— 原型、测试与迭代优于完美计划
能在初创节奏下保持高效 —— 不惧优先级变化或从零开始构建系统
有速度偏好 —— 更愿意及时交付有价值的成果,而不是追求完美却迟迟不落地
将反馈与失败视为成长的机会 —— 渴望不断提升自己
拥有谦逊、求知欲与实干精神,并乐于帮助他人共同进步
任职要求
有为机器学习或大模型微调准备数据集的成熟经验
精通文本与图像数据的清洗、预处理与转换流程
熟悉数据标注工作流,并能对标注数据进行质量控制与管理
熟悉内容审核类数据集的构建与维护(例如安全性、合规性、过滤机制)
精通脚本编程(如 Python、SQL),并能处理大规模数据管道
你将获得
扁平化组织架构与真实项目主导权
全程参与产品方向制定与共识决策过程
灵活办公制度
高影响力岗位,跨产品、数据与工程多团队协作
顶尖市场薪酬与绩效奖金制度
全球化产品开发机会
丰厚福利:住房补贴、高质量公司食堂、加班餐补
健康、牙科与视力保险
全球差旅保险(适用于你与家属)
无限制、弹性带薪休假
团队与文化
我们是一支高密度、高绩效的团队,专注于打造高质量产品,影响全球用户。我们像主人一样思考与行动,重视执行速度、沟通清晰与极致责任感。如果你渴望成长并追求卓越,欢迎加入我们!
关于 BJAK
BJAK 是东南亚最大的保险聚合平台,拥有超过 800 万用户,并实现员工持股制。总部位于马来西亚,业务覆盖泰国、台湾、日本等地。我们通过 Bjak.com,帮助数百万用户获取透明、可负担的金融保障服务。
我们通过 API、自动化与 AI 等前沿技术,简化复杂金融产品,致力于打造下一代智能金融系统。
如果你希望在现实世界中构建真正有影响力的 AI 系统,并在高成长环境中实现快速突破,欢迎加入我们!
-
Transform Language Models into Real-World Applications
We’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.
This role is a global role with hybrid work arrangement - combining flexible remote work with in-office collaboration at our HQ. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.
Why This Role Matters
You’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.
What You’ll Do
Collect, clean, and preprocess user-generated text and image data for fine-tuning large models
Design and manage scalable data labeling pipelines, leveraging both crowdsourcing and in-house labeling teams
Build and maintain automated datasets for content moderation (e.g., safe vs unsafe content)
Collaborate with researchers and engineers to ensure datasets are high-quality, diverse, and aligned with model training needs
What Is It Like
Likes ownership and independence
Believe clarity comes from action - prototype, test, and iterate without waiting for perfect plans.
Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.
Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.
See feedback and failure as part of growth - you’re here to level up.
Possess humility, hunger, and hustle, and lift others up as you go.
Requirements
Proven experience preparing datasets for machine learning or fine-tuning large models
Strong skills in data cleaning, preprocessing, and transformation for both text and image data
Hands-on experience with data labeling workflows and quality assurance for labeled data
Familiarity with building and maintaining moderation datasets (safety, compliance, and filtering)
Proficiency in scripting (Python, SQL) and working with large-scale data pipelines
What You’ll Get
Flat structure & real ownership
Full involvement in direction and consensus decision making
Flexibility in work arrangement
High-impact role with visibility across product, data, and engineering
Top-of-market compensation and performance-based bonuses
Global exposure to product development
Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals
Health, dental & vision insurance
Global travel insurance (for you & your dependents)
Unlimited, flexible time off
Our Team & Culture
We’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.
About Bjak
BJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through Bjak.com. We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems.
If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.