搜索建议:

外企
远程办公
remote
finance
国外工作
小红书
remote work
japanese
兼职
远程
marketing
ai
实习
Shanghai
上海市
顺德区
澳門
澳門
香港
Remote
湖南省
福建省
北區
屯門區
饶平县
申请

Site Reliability Engineer (SRE)

苹果
Shanghai, 上海市
3周前
We are looking for an SRE who can help lead the next generation of products we create. Our infrastructure team is responsible for architecting, building, and scaling a distributed system that enables Apple to manufacture every product. We manage hundreds of bare-metal servers and thousands of client machines across 20+ data centers. You should strive to make everyone that uses these systems life easier including devs, technical support, on location teams, and end users. This means automating everything from deployment workflow to CI/CD to monitoring and alerting systems.

Description

This is a rare opportunity to put your signature on how Apple manufactures everything. We need your to help take our system to the next level working closely with manufacturing design and the mechanical engineering team on new products. We don’t expect you to be a manufacturing expert, but guarantee within the first 6 months you will become one. You’ll be working with the worlds best engineers to help them build the products we all want. Our current stacks are diverse and evolving combinations of old and new, closed and open source technologies. We are not looking for a solution for now; we are looking for the best solution for tomorrow. We are an ambitious team that takes smart risks and challenges everything - including each other. None of us are the best at everything but all are the best at something. As we scale and evolve the supporting infrastructure for such diverse technologies it becomes crucial to understand the entire stack to help maintain, investigate, log, monitor, optimize and expand our services.

Minimum Qualifications
  • 3 years experience managing server infrastructure across multiple data centers
  • Proficient in Linux, command-line tools, and general system debugging
  • Proficient in configuration management tools such as Ansible
  • Experience using Docker for production services
  • Experience deploying and managing observability tools such as Prometheus, Grafana and the ELK stack
  • Experience using a CI/CD system like Jenkins
  • Good communication skills in written and spoken English

Preferred Qualifications
  • Experience managing bare-metal hardware (PXE boot, kickstart)
  • Experience with one or more: Golang, Python, SQL, HTTP, TCP/IP
  • Experience managing database servers such as PostgreSQL including replication across multiple data centers


提交简历
保存 申请
举报职位
其他职位推荐:

Site Reliability Engineer (SRE)

苹果
Shanghai, 上海市
We are looking for an SRE who can help lead the next generation of products we create Our infrastructure team is responsible...
3周前

Senior Associate Engineer, Reliability

Celanese
顺德区, 广东省
1. 参与仪表设备的选型、安装及调试工作,严格按照相关标准与规范操作,确保新设备顺利投入使用。 2. 负责日常对化工生产线上各类仪表(如温度、压力、流量、液位等仪表)进行巡检、维护与保养,及时处理仪表出现的故障,保障其正常运行,减少对生产的影响。 3....
3周前

Reliability Test Support Engineer, Product Integrity

亚马逊
澳門
Upload reliability test data to reliability test database.Train suppliers to enter data to reliability test database. 1....
3周前

Reliability Manager设备可靠性经理

Goodyear
澳門, 澳門
You are responsible for manage maintenance process ,PM ,lubrication,PDM (including plant employees and contractors) drive to zero...
1周前

Senior Site Engineer

Cochlear
澳門
This role ensures that all assets are maintained optimally, IT operations are efficient and effective, and configuration...
3周前