Director of Site Reliability Engineering, AI Infrastructure
oracle -
Santa Clara, CA, United States |
2024-07-31 06:25:25
Director of Site Reliability Engineering, AI Infrastructure 工作机会 2025, Director of Site Reliability Engineering, AI Infrastructure 工作 2025, Director of Site Reliability Engineering, AI Infrastructure 职位空缺 2025, Director of Site Reliability Engineering, AI Infrastructure 职位空缺 2025, Director of Site Reliability Engineering, AI Infrastructure 职位描述 2025, Director of Site Reliability Engineering, AI Infrastructure 职位列表 2025 Oracle 工作机会 2025, Oracle 工作 2025, Oracle 职位空缺 2025, Oracle 职位空缺 2025, Oracle 职位描述 2025, Oracle 职位列表 2025 Santa Clara, CA, United States 工作机会 2025, Santa Clara, CA, United States 工作 2025, Santa Clara, CA, United States 职位空缺 2025, Santa Clara, CA, United States 职位空缺 2025, Santa Clara, CA, United States 职位描述 2025, Santa Clara, CA, United States 职位列表 2025, China 邮政服务 工作 2025, China 邮政服务 工作机会 2025, China 邮政服务 职位空缺 2025, China 邮政服务 职位空缺 2025, China 邮政服务 职位描述 2025, China 邮政服务 职位列表 2025
欲了解更多信息,请点击下面的链接
- MS or BS in Computer Science, or equivalent experience.
- 5+ years of experience managing technology teams.
- 10+ years of software engineering experience
- Proven experience as a Director of Site Reliability Engineering or a similar leadership role, with a track record of successfully managing and scaling SRE teams.
- Strong knowledge of cloud infrastructure, distributed systems, and network architecture.
- Demonstrated ability to manage and prioritize multiple projects and initiatives in a fast-paced, dynamic environment.
- Excellent problem-solving and troubleshooting skills, with the ability to analyze complex systems and identify areas for improvement.
- Strong leadership and communication skills, with the ability to effectively collaborate with cross-functional teams and influence decision-making at all levels of the organization.
- Experience in Nvidia training technologies (CUDA, NCCL).
- Working familiarity with networking protocols (TCP/IP, UDP, HTTP) and standard network architectures.
- Strong technical knowledge in distributed systems, high performance computing, and GPU systems.
- Experience in AI model training infrastructure
Career Level - M4
Director of Site Reliability Engineering, AI Infrastructure 工作机会 2025, Director of Site Reliability Engineering, AI Infrastructure 工作 2025, Director of Site Reliability Engineering, AI Infrastructure 职位空缺 2025, Director of Site Reliability Engineering, AI Infrastructure 职位空缺 2025, Director of Site Reliability Engineering, AI Infrastructure 职位描述 2025, Director of Site Reliability Engineering, AI Infrastructure 职位列表 2025 Oracle 工作机会 2025, Oracle 工作 2025, Oracle 职位空缺 2025, Oracle 职位空缺 2025, Oracle 职位描述 2025, Oracle 职位列表 2025 Santa Clara, CA, United States 工作机会 2025, Santa Clara, CA, United States 工作 2025, Santa Clara, CA, United States 职位空缺 2025, Santa Clara, CA, United States 职位空缺 2025, Santa Clara, CA, United States 职位描述 2025, Santa Clara, CA, United States 职位列表 2025, China 邮政服务 工作 2025, China 邮政服务 工作机会 2025, China 邮政服务 职位空缺 2025, China 邮政服务 职位空缺 2025, China 邮政服务 职位描述 2025, China 邮政服务 职位列表 2025
欲了解更多信息,请点击下面的链接