Skip to content
Skip to content
DevOps Jobs
A

SRE Manager, ML Operations - Apple Ads

Apple

Location
Onsite (Cupertino, California)
Employment
Full-time
Level
Senior Level
Posted 1 week ago

About the Role

Apple Ads is seeking a senior engineering leader to manage its Site Reliability Engineering team, which is responsible for the Ad Serving infrastructure. This role offers the opportunity to shape the future of how services are built and run at Apple's global scale with a high degree of operational precision.

Skills

Site Reliability Engineering Distributed Systems Engineering Leadership Operating System Principles Networking Fundamentals Systems Management Monitoring Alerting Error Budgets Fault Analysis AWS ML Systems GPU Cluster Optimization Digital Advertising Production Engineering Cross-functional Leadership

Full job details

At Apple, we believe in the power of technology to enrich people's lives. Everything we build is designed to empower people, including our advertising platform. We deliver ads in a way that benefits both customers and advertisers — helping people discover content, supporting creators, and protecting and respecting everyone’s privacy. Our technology makes advertising possible on the App Store, Apple News, Stocks, and Apple TV. We help developers and marketers of all sizes drive app discovery across the App Store. Our display ads on Apple News and Stocks let advertisers promote their products alongside trusted content in a brand-safe environment, while supporting publishers and journalists. Sponsorship integrations and experiences in live sports on Apple TV help advertisers connect with captivated audiences. Everything we do is with the unwavering commitment to privacy you expect from Apple. Because when advertising is done right, it benefits everyone. We are seeking a senior engineering leader and experienced professional to lead our Site Reliability Engineering team. This team is responsible for Ad Serving infrastructure that serves as the front door of Apple Ads.

Description


You will be an accomplished builder and leader of teams looking to take on your next challenge. You know SRE and you know what it will take to run services at Apple scale with a high degree of operational precision. This role will position you to help craft the future of how we build and run our services on a global scale. You will have the technical skills to go deep and retain the ability to focus on higher-level business and product goals. We hire high quality leaders and engineers with a diverse set of experiences and abilities for positions on Apple.

Minimum Qualifications


10+ years experience with large scale distributed systems Demonstrable success leading engineering teams - ideally SRE or Production Engineering Knowledge of core operating system principles, networking fundamentals, and systems management Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts Strong leadership capabilities, with excellent problem-solving and decision-making skills. 5+ years professional experience in an engineering leadership position

Preferred Qualifications


Bachelors or Master’s degree in computer science or equivalent field with 10+ years of experience Experience managing infrastructure in AWS Experience building and operating large-scale distributed systems or ML systems in production. Experience partnering with Product, ML Platform, Ads Serving, Data Science, and cross-functional stakeholders to deliver complex initiatives Experience managing and optimizing GPU based clusters. Prior experience in digital advertising industry is a huge plus.