Service Reliability Engineer, G&A Solutions Engineering
1 week ago
Do you have a passion for ensuring the reliability, scalability, and performance of critical services? Are you a highly motivated and expert engineer with a strong understanding of Site Reliability Engineering (SRE) principles and a desire to automate and improve processes? Join Apple's General and Administrative (G&A) Solutions Engineering team as a Service Reliability Engineer and play a vital role in supporting our global, mission-critical production systems.
Description
You'll be at the forefront of maintaining the health, stability, and efficiency of our services, working with a diverse range of technologies and platforms. You will collaborate with Engineers, Data Engineers, DBAs, and network specialists to proactively identify and resolve potential issues, automate repetitive tasks, and drive continuous improvement initiatives. Your expertise will directly impact the reliability of our systems, enabling Apple to deliver innovative products and services to our customers.","responsibilities":"Proactively monitor service performance, identify potential bottlenecks, and implement solutions to optimize efficiency and resilience
Lead incident response efforts, driving rapid resolution and conducting thorough root cause analysis (RCA)
Develop and implement automation strategies to streamline operational tasks, improve service resilience, and reduce manual intervention
Apply SRE principles to maintain highly reliable and scalable service infrastructure
Collaborate closely with development teams to ensure that new services are designed for operational perfection, incorporating best practices for monitoring, alerting, and scalability
Contribute to the creation and maintenance of comprehensive documentation, including run-books, service level objectives (SLOs)
Participate in on-call rotations, providing 24/7 support for critical services and responding to incidents with a sense of urgency
Find opportunities for process improvement and drive initiatives to enhance the efficiency and effectiveness of the service reliability team
Champion a culture of continuous learning and knowledge sharing within the team
Define and supervise key service level indicators (SLIs) to measure and improve service reliability
Preferred Qualifications
Familiarity with CI/CD pipelines and DevOps practices
Experience with database technologies (e.g., MySQL, PostgreSQL, NoSQL databases)
Knowledge of ITIL frameworks and incident management processes
Experience with vibe coding
Understanding of Linux/Unix system administration
Experience with configuration management tools (Ansible, Chef, Puppet)
Minimum Qualifications
4+ years of experience in a Site Reliability Engineering, DevOps, or related role, supporting large-scale, enterprise-level services
Strong proficiency in at least one programming language (e.g., Python, Java, Go) and scripting languages (e.g., Bash, PowerShell)
Experience with cloud platforms (e.g., AWS, Azure, GCP) and cloud-native technologies (e.g., Kubernetes, Docker)
Hands-on experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Splunk, Datadog)
Bachelor's degree in Computer Science or work related equivalent experience
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .
-
Director of Product Marketing, Solutions
2 weeks ago
Austin, Texas, United States Culture Amp Full timeJoin us on our mission to make a better world of work.Culture Amp is the world's leading employee experience platform, revolutionizing how 25 million employees across more than 6,500 companies create a better world of work. Culture Amp empowers companies of all sizes and industries to transform employee engagement, drive performance management, and develop...
-
Security Engineer, G&A Solutions Engineering
7 days ago
Austin, Texas, United States Apple Full timeApple is where individual creativities capture together, contributing to the values that lead to phenomenal work. Every new product we build, service we compose, or Apple Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it...
-
Security Engineer, G&A Solutions Engineering
6 days ago
Austin, Texas, United States Apple Full timeApple is where individual creativities capture together, contributing to the values that lead to phenomenal work. Every new product we build, service we compose, or Apple Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it...
-
Site Reliability Engineer
7 hours ago
Austin, Texas, United States KNIME AG Full timeToo much data, not enough insight?We get it. At KNIME, we build software that helps people clean, combine, and understand their data: fast, efficiently, and without code.And with our focus on Data Analytics & AI, we empower everyone to turn complex challenges into clear, actionable insights.You can help make that happen.We're not just an open-source data...
-
Principal Mechanical Reliability Engineer
4 days ago
Austin, Texas, United States Dell Technologies Full timeMechanical Engineering leads and delivers the development of innovative and compliant mechanical design solutions, as well as cross-functional interfaces for desktop, portable and server computer systems and peripherals. Our team conducts the analysis, feasibility studies and testing of mechanical products, instruments, subassemblies and packaging for new...
-
Test Reliability Engineer
4 days ago
Austin, Texas, United States Saronic Technologies Full timeSaronic Technologies is a leader in revolutionizing defense autonomy at sea, dedicated to developing state-of-the-art solutions that enhance maritime operations for the Department of Defense (DoD) through autonomous and intelligent platforms.We are seeking a Test Reliability Engineer to drive key system verification, test, and integration initiatives for our...
-
Principal Electrical Reliability Engineer
2 days ago
Austin, Texas, United States Dell Technologies Full time $148,000 - $164,000Principal Electrical Reliability EngineerOur Electrical Engineering team puts the spark into the full hardware development lifecycle, from concept to production. It takes experts in system architecture definition, design, analysis, prototyping, sourcing & the debugging and validation of layouts or routes to deliver state-of-the-art products for a changing...
-
Principal Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Collins Aerospace Full timeDate Posted: Country:United States of AmericaLocation:HTX99: Field Office - TX Remote Location, Remote City, TX, 73301 USAPosition Role Type:RemoteU.S. Citizen, U.S. Person, or Immigration Status Requirements:Must be authorized to work in the U.S. without the company's immigration sponsorship now or in the future. The company will not offer immigration...
-
Senior Site Reliability Engineer
5 days ago
Austin, Texas, United States AMBA Full timeAMBA is seeking an experienced Senior Site Reliability Engineer to join our IT TeamAbout AMBASince 1981, AMBA has been a trusted provider of essential coverage for retired public servants nationwide. Our reach extends to diverse groups, including hardworking public employees, state retirees, educators, military personnel, trade professionals, firefighters,...
-
Senior Site Reliability Engineer
2 days ago
Austin, Texas, United States Charles Schwab Full timeYour opportunityAt Schwab, you're empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us challenge the status quo and transform the finance industry together.We believe in the importance of in-office collaboration and fully intend for the selected candidate for this role to work on site in the...