Software Engineering Manager II, Site Reliability Engineering, Cloud

1 month ago


New York, New York, United States Google Full time
Minimum

Qualifications:

Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
8 years of experience with data structures or algorithms.
5 years of experience with software development in one or more programming languages.
3 years of experience managing people or teams, leading projects, and designing, analyzing, and troubleshooting distributed systems.
Preferred

Qualifications:

Master's degree in Computer Science or Engineering.
1 year of people management experience.

About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.

SRE ensures that Google's services both our internally critical and our externally-visible systems have reliability, uptime appropriate to users' needs and a fast rate of improvement.

Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance.
Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation.

On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design.

SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success.
Our organization brings together people with a wide variety of backgrounds, experiences and perspectives.
We encourage them to collaborate, think big and take risks in a blame-free environment.

We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.


To learn more:


check out our books on Site Reliability Engineering or read a career profile about why a Software Engineer chose to join SRE.

As an Engineering Manager, you'll lead a team and be responsible for products globally, providing technical leadership to key projects and empowering and developing teams to do the same.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running.

From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible.

We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them.

We keep our networks up and running, ensuring our users have the best and fastest experience possible.
Responsibilities Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime.
Own end-to-end availability and performance of key services and build automation to prevent problem recurrence.
Automate response to all non-exceptional service conditions.
Lead by example, mentor the team and establish credibility through quality technical execution.
Manage on-call rotations across continents, using a follow-the-sun model.
Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services.
.

Estimated Salary:
$20 to $28 per hour based on qualifications.

  • New York, New York, United States Betterment Full time

    About the RoleWe are seeking a highly skilled Cloud Reliability Engineer to join our team at Betterment. As a Staff Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based systems.Key ResponsibilitiesDesign and implement scalable and reliable cloud native solutions using AWSDevelop...


  • New York, New York, United States Russell Tobin & Associates Full time

    Job Description:As a Site Reliability Engineer at Russell Tobin & Associates, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure. We are seeking a highly skilled and experienced engineer to join our team and contribute to the design, implementation, and maintenance of our cloud-based systems.Key...


  • New York, New York, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • New York, New York, United States Kyndryl Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Cloud Infrastructure team at Kyndryl. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based services.Key ResponsibilitiesDesign and Implement Monitoring and Logging Systems: Develop and...


  • New York, New York, United States Hebbia Full time

    About HebbiaHebbia is a cutting-edge technology company that specializes in developing Artificial General Intelligence (AGI) solutions. Our mission is to empower users to collaborate with AI on complex tasks and validate responses, rather than blindly trusting them.Job DescriptionAs a highly skilled Site Reliability Engineer, you will play a critical role in...


  • New York, New York, United States Abnormal Security Full time

    Job OverviewAbnormal Security is seeking a talented Software Engineer II to enhance our Cloud Infrastructure team. This team plays a crucial role in managing our operations within the public cloud, ensuring that our cloud usage is secure, dependable, and efficient while catering to the needs of our engineering teams.Key ResponsibilitiesThe selected candidate...


  • New York, New York, United States Betterment Full time

    About the RoleBetterment is seeking a highly skilled Staff Site Reliability Engineer to join our team. As a Staff Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based systems.Key ResponsibilitiesDesign and implement scalable and reliable cloud-based systems using AWS, Docker,...


  • New York, New York, United States Celonis GmbH Full time

    We're Celonis, the global leader in Process Mining technology and one of the world's fastest-growing SaaS firms. We believe there is a massive opportunity to unlock productivity by placing data and intelligence at the core of business processes - and for that, we need you to join us.**The Team:**Site Reliability Engineering**The Role:**+ You will be part of...


  • New York, New York, United States Celonis GmbH Full time

    About the RoleWe're seeking a highly skilled Senior Cloud Software Engineer to join our Site Reliability Engineering team at Celonis GmbH. As a key member of our team, you will be responsible for designing, implementing, and managing cloud-based applications and platforms that meet the highest standards of reliability and scalability.Key...


  • New York, New York, United States Streaming Talent Full time

    Streaming Talent is seeking a highly skilled Site Reliability Engineer to join our client's US team. As a key member of the Site Reliability Team, you will be responsible for ensuring the smooth operation of the company's Content Delivery Network.The ideal candidate will have a strong background in cloud technologies, with experience working with Kubernetes...


  • New York, New York, United States Diverse Lynx Full time

    About the Role:Diverse Lynx is seeking a highly skilled Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability and efficiency of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and improve system reliabilityDevelop and maintain...


  • New York, New York, United States FLOAT LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at FLOAT LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our infrastructure, enabling our engineering teams to focus on delivering high-quality software to our customers.Key ResponsibilitiesContinuously...


  • New York, New York, United States Formation Bio Full time

    Advancements in AI and drug discovery are creating more candidate drugs than the industry can progress because of the high cost and time of clinical trials. Recognizing that this development bottleneck may ultimately limit the number of new medicines that can reach patients, Formation Bio, founded in 2016 as TrialSpark Inc., has built technology platforms,...


  • New York, New York, United States CLS Group Full time

    About CLS GroupCLS Group is a pivotal entity within the global foreign exchange (FX) ecosystem, facilitating secure and efficient currency transactions for numerous counterparties. Our systems handle trillions of dollars in currency flows daily, enhancing the safety and cost-effectiveness of FX operations.Our state-of-the-art global settlement...


  • New York, New York, United States CLS Group Full time

    About CLS GroupCLS Group stands as a pivotal entity within the global foreign exchange (FX) landscape. Serving a multitude of counterparties, CLS enhances the safety, efficiency, and cost-effectiveness of FX transactions. Our systems facilitate the movement of trillions of dollars in currency daily.Developed by the market for the market, our unparalleled...


  • New York, New York, United States CLS Group Full time

    About CLS GroupCLS Group stands as a pivotal entity within the global foreign exchange (FX) ecosystem. Our services are leveraged by numerous counterparties, ensuring that FX transactions are executed with enhanced safety, efficiency, and cost-effectiveness. Each day, trillions of dollars in currency transactions flow through our robust systems.Designed by...


  • New York, New York, United States CLS Group Full time

    About CLS GroupCLS Group is a pivotal entity within the global foreign exchange (FX) ecosystem, serving a multitude of counterparties. Our systems facilitate the secure, efficient, and cost-effective flow of trillions of dollars in currency transactions daily.Designed by the market for the market, our unparalleled global settlement infrastructure mitigates...


  • New York, New York, United States IRIS Software Group Full time

    About IRIS Software Group:IRIS Software Group stands as one of the foremost privately owned software enterprises in the UK. For over four decades, we have delivered cutting-edge administrative solutions to a diverse range of clients, including businesses, charitable organizations, and public sector entities. Our workforce has grown to nearly 3000...


  • New York, New York, United States STAND 8 Technology Services Full time

    Job Description**Job Summary:**We are seeking a highly skilled Site Reliability Engineer with expertise in Building Management Systems (BMS) software, specifically Tridium Niagara. The ideal candidate will have a strong focus on setting up, configuring, and troubleshooting BMS software, as well as monitoring and enabling continuous operations.Key...


  • New York, New York, United States DriveWealth Full time

    Job OverviewDriveWealth is a pioneering B2B fintech company committed to making financial independence accessible globally. Our vision is realized through an API-driven platform that empowers partners to deliver seamless investing and trading experiences to clients worldwide, all from their mobile devices.Our technology equips partners with a modern,...