Senior Service Reliability Engineer

1 day ago


Aliso Viejo CA United States Sony Playstation Full time

Why PlayStation?

PlayStation isn’t just the Best Place to Play — it’s also the Best Place to Work. Today, we’re recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed PlayStation software titles from PlayStation Studios, and more.

PlayStation also strives to create an inclusive environment that empowers employees and embraces diversity. We welcome and encourage everyone who has a passion and curiosity for innovation, technology, and play to explore our open positions and join our growing global team.

The PlayStation brand falls under Sony Interactive Entertainment, a wholly-owned subsidiary of Sony Corporation.

Senior Service Reliability Engineer

Future Technology Group

As a part of Sony Computer Entertainment, the Future Technology Group (FTG) is leading the cloud gaming revolution, putting console-quality video games on any device, from TVs to consoles to mobile devices and beyond.

Our Service Reliability Engineering team plays a significant role in delivering on the promise of a great cloud gaming experience to our customers. We do this by influencing design and operational decisions towards the overall stability of the gaming service. Our SREs focus on three main things: overall ownership of production, production code quality, and deployments. The successful candidate will be self-directed and able to participate in the way we make decisions at different levels.

We expect our SREs to have opinions on the state of our service and provide critical feedback during different phases of the operational lifecycle. We are engaged throughout the S/W development lifecycle, ensuring the operational readiness and stability.

Requirements

  • Minimum of 7+ years working experience in Software Development and/or Linux Systems Administration role.
  • Strong interpersonal, written and verbal communication skills.
  • Available to be scheduled in on-call rotation.

Skills & Knowledge

  • Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
  • Development experience in one or more of the following programming languages:
    • Python (preferred)
    • Bash, Go, Java, C++, or Rust
  • In addition, experience with at least 3 of the following topics:
    • Distributed data storage at scale (Hadoop, Ceph)
    • NoSQL at scale (MongoDB, Redis, Cassandra)
    • Data Aggregation technologies (ElasticSearch, Kafka)
    • Scaling and running traditional RDBMS (PostgreSQL, MySQL) with High Availability
    • Monitoring & Alerting (Prometheus, Grafana), and Incident Management toolsets
    • Kubernetes and/or AWS (deployment and management)
    • Software Distribution (Package management and distribution at scale)
    • Configuration Management (ansible, saltstack, puppet, chef)
    • S/W Performance analysis and load testing (QA or SDET experience: a plus)

Responsibilities

  • Taking a leadership role in ongoing improvements in Reliability and Scalability
  • Work closely with SRE Management to define KPIs, processes and drive continuous improvement
  • Influence the architecture and implementation of solutions within the division
  • Mentor more junior SRE staff and enable them for success
  • Act as a voice to represent SRE in the wider organization
  • Represent the operational scalability of solutions in the wider division
  • Lead small-scale projects from inception to implementation
  • Design platform-wide solutions and provide technical leadership during their implementation
  • Demonstrate a high-level of organizational skills and initiative in the role
#J-18808-Ljbffr

  • Aliso Viejo, CA, United States Sony Interactive Entertainment Full time

    Why PlayStation? PlayStation isn't just the Best Place to Play - it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed PlayStation software titles from PlayStation Studios, and...


  • Sunnyvale, CA, United States Tbwa ChiatDay Inc Full time

    Figure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in Sunnyvale, CA and require 5 days/week in-office collaboration. We are looking for a Senior Reliability Test Engineer in charge of designing and executing test...


  • Aliso Viejo, California, United States PlayStation Global Full time

    At PlayStation Global, we're shaping the future of gaming. As a Senior Site Reliability Engineer, you'll play a crucial role in ensuring the stability and scalability of our cloud gaming services.About UsWe're not just a company - we're a community that empowers employees and celebrates diversity. Our mission is to deliver unparalleled entertainment...


  • Los Angeles, CA, United States Management Recruiters of Raleigh Full time

    Our client, a Global Petrochemical & Plastics Company, has an excellent opportunity in its world-class ethane cracker and polymers facility for a Senior Instrumentation Reliability Engineer.This brand new, state of the art facility is among the largest of its kind in the US and is one of the most extensively instrumented facilities in the world. The site...


  • Aliso Viejo, CA, United States Sony Interactive Entertainment Full time

    Why PlayStation? PlayStation isn't just the Best Place to Play - it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed PlayStation software titles from PlayStation Studios, and...


  • Aliso Viejo, United States TBWA\Chiat\Day Full time

    Senior Software Engineer (Linux/FreeBSD Network Driver) Why PlayStation? PlayStation isn’t just the Best Place to Play — it’s also the Best Place to Work. Today, we’re recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed...


  • Aliso Viejo, CA, United States Tbwa ChiatDay Inc Full time

    Senior Software Engineer (Embedded Software) Why PlayStation? PlayStation isn’t just the Best Place to Play — it’s also the Best Place to Work. Today, we’re recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed...


  • Richardson, TX, United States Celestica Full time

    SummaryThe Senior Reliability Engineer, works in cross functional teams with designers, customers and manufacturing engineering and project leaders to ensure products designed can meet reliability specifications. Define the reliability testing strategy, reliability test plan and conduct tests. Complete a stress based MTBF analysis of products, thus providing...

  • Reliability Engineer

    1 month ago


    Goleta, CA, United States Raytheon Full time

    Date Posted:2024-10-04Country:United States of AmericaLocation:CA602: Goleta (RVS) Bldg B Cortona Drive Building B01, Goleta, CA, 93117 USAPosition Role Type:OnsiteAt Raytheon, the foundation of everything we do is rooted in our values and a higher calling - to help our nation and allies defend freedoms and deter aggression. We bring the strength of more...


  • Chicago, IL, United States WEX Inc. Full time

    Senior Staff Site Reliability Engineer Apply to locations: Chicago, IL; Bay Area, CA; San Francisco, CA. About the Role The WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and...


  • Aliso Viejo, California, United States RxSight, Inc. Full time

    About RxSight, Inc.RxSight is a leading ophthalmic medical technology corporation headquartered in California. We are dedicated to revolutionizing the premium cataract surgery experience by providing innovative solutions for surgeons and patients.Job DescriptionThe Senior Software Quality Assurance Engineer for Medical Devices will play a crucial role in...


  • Mountain View, CA, United States VLink Inc Full time

    Senior Site Reliability Engineer- Only local to Mountain View, CA or Bellevue, WAOnly USC/GC/EAD- W2 onlyNO C2CContractRemoteJob Description:Primary:Ability to code in Python or GoLinux Admin (System Administration & Network Configuration)Debugging & Troubleshooting (Application and Infrastructure) production performance issuesKnowledge of MQ (Message Queue...


  • Chicago, IL, United States WEX, Inc. Full time

    The WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...


  • Sunnyvale, CA, United States Apple Inc. Full time

    To view your favorites, sign in with your Apple Account. Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. The people here at Apple don’t just create products —...


  • Boston, MA, United States Wasabi Technologies Inc. Full time

    At Wasabi, we’re a proven collection of pioneers, visionaries and disruptive doers. We see things differently than our competitors, and we make our mark in the industry by challenging the norm and delivering the unexpected and improbable. We’re a fast-growing company taking the Cloud Storage industry by storm and recognized as one of the best places to...


  • Redmond, WA, United States Amazon Full time

    Senior Reliability Engineer, Project Kuiper Job ID: 2768100 | Amazon Kuiper Manufacturing Enterprises LLC Project Kuiper is an initiative to increase global broadband access through a constellation of 3,236 satellites in low Earth orbit (LEO). Its mission is to bring fast, affordable broadband to unserved and underserved communities around the world. Project...


  • Cupertino, CA, United States Apple Full time

    Senior Site Reliability Engineer, Object Storage Cupertino, California, United States Software and Services The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And...


  • Cupertino, CA, United States Apple Inc. Full time

    The Media Platforms SRE team under the Apple Service Engineering division is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. They do it on a massive scale, meeting Apple’s high expectations with high...


  • Cupertino, CA, United States Apple Inc. Full time

    Senior Site Reliability Engineer, Object Storage The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on a massive scale, meeting Apple’s high...

  • Reliability Engineer

    4 weeks ago


    Goleta, CA, United States Raytheon Careers Full time

    Goleta (RVS) Bldg B01 6825 Cortona Drive Building B01, Goleta, CA, 93117 USA*Position Role Type:* OnsiteAt Raytheon, the foundation of everything we do is rooted in our values and a higher calling - to help our nation and allies defend freedoms and deter aggression. We bring the strength of more than 100 years of experience and renowned engineering...