Site Reliability Engineer

1 month ago


San Mateo, California, United States 2K Full time
Who We Are

Founded in 2005, 2K Games is a global video game company, publishing titles developed by some of the most influential game development studios in the world. Our studios responsible for developing 2K's portfolio of world-class games across multiple platforms, include Visual Concepts, Firaxis, Hangar 13, CatDaddy, Cloud Chamber, and HB Studios. Our portfolio of titles is expanding due to our global strategic plan, building and acquiring exciting studios whose content continues to inspire all of us 2K publishes titles in today's most popular gaming genres, including sports, shooters, action, role-playing, strategy, casual, and family entertainment.

Our team of engineers, marketers, artists, writers, data scientists, producers, thinkers and doers, are the professional publishing stewards of our growing library of critically-acclaimed franchises such as NBA 2K, Battleborn, BioShock, Borderlands, The Darkness, Mafia, Sid Meier's Civilization, WWE 2K, and XCOM.

At 2K, we pride ourselves on creating an inclusive work environment, which means encouraging our teams to Come as You Are and do your best work We are dedicated to diversity and inclusion, and want our community of candidates to reflect this commitment. We encourage all qualified applicants to explore our global positions.

2K is headquartered in Novato, California and is a wholly owned label of Take-Two Interactive Software, Inc. (NASDAQ: TTWO).

About the Team: Site Reliability Engineering (SRE)

The 2K Site Reliability team is responsible for the operations and infrastructure of all consumer-facing production systems and developer-facing systems at 2K Games, including NBA2K game services, customer-facing account services, and websites. This team handles systems and services spanning multiple datacenters both terrestrial and cloud-based.

What We Need

We are looking for an expert engineer who is passionate about building multi-datacenter infrastructure and services. Robust systems and problem-solving skills are required as we develop solutions for game studios and support data centers around the world alongside a group of outstanding engineers. In this role, you will collaborate with network engineers, systems architects, and development staff to support our gamers and the needs of the business.

What You Will Do
  • Build and operate highly resilient systems in a multi-datacenter and cloud global environment serving game and consumer services
  • Develop tools for the management and automation of the systems and service infrastructure
  • Define and implement standards that will impact systems, services, and multiple software environments
  • Diagnose and resolve technical issues from both internal and external customers and drive improvements to prevent them from recurring
  • Participate in Site Reliability Engineering's on-call rotation
Who We Believe Will Be an Outstanding Fit

You are eager to work in a fast-paced environment with other highly skilled engineers who are passionate about service availability and health

If the idea of building data center infrastructure services from greenfield to implementation moves you

Required Qualifications
  • 6+ years of demonstrated influence across one or more teams for large scale projects that drive impact and improvement across the organization
  • 6+ years of experience in an SRE role for online services in a multi-region, multi-cloud environment with specific experience in reliability and resliency
  • 6+ years of developing tools for automation of processes or augmenting off the shelf tool functionality
  • 6+ years of AWS and/or GCP cloud experience running highly elastic mission critical workloads
  • 6+ years of coding experience in at least one or more of Python, Ruby, Java, or Go and a good understanding of code management
  • 6+ years of experience using Infrastructure as Code tools like Terraform, Pulumi, or others
  • Extensive knowledge of software build, test, and deploy processes using Git, Jenkins, Puppet, Ansible, Docker/containers, and Kubernetes
  • Experience with system analysis and troubleshooting
  • Serve as a mentor to junior engineers and provide technical leadership to the organization.
Bonus Points
  • Prior hands-on experience running large scale multiplayer video games at scale
  • Experience designing and crafting software for systems and network automation
  • Debugging, code optimization, and routine task automation skills
  • Demonstrated ability to decompose sophisticated problems. Ability to engage in lateral investigations.

As an equal opportunity employer, we are committed to ensuring that qualified individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform their essential job functions, and to receive other benefits and privileges of employment. Please contact us if you need reasonable accommodation.

Please note that 2K Games and its studios never uses instant messaging apps or personal email accounts to contact prospective employees or conduct interviews and when emailing, only use accounts.

The pay range for this position in California at the start of employment is expected to be between $75,500 and $111,740 per Year. However, base pay offered is based on market location, and may vary further depending on individualized factors for job candidates, such as job-related knowledge, skills, experience, and other objective business considerations. Subject to those same considerations, the total compensation package for this position may also include other elements, including a bonus and/or equity awards, in addition to a full range of medical, financial, and/or other benefits. Details of participation in these benefit plans will be provided if an employee receives an offer of employment. If hired, employee will be in an 'at-will position' and the company reserves the right to modify base salary (as well as any other discretionary payment or compensation or benefit program) at any time, including for reasons related to individual performance, company or individual department/team performance, and market factors.

#LI-Hybrid

#LI-BN1



  • San Jose, California, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe's Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Jose, California, United States Zscaler Full time

    About ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as a leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...


  • San Jose, California, United States Zscaler Full time

    About ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as the leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...


  • San Jose, California, United States Zscaler Full time

    About UsZscaler has developed the world's largest cloud security platform, continually innovating and expanding our services. With a robust portfolio of over 100 patents and ambitious plans for global growth, our team has established itself as a leader in cloud security, serving more than 15 million users across 185 countries. We are looking for talented...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our platform empowers developers to streamline their workflows, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, California, United States Academia Full time

    SRE / Site Reliability EngineerSAN FRANCISCO, CA or REMOTE from anywhere in the USAWho we are: has built and is expanding the premier distribution and peer review platform for academic research. Guided by our mission to democratize and accelerate the world's research, Academia aims to make every academic paper ever published available for free online and...


  • San Leandro, California, United States NTT DATA Services Full time

    Req ID: NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Site Reliability Engineer (FTE / Hybrid) to join our team in San Leandro, California (US-CA), United States (US). Job Duties...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS provider and a prominent leader in the Salesforce DevSecOps platform tailored for regulated sectors such as finance, insurance, and healthcare. Our solutions empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering to...


  • San Jose, California, United States VDart Inc Full time

    Job OverviewPosition: Lead Site Reliability EngineerLocation: San Jose, CA (Hybrid Work Model)Contract Duration: 6+ monthsExperience Required: 14+ YearsRole Summary:We are in search of a highly experienced and proactive Site Reliability Engineer Consultant. In this pivotal role, you will be responsible for:Key Responsibilities:Enhancing the reliability,...


  • San Jose, California, United States VDart Inc Full time

    Job OverviewPosition: Lead Site Reliability EngineerLocation: San Jose, CA (Hybrid Work Model)Contract Duration: 6+ monthsExperience Required: 14+ YearsRole Summary:We are in search of a highly experienced and proactive Site Reliability Engineer Consultant. In this capacity, you will be responsible for:Key Responsibilities:Enhancing the reliability,...


  • San Leandro, California, United States VDart Inc Full time

    Job OverviewPosition: Site Reliability EngineerCompany: VDart IncRole Summary:We are seeking a skilled Site Reliability Engineer with a strong background in Java to enhance our platform's performance and reliability. The ideal candidate will have a proven track record in production support and a commitment to optimizing system health.Key...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our offerings empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering...


  • San Diego, California, United States Dexcom Full time

    About Dexcom:Founded in 1999, Dexcom, Inc. (NASDAQ: DXCM) is a pioneer in the development and marketing of Continuous Glucose Monitoring (CGM) systems designed for use by individuals with diabetes and healthcare professionals. As a leader in the transformation of diabetes management, Dexcom is committed to providing innovative CGM technology that empowers...


  • San Diego, California, United States Platform Science Full time

    About UsAt Platform Science, we are dedicated to connecting all aspects of mobility. Established in 2015, our open IoT platform collaborates with forward-thinking fleets, application developers, vehicle manufacturers, and equipment providers within the transportation sector to deliver groundbreaking solutions for supply chain professionals worldwide.Our...


  • San Diego, California, United States Platform Science Full time

    Company OverviewAt Platform Science, we are dedicated to revolutionizing connectivity in the transportation sector. Established in 2015, our open IoT platform collaborates with forward-thinking fleets, application developers, vehicle manufacturers, and equipment providers to deliver groundbreaking solutions for supply chain professionals worldwide.Our...


  • San Diego, California, United States Platform Science Full time

    About UsAt Platform Science, we are dedicated to revolutionizing the transportation industry through innovative IoT solutions. Established in 2015, our open platform collaborates with forward-thinking fleets, application developers, vehicle manufacturers, and equipment providers to enhance supply chain efficiency worldwide.Our workforce is a vibrant and...


  • San Francisco, California, United States Cisco Full time

    Principal Site Reliability Engineer, Datastores (ThousandEyes)LOCATION:San Francisco, California, USAREA OF INTERESTEngineer - SoftwareCOMPENSATION RANGE219700 USD USDJOB TYPEProfessionalTECHNOLOGY INTERESTNetworkingJOB ID1422674Who We AreThe name ThousandEyes was born from two big ideas: the power to see things not ordinarily possible and the ability to...


  • San Francisco, California, United States Operant AI Full time

    Job OverviewSenior Site Reliability EngineerAs the inaugural SRE within our organization, we are looking for an individual to establish Operant's SRE strategy and operations aimed at ensuring the resilience and security of our platforms and services. If you are enthusiastic about the prospect of being an early engineer at a startup ready to revolutionize...


  • San Francisco, California, United States Centene Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Centene. As a key member of our technology organization, you will play a critical role in ensuring the reliability, performance, and security of our platform infrastructure.Key ResponsibilitiesLead Projects and Initiatives: Help lead projects focused on...


  • San Francisco, California, United States Dice Full time

    Company Overview:Dice is recognized as a premier career platform for technology professionals at all levels. We are collaborating with ZEN3 INFOSOLUTIONS AMERICA INC to find a suitable candidate for an important role.Position:Site Reliability Engineer with Oracle Applications ExpertiseLocation:RemoteContract Duration:Long-Term EngagementRole Summary:We are...