Current jobs related to Site Reliability Engineer - San Francisco - Ellation, Inc.

Site Reliability Engineer

1 month ago

San Francisco, United States Apollo Solutions Full time

Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking artifical inteligence business who are making major developments in how we use AI/ML for gaming/security. They are working closely with government contracts as well as gaming consoles companys and are now searching for an SRE to join their growing team. The Site Reliability...
Site Reliability Engineer

3 weeks ago

San Francisco, United States Bun Full time

Bun is an open-source JavaScript tooling company focused on making programming simpler. We've raised $26 million from top investors in Silicon Valley, are among the top GitHub repositories and have a growing community of 33,000 Discord members.We're hiring an experienced Site Reliability Engineer to scale and maintain the infrastructure that builds and tests...
Site Reliability Engineer

1 month ago

San Francisco, United States Unreal Gigs Full time

Are you passionate about building and maintaining resilient systems that ensure high availability and performance? Do you excel at automating processes, troubleshooting complex issues, and creating systems that scale smoothly? If you're ready to take on the challenge of ensuring reliable, efficient, and secure system operations, our client has the perfect...
Site Reliability Engineer

2 months ago

San Francisco, United States New York Technology Partners Full time

Must Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years
Site Reliability Engineer

2 months ago

san francisco, United States New York Technology Partners Full time

Must Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years
Site Reliability Engineer

3 days ago

San Francisco Bay Area, United States Bun Full time

Bun is an open-source JavaScript tooling company focused on making programming simpler. We've raised $26 million from top investors in Silicon Valley, are among the top GitHub repositories and have a growing community of 33,000 Discord members.We're hiring an experienced Site Reliability Engineer to scale and maintain the infrastructure that builds and tests...
Site Reliability Engineer

2 weeks ago

San Francisco, United States Asystem Full time

Particle is a startup based in the San Francisco Bay Area. We are seeking candidates who are self-starters, adaptable, and flexible in a startup environment. As a team of veteran technologists from Twitter, Tesla, Periscope, and more, we are developing a next-generation news platform to redefine your daily intake of news. We value active engagement in...
Site Reliability Engineer

1 month ago

San Francisco, United States Perplexity AI Full time

Perplexity is seeking a Site Reliability Engineer (SRE) to join our small team in revolutionizing the way people search and interact with the internet. You will be responsible for leading the design, implementation, and scaling of the infrastructure and systems that support our web and mobile products. The ideal candidate should have experience in designing...
Site Reliability Engineering Lead

5 days ago

San Francisco, California, United States Springshot Full time

Springshot lives at the intersection between technology and humanity. We assimilate and simplify the complex, striving to provide users with easy-to-use web and mobile interfaces that present the right information at the right time so they can make the right decision or take the right physical action, including through robotics and autonomous machines.This...
Site Reliability Engineer

17 hours ago

San Jose, United States Avance Consulting Full time

Role: Site Reliability EngineerLocation: San Jose, CA - OnsiteDuration: Full Time (Permanent Role)We are seeking a Skied Site Renaulty Engineer (SRE) with expertise in Github Actions, AWS DevOps, nem Charts, and YAML Configuration. The real candidate will be responsible based applications. You will work closely with development teams to implement and manage...
Site Reliability Engineer

21 hours ago

San Jose, United States Avance Consulting Full time

Role: Site Reliability EngineerLocation: San Jose, CA - OnsiteDuration: Full Time (Permanent Role)We are seeking a Skied Site Renaulty Engineer (SRE) with expertise in Github Actions, AWS DevOps, nem Charts, and YAML Configuration. The real candidate will be responsible based applications. You will work closely with development teams to implement and manage...
Senior Site Reliability Engineer

1 month ago

San Francisco, United States Focal Systems Full time

Location: San Francisco - hybrid (1-2 days per week)Salary: $170-190k + stockCompany DescriptionFocal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar retail...
Site Reliability Engineer

2 weeks ago

San Jose, United States EVONA Full time

Site Reliability Engineer (SRE)Location: San Francisco Bay AreaRole Overview:We are seeking a highly skilled Site Reliability Engineer (SRE) to join a dynamic team at a rapidly growing technology company. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical systems, while implementing automation...
Site Reliability Engineer

1 month ago

San Jose, United States EVONA Full time

Site Reliability Engineer (SRE)Location: San Francisco Bay AreaRole Overview:We are seeking a highly skilled Site Reliability Engineer (SRE) to join a dynamic team at a rapidly growing technology company. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical systems, while implementing automation...
Senior Staff Site Reliability Engineer

2 months ago

San Francisco, United States WEX, Inc. Full time

About the RoleThe WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits Reliability organization which supports our internal...
Site Reliability Engineer

1 week ago

San Jose, California, United States Syntricate Technologies Full time

Job Title: Site Reliability EngineerAbout the Job: Syntricate Technologies is seeking an experienced Site Reliability Engineer to join our team in San Jose, CA. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Design, implement, and...
Lead DevOps/Site Reliability Engineer

4 weeks ago

San Francisco, United States Indotronix International Corporation Full time

Pay Rate:- W2 Rate $ 61.75 Looking in PST time zone, preferred to be local to SF and willing to go into office occasionally, but okay with Remote (needs to hive high work ethic!) Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers,...
Site Reliability Engineer II

4 weeks ago

San Francisco, CA, United States Earnest Current Job Openings Full time

The Site Reliability Engineer II position will report to the Lead Cloud Engineer. As an SRE II Engineer, you will: Set up and maintain comprehensive monitoring, create and refine playbooks, build dashboards, and adopt industry-standard practices to enhance the reliability and resilience of our site and systems. Develop and manage IaC to ensure reliable,...
Sr Site Reliability Engineer

1 month ago

San Francisco, United States Federal Reserve Bank of San Francisco Full time

Company: Federal Reserve Bank of San FranciscoJob Description:While the SF Fed is a Reserve Bank, we're not what you might expect. We're unreserved here. That means we seek new and diverse perspectives. We spark conversations and encourage debate. We build opportunity. We pursue careers that are true to ourselves. We are looking for people who want to help...
Staff Site Reliability Engineer

2 months ago

San Francisco, United States Ellation, Inc. Full time

Who We Are We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection...

Site Reliability Engineer

2 months ago

San Francisco, United States Ellation, Inc. Full time

Who We Are

We‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection of brands.

About the Team

The Site Reliability Engineering (SRE) team is dedicated to ensuring the reliability, scalability, and performance of our data infrastructure. We focus on standardizing and implementing monitoring and alerting across all datastores to track key metrics like errors, latency, and throughput, and to ensure critical systems are covered. Our team also leads efforts to keep databases up-to-date, implements Infrastructure as Code (IaC) for high availability and performance, and automates key processes to enhance operational efficiency.

We lead and evangelize the principle of 100% automation. Additionally, we define and document operational requirements, develop incident response processes, and automate monitoring and compliance checks to maintain a secure and reliable data environment. By continuously improving load testing and optimizing data governance practices, we support the overall health and efficiency of our data systems.

About the Role

Crunchyroll is growing and changing, presenting unique challenges and opportunities to support millions of anime fans around the world. The Data Engineering team provides seamless help to our internal stakeholders, ensuring an exceptional experience for all Crunchyroll fans.

As a Staff Site Reliability Engineer for the Data Engineering team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure. Your work will directly impact the availability and performance of our data services, enabling the organization to better decisions. You will collaborate closely with data engineers, and software engineers to develop and drive 100% automation, best practices for deep monitoring and alerting. This role will report to our Director of Data Engineering. While it is preferred for this role to sit in one of our offices, fully remote is also an option in the United States.

About You

Bachelor‘s degree in Computer Science, Information Technology, or a related field.
12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms, data stores, data operations.
Extensive experience with AWS cloud platform and their data-related services.
Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights).
Proficiency in one or more programming languages (e.g. Python, Java)
Proficiency in automation frameworks (e.g., Terraform, Cloud Formation).
Strong understanding of various performance metrics both at a high level and at a low level like Disk/IO saturation.
Experience in identifying and eliminating the bottlenecks in the system.
Strong understanding of database internals like types of indexes, schemas, query plans.
Strong understanding of database systems (e.g., SQL, NoSQL) and experience in managing large-scale data infrastructures.
Strong understanding and hands-on implementation of CI/CD pipelines and DataOps practices.
Experience with data governance, compliance, and lifecycle management.
Ability to own and execute projects while effectively collaborating with the team to influence and shape the vision of the data engineering organization.

Why you will love working at Crunchyroll

Not only will you get to work with fun, passionate and inspired colleagues, you will also...

Receive a great compensation package including salary plus performance bonus earning potential, paid annually.
Enjoy flexible PTO and time off policies allowing you to take the time you need to be your whole self.
Appreciate the generous medical, dental, vision, STD, LTD, and life insurance options for you and your family.
Take advantage of our health saving account HSA program plus health care and dependent care FSA programs.
Love that we offer an employer match on our 401(k) plan.
Receive employer paid commuter benefit (for eligible employees)
Appreciate the generous support program for new parents
Obtain pet insurance and some of our offices are pet friendly

#LifeAtCrunchyroll #LI-Remote

#J-18808-Ljbffr

Americas

Europe

Asia / Oceania

Africa

Current jobs related to Site Reliability Engineer - San Francisco - Ellation, Inc.

Site Reliability Engineer