Current jobs related to Senior Site Reliability Engineer - San Diego - Platform Science


  • San Francisco, California, United States Infused Solutions Full time

    Senior Site Reliability EngineerInfused Solutions is seeking a highly skilled Senior Site Reliability Engineer to join their IT infrastructure team. Our client is a market leader in the San Francisco area, and we are looking for a talented individual with expertise in Microsoft Azure and a strong background in software engineering.Key Responsibilities:Design...


  • San Francisco, California, United States Tampa Gardens Senior Living Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Team. As a key member of our team, you will be responsible for deploying, managing, optimizing, and upgrading the systems that run Sight Machine software.You will work closely with our Development Engineering team to ensure the stability,...


  • San Francisco, California, United States smartrecruiters - JobBoard Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for leading a team of site reliability engineers who work to keep Twitter reliable and scalable.Responsibilities:Lead a team of site reliability engineers to...


  • San Diego, California, United States Intellipro Group Full time

    Job Title: Senior Bilingual Site Reliability EngineerWe are seeking a highly skilled Senior Bilingual Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for designing and implementing scalable solutions to meet our expanding data and business needs.Responsibilities:Collaborate with cross-functional teams to...


  • San Francisco, California, United States Outdefine Full time

    About the JobOutdefine is seeking a skilled Senior Site Reliability Engineer to join our team. As a key member of our Infrastructure team, you will be responsible for ensuring the reliability and scalability of our blockchain-based services.Key ResponsibilitiesRun internal Chainlink and Blockchain nodesProvide enterprise-level blockchain connectivity to...


  • San Francisco, California, United States Autodesk Full time

    {"Responsibilities": "As a Senior Site Reliability Engineer at Autodesk, you will be responsible for leading the development and maintenance of robust cloud infrastructure to support millions of daily users. You will automate processes to improve system reliability and introduce best practices in continuous integration and deployment. You will also lead...


  • San Francisco, California, United States SingleStore Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at SingleStore. As a key member of our engineering team, you will be responsible for designing, building, and running elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.Key Responsibilities:Help drive...


  • San Francisco, California, United States Infused Solutions Full time

    Job Title: Senior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability Engineer to join our team at Infused Solutions. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable, high-availability infrastructure for our platform.Key Responsibilities:Architect and manage...


  • San Francisco, California, United States Twitter Full time

    Job DescriptionAt Twitter, we're committed to delivering a seamless and reliable experience for our users. As a Senior Site Reliability Engineer, you'll play a critical role in ensuring the stability and scalability of our services.ResponsibilitiesLead a team of site reliability engineers to design, implement, and maintain scalable and reliable...


  • San Francisco, California, United States Astranis Full time

    Astranis MissionAstranis is revolutionizing global connectivity by developing the next generation of smaller, more cost-effective spacecraft. Our mission is to bridge the digital divide and connect the four billion people worldwide who lack internet access.Job SummaryWe are seeking a highly motivated and experienced Senior Site Reliability Engineer to join...


  • San Diego, California, United States Qualcomm Full time

    Job Title: Site Reliability EngineerJoin Qualcomm as a Site Reliability Engineer and be part of a highly collaborative team focused on provisioning and maintaining infrastructure and services with stability, sustainability, and security always on your mind.About the RoleWe are seeking a skilled Site Reliability Engineer to join our team. As a Site...


  • San Diego, California, United States Qualcomm Full time

    Job Title: Site Reliability EngineerAt Qualcomm, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the stability, scalability, and security of our infrastructure and services.Key Responsibilities:Monitor system health and detect anomaliesInvestigate and...


  • San Francisco, California, United States RevenueCat Full time

    About RevenueCat:RevenueCat is a mission-driven, remote-first company that is building the standard for mobile subscription infrastructure. We're a close-knit, product-driven team that strives to live our core values: Customer Obsession, Always Be Shipping, Own It, and Balance.We're looking for a Senior Site Reliability Engineer to help design, build, and...


  • San Francisco, California, United States Twitter Full time

    Job DescriptionAt Twitter, we're committed to delivering a seamless and reliable experience for our users. As a Senior Site Reliability Engineer, you'll play a critical role in ensuring the stability and scalability of our infrastructure.ResponsibilitiesLead a team of site reliability engineers to design, implement, and maintain scalable and reliable...


  • San Diego, California, United States Qualcomm Full time

    Job Title: Site Reliability EngineerAt Qualcomm, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the stability, sustainability, and security of our infrastructure and services.Key Responsibilities:Monitor system health and detect anomalies to prevent service...


  • San Jose, California, United States F5 Full time

    Job SummaryF5 is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will play a pivotal role in ensuring the reliability and scalability of our distributed cloud product.Key ResponsibilitiesDesign and implement automation solutions to reduce toil and improve operational efficiencyParticipate in...


  • San Diego, California, United States Insight Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and highly available cloud...


  • San Diego, California, United States Insight Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and highly available cloud...


  • San Diego, California, United States Commserve Technologies Inc Full time

    Job Title: Site Reliability EngineerAt Commserve Technologies Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our enterprise-level applications.Key Responsibilities:Configure, architect, and maintain...


  • San Diego, California, United States Commserve Technologies Inc Full time

    Job Title: Site Reliability EngineerAt Commserve Technologies Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our enterprise-level applications.Key Responsibilities:Configure, architect, and maintain...

Senior Site Reliability Engineer

2 months ago


San Diego, United States Platform Science Full time

Who We Are At Platform Science, we’re working to connect everything that moves. Founded in 2015, we are an open IoT platform that partners with innovative fleets, application developers, vehicle manufacturers, and equipment providers in the transportation industry to deliver revolutionary solutions to supply chain professionals across the globe. Our employees are an engaging, diverse group of people who believe in the power of great ideas. We hire people with different experiences and perspectives to build a company culture that fuels growth through innovation. We value thoughtful actions and empathy for others. We approach challenges with resiliency and creativity, while encouraging transparency because, no matter our backgrounds or responsibilities, we are one team. About the Role We are looking for a qualified Senior SRE to join our team in San Diego, CA (or remote). You will be hired to solve operational problems and provide support to development teams for critical business applications in production. Our focus is to ensure reliability in all production services and enable dev teams to measure their reliability to effectively make decisions. The SRE team has the unique opportunity to work with all aspects of our platform. We run entirely in the cloud i.e. AWS, Azure and GCP. Our applications and services are containerized and serverless. If you’re excited about learning and supporting new technologies and many different types of products (including mobile apps, hardware, websites, messaging queues, serverless pipelines, and more), and working with an incredibly talented team, then this is the position for you As a Senior SRE, you have a software development background or systems background with strong coding skills. Ideal candidates want to deeply understand how our systems work from the infrastructure level, their dependencies to other systems, to the customer experience, and how to mitigate risk. You are comfortable with giving and taking technical direction. You are a great communicator and self-starter who strives to make the company and our technologies better. Essential Responsibilities The Sr SRE is expected to develop and enhance the Continuous Integration/Continuous Deployment (CI/CD) pipelines, along with refining release management processes and associated toolsets. Maintain Helm charts to streamline application deployment and management. Establish standardized observability solutions to empower development teams in efficiently managing their applications. Lead the effort in promoting and prioritizing reliability, driving achievement of uptime goals, and mentoring colleagues in SRE best practices. Conduct comprehensive Production Readiness Reviews, working with teams to identify and establish Service Level Indicators and Service Level Objectives (SLIs/SLOs), and ensure high-quality and dependable services. Design and develop software solutions to address operational challenges effectively to improve system stability and reliability. Fulfill on-call duties, providing expert support to development teams for mission-critical applications in production environments. Improve the resiliency of applications and systems using chaos engineering. Education and Experience Possess 5+ years of hands-on experience in SRE or Platform Engineering roles. Demonstrated expertise (2+ years) with automation technologies like Jenkins, ArgoCD, or similar. Experience with Kubernetes (2+ years), Helm, and Docker within production environments. Proficiency with current software development lifecycle (SDLC) concepts and best practices, CI/CD pipelines, and test-driven development. Experience with AWS, encompassing proficiency in EKS, IAM, autoscaling, networking, and load balancing/request routing in a production environment. Proficient in Python, Bash, Nodejs, and/or Go. Proficient with distributed tracing methodologies and observability tools such as Prometheus, ELK, or Datadog. Strong emphasis on documentation and fostering knowledge-sharing practices within the team and organization. Track record of successfully training and mentoring engineers. Proven expertise in optimizing performance and managing costs within cloud environments. Sound understanding of SLI/SLO concepts and adherence to SRE best practices. Bachelors in Computer Science or related field. Platform Science Benefits Highlights The company offers various benefits to regular, full-time employees including: Medical, dental, and vision insurance. Short-term and long-term disability insurances. AD&D and life insurance. 401k plan. Paid vacation, sick leave and holidays. Six weeks of paid parental leave. #J-18808-Ljbffr