Principal Site Reliability Engineer

2 weeks ago


Palo Alto, United States JPMorgan Chase & Co. Full time

Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.As a Principal Site Reliability Engineer at JPMorgan Chase within the Enterprise Technology, AI/ML & Data Platforms division, you will utilize your expertise to create innovative solutions that improve critical incident management and streamline the software development lifecycle throughout the organization. Your role will involve overseeing, designing, and deploying infrastructure components to enhance reliability and ensure operational efficiency. Job responsibilitiesArchitect and implement observability platforms and tools for proactive detection and continuous improvement.Lead the design and development of core observability services, including metrics pipelines and log aggregation.Leverage modern technologies such as Open Telemetry and AI/ML for anomaly detection and automated insights.Collaborate with engineering and SRE teams to define service-level objectives (SLOs) and error budgets.Provide technical leadership and mentorship to engineering teams, ensuring best practices in system design.Champion observability as a first-class concern in the software development lifecycle.Influence platform strategy and roadmap through deep technical insight and alignment with business priorities.Write advanced documentation and create executive presentations that translate technical issues into business impact.Participate in industry professional forums and monitor relevant industry technologies and standards.Lead medium to large projects by bringing together the proper perspective and integrating feedback from team members.Participate in support responsibilities for coverage of critical applications.Required qualifications, capabilities, and skillsFormal training or certification on site reliability engineering concepts and 10+ years applied experience.Ability to determine how each system relates to each other and build automation to improve reliability.Experience with translating research, analysis, and tests into business recommendations.Ability to balance and be accountable for the work of multiple architects and designers.Understands and leads partnerships across job functions to develop efficient systems.Engages team members and expresses complex ideas with appropriate level of detail, while providing constructive feedback.Self-motivated and able to work well under pressure with minimal supervision.Ability to tackle a problem by using a logical, systematic, sequential approach.Preferred qualifications, capabilities, and skillsExperience with cloud-native instrumentation and streaming data platforms.Influence technology and policy decisions while fostering commitment and confidence in team members.Develop effective solutions and analyze competitive positions by considering market trends.Support the introduction of innovative methods and communicate clearly to persuade audiences.Demonstrate concern and meet the needs of both internal and external customers.#LI-RB3



  • Palo Alto, United States JPMorgan Chase & Co. Full time

    Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.As a Principal Site Reliability Engineer at JPMorgan Chase within the Enterprise Technology, AI/ML & Data Platforms division, you will utilize your expertise...


  • Palo Alto, United States JPMorganChase Full time

    Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.As a Principal Site Reliability Engineer at JPMorgan Chase within the Enterprise Technology, AI/ML & Data Platforms division, you will utilize your expertise...


  • Palo Alto, United States Xai Full time

    About xAIxAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational...


  • Palo Alto, United States FLUIX Full time

    FLUIX is building the AI operating system that plans, designs, and optimizes AI infrastructure. We are based in Silicon Valley. We specialize in providing AI-driven solutions for data centers and power providers, leveraging cutting-edge Machine Learning (ML) and Artificial Intelligence (AI) technologies. Our mission is to double America’s compute capacity...


  • Palo Alto, CA, United States Xai Full time

    Job Description Job Description About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate...


  • Palo Alto, CA, United States Xai Full time

    Job Description Job Description About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate...


  • Palo Alto, United States Theklicker Full time

    Company Description theklicker is an online platform specializing in electronic product price comparison, enabling users to browse prices across multiple booking sites effortlessly. We are dedicated to being a one-stop solution for purchasing electronic products. With a focus on delivering the best user experience, theklicker empowers users to make informed...


  • Palo Alto, United States Xai Full time

    About xAIxAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational...


  • Palo Alto, United States Archetype AI Full time

    Get AI-powered advice on this job and more exclusive features. About Archetype AI Archetype AI is developing the world's first AI platform to bring AI into the real world. Formed by an exceptionally high-caliber team from Google, Archetype AI is building a foundation model for the physical world, a real-time multimodal LLM for real life, transforming...


  • Palo Alto, California, United States xAI Full time

    About xAIxAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational...