Principal Software Engineer, Site Reliability Engineering

1 day ago


San Francisco CA United States Salesforce, Inc. Full time
Software Engineering PMTS

remote type: Office Tech-Flexible

locations: California - San Francisco, Washington - Bellevue

time type: Full time

posted on: Posted 3 Days Ago

job requisition id: JR266855

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

About Salesforce

We’re Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you’ve come to the right place.

Job Details

(Lead/Principal/Architect) Software Engineer - Availability Engineering
Our Availability engineering teams are responsible for driving ‘best in class’ availability. You will work with delivery teams deploying customer-facing/supporting software across a multi-substrate engineering platform that collectively ships hundreds of features to production for tens of millions of users across all industries every day. Our users count on our applications and platforms to be highly reliable, lightning fast, supremely secure, and to preserve all of their customizations and integrations every time we ship. You will need deep experience with concurrency, large scale systems, proficiency with solving real-world data management challenges, a strong understanding of how to craft solutions that are highly available, and a proven ability to design, develop, and optimize the core back-end systems.

What you’ll be doing:

  • As part of a specialist unit focused on availability and resilience, you will embed with delivery teams, acting in a Lead capacity, creating bandwidth and prioritizing a focus on corrective and proactive availability measures.
  • You will be contributing to designing, developing, debugging, and operating resilient applications and platforms deployed across distributed systems that run across thousands of compute nodes in multiple data centers.
  • You will champion resiliency best practices; Observability tool integration, horizontal/vertical sizing & auto-scaling, release rollback & recovery workflows, integration tests and validation procedures for applications running on self-host infra as well as public cloud platforms such as AWS, GCP, Azure & Alibaba.
  • Using and contributing to open source technology (Spinnaker, Zookeeper, etc.).
  • Developing/leverage Infrastructure-as-Code using Terraform.
  • Building/integrating with APIs and microservices deployed on containerization frameworks such as Kubernetes, Docker, Mesos, etc.
  • Resolving complex technical issues and driving innovations that improve system availability, resilience, and performance.
  • You have experience balancing live runtime management, feature delivery, and retirement of technical debt.
  • Participate in the team’s on-call rotation to address complex problems in real-time and keep services operational and highly available.

Required Skills:

  • A related technical degree required (master's preferred).
  • 15+ years of hands-on software development experience.
  • 5+ years in a Tech Lead, Principal or Architect capacity.
  • Ability to reverse engineer solutions via independent code and architecture review, envision, define and then contribute to delivery of availability improvement refactoring projects.
  • Mastery of one or more object-oriented delivery with languages such as Java, Golang, APEX, Python.
  • Deep experience working with core web technologies: HTTP, JSON, REST, XML.
  • Proficiency with databases including Oracle or other relational and/or NoSQL solutions.
  • Experience owning and operating multiple instances of a critical service.
  • Running critical infrastructure services; monitoring, alerting, logging, tracing, and reporting.
  • Subject matter expertise on Service ownership best practices, SLO/I/A definition, driving proactive operational awareness and experience with Incident/Problem management.
  • Thorough knowledge of Agile development methodology with experience in both Test/Behavioral Driven Development practices.

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

At Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at and explore our company benefits at .

Salesforce is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Salesforce does not accept unsolicited headhunter and agency resumes. Salesforce will not pay any third-party agency or company that does not have a signed agreement with Salesforce .

Salesforce welcomes all.

Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.

For Washington-based roles, the base salary hiring range for this position is $204,400 to $296,400. For California-based roles, the base salary hiring range for this position is $223,000 to $323,400. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, benefits. More details about our company benefits can be found at the following link: .

#J-18808-Ljbffr

  • Sunnyvale, CA, United States Microsoft Full time

    There has never been a more exciting time to be working in healthcare at Microsoft. Our Health & Life Sciences Solutions organization is an interdisciplinary team of product managers, designers, engineers, and clinicians who are designing, developing and deploying next-generation healthcare solutions powered by the Microsoft Cloud for healthcare...


  • San Francisco, CA, United States salesforce Full time

    To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category: Software Engineering About Salesforce: We’re Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every...


  • San Francisco, United States salesforce Full time

    To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job Category: Software EngineeringAbout Salesforce:We’re Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every...


  • San Francisco, United States Apollo Solutions Full time

    Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking artifical inteligence business who are making major developments in how we use AI/ML for gaming/security. They are working closely with government contracts as well as gaming consoles companys and are now searching for an SRE to join their growing team. The Site Reliability...


  • San Francisco, United States salesforce Full time

    To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job Category: Software EngineeringAbout Salesforce:We’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every...


  • San Francisco, United States WEX Full time

    The WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...


  • Newton, MA, United States Intelliswift Software Full time

    Title : Site Reliability EngineerLocation : Newton, MA HybridDuration : 6 MonthsPay rate : $38.73 per hour on W2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and...


  • san francisco, United States Understanding Recruitment Full time

    Principal Software EngineerUS Tech start-up - Fully Remote$180k + BenefitsWe're excited to share an opportunity with a fast-growing, heavily-backed live shopping platform based on the West Coast, currently valued at over $100M!They're on the lookout for a Principal Software Engineer with expertise in Full Stack Engineering (React.js/Node.js) and a focus on...


  • San Francisco, United States Understanding Recruitment Full time

    Principal Software EngineerUS Tech start-up - Fully Remote$180k + BenefitsWe're excited to share an opportunity with a fast-growing, heavily-backed live shopping platform based on the West Coast, currently valued at over $100M!They're on the lookout for a Principal Software Engineer with expertise in Full Stack Engineering (React.js/Node.js) and a focus on...


  • San Francisco, CA, United States Withorb Full time

    Mission Orb is on an ambitious mission to provide every business with the infrastructure to unlock their revenue. Best-in class businesses find ways to effectively align their monetization to product usage—whether that's through seats, consumption, feature limits, or usage-based tiers. Orb brings that opportunity to every software company. We are...


  • San Francisco, United States Ellation, Inc. Full time

    Who We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...


  • San Francisco, United States Ellation, Inc. Full time

    Who We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...


  • Sunnyvale, CA, United States Apple Inc. Full time

    Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the...


  • San Francisco, CA, United States Autodesk, Inc. Full time

    Principal Software Engineer page is loaded Principal Software Engineer Apply locations San Francisco, CA, USA California, USA - Remote time type Full time posted on Posted 30+ Days Ago job requisition id 24WD81607 Job Requisition ID # 24WD81607 Position Overview Autodesk's pre-construction bidding application is powered by the builder's network, a...


  • Sunnyvale, CA, United States Apple Inc. Full time

    Software Engineering Manager, Site Reliability Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can...


  • San Francisco, CA, United States Mistral AI Full time

    About Mistral At Mistral AI, we are a tight-knit, nimble team dedicated to bringing our cutting-edge AI technology to the world. Our mission is to make AI ubiquitous and open. We are creative, low-ego, team-spirited, and have been passionate about AI for years. We hire people who thrive in competitive environments, because they find them more fun to work...


  • San Diego, CA, United States Cubic Corporation Full time

    Hello! To apply to the job you were interested in, please create a Workday account. If you already have an account, please sign in. We look forward to learning more about you! Principal Software Engineer Locations: San Diego, California Time Type: Full time Posted On: Posted 3 Days Ago Job Requisition ID: REQ_41191 Business Unit: Cubic Defense Company...


  • Chicago, IL, United States WEX, Inc. Full time

    The WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...


  • Del Mar, CA, United States Softworld, a Kelly Company Full time

    Job Title: 80553 - Principal Software EngineerJob Location: San Diego CA 92121Onsite Requirements:Looking for someone with end-to-end ownership of the software development processEngineers to be responsible for building, testing, deploying, and operating the services they developSomeone with strong Python skillsNeed for in-depth AWS experience, including...


  • San Jose, CA, United States Zscaler, Inc. Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...