Principal Software Engineer, Site Reliability Engineering
1 day ago
remote type: Office Tech-Flexible
locations: California - San Francisco, Washington - Bellevue
time type: Full time
posted on: Posted 3 Days Ago
job requisition id: JR266855
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.
About Salesforce
We’re Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you’ve come to the right place.
Job Details
(Lead/Principal/Architect) Software Engineer - Availability Engineering
Our Availability engineering teams are responsible for driving ‘best in class’ availability. You will work with delivery teams deploying customer-facing/supporting software across a multi-substrate engineering platform that collectively ships hundreds of features to production for tens of millions of users across all industries every day. Our users count on our applications and platforms to be highly reliable, lightning fast, supremely secure, and to preserve all of their customizations and integrations every time we ship. You will need deep experience with concurrency, large scale systems, proficiency with solving real-world data management challenges, a strong understanding of how to craft solutions that are highly available, and a proven ability to design, develop, and optimize the core back-end systems.
What you’ll be doing:
- As part of a specialist unit focused on availability and resilience, you will embed with delivery teams, acting in a Lead capacity, creating bandwidth and prioritizing a focus on corrective and proactive availability measures.
- You will be contributing to designing, developing, debugging, and operating resilient applications and platforms deployed across distributed systems that run across thousands of compute nodes in multiple data centers.
- You will champion resiliency best practices; Observability tool integration, horizontal/vertical sizing & auto-scaling, release rollback & recovery workflows, integration tests and validation procedures for applications running on self-host infra as well as public cloud platforms such as AWS, GCP, Azure & Alibaba.
- Using and contributing to open source technology (Spinnaker, Zookeeper, etc.).
- Developing/leverage Infrastructure-as-Code using Terraform.
- Building/integrating with APIs and microservices deployed on containerization frameworks such as Kubernetes, Docker, Mesos, etc.
- Resolving complex technical issues and driving innovations that improve system availability, resilience, and performance.
- You have experience balancing live runtime management, feature delivery, and retirement of technical debt.
- Participate in the team’s on-call rotation to address complex problems in real-time and keep services operational and highly available.
Required Skills:
- A related technical degree required (master's preferred).
- 15+ years of hands-on software development experience.
- 5+ years in a Tech Lead, Principal or Architect capacity.
- Ability to reverse engineer solutions via independent code and architecture review, envision, define and then contribute to delivery of availability improvement refactoring projects.
- Mastery of one or more object-oriented delivery with languages such as Java, Golang, APEX, Python.
- Deep experience working with core web technologies: HTTP, JSON, REST, XML.
- Proficiency with databases including Oracle or other relational and/or NoSQL solutions.
- Experience owning and operating multiple instances of a critical service.
- Running critical infrastructure services; monitoring, alerting, logging, tracing, and reporting.
- Subject matter expertise on Service ownership best practices, SLO/I/A definition, driving proactive operational awareness and experience with Incident/Problem management.
- Thorough knowledge of Agile development methodology with experience in both Test/Behavioral Driven Development practices.
Accommodations
If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.
Posting Statement
At Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at and explore our company benefits at .
Salesforce is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Salesforce does not accept unsolicited headhunter and agency resumes. Salesforce will not pay any third-party agency or company that does not have a signed agreement with Salesforce .
Salesforce welcomes all.
Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.
For Washington-based roles, the base salary hiring range for this position is $204,400 to $296,400. For California-based roles, the base salary hiring range for this position is $223,000 to $323,400. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, benefits. More details about our company benefits can be found at the following link: .
#J-18808-Ljbffr-
Principal Site Reliability Engineer
1 day ago
Sunnyvale, CA, United States Microsoft Full timeThere has never been a more exciting time to be working in healthcare at Microsoft. Our Health & Life Sciences Solutions organization is an interdisciplinary team of product managers, designers, engineers, and clinicians who are designing, developing and deploying next-generation healthcare solutions powered by the Microsoft Cloud for healthcare...
-
Principal Site Reliability Engineer
23 hours ago
San Francisco, CA, United States salesforce Full timeTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category: Software Engineering About Salesforce: We’re Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every...
-
Principal Site Reliability Engineer
2 weeks ago
San Francisco, United States salesforce Full timeTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job Category: Software EngineeringAbout Salesforce:We’re Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every...
-
Site Reliability Engineer
1 week ago
San Francisco, United States Apollo Solutions Full timeSite Reliability Engineer Apollo Solutions have partnered with a groundbreaking artifical inteligence business who are making major developments in how we use AI/ML for gaming/security. They are working closely with government contracts as well as gaming consoles companys and are now searching for an SRE to join their growing team. The Site Reliability...
-
Principal Site Reliability Engineer
4 days ago
San Francisco, United States salesforce Full timeTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job Category: Software EngineeringAbout Salesforce:We’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every...
-
Site Reliability Engineer
3 weeks ago
San Francisco, United States WEX Full timeThe WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...
-
Site Reliability Engineer
4 weeks ago
Newton, MA, United States Intelliswift Software Full timeTitle : Site Reliability EngineerLocation : Newton, MA HybridDuration : 6 MonthsPay rate : $38.73 per hour on W2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and...
-
Principal Software Engineer
3 weeks ago
san francisco, United States Understanding Recruitment Full timePrincipal Software EngineerUS Tech start-up - Fully Remote$180k + BenefitsWe're excited to share an opportunity with a fast-growing, heavily-backed live shopping platform based on the West Coast, currently valued at over $100M!They're on the lookout for a Principal Software Engineer with expertise in Full Stack Engineering (React.js/Node.js) and a focus on...
-
Principal Software Engineer
3 weeks ago
San Francisco, United States Understanding Recruitment Full timePrincipal Software EngineerUS Tech start-up - Fully Remote$180k + BenefitsWe're excited to share an opportunity with a fast-growing, heavily-backed live shopping platform based on the West Coast, currently valued at over $100M!They're on the lookout for a Principal Software Engineer with expertise in Full Stack Engineering (React.js/Node.js) and a focus on...
-
Site Reliability Engineer
2 days ago
San Francisco, CA, United States Withorb Full timeMission Orb is on an ambitious mission to provide every business with the infrastructure to unlock their revenue. Best-in class businesses find ways to effectively align their monetization to product usage—whether that's through seats, consumption, feature limits, or usage-based tiers. Orb brings that opportunity to every software company. We are...
-
Site Reliability Engineer
3 weeks ago
San Francisco, United States Ellation, Inc. Full timeWho We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...
-
Site Reliability Engineer
3 weeks ago
San Francisco, United States Ellation, Inc. Full timeWho We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...
-
Software Engineer, Site Reliability
1 day ago
Sunnyvale, CA, United States Apple Inc. Full timeApple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the...
-
Principal Software Engineer
2 days ago
San Francisco, CA, United States Autodesk, Inc. Full timePrincipal Software Engineer page is loaded Principal Software Engineer Apply locations San Francisco, CA, USA California, USA - Remote time type Full time posted on Posted 30+ Days Ago job requisition id 24WD81607 Job Requisition ID # 24WD81607 Position Overview Autodesk's pre-construction bidding application is powered by the builder's network, a...
-
Sunnyvale, CA, United States Apple Inc. Full timeSoftware Engineering Manager, Site Reliability Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can...
-
Site Reliability Engineer
2 days ago
San Francisco, CA, United States Mistral AI Full timeAbout Mistral At Mistral AI, we are a tight-knit, nimble team dedicated to bringing our cutting-edge AI technology to the world. Our mission is to make AI ubiquitous and open. We are creative, low-ego, team-spirited, and have been passionate about AI for years. We hire people who thrive in competitive environments, because they find them more fun to work...
-
Principal Software Engineer
24 hours ago
San Diego, CA, United States Cubic Corporation Full timeHello! To apply to the job you were interested in, please create a Workday account. If you already have an account, please sign in. We look forward to learning more about you! Principal Software Engineer Locations: San Diego, California Time Type: Full time Posted On: Posted 3 Days Ago Job Requisition ID: REQ_41191 Business Unit: Cubic Defense Company...
-
Site Reliability Engineer
21 hours ago
Chicago, IL, United States WEX, Inc. Full timeThe WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...
-
Principal Software Engineer
1 month ago
Del Mar, CA, United States Softworld, a Kelly Company Full timeJob Title: 80553 - Principal Software EngineerJob Location: San Diego CA 92121Onsite Requirements:Looking for someone with end-to-end ownership of the software development processEngineers to be responsible for building, testing, deploying, and operating the services they developSomeone with strong Python skillsNeed for in-depth AWS experience, including...
-
Site Reliability Engineering Intern
21 hours ago
San Jose, CA, United States Zscaler, Inc. Full timeOur Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...