We have other current jobs related to this field that you can find below


  • Dallas, United States Themesoft Inc. Full time

    Role: Site Reliability EngineerLocation: Dallas, TexasFull TimeSalary: $140,000 + Bonus+ BenefitsThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment....


  • Dallas, United States Themesoft Inc. Full time

    Role: Site Reliability EngineerLocation: Dallas, TexasFull TimeSalary: $140,000 + Bonus+ BenefitsThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment....


  • Dallas, United States Themesoft Inc. Full time

    The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. The roleMonitor application performance, take steps to improve overall application performance...


  • Dallas, United States Diamondpick Full time

    Hi,Hope you are doing well.Please find the below JD.Title: SRE EngineerLocation: Dallas, TX Type of Hire: Full TimeJob Description:The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a...


  • Dallas, United States Appspace Full time

    Your Role as a Site Reliability Engineer: Our Cloud Operations team seeks a Site Reliability Engineer who is passionate about problem-solving, automating, and maintaining Appspace’s Cloud Platform to support the needs of our Engineering and Customer Care teams. The ideal candidate will see manual work as an opportunity to exercise automation, will...


  • Dallas, United States VDart Inc Full time

    Job DescriptionJob DescriptionTitle: SRE / Site Reliability EngineerLocation: TX/Dallas Hybrid/OnsiteDuration: 1 YearSkillsHelp build a Site Reliability Engineering culture by sharing your best practices, approaches, documentation, and code with other engineering teams.Apply automation and software to any tasks or parts of the system that would benefit from...


  • Dallas, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Dallas, TX//Onsite Duration: Full Time-Only Job Description Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). Developing, automating, and implementing automation tools to streamline processes, deploy applications, and manage...


  • Dallas, United States Motion Recruitment Full time

    Job Description Our client, an independent services business that focuses on delivering a unified operating model for cloud, data, IoT and managed services, is looking for a Site Reliability Engineer who will be accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. This...


  • Dallas, United States Signify Health Full time

    How will this role have an Impact? Join Signify Health's vibrant Site Reliability Engineering team as a Site Reliability Engineer. We're seeking passionate individuals from diverse technical backgrounds. Reporting to the Manager of Site Reliability Engineering, we offer a collaborative environment that values each team member's unique contribution and...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, United States Signify Health Full time

    Job DescriptionJob DescriptionHow will this role have an Impact?Join Signify Health's vibrant Site Reliability Engineering team as a Site Reliability Engineer. We're seeking passionate individuals from diverse technical backgrounds. Reporting to the Manager of Site Reliability Engineering, we offer a collaborative environment that values each team...


  • Dallas, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Galaxy i Technologies, Inc., is seeking the following. Apply via Dice today! Site Reliability Engineer Location: Dallas TX Onsite Full Time Skill: Site Reliability Engineer Ensures supported applications are functioning and available by minimizing downtime...


  • Dallas, United States VIZIO Full time

    About the Team: VIZIO releases firmware & software for millions of customers in a time efficient manner. Our goal is to maintain 99.9% uptime for our customers. We are seeking a Site Reliability Engineer to join our expanding organization. The Site Reliability Engineer will report to the Manager, DevOps Security and will play a crucial role in enhancing the...


  • Dallas, United States Motion Recruitment Partners LLC Full time

    Our client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This person will be responsible for monitoring their application performance, making suggestions to improve performance and stability, and taking the lead on implementing those improvements. The ideal...


  • Dallas, United States Diverse Lynx Full time

    Role : Site Reliability Engineer/Devops Engineer Location : Dallas TX (Onsite) Duration: Full-time Job Description Skill: Site Reliability Engineer Ensures supported applications are functioning and available by minimizing downtime and maximizing performance. Provides technical expertise to the stakeholders and end user ensuring continuous...


  • Dallas, United States JPMorganChase Full time

    Job Description There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Enterprise technology, Infrastructure platforms team, you...


  • Dallas, Texas, United States JPMorganChase Full time

    Job Description There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Enterprise technology, Infrastructure platforms team, you will solve...


  • Dallas, United States Apple Full time

    Site Reliability Engineering (SRE) Manager - Apple Service Engineering Austin, Texas, United States Software and Services Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish! Join...


  • Dallas, United States Motion Recruitment Full time

    Our client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This person will be responsible for monitoring their application performance, making suggestions to improve performance and stability, and taking the lead on implementing those improvements. The ideal...


  • Dallas, United States Motion Recruitment Full time

    Job Description Our client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This individual will oversee the functionality and performance of their application, coming up with ideas to make it more stable and efficient, and leading the implementation of those...

Manager Site Reliability Engineer

2 months ago


Dallas, United States Sana Commerce Inc Full time
Company Description

At Sana Commerce we're committed to an inclusive environment and recognize that our diverse workforce is one of our greatest strengths.

It all started in 2007, with a pizza and a plan. Sana Commerce is an e-commerce platform designed to help manufacturers, distributors and wholesalers succeed by fostering lasting relationships with customers who depend on them. We're a fast-growing SaaS company that allows you to take ownership of your career.

At Sana Commerce, we're looking for a Manager SRE to build & manage our global SRE team that manages and monitors all installed systems, environments and infrastructure and resolves issues that come in through our notification system.

What you'll get:
  • The opportunity to make an impact at a fast-growing SaaS scale-up;
  • Up to 3 weeks "work from anywhere" per year;
  • A global and customized onboarding program (9,1/10 rated by previous hires);
  • A hybrid working model - 3 days from the office, 2 day from home.
  • Great and varied healthcare plans for you to choose.
Job Description

What you'll be doing:
  • Leading the SRE team, setting objectives, and guiding the team towards achieving high reliability while balancing cost and performance SLAs.
  • Collaborating with platform & product engineering teams to embed reliability and operational best practices into the software development lifecycle.
  • Developing and implementing SRE policies and practices, including service level objectives (SLOs), service level indicators (SLIs), and error budgets.
  • Driving automation across operations to reduce toil, improve system performance, ensure scalability, with a reasonable amount of allergic response towards repetitive manual work.
  • Overseeing incident management, post-mortem analyses, and root cause investigations to prevent future outages and enhance system reliability.
  • Facilitating capacity planning and scalability exercises to manage growth and ensure the efficient use of resources.
  • Facilitating disaster recovery plans & testing to ensure business continuity for our customers' webstores.
  • Encouraging a culture of continuous improvement by mentoring team members and fostering innovation within the team.
  • Staying up to date with the latest trends and technologies in SRE and advocating for their adoption where appropriate.
Qualifications

What you'll bring:
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
  • At least 5 years of experience in Site Reliability Engineering, with 2+ years in a leadership or management role.
  • Proven expertise in cloud computing platforms (e.g., AWS, Azure, GCP) and experience with container orchestration (e.g., Kubernetes).
  • A deep understanding of network protocols, load balancing, and high availability configurations.
  • Experience in applying software development solutions to SRE and familiarity with programming languages such as (preferably) PowerShell and C# or else Python, Go, Java etc.
  • Experience with automation tools, infrastructure as code (e.g., Terraform, Ansible).
  • Proficiency in monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) and in implementing comprehensive monitoring solutions. Dynatrace knowledge is a plus.
  • Excellent problem-solving skills, with a proven ability to tackle complex issues under pressure.
  • Outstanding leadership qualities, with a track record of mentoring and developing high-performing teams.
  • Exceptional communication and collaboration skills, capable of working effectively with cross-functional teams.


Additional Information

#LI-JS1

#LI-Hybrid