Senior Site Reliability Engineer

4 weeks ago


Dallas, United States Saxon Global Full time

Job Summary:

We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, availability, and performance of our production systems. As an SRE, you will work closely with cross development and engineering teams to design and implement tools and processes to automate deployment, observability, and troubleshooting of our applications and infrastructure supporting the deployment of new Android tablets to the stores.

This individual must be skilled and have professional experience with the core functions of Site Reliability Engineering including deployments, observability, monitoring, telemetry, and automation.

Please be sure to call out your experience in these areas and how your technical experience matches the requirements below in your resume.

Responsibilities:

Ensure the reliability, availability, and performance of our production systems as we scale

Develop and maintain monitoring and alerting systems to detect and respond to incidents in a timely manner

There is no on-call rotation but occasionally support planned deployment roll outs that may require working off-hours during store closure

Work with cross-functional teams to plan and execute scaling initiatives

Develop and maintain documentation of processes, procedures, and technical configurations

Requirements:

Strong written and verbal communication skills with peers, technical leads, project managers and product owners

Must be able to collaborate with customers and cross-functional teams to design, test and validate deliverable which meet or exceed expectations

Self-starter and highly motivated individual that is well-organized

Bachelor's degree in Computer Science or related field

5+ years of experience as a Site Reliability Engineer

Strong experience with automation tools and experience with automation scripting in Python

Experience with containerization technologies such as Docker and Kubernetes

Experience with cloud platforms such as Azure or AWS

Experience with monitoring and logging tools such as Datadog, Prometheus, Grafana or Splunk

Strong understanding of networking, security, and systems administration

Excellent problem-solving skills and attention to detail

Must be available to work core hours PST.

Preferred qualifications:

Experience with distributed systems and supporting a large retail business

Experience with infrastructure as code tools such as Terraform or CloudFormation

Experience with CI/CD tools such as Jenkins

Experience with incident ticketing systems such as ServiceNow and Jira for tracking stories

Familiarity with Agile/Scrum methodologies and DevOps principles

If you are passionate about ensuring the reliability and availability of systems in our stores and enjoy collaborating with cross-functional teams to solve complex problems, we encourage you to apply for this exciting opportunity as an SRE. #J-18808-Ljbffr



  • Dallas, United States Tekwissen Full time

    Overview: TekWissen Group is a workforce management provider throughout the USA and many other countries in the world. Our client is an American multinational information technology services and consulting company and is a leading provider of information technology, consulting, and business process outsourcing services, dedicated helping the world's leading...


  • Dallas, United States Tekwissen Full time

    Overview: TekWissen Group is a workforce management provider throughout the USA and many other countries in the world. Our client is an American multinational information technology services and consulting company and is a leading provider of information technology, consulting, and business process outsourcing services, dedicated helping the world's leading...


  • Dallas, United States TekWissen LLC Full time

    Job DescriptionJob DescriptionOverview: TekWissen Group is a workforce management provider throughout the USA and many other countries in the world. Our client is an American multinational information technology services and consulting company and is a leading provider of information technology, consulting, and business process outsourcing services,...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States JPMorgan Chase Full time

    Job Description Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the CORPORATE SECTOR in the INFRASTRUCTURE PLATFORMS, Runtime Compute Team, you are deemed as a...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Veradigm (formerly Allscripts) Full time

    Welcome to Veradigm! Our Mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our Vision is a Connected Community of Health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Dallas, United States Saicon Consultants Full time

    Site Reliability Engineer (Buffer) Location:Dallas, TX Posted On: 11/08/2023 Requirement Code: 66074 Requirement Detail Job Description: Site Reliability Engineer (Buffer) • Bachelor's Degree in Computer Science or related; or equivalent combination of education and experience • 5~~@~~ yrs overall experience in Software Application Development &...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, Texas, United States Cognizant Technology Solutions Full time

    Sr. Site Reliability Engineer (SRE)Cognizant's Digital Engineering practice is seeking a highly qualified Sr. Site Reliability Engineer with 10+ years plus experience developing and building high-performing, scalable, enterprise applications. You will be part of a digital software team that works on high-demand applications. Our engineers have a passion for...


  • Dallas, Texas, United States Cognizant Technology Solutions Full time

    Sr. Site Reliability Engineer (SRE)Cognizant's Digital Engineering practice is seeking a highly qualified Sr. Site Reliability Engineer with 10+ years plus experience developing and building high-performing, scalable, enterprise applications. You will be part of a digital software team that works on high-demand applications. Our engineers have a passion for...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Veradigm® Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today’s healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Dallas, TX//Onsite Duration: Full Time-Only Job Description Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). Developing, automation and implementing automation tools to streamline processes, deploy applications, and manage...


  • Dallas, United States STIAOS Technologies Full time

    We are looking for Site Reliability Engineer for our client location in Dallas TX with following Skills: *Java Spring boot *Kubernetes *eCommerce experience Required. Key Responsbilities: *Working with the Applications, Engineering, Platform, Operations and infrastructure and Cloud teams to ensure we are a premier software delivery organization. *Drive...


  • Dallas, United States STIAOS Technologies Full time

    We are looking for Site Reliability Engineer for our client location in Dallas TX with following Skills: *Java Spring boot *Kubernetes *eCommerce experience Required. Key Responsbilities: *Working with the Applications, Engineering, Platform, Operations and infrastructure and Cloud teams to ensure we are a premier software delivery organization. *Drive...