Site Reliability Engineer

3 weeks ago


Dallas, United States ConsultUSA Full time

Description:

Our client has an immediate need for a Site Reliability Engineer, who will be responsible for enabling engineering teams with guidance and tools to deliver frequent, high quality and reliable components as part of our digital platform


Requirements:

  • Bachelor’s degree in Engineering, Computer Science, or a related field
  • 5-7 years of experience with AWS, Kubernetes and container based architecture, designs and solutions
  • Experience in a continuous delivery model similar role at an organization that has adopted the SRE model
  • Experience in AWS CloudFormation, Hashicorp Terraform, and Ansible for automated infrastructure and platform provisioning
  • Experience with source code management system (e.g., GIT/Bitbucket)
  • Experience with CI/CD tools and pipeline development like (e.g., Jenkins, Maven, Nexus, Python, Ruby, Groovy)
  • Background in production operations and support at scale with a proven track record of maintaining highly available and performant cloud platforms
  • Experience in programming and scripting languages to contribute to application development and automation (e.g., Java, NodeJS, Python)
  • Solid debugging / problem solving skills including ability to investigate and remedy software bugs if necessary for application developed in Java and NodeJS
  • Knowledge of deployment pipelines for languages like (e.g., Node.js, Java, Springboot)
  • Experience with Security best practices (SSH, Certificate management, AWS-IAM, or standards such as PCI)
  • Experience with Infrastructure as a Service / Cloud computing (e.g., Amazon AWS, Google Compute Engine, etc.) is a plus
  • Knowledge of Docker, containerization technologies, Spinnaker, and cloud orchestration a strong plus
  • Experience with usage of test frameworks like (e.g., JMeter, Cucumber, Selenium) a plus

Responsibilities:

  • Design and implement business critical cloud based Platform solutions with automation-first mindset, observability, container design patterns and best of breed cloud tools and architecture practices
  • Collaboratively solves business and technology problems in partnership with key stakeholders from Digital Platform team, security, enterprise architecture and product owners
  • Contribute to container, microservices application code base and architecture with a focus on optimization for performance, reliability, scalability, security, observability and cost
  • Develop and implement solutions for non-functional requirements with a focus on automation, Whitebox monitoring, and modularity for broad re-use across system components
  • Design and implement CI/CD deployment pipelines and test automation frameworks based on best practices to enable frequent, high quality releases
  • Define and implement application deployment strategies based on application type
  • Operations
  • Assist with guiding, growing and training agile engineering teams to optimize service quality and ensure adoption of container, microservices, and operational best practices
  • Ensure the effective capture of application telemetry, logging and monitoring of all aspects of system and application behavior to facilitate fast detection and issue resolution
  • Design and develop operational tools and services needed to effectively operate system components at scale
  • Understanding and adherence to operational processes ensuring audit-ability, risk and compliance with ISO and industry standards (includes Incident, Problem and Change Management)
  • Continually evaluate service and infrastructure usage to effectively manage performance, capacity and cost – automating solutions, removing toil wherever possible
  • Participate as a member of the broader SRE community to develop tools and services that enable automated operations
  • Support
  • Contribute to technical documentation required to guide on-call engineers and on-board team members
  • Maintain system wide health and proactively seek out potential issues, address with component teams
  • Proactively and continuously drive system wide quality improvements by undertaking thorough root cause analysis for major incidents with component engineering teams
  • Provide training and coaching in a capacity as Subject Matter Expert to other engineers


Why Work for ConsultUSA:

  • ConsultUSA offers competitive salaries, major medical (PPO or HDHP w/ HSA), dental, and vision insurance plans, and 401k plan with immediate eligibility for both salary and hourly employees
  • ConsultUSA hosts several outings and events, holiday and summer parties, and volunteer opportunities throughout the year for employees
  • We will work with you to obtain training for in-demand technologies and prepare you for industry-recognized certification exams
  • ConsultUSA offers Business Analysis and Project Management training through our Project Management Institute (PMI)® award-winning sister company, PMCentersUSA

How to Apply:

To submit your application, please click the “Apply Now” button located at the top and bottom of the page.

ConsultUSA is committed to providing equal employment opportunities (EEO) to all qualified employees and applicants for employment without regard to race, color, religion, gender identity or expression, sexual orientation, national origin, age, disability, genetic information, marital status, pregnancy, ancestry, or status as a covered veteran as well as any other prohibited criteria under any applicable federal, state, and local laws applicable to ConsultUSA.

For a complete listing of all ConsultUSA jobs please visit www.consultusa.com



  • Dallas, United States Onwardpath Full time

    SRE (Site Reliability Engineer)Dallas, TX – Hybrid (Local Candidates Only)6+ Months Contract Job Description (SRE)• Collaborating closely with engineering teams on building and enhancing tooling and automation solutions for faster resolution of issues impacting SLO’s and averting incidents altogether when possible.• Collaborating with the customers...


  • Dallas, United States Collabera Full time

    Description Home Search Jobs Job Description Site Reliability Engineer Contract: Dallas, Texas, US Salary: $60.00 Per Hour Job Code: 350552 End Date: 2024-07-14 Days Left: 29 days, 3 hours left Apply Job Title: Cloud DevOps Engineer/Site Reliability EngineerDuration of project: 6+ Months + possible Extension Location: Remote Role Description: ...


  • Dallas, United States Collabera Full time

    Description Home Search Jobs Job Description Site Reliability Engineer Contract: Dallas, Texas, US Salary: $60.00 Per Hour Job Code: 350552 End Date: 2024-07-14 Days Left: 28 days, 3 hours left Apply Job Title: Cloud DevOps Engineer/Site Reliability EngineerDuration of project: 6+ Months + possible Extension Location: Remote Role Description: ...


  • Dallas, United States Maarut Inc Full time

    Job Description: Collaborating closely with engineering teams on building and enhancing tooling and automation solutions for faster resolution of issues impacting SLO’s and averting incidents altogether when possible. Collaborating with the customers to understand their pain points around Supportability and SLO attainment and formulate strategies for...


  • Dallas, United States Saicon Consultants Full time

    Site Reliability Engineer (Buffer) Location:Dallas, TX Posted On: 11/08/2023 Requirement Code: 66074 Requirement Detail Job Description: Site Reliability Engineer (Buffer) • Bachelor's Degree in Computer Science or related; or equivalent combination of education and experience • 5~~@~~ yrs overall experience in Software Application Development &...


  • Dallas, United States Bayone Full time

    Role: Site Reliability EngineerLocation: Dallas, TX ( Hybrid role)Type: 6 months+Bill Rate:$110/HR LOCAL candidates only. Third party C2C is ok too. RESPONSIBILITIES:- Design, build, and maintain highly available and scalable applications deployed in Azure.- Develop and maintain automation tools and scripts to streamline deployment and maintenance tasks.-...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, Texas, United States Cognizant Technology Solutions Full time

    Sr. Site Reliability Engineer (SRE)Cognizant's Digital Engineering practice is seeking a highly qualified Sr. Site Reliability Engineer with 10+ years plus experience developing and building high-performing, scalable, enterprise applications. You will be part of a digital software team that works on high-demand applications. Our engineers have a passion for...


  • Dallas, Texas, United States Cognizant Technology Solutions Full time

    Sr. Site Reliability Engineer (SRE)Cognizant's Digital Engineering practice is seeking a highly qualified Sr. Site Reliability Engineer with 10+ years plus experience developing and building high-performing, scalable, enterprise applications. You will be part of a digital software team that works on high-demand applications. Our engineers have a passion for...


  • Dallas, Texas, United States Veradigm (formerly Allscripts) Full time

    Welcome to Veradigm Our Mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our Vision is a Connected Community of Health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, United States Maarut Inc Full time

    Job Description:Collaborating closely with engineering teams on building and enhancing tooling and automation solutions for faster resolution of issues impacting SLO’s and averting incidents altogether when possible.Collaborating with the customers to understand their pain points around Supportability and SLO attainment and formulate strategies for...


  • Dallas, United States Saxon Global Full time

    Job Summary: We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, availability, and performance of our production systems. As an SRE, you will work closely with cross development and engineering teams to design and implement tools and processes to automate deployment, observability, and troubleshooting...


  • Dallas, United States Saxon Global Full time

    Job Summary: We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, availability, and performance of our production systems. As an SRE, you will work closely with cross development and engineering teams to design and implement tools and processes to automate deployment, observability, and troubleshooting...


  • Dallas, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Dallas, TX//Onsite Duration: Full Time-Only Job Description Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). Developing, automation and implementing automation tools to streamline processes, deploy applications, and manage...


  • Dallas, Texas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, United States STIAOS Technologies Full time

    We are looking for Site Reliability Engineer for our client location in Dallas TX with following Skills: *Java Spring boot *Kubernetes *eCommerce experience Required. Key Responsbilities: *Working with the Applications, Engineering, Platform, Operations and infrastructure and Cloud teams to ensure we are a premier software delivery organization. *Drive...


  • Dallas, Texas, United States STIAOS Technologies Full time

    We are looking for Site Reliability Engineer for our client location in Dallas TX with following Skills: *Java Spring boot *Kubernetes *eCommerce experience Required. Key Responsbilities: *Working with the Applications, Engineering, Platform, Operations and infrastructure and Cloud teams to ensure we are a premier software delivery organization. *Drive...


  • Dallas, United States STIAOS Technologies Full time

    We are looking for Site Reliability Engineer for our client location in Dallas TX with following Skills: *Java Spring boot *Kubernetes *eCommerce experience Required. Key Responsbilities: *Working with the Applications, Engineering, Platform, Operations and infrastructure and Cloud teams to ensure we are a premier software delivery organization. *Drive...