Site Reliability Engineer

2 weeks ago


St George, United States TCN Full time

TCN is looking for a Site Reliability Engineer to join our team in Saint George, Utah. The Site Reliability Engineer works as part of a team to analyze, troubleshoot, deploy, monitor, and maintain TCN’s large production environment with global scale. These significant responsibilities are completed while continually thinking about reliability, scalability, resilience, security, and performance. The Site Reliability Engineer’s responsibilities are critical to the continuity of the services provided to TCN’s clients.

The ideal candidate will have at least three (3) years' experience working in a Linux environment as a System Administrator, Site Reliability Engineer, or a similar role. 

Responsibilities

  • Designs and deploys software/systems - Collaborates with development teams to throughout the product life cycle, including but not limited to engaging in the design, development, deployment, and ongoing delivery of services; assists in ensuring the development of software and systems that increase product reliability and organizational efficiency
  • Manages solutions and ensures resistance to failure - Deploys and manages solutions to manage platform infrastructure as we continue to grow our global scale; ensuring resistance to failure
  • Troubleshoots - Troubleshoots complicated, cross platform incidents for OS, networking, and database in a cloud-based SaaS environment; ability to handle live production incidents, debug and troubleshoot application and infrastructure issues, and follow and implement best practices 
  • Post-incident evaluation - Participates in post-incident evaluations and ensures permanent closure of incidents
  • Monitors performance | Improves application stability - Monitors application performance and takes steps to improve application performance and stability; follows through with implementation
  • Conducts analysis and development improvements - Conducts system analysis, configuration management, and development improvements for system software performance, availability, and reliability
  • Identifies application patterns and analytics in support of better service level objectives
  • Incident response - Participates in 24x7 incident response and on-call rotation
  • Shares best practices - Shares understanding of Site Reliability Engineering culture across organization; shares knowledge of best practices, approaches, documentation, and code with team members and other teams

Qualifications 

  • Bachelor’s degree in computer science, information technology, or related field of study
  • Not less than three (3) years’ experience in a Linux environment as a System Administrator, Site Reliability Engineer, or similar role
  • Demonstrated advanced knowledge of networking protocols, including but not limited to IP routing (static/BGP/OSPF), TCP/UDP fundamentals, security (TLS, IPSEC), and common application protocols
  • Demonstrated advanced knowledge of Linux operating environment including storage, network, and container subsystems
  • Proven skills in incident management and root cause analysis
  • Demonstrated experience with Google Cloud Platform (APIs and CLIs)
  • Experience with configuration management tools
  • Experience with scripting and automation in commonly used languages, including but not limited to Bash, Ruby, and Python
  • Familiarity with programming languages used for DevOps/Continuous Delivery, including but not limited to Go, Java, and Node.Js 
  • Experience with distributed storage, containers, containerizing applications, and container orchestration (Kubernetes)
  • Excellent communication skills, both oral and written; ability to adapt message/style to fit audience (i.e., ability to communicate technical concepts to a non-technical audience)
  • Strong interpersonal skills with the ability to work with all levels of management and employees; ability to gain credibility, provide effective customer service, and foster positive working relationships with internal and external stakeholders 
  • Excellent attention to detail; ability to work accurately and to identify, analyze, prevent, and solve problems 

About TCN

TCN is a fast-growing technology company and provides all its services over the internet in a cloud-based software-as-a-service model. TCN's technology stack and culture are positive and forward-thinking. When you join TCN, you are joining a dedicated team of professionals. Employees often describe our culture as friendly, collaborative, flexible, and fast-paced. To learn more, visit our .

Our benefits include:

  • Medical Insurance (HDHP with HSA)
  • Dental Insurance
  • Vision Insurance
  • Life Insurance 
  • 401k with employer match
  • Competitive salary
  • Paid time off 
  • Paid holidays (11 scheduled)
  • Weekly lunches; free drinks and snacks
  • Casual dress and flexible work environment

Powered by JazzHR



  • St Albans, United States The Chemical Engineer Full time

    The world needs fresh and innovative solutions. We need YOU! Where the chemistry happens… Our team is searching for a Reliability Engineer to work at our Attapulgus, GA site for the BASF Catalyst division which is the world's leading supplier of environmental and process catalysts. The group offers exceptional expertise in the development of technologies...


  • Saint George, United States TCN Full time

    Job DescriptionJob DescriptionTCN is looking for a Site Reliability Engineer to join our team in Saint George, Utah. The Site Reliability Engineer works as part of a team to analyze, troubleshoot, deploy, monitor, and maintain TCN’s large production environment with global scale. These significant responsibilities are completed while continually thinking...


  • St Louis, United States Hire Talent Full time

    Job Description: The client supports in key business lines related to financial management for the federal government. As a Contract Site Reliability Engineer you will report to a Manager and be part of a team that provides software engineering services to a forecasting system owned by the client This role will assist with designing and enhancing existing...


  • St Louis, United States PRI Global Full time

    As per the client guidelines we have to submit only W2 for this role. So, please avoid sending C2C profilesThe requirement as followsJob title: Site Reliability EngineerDuration: 12+ monthsLocation: REMOTERequired:• 3+ years experience with Dynatrace in the following areas:• Developing custom dashboards, alerts, and reports• Proven experience with...


  • St. Louis, United States PRI Global Full time

    As per the client guidelines we have to submit only W2 for this role. So, please avoid sending C2C profilesThe requirement as followsJob title: Site Reliability EngineerDuration: 12+ monthsLocation: REMOTERequired:• 3+ years experience with Dynatrace in the following areas:• Developing custom dashboards, alerts, and reports• Proven experience with...


  • St Louis, United States Equifax Full time

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. SRE is also an...


  • St Albans, United States Equifax Full time

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. SRE is also an...


  • St Louis, United States Penn Foster Inc Full time

    Company Description Take a seat on the rocket ship and join us as Site Reliability Engineer to help people succeed across the world. We’re a global team of builders, listeners and problem-solvers who are relentlessly focused on making life simple, so our customers can get back to growing their business, engaging consumers and doing what they love. Job...


  • St Albans, United States Jettycloud Full time

    Site Reliability Engineer (Core backend team) Location: Georgia Bulgaria JettyCloud looks for IT professionals on behalf of RingCentral to join its team in Sofia . RingCentral Bulgaria is a new European branch of an American company RingCentral. JettyCloud is a software R&D center that works for RingCentral. RingCentral builds a high-available cloud-based...


  • St Petersburg, United States Jobs for Humanity Full time

    Job Description Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Science Travel Percentage : 5 - 10% Job Description Every day, our teams innovate across the world of finance. We collaborate to work smarter, while making a difference. We believe in diversity and...


  • St. Petersburg, United States FIS Global Full time

    Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Science Travel Percentage : 5 - 10%Job DescriptionEvery day, our teams innovate across the world of finance. We collaborate to work smarter, while making a difference. We believe in diversity and inclusivity, giving a voice to...


  • St Albans, United States BASF SE Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Select how often (in days) to receive an alert: The world needs fresh and innovative solutions. We need YOU! Where the chemistry happens… Our team is searching for a Reliability Engineer to work at our Attapulgus, GA site for the BASF Catalyst division which is the...


  • St Paul, United States Reckitt Benckiser Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Select how often (in days) to receive an alert: Oversees the assessment and reliability planning of the production equipment. Provides timely completion of technical processes related to reliability of equipment and procedures to minimize the risk and impact of...


  • St Louis, United States LTIMindtree Full time

    About US:LTIMindtree is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies. As a digital transformation partner to more than 750 clients, LTIMindtree brings extensive domain and technology expertise...


  • St. Louis, United States LTIMindtree Full time

    About US:LTIMindtree is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies. As a digital transformation partner to more than 750 clients, LTIMindtree brings extensive domain and technology expertise...


  • St Paul, United States PURIS FlowX Full time

    Description The Reliability Engineer’s (RE) purpose is to drive reliability performance, in compliance with all applicable standards, procedures, and policies. The RE maintains a long-term, strategic focus on opportunities that impact Dawson’s measured asset capability . They use reliability technology and tools to facilitate development of reliability...


  • Saint George, United States TCN Full time

    Job DescriptionJob DescriptionTCN is looking for a Site Operations Engineer to join our team in Saint George, Utah. The Site Operations Engineer works as part of a team to analyze, troubleshoot, deploy, monitor, and maintain TCN’s large production environment with global scale. These significant responsibilities are completed while continually thinking...

  • Reliability Engineer

    3 weeks ago


    St James, United States The Mosaic Company Full time

    Are You Our Next Reliability Engineer-Multiple Level Applicants Welcome?Join our dynamic team at the forefront of the global digital acceleration as a Reliability Engineer I, II, III or Sr to work at our Mosaic Uncle Sam Plant. The successful candidate will find opportunities to improve equipment reliability and to implement those changes through failure...

  • Reliability Engineer

    3 weeks ago


    St James, United States The Mosaic Company Full time

    Are You Our Next Reliability Engineer-Multiple Level Applicants Welcome?Join our dynamic team at the forefront of the global digital acceleration as a Reliability Engineer I, II, III or Sr to work at our Mosaic Uncle Sam Plant. The successful candidate will find opportunities to improve equipment reliability and to implement those changes through failure...


  • St Paul, United States Staffingine LLC Full time

    Position:- Reliability Engineer Location:- Carrollton, GA Job Description: The Reliability Engineer champions the implementation of reliability best practices. This role will also evaluate new best-known methods and technologies to continuously improve equipment reliability. Key aspects of this position include: Working with various plant teams and areas to...