Site Reliability Engineer, Americas

3 months ago


Atlanta, United States Canonical Full time
Job DescriptionJob Description

Next-gen operations at scale, with pure Python infra-as-code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops.

We run hundreds of private cloud, Kubernetes, and application clusters for customers across physical and public cloud estate, and we are raising the bar on what's possible with automation by embracing a universal operator pattern and model-driven operations.

To succeed in this role you need to believe in automation as a pure software engineering problem, not a hack-it-till-it-works-for-me problem. You need to be interested in the scientific approach to operations at scale, driven by metrics and code, and you need to be able to learn the entire stack, from bare metal networking and kernel up to serverless and open source applications.

Site Reliability Engineer

Our site reliability engineers bring Python software-engineering skills and rigour to the operations domain. We practice devsecops from bare metal to application. We architect and run OpenStack, Kubernetes and software defined storage, and we enable devsecops for applications running on that infrastructure too.

To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers.

As a member of the team you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure. We drive upgrades to keep our customers on the latest, best solutions.

What Canonical Offers
  • Technical management team that understands the details of what we are developing
  • A culture of openness and inclusiveness
  • Helpful and talented engineers who are world-class experts in many fields
  • Teams focused on good work life balance with long average retention rates
  • A wide range of engineering disciplines and career paths that can move between divisions
  • Fully remote company for career growth without relocation requirements
Requirements
  • Software Engineering or Computer Science degree
  • Linux experience and familiarity with Linux networking and storage
  • Python software development experience
  • Demonstrated drive for continual learning
  • Devops experience
Nice to haves
  • Experience with OpenStack or Kubernetes deployment or operations
  • We hope that you'll join us in helping to shape and build the future of free software together

Of course we also offer...

  • Learning and Development
  • Annual Compensation Review
  • Recognition Rewards
  • Annual Leave
  • Priority Pass for travel
  • Flexible working option
About Canonical

Canonical is a pioneering tech firm that is at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT and the cloud, we are changing the world on a daily basis. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do.

Canonical has been a remote-first company since its inception in 2004.​ Work at Canonical is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game. Canonical provides a unique window into the world of 21st-century digital business.

Canonical is an equal opportunity employer

We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background create a better work environment and better products. Whatever your identity, we will give your application fair consideration.

#LI-Remote



  • Atlanta, United States BeVera Solutions LLC Full time

    Job DescriptionJob DescriptionDescription:Company DescriptionBeVera Solutions, LLC is a fast-growing Data Science Consulting provider focused on delivering high-value solutions to its Federal Government customers. BeVera places a high premium on Integrity and Respect for all employees. Our CEO values every employee and fosters that attitude throughout the...


  • atlanta, United States Advansys Full time

    Job Title: Site Reliability Engineer Location: Alpharetta, GA (Locals Candidates only) Duration: Long term We seek a highly skilled Site Reliability Engineer and dynamic – Consultant In this role you will Maintain and improve the reliability, performance, and availability of software systems. Act as a bridge between traditional IT operations and...


  • Atlanta, United States Advansys Full time

    Job Title: Site Reliability Engineer Location: Alpharetta, GA (Locals Candidates only) Duration: Long term We seek a highly skilled Site Reliability Engineer and dynamic – Consultant In this role you will Maintain and improve the reliability, performance, and availability of software systems. Act as a bridge between traditional IT operations and...


  • Atlanta, United States Advansys Full time

    Job Title: Site Reliability Engineer Want to make an application Make sure your CV is up to date, then read the following job specs carefully before applying. Location: Alpharetta, GA (Locals Candidates only) Duration: Long term We seek a highly skilled Site Reliability Engineer and dynamic – Consultant In this role you will Maintain and improve the...


  • Atlanta, United States ACL Digital Full time

    Title:: Site Reliability EngineerLocation:: Atlanta, GA (Hybrid role, 3x days onsite/week)Type of Hire:: Contract (c2c/w2)Duration:: 12 months with possible extension Site Reliability Engineer (SRE) with AWS Cloud and Application Monitoring Experience** We are seeking a skilled Site Reliability Engineer (SRE) with expertise in AWS cloud infrastructure and...


  • Atlanta, United States ACL Digital Full time

    Title:: Site Reliability EngineerLocation:: Atlanta, GA (Hybrid role, 3x days onsite/week)Type of Hire:: Contract (c2c/w2)Duration:: 12 months with possible extension Site Reliability Engineer (SRE) with AWS Cloud and Application Monitoring Experience** We are seeking a skilled Site Reliability Engineer (SRE) with expertise in AWS cloud infrastructure and...


  • Atlanta, United States Insight Global Full time

    Must Haves:5+ years of C# .NET Development ExperienceExperience building automated deploymentsIIS application pool experience Plusses:Splunk Scrum Experience Cloud knowledge and experience Day-to-Day Responsibilities:A Fortune 500 client of Insight Global is seeking a Site Reliability Engineer (SRE) to join their team on a hybrid basis. As the sole SRE, you...


  • Atlanta, United States Insight Global Full time

    Must Haves:5+ years of C# .NET Development ExperienceExperience building automated deploymentsIIS application pool experience Plusses:Splunk Scrum Experience Cloud knowledge and experience Day-to-Day Responsibilities:A Fortune 500 client of Insight Global is seeking a Site Reliability Engineer (SRE) to join their team on a hybrid basis. As the sole SRE, you...


  • Atlanta, United States Insight Global Full time

    Position Title: Site Reliability EngineerLocation: Atlanta, GA; Portland, ME; or Chattanooga, TN (3 days/week onsite)Compensation: $130-150k Duration: Full-Time, Direct Hire Job Overview:A Fortune 500 client of Insight Global is seeking a dedicated Site Reliability Engineer (SRE) to join their team. As the sole SRE, you will play a crucial role in...


  • Atlanta, United States Tata Consultancy Services Full time

    Job DescriptionAutomating work including infrastructure needs, testing, failover solutions, failure mitigation, and much moreDebugging complex problems across an entire stack and creating solid solutionsDeveloping and building CI/CD processes to improve cadenceUsing Chaos Engineering to test what you build under real-world conditionsTriage product or system...


  • Atlanta, United States Tata Consultancy Services Full time

    Job DescriptionAutomating work including infrastructure needs, testing, failover solutions, failure mitigation, and much moreDebugging complex problems across an entire stack and creating solid solutionsDeveloping and building CI/CD processes to improve cadenceUsing Chaos Engineering to test what you build under real-world conditionsTriage product or system...


  • Atlanta, United States Hermeus Full time

    Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-effectively by integrating hardware-rich, iterative development with modern computing and autonomy. This approach has been validated through design, build, and...


  • Atlanta, United States Datum Technologies Group Full time

    Job Details:Site Reliability EngineerLong term contractAtlanta, GAQualifications:Must have Skills:Deep understanding of AWS services (Lambda, S3, SQS, IAM, Route 53 etc.) and proficiency in infrastructure as code (e.g., Terraform, CloudFormation).Hands-on experience with monitoring tools such as CloudWatch, Sumo Logic, Dynatrace, Grafana, or similar for...


  • Atlanta, United States Datum Technologies Group Full time

    Job Details:Site Reliability EngineerLong term contractAtlanta, GAQualifications:Must have Skills:Deep understanding of AWS services (Lambda, S3, SQS, IAM, Route 53 etc.) and proficiency in infrastructure as code (e.g., Terraform, CloudFormation).Hands-on experience with monitoring tools such as CloudWatch, Sumo Logic, Dynatrace, Grafana, or similar for...


  • Atlanta, United States Hermeus Full time

    Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-effectively by integrating hardware-rich, iterative development with modern computing and autonomy. This approach has been validated through design, build, and...


  • Atlanta, Georgia, United States Advansys Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Advansys. As a key member of our infrastructure team, you will be responsible for maintaining and improving the reliability, performance, and availability of our software systems.Key Responsibilities:Maintain and improve the reliability, performance, and availability...


  • Atlanta, United States Cox Communications Full time

    This role is for an opening for a Senior Site Reliability Engineer (SRE) on the Manheim Logistics SRE team. The SRE team is tasked with designing and maintaining AWS infrastructure and deployment pipelines for Manheim Logistics 15 development teams. Reliability Engineer, Liability, Reliability, Engineer, Reliability, Monitoring, Technology


  • Atlanta, United States TEKsystems Careers Full time

    **Must work W2, No C2C** Must sit on site 3 days a week in Atlanta, GA *Description:* TEKsystems is seeking an experienced Site Reliability Engineer at one of out top clients in Charlotte, NC. The SRE must have experience monitoring, maintaining, and building out an OpenShift platform before. This SRE will be able to handle tickets but also help reduce...


  • Atlanta, United States TEKsystems Careers Full time

    **Must work W2, No C2C** Must sit on site 3 days a week in Atlanta, GA *Description:* TEKsystems is seeking an experienced Site Reliability Engineer at one of out top clients in Charlotte, NC. The SRE must have experience monitoring, maintaining, and building out an OpenShift platform before. This SRE will be able to handle tickets but also help reduce...


  • Atlanta, United States Motion Recruitment Full time

    A prominent insurance firm located in Atlanta is seeking skilled professionals to join their engineering team. They are currently in search of a DevOps/Senior Site Reliability Engineer for a full-time position, offering a hybrid work model at their Atlanta office. This company is at the cutting edge of innovation in content and presentation software designed...