Current jobs related to Site Reliability Engineer - Raleigh - Bandwidth


  • Raleigh, United States Red Hat Full time

    About the Job. Red Hat is seeking a Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hats enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at Reliability Engineer, Liability, Reliability, Engineer, Reliability, Monitoring, Technology


  • Raleigh, United States Associates Systems LLC Full time

    Site Reliability Engineer Required Experience & Skills: Due to the work you’ll perform and interactions with DoD programs you will need to be a US citizen with the ability to obtain and maintain a DoD Secret Security Clearance BS in Computer Science, Engineering, Applied Mathematics, or a related technical field along with 7-9 years relevant work...


  • Raleigh, North Carolina, United States Associates Systems LLC Full time

    Essential Qualifications for Site Reliability Engineer:As part of your responsibilities and interactions with defense programs, you must be a US citizen capable of obtaining and maintaining a DoD Secret Security Clearance.A Bachelor’s degree in Computer Science, Engineering, Applied Mathematics, or a similar technical discipline is required, along with 7-9...


  • Raleigh, North Carolina, United States Veradigm® Full time

    Welcome to Veradigm. Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our vision is a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Raleigh, United States Booz Allen Hamilton Full time

    The Opportunity: Everyone is trying to “harness the power of the cloud,” but not everyone knows how. As a site reliability engineer, you know how to build resilient platforms that meet customer needs and take advantage of the power of containerization both in the cloud and on premises. What if you could use your engineering skills to improve warfighter...


  • Raleigh, United States Allscripts Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today’s healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Raleigh, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Raleigh, North Carolina, United States Celonis Full time

    About Celonis: Celonis stands as the global frontrunner in Process Mining technology and is recognized as one of the fastest-growing SaaS companies worldwide. We are dedicated to harnessing the potential of data and intelligence to enhance productivity within business operations, and we invite you to be a part of this journey. Role Overview: Join a...


  • Raleigh, North Carolina, United States Ally Full time

    General InformationReference Number: 17885Remote Work: NoAbout Ally and Your CareerAt Ally Financial, our success is intrinsically linked to the success of our employees. We prioritize the well-being of our team members, recognizing their diverse interests, families, and aspirations. Our commitment to work-life balance, health, and inclusivity is reflected...


  • Raleigh, United States Veradigm® Full time

    Welcome to Veradigm! Our Mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our Vision is a Connected Community of Health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Raleigh, North Carolina, United States Citrix Systems Inc Full time

    Location: Fully on-site in Raleigh, NC.About Our TeamAre you passionate about working in a dynamic and agile environment? If you thrive in a setting that encourages innovation and collaboration, we want to hear from you. Our team is embarking on an exciting journey as we transition back to our roots, focusing on our SaaS offerings and positioning ourselves...


  • Raleigh, United States Delta System and Software Full time

    Job Title: Site Reliability Engineer Location: Cary, NC Day 1 onsite requirement Permanent hire - Must have good knowledge on Google Cloud Platform (GCP) - Required to have hands-on experience in defining and creating CUJ, SLO, SLI, and Error Budgeting based on NFR - S...


  • Raleigh, United States Cisco Full time

    Who We Are Today’s results-oriented business environment is more than that – it’s a period of disruption between the pandemic, global business change and internal process complexity. For us to focus on simplicity and the best customer experience, we need great talent and the right skillsets to be successful. This is now a mantra for our Cisco...


  • Raleigh, North Carolina, United States Biogen Idec Full time

    Job OverviewPosition SummaryThe Senior Reliability Engineer plays a crucial role in applying Reliability Engineering principles to enhance the design specifications and operational efficiency of essential assets throughout the organization. This position involves the development of analytical techniques to assess the reliability of components, machinery, and...


  • Raleigh, North Carolina, United States Biogen Idec Full time

    Job OverviewAbout the PositionThe Senior Reliability Engineer is responsible for implementing Reliability Engineering principles to enhance design specifications and operational efficiency of essential assets throughout the organization. This role involves developing analytical techniques to assess the reliability of components, machinery, and processes. The...


  • Raleigh, United States Cisco Full time

    Who We Are Today’s results-oriented business environment is more than that – it’s a period of disruption between the pandemic, global business change and internal process complexity. For us to focus on simplicity and the best customer experience, we need great talent and the right skillsets to be successful. This is now a mantra for our Cisco...


  • Raleigh, United States Biogen Idec Full time

    Job Description About This Role The Sr. Reliability Engineer I applies Reliability Engineering methodologies to optimize design requirements and performance of critical assets across the site. Originates and develops analysis methods for determining reliability of components, equipment and processes. Acquires data and analyzes the data. Prepares and...


  • Raleigh, North Carolina, United States Veradigm® Full time

    Welcome to Veradigm. Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our vision is a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...

  • Reliability Engineer

    3 weeks ago


    Raleigh, United States Amentum Full time

    Amentum is seeking a Reliability Engineer to join our team in Winston Salem, NC! Typical work schedule is 1st Shift, 7:00 am – 3:30 pm; hours may vary based on business demand. Weekend hours may be scheduled to support our 24/7 operation. The Reliability Engineer acts as a Lean Maintenance SME and adds support to maintenance teams with development of...

  • Reliability Engineer

    2 months ago


    Raleigh, United States Amentum Full time

    Amentum is seeking a Reliability Engineer to join our team in Winston Salem, NC! Typical work schedule is 1st Shift, 7:00 am – 3:30 pm.; hours may vary based on business demand. Weekend hours may be scheduled to support our 24/7 operation. The Reliability Engineer acts as a Lean Maintenance SME and adds support to maintenance teams with development of...

Site Reliability Engineer

1 month ago


Raleigh, United States Bandwidth Full time

Site Reliability Engineer (Raleigh, NC) Duties: Work closely with leadership and internal partners to ensure that software meets security, SLA, performance, and capacity requirements. Set up and maintain monitoring tools and systems to detect issues using Datadog Monitors and Alert using OpsGenie. Configure Datadog and Grafana alerts and Application Health Monitors to notify the team when anomalies or problems occur. Work closely with other Site Reliability Engineers, DevOps Engineers, and System Administrators to achieve common goals. Analyze system performance data using Snowflake to plan for capacity upgrades or optimizations. Ensure the system can handle expected growth in traffic and data using the tools by getting the Lags and behavior of the Application. Manage Kubernetes clusters and OpenShift environments for deploying and scaling containerized applications. Implement and manage infrastructure using Ansible and maintain version-controlled infrastructure code using Gitlab for consistency and repeatability. Use Terraform and Ansible scripts to define and provision infrastructure resources in a repeatable and automated manner. Create and maintain Ansible playbooks to automate routine tasks, configurations, and deployments. Use GitHub Actions for CI/CD activities to continuously build and deploy the code and implement CI/CD pipelines to streamline application updates. Build and maintain deployment pipelines using the Ansible Playbooks and ensure smooth and reliable deployments, rollback procedures, and create production releases using Service Now for Tracking the Records. Maintain detailed documentation on system architecture, configurations, and processes using Confluence and Share knowledge and best practices with team members. Plan for resource allocation using Red Hat OpenShift including servers, storage, and network capacity, following the Kubernetes Architecture to ensure the system is equipped to handle traffic spikes and growth. Develop and test disaster recovery plans to ensure data and service availability in case of major failures or disasters by creating the tools using the Go. Work closely with development teams to promote a DevOps culture and ensure reliability is built into software from the start by following best practices. Collaborate with other Site Reliability Engineers to share knowledge and solve complex problems on a weekly basis and touch base all the points. Monitor and manage cloud resource costs in AWS to optimize spending while maintaining performance.

Required: Master’s degree or foreign equivalent in Computer Science, Electrical Engineering, or related field of study plus 2 years of experience in the job offered or related position. Must have experience 2 years of experience with: Infrastructure and networking concepts including virtualization, load balancing, and DNS. At least one of the following cloud infrastructure technologies AWS, Google Cloud, Azure. REST APIs using at least one or more of the following (JSON, XML, YAML). Designing, building, and operating large-scale production systems. Continuous Integration and Continuous Deployment (CI/CD) concepts and technologies using at least one or more of following (Jenkins, GHA, Circle). Containerization technologies (Docker, Docker Compose, Docker Swarm, Kubernetes). Configuration and management techniques in large distributed environments. Monitoring and observability techniques with at least one or more of the following tools Datadog, Sensu, New Relic, Nagios. General use of open-source databases MySQL, Postgres, Redis, Cassandra. Unix/Linux administration, troubleshooting and shell scripting. At least one or more of the following programming languages Python, Java, Go, Rust, or similar. Source control (Git, GitHub) and feature branching strategies. Automating infrastructure, testing, and deployment using tools Ansible, Chef, or Terraform. Infrastructure as Code paradigm.

Or in the alternate will accept a Bachelor’s degree or foreign equivalent in Computer Science, Electrical Engineering or related field of study plus 5 years of experience in the job offered or related position. Must have experience 2 years of experience with: Infrastructure and networking concepts including virtualization, load balancing, and DNS. At least one of the following cloud infrastructure technologies AWS, Google Cloud, Azure. REST APIs using at least one or more of the following (JSON, XML, YAML). Designing, building, and operating large-scale production systems. Continuous Integration and Continuous Deployment (CI/CD) concepts and technologies using at least one or more of following (Jenkins, GHA, Circle). Containerization technologies (Docker, Docker Compose, Docker Swarm, Kubernetes). Configuration and management techniques in large distributed environments. Monitoring and observability techniques with at least one or more of the following tools Datadog, Sensu, New Relic, Nagios. General use of open-source databases MySQL, Postgres, Redis, Cassandra. Unix/Linux administration, troubleshooting and shell scripting. At least one or more of the following programming languages Python, Java, Go, Rust, or similar. Source control (Git, GitHub) and feature branching strategies. Automating infrastructure, testing, and deployment using tools Ansible, Chef, or Terraform. Infrastructure as Code paradigm.

Submit resumes to: Bandwidth, Inc, 2230 Bandmate Way, Raleigh, NC 27607, Attn: Kellie Sigmon, Sr. Manager People Services or apply at www.bandwidth.com/careers/openings/. Must reference “Site Reliability Engineer” when applying.

#LI-DNI
#LI-DNP