Site Reliability Engineer

2 months ago


Raleigh, United States Cisco Full time

Who We Are

Today’s results-oriented business environment is more than that – it’s a period of disruption between the pandemic, global business change and internal process complexity. For us to focus on simplicity and the best customer experience, we need great talent and the right skillsets to be successful. This is now a mantra for our Cisco leadership team and for us. The Digital Enterprise Solutions team is changing the way we run Cisco’s operations by improving the power of technology, the best of business processes and outstanding data insights. Together, we will Reinvent the Cisco experience. Show the world how to Reinvent applications and demonstrate the future of the Internet to showcase the power of Cisco: our people, products, processes, systems, and data. Please join us and make this journey together Be part of IT Cloud Strategy efforts passionate about modernization and re-engineering of on-prem file storage, block storage and backup infrastructure with Distributed Storage, including distributed block storage, object storage, file storage and SAN Storage.

Who You'll Work With

You’ll be part of Storage SRE team passionate about automating and modernizing Cisco Storage Infrastructure (NetApp, EMC, Ceph, SolidFire) as part of IT transformation to cloud strategy. This team is comprised of Architects, Design Engineers, SME's and software developers organized in agile teams with daily or weekly scrum meetings using JIRA as a tool to track all activities with scrum or Kanban approach.

Who You Are

You are an articulate communicator with effective verbal and written skills, capable of engaging successfully with team members and stakeholders alike. Your strong analytical and problem-solving abilities set you apart, enabling you to offer creative alternatives, conduct in-depth root cause analyses, and present thoughtful proposals. With a customer solutions mindset, you bring a blend of strong interpersonal skills and leadership qualities to the table, always ready to guide and inspire those around you. As a Site Reliability Engineer, you have demonstrated leadership by spearheading projects that achieved significant infrastructure enhancements. Your extensive experience in cloud planning, migration, and implementation with platforms like AWS, GCP, and Azure is a testament to your expertise. In addition, you possess a robust background in DevOps practices as well as a deep understanding of object storage and archival solutions. You are a forward-thinking leader who has consistently delivered large-scale infrastructure improvements and is poised to bring your comprehensive skill set to our team.

What You’ll Do

As a Site Reliability Engineer, you will play a crucial role in managing and optimizing SRE operations for an array of storage solutions including distributed block storage, object storage, file storage, and SAN Storage. You will focus on automating processes, leveraging a DevOps and SRE mindset to foster user self-service capabilities and system self-recovery features, and will actively participate in agile infrastructure software development meetings. You have a solid understanding of Linux operations and expertise in storage systems running on both virtual and physical infrastructures. You are a subject matter expert for the implementation of continuous integration and continuous delivery (CI/CD) pipelines for storage infrastructure as code. You will manage revision control and automate testing, provisioning, and deprovisioning workflows within the storage domain, ensuring smooth operations across both public and private cloud platforms. You have the ability to handle complex provisioning tasks, advanced maintenance, data replication, disaster recovery, data migration, and the creation of comprehensive documentation for Storage and Backup environments.

You will interact with storage vendors to configure and fine-tune storage systems, troubleshoot issues, and determine the root cause to prevent future problems. In terms of Capacity Management, you will automate the monitoring and reporting of storage usage and trends, facilitating accurate capacity planning. You will take part in ongoing projects and evaluate new technologies related to storage solutions and migrations, code upgrades, and data protection and management activities. You will be responsible for establishing guidelines and best practices for storage standards on public cloud platforms, including AWS, Azure, and GCP Storage services such as S3, EBS, FSx, EFS, and other related offerings, ensuring a secure, efficient, and scalable storage infrastructure.

Minimum Qualifications:

7+ years of experience supporting enterprise storage Experience implementing, optimizing, and managing global enterprise scale storage, including Enterprise Scale SAN and NAS and Software Defined storage. Enterprise Backup solutions and Replication technologies, software defined storage and Public Cloud platform and protocols. Experience with cloud and on-prem enterprise storage environments such as EMC VMAX/PowerMax, Data domain, SRDF, SnapMirror, NetApp, Veritas Netbackup, VMware, EC2, EBS, FSx or S3, S3 Glacier or S3 Glacier Deep Archive Experience with cloud-based software development tools and methodologies such as Git, CI/CD, CodeDeploy, CodePipeline, Jenkins, Build Automation and Testing Experience in programming languages such as, Python, PowerShell, Ruby, GoLang or Bash Experience with infrastructure and configuration management tools such as, RedHat, CentOS/Windows, VMware, Puppet or Ansible

Preferred Qualifications:

Bachelors Degree in STEM

Why Cisco

#WeAreCisco. We are all unique, but collectively we bring our talents to work as a team, to develop innovative technology and power a more inclusive, digital future for everyone. How do we do it? Well, for starters – with people like you

Nearly every internet connection around the world touches Cisco. We’re the Internet’s optimists. Our technology makes sure the data traveling at light speed across connections does so securely, yet it’s not what we make but what we make happen which marks us out. We’re helping those who work in the health service to connect with patients and each other; schools, colleges, and universities to teach in even the most challenging of times. We’re helping businesses of all shapes and sizes to connect with their employees and customers in new ways, providing people with access to the digital skills they need and connecting the most remote parts of the world – whether through 5G, or otherwise.

We tackle whatever challenges come our way. We have each other’s backs, we recognize our accomplishments, and we grow together. We celebrate and support one another – from big and small things in life to big career moments. And giving back is in our DNA (we get 10 days off each year to do just that). #J-18808-Ljbffr



  • Raleigh, United States Red Hat Full time

    About the Job. Red Hat is seeking a Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hats enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at Reliability Engineer, Liability, Reliability, Engineer, Reliability, Monitoring, Technology


  • Raleigh, United States Associates Systems LLC Full time

    Site Reliability Engineer Required Experience & Skills: Due to the work you’ll perform and interactions with DoD programs you will need to be a US citizen with the ability to obtain and maintain a DoD Secret Security Clearance BS in Computer Science, Engineering, Applied Mathematics, or a related technical field along with 7-9 years relevant work...


  • Raleigh, North Carolina, United States Associates Systems LLC Full time

    Essential Qualifications for Site Reliability Engineer:As part of your responsibilities and interactions with defense programs, you must be a US citizen capable of obtaining and maintaining a DoD Secret Security Clearance.A Bachelor’s degree in Computer Science, Engineering, Applied Mathematics, or a similar technical discipline is required, along with 7-9...


  • Raleigh, North Carolina, United States Veradigm® Full time

    Welcome to Veradigm. Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our vision is a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Raleigh, United States Booz Allen Hamilton Full time

    The Opportunity: Everyone is trying to “harness the power of the cloud,” but not everyone knows how. As a site reliability engineer, you know how to build resilient platforms that meet customer needs and take advantage of the power of containerization both in the cloud and on premises. What if you could use your engineering skills to improve warfighter...


  • Raleigh, United States Allscripts Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today’s healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Raleigh, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Raleigh, North Carolina, United States Celonis Full time

    About Celonis: Celonis stands as the global frontrunner in Process Mining technology and is recognized as one of the fastest-growing SaaS companies worldwide. We are dedicated to harnessing the potential of data and intelligence to enhance productivity within business operations, and we invite you to be a part of this journey. Role Overview: Join a...


  • Raleigh, North Carolina, United States Ally Full time

    General InformationReference Number: 17885Remote Work: NoAbout Ally and Your CareerAt Ally Financial, our success is intrinsically linked to the success of our employees. We prioritize the well-being of our team members, recognizing their diverse interests, families, and aspirations. Our commitment to work-life balance, health, and inclusivity is reflected...


  • Raleigh, United States Veradigm® Full time

    Welcome to Veradigm! Our Mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our Vision is a Connected Community of Health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Raleigh, North Carolina, United States Citrix Systems Inc Full time

    Location: Fully on-site in Raleigh, NC.About Our TeamAre you passionate about working in a dynamic and agile environment? If you thrive in a setting that encourages innovation and collaboration, we want to hear from you. Our team is embarking on an exciting journey as we transition back to our roots, focusing on our SaaS offerings and positioning ourselves...


  • Raleigh, United States Delta System and Software Full time

    Job Title: Site Reliability Engineer Location: Cary, NC Day 1 onsite requirement Permanent hire - Must have good knowledge on Google Cloud Platform (GCP) - Required to have hands-on experience in defining and creating CUJ, SLO, SLI, and Error Budgeting based on NFR - S...


  • Raleigh, United States Cisco Full time

    Who We Are Today’s results-oriented business environment is more than that – it’s a period of disruption between the pandemic, global business change and internal process complexity. For us to focus on simplicity and the best customer experience, we need great talent and the right skillsets to be successful. This is now a mantra for our Cisco...


  • Raleigh, North Carolina, United States Biogen Idec Full time

    Job OverviewPosition SummaryThe Senior Reliability Engineer plays a crucial role in applying Reliability Engineering principles to enhance the design specifications and operational efficiency of essential assets throughout the organization. This position involves the development of analytical techniques to assess the reliability of components, machinery, and...


  • Raleigh, North Carolina, United States Biogen Idec Full time

    Job OverviewAbout the PositionThe Senior Reliability Engineer is responsible for implementing Reliability Engineering principles to enhance design specifications and operational efficiency of essential assets throughout the organization. This role involves developing analytical techniques to assess the reliability of components, machinery, and processes. The...


  • Raleigh, United States Biogen Idec Full time

    Job Description About This Role The Sr. Reliability Engineer I applies Reliability Engineering methodologies to optimize design requirements and performance of critical assets across the site. Originates and develops analysis methods for determining reliability of components, equipment and processes. Acquires data and analyzes the data. Prepares and...


  • Raleigh, North Carolina, United States Veradigm® Full time

    Welcome to Veradigm. Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our vision is a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...

  • Reliability Engineer

    3 weeks ago


    Raleigh, United States Amentum Full time

    Amentum is seeking a Reliability Engineer to join our team in Winston Salem, NC! Typical work schedule is 1st Shift, 7:00 am – 3:30 pm; hours may vary based on business demand. Weekend hours may be scheduled to support our 24/7 operation. The Reliability Engineer acts as a Lean Maintenance SME and adds support to maintenance teams with development of...

  • Reliability Engineer

    2 months ago


    Raleigh, United States Amentum Full time

    Amentum is seeking a Reliability Engineer to join our team in Winston Salem, NC! Typical work schedule is 1st Shift, 7:00 am – 3:30 pm.; hours may vary based on business demand. Weekend hours may be scheduled to support our 24/7 operation. The Reliability Engineer acts as a Lean Maintenance SME and adds support to maintenance teams with development of...


  • Raleigh, United States Cisco Full time

    Who We Are Today’s business environment is more than that – it’s a period of disruption between the pandemic, global business change and internal process complexity. For us to focus on simplicity and the best customer experience, we need great talent and the right skills to be successful. This is now a mantra for our Cisco leadership team and for us....