Senior Site Reliability Engineer, FedRAMP

2 weeks ago


San Francisco, United States Cisco Systems, Inc. Full time

Who We Are

Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network - even the ones they don't own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and remediate issues - before they impact end-user experiences.

ThousandEyes is deeply integrated across the entire Cisco technology portfolio and beyond, helping customers deploy at scale while also delivering AI-powered assurance insights within Cisco's leading Networking, Security, Collaboration, and Observability portfolios.

About The Role

The FedRAMP SRE team is focused on our Federal region's platform. The team is responsible for all aspects of the Federal region's infrastructure and operations, such as availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning, with a strong focus on security. The job is to handle the Federal region's core infrastructure services, maintaining a constantly growing infrastructure capable of handling a very high volume of incoming data per day. We believe in operations/infrastructure/everything as code which makes our distributed team efficient, functional and very effective.

We're looking for talented engineers with a software or operations background, experienced in designing and operating large-scale highly available distributed systems in the cloud. You must be willing to work closely with our application development teams to ensure the reliability, performance and security of our infrastructure.

What You'll Do

  • Join forces with the software engineers to ensure that the ThousandEyes platform's Federal region infrastructure and services are designed and optimized for availability, latency, and performance.
  • Design, implementation, and management of FedRAMP-compliant infrastructure and systems.
  • Establish and maintain processes for continuous monitoring, logging, and auditing of systems to ensure compliance with FedRAMP controls.
  • Collaborate and partner with security teams to identify and remediate vulnerabilities, conduct security assessments, and implement necessary security controls.
  • Design and implement dynamic infrastructure solutions to run our platform's infrastructure as we grow and continue scaling (think multi-region scale).
  • Drive and build automation enabling our infrastructure and platforms to scale effortlessly, with a special focus on FedRAMP systems.
  • Know the latest industry best practices, evolving security threats, and updates to FedRAMP guidelines, and apply this knowledge to improve the security posture of our systems.
  • Design, deploy, and maintain cloud-native services in AWS that are elastic and resilient to failure.
  • Participate in and contribute to improving our 24x7 incident response and on-call rotation.
  • Capacity planning for the infrastructure and platform and help teams prepare for growth.

Qualifications

  • 5+ years of experience.
  • Experience building and/or operating FedRAMP environments.
  • Experience identifying and analyzing cyber security risks.
  • Solid understanding of the FedRAMP framework, its controls, and compliance requirements.
  • Familiarity with security standard processes, vulnerability management, and incident response processes.
  • Ability to write high-quality code in Python, Go, or equivalent languages.
  • Ability to build and implement scalable and well-tested solutions.
  • Good understanding of Unix/Linux systems, the kernel, system libraries, file systems, and client-server protocols.
  • Knowledge of cloud providers, ideally AWS.
  • Infrastructure as Code skills, ideally with Terraform, Puppet, and Kubernetes.
  • Good communication and documentation skills.
  • Solid sense of ownership, drive, and enthusiastic attention to detail.

The successful applicant will be performing work in FedRAMP environments, and therefore, must be a U.S. Person (i.e. U.S. citizen, U.S. national, lawful permanent resident, asylee, or refugee). This position may also perform work that the U.S. government has specified can only be performed by a U.S. citizen on U.S. soil.

#J-18808-Ljbffr

  • San Francisco, United States Cisco Systems, Inc. Full time

    Senior Site Reliability Engineer, FedRAMPCisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network – even the ones they don’t own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to...


  • San Francisco, United States Cisco Systems Full time

    Who We Are Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network – even the ones they don’t own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and...


  • San Francisco, United States Cisco Full time

    Who We Are Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network - even the ones they don’t own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and...


  • San Francisco, United States Cisco Full time

    Who We Are The name ThousandEyes was born from two big ideas: the power to see things not ordinarily possible and the ability to collect insights from a multitude of vantage points. As organizations rely more on cloud services and the Internet, the network has become a black box they can't understand. Our Internet and cloud intelligence platform...


  • San Francisco, California, United States Cisco Full time

    About CiscoCisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network.Job SummaryWe are seeking an experienced Senior Site Reliability Engineer to join our FedRAMP SRE team. As a key member of this team, you will be responsible for designing and operating large-scale...


  • San Francisco, United States Cisco Full time

    Senior Software Engineer, Endpoint (FedRAMP) Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network – even the ones they don’t own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to...


  • San Francisco, United States Cisco Systems, Inc. Full time

    Senior Software Engineer, Endpoint (FedRAMP) Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network – even the ones they don’t own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to...


  • San Francisco, United States Cisco Systems, Inc. Full time

    Senior Software Engineer, Endpoint (FedRAMP)Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network – even the ones they don’t own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to...


  • San Francisco, United States WEX Full time

    The WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...


  • San Francisco, United States WEX, Inc. Full time

    About the RoleThe WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits Reliability organization which supports our internal...


  • San Francisco, California, United States WEX Inc Full time

    The WEX Site Reliability Engineering team is looking for a motivated Site Reliability Engineer to join our Benefits Reliability organization. As a member of our team, you will be responsible for ensuring the reliability, performance, and security of our systems.Key Responsibilities:Learning and Development: Participate in training and mentorship programs to...


  • San Francisco, United States WEX Full time

    About the Role The WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits Reliability organization which supports our internal...


  • San Francisco, United States Apollo Solutions Full time

    Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking artifical inteligence business who are making major developments in how we use AI/ML for gaming/security. They are working closely with government contracts as well as gaming consoles companys and are now searching for an SRE to join their growing team. The Site Reliability...


  • San Francisco, United States Mindlance Full time

    Job Brief: As a Senior Software Delivery Engineer, you'll be on a team building a secure, compliant SaaS platform for Federal government-led construction projects. This will involve building and adopting tools to build, test, and deploy software to run in a dedicated environment that meets all controls for, and is authorized for use at, the FedRAMP Moderate...


  • San Francisco, United States Focal Systems Full time

    Location: San Francisco - hybrid (1-2 days per week)Salary: $170-190k + stockCompany DescriptionFocal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar retail...


  • San Jose, United States NInfo Systems, Inc. Full time

    Company DescriptionNInfo Systems Inc. is a Certified minority-owned national IT Recruiting and Solutions provider with two decades of experience. It works with Fortune 500 corporations, mid-sized companies, Boutique Consulting companies, startups, SME-level organizations, Federal/ State agencies, and tier-one vendors.Role: Senior Reliability Engineer, Hybrid...


  • San Francisco, United States Ellation, Inc. Full time

    Who We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...


  • San Francisco, United States Ellation, Inc. Full time

    Who We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...


  • San Francisco, United States Unreal Gigs Full time

    Are you passionate about building and maintaining resilient systems that ensure high availability and performance? Do you excel at automating processes, troubleshooting complex issues, and creating systems that scale smoothly? If you're ready to take on the challenge of ensuring reliable, efficient, and secure system operations, our client has the perfect...


  • San Francisco, United States Cisco Systems, Inc. Full time

    Who We AreCisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network - even the ones they don't own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and...