Staff Site Reliability Engineer

1 month ago


Newton MA USA, United States CyberArk Full time
About CyberArk

CyberArk is the global leader in Identity Security, providing the most comprehensive security offering for any identity - human or machine - across business applications, distributed workforces, hybrid cloud workloads, and throughout the DevOps lifecycle. The world's leading organizations trust CyberArk to help secure their most critical assets.

Job Description

CyberArk is seeking a FedRAMP Staff Site Reliability Engineer to bring their knowledge, excitement, and energy to the team. If you have worked in the cloud solving large-scale problems, bringing visibility into your platform, and accomplishing true automation, we want you on the team. Driven and excited to innovate is what we need, all while allowing you to grow professionally and creating strong relationships that will last a lifetime.

Key Responsibilities
  • Architect, lead, and design future deployment and management automation for CyberArk's cloud-based infrastructure and software.
  • Provide guidance to Site Reliability Engineers on managing the reliability and performance of SaaS environments and building automation to prevent recurring issues.
  • Architect, develop, and guide the team with the use of cloud configuration management, deployment, and compliance tools such as CloudFormation, Helm, Kubernetes, Terraform, Salt, and Ansible across both Windows and Linux environments.
  • Ensure cloud-based architectures meet availability and recoverability requirements.
  • Implement best practices for cloud-based monitoring, alerting, and observability using tools like PagerDuty, CloudWatch, Grafana, Datadog, and OpenSearch.
  • Support and guide tooling initiatives that enhance team output and reliability.
  • Develop and continuously improve automation of manual processes.
  • Collaborate with engineering and product teams to identify areas for improvement, prepare architecture roadmaps, and advocate to the Product Management group.
  • Respond to production incidents and participate in on-call rotations.
Qualifications
  • B.S. in Computer Science or equivalent experience.
  • Minimum 5 years of experience managing AWS infrastructure.
  • Minimum of 7 years in a senior, architect, or technical lead role of site reliability, systems engineering, or software development.
  • A deep understanding of Site Reliability, infrastructure, and Cloud Platforms.
  • Solid understanding/experience of web services, databases, and relating infrastructure/architectures.
  • Previous experience with FedRAMP or DOD compliance requirements and audits.
  • Strong level of scripting and automation expertise, using Python or an equivalent language.
  • Proven track record of managing reliability and performance for large-scale, enterprise-level SaaS environments.
  • Strong analytical and problem-solving abilities, with a proactive approach to identify and mitigate issues.
  • Extensive experience designing and managing AWS infrastructure components including VPC, ELB/ALB, IAM, KMS, EC2, Route53, AWS Config, CloudTrail, CloudFormation across both AWS commercial and GovCloud Regions.
  • Must be a U.S Citizen or Green Card Holder for meeting with FedRAMP High authorized access requirements.


  • Newton, MA, USA, United States Intelliswift Software Inc Full time

    Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our dynamic team at Intelliswift Software Inc. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities:System Monitoring and Incident Response: Monitor...


  • Newton, MA, USA, United States Intelliswift Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a skilled Site Reliability Engineer to join our dynamic team at Intelliswift. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities:System Monitoring and Incident...


  • Newton, MA, USA, United States Cypress HCM Full time

    Job SummaryWe are seeking a skilled Site Reliability Engineer to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities:System Monitoring and Incident Response: Monitor system health, performance metrics, and...


  • Newton, MA, USA, United States Software Guidance and Assistance, Inc. Full time

    Job Title: Site Reliability EngineerSoftware Guidance and Assistance, Inc. (SGA) is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure.Key Responsibilities:System Monitoring and Incident Response:...


  • Newton, MA, United States Intelliswift Full time

    Site Reliability Engineer 2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team at Intelliswift. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities: System Monitoring and Incident Response:...


  • Newton, Massachusetts, United States Software Guidance and Assistance, Inc. Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Software Guidance and Assistance, Inc. (SGA). As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Monitor system health, performance...


  • Boston, MA , USA, United States Insight Global Full time

    Site Reliability Engineering ManagerA leading retail company in the $7 billion industry is seeking a Site Reliability Engineering Manager to lead a team of 7-10 Site Reliability Engineers in Boston, MA.Key Responsibilities:Lead a team of Site Reliability Engineers in supporting and monitoring production for the eCommerce platform.Develop and implement...


  • Newton, United States Cypress HCM Full time

    Site Reliability Engineer 2 Description:Reason: Special Project |Department: Stock US Eng | 6 MonthsJob SummaryWe are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance....


  • Newton, Massachusetts, United States Software Guidance & Assistance Full time

    Job Title: Site Reliability EngineerSoftware Guidance & Assistance, Inc. (SGA) is seeking a skilled Site Reliability Engineer to join our team for a contract assignment with a premier SaaS client in Newton, MA.Responsibilities:Monitor system health, performance metrics, and availability, and respond promptly to incidents and outages to ensure minimal...


  • Newton, United States Intelliswift Software Full time

    Title : Site Reliability EngineerLocation : Newton, MA HybridDuration : 6 MonthsPay rate : $38.73 per hour on W2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and...


  • Newton, Massachusetts, United States Akraya Inc. Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Akraya Inc. The ideal candidate will possess expertise in system monitoring, infrastructure management, automation, and a keen interest in enhancing system reliability.Key ResponsibilitiesEnsure system health and responsiveness to incidents with minimal downtime.Optimize...


  • Newton, Massachusetts, United States NextDeavor Full time

    Site Reliability Engineer 2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team at NextDeavor. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities: Monitor system health, performance metrics, and...


  • Newton, MA, United States Intelliswift Software Full time

    Title : Site Reliability EngineerLocation : Newton, MA HybridDuration : 6 MonthsPay rate : $38.73 per hour on W2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and...


  • Newton, United States Software Guidance & Assistance Full time

    Software Guidance & Assistance, Inc., (SGA), is searching for a Site Reliability Engineer for a contract assignment with one of our premier SaaS clients in Newton, MA. Responsibilities : System Monitoring and Incident Response: Monitor system health, performance metrics, and availability. Respond promptly to incidents and outages, ensuring minimal...


  • Newton, United States Software Guidance & Assistance Full time

    Software Guidance & Assistance, Inc., (SGA), is searching for a Site Reliability Engineer for a contract assignment with one of our premier SaaS clients in Newton, MA or Fully Remote. Responsibilities : System Monitoring and Incident Response: Monitor system health, performance metrics, and availability. Respond promptly to incidents and outages,...


  • Newton, United States Intelliswift Full time

    Job ID: 24-05261 Site Reliability Engineer Newton, MA Hybrid 6 Months $38.73 per hour on W2 We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance. You will play a...


  • Newton, Massachusetts, United States Intelliswift Full time

    Job Title: Site Reliability Engineer 2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team at Intelliswift. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities:System Monitoring and Incident...


  • Washington, DC , USA, United States Radius Networks Inc Full time

    About Radius Networks IncRadius Networks Inc is the global leader in location technology solutions, powering some of the world's largest restaurant, grocery, retail, and hospitality brands with its Flybuy platform. Flybuy helps companies deliver a seamless customer experience, boost loyalty, and drive efficient staff operations.Job SummaryWe're seeking a...


  • Newton, United States TalentBridge Full time

    Role: Site Reliability Engineer 2Location: Newton MADuration: Long Term Job Summary: We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance. You will play a crucial...


  • Newton, Massachusetts, United States Cypress HCM Full time

    Job DescriptionWe are seeking a skilled Site Reliability Engineer 2 to join our team at Cypress HCM. As a key member of our infrastructure team, you will play a crucial role in ensuring the seamless operation of our services.Key Responsibilities:System Monitoring and Incident Response: Monitor system health, performance metrics, and availability. Respond...