Staff Site Reliability Engineer
1 month ago
CyberArk is the global leader in Identity Security, providing the most comprehensive security offering for any identity - human or machine - across business applications, distributed workforces, hybrid cloud workloads, and throughout the DevOps lifecycle. The world's leading organizations trust CyberArk to help secure their most critical assets.
Job DescriptionCyberArk is seeking a FedRAMP Staff Site Reliability Engineer to bring their knowledge, excitement, and energy to the team. If you have worked in the cloud solving large-scale problems, bringing visibility into your platform, and accomplishing true automation, we want you on the team. Driven and excited to innovate is what we need, all while allowing you to grow professionally and creating strong relationships that will last a lifetime.
Key Responsibilities- Architect, lead, and design future deployment and management automation for CyberArk's cloud-based infrastructure and software.
- Provide guidance to Site Reliability Engineers on managing the reliability and performance of SaaS environments and building automation to prevent recurring issues.
- Architect, develop, and guide the team with the use of cloud configuration management, deployment, and compliance tools such as CloudFormation, Helm, Kubernetes, Terraform, Salt, and Ansible across both Windows and Linux environments.
- Ensure cloud-based architectures meet availability and recoverability requirements.
- Implement best practices for cloud-based monitoring, alerting, and observability using tools like PagerDuty, CloudWatch, Grafana, Datadog, and OpenSearch.
- Support and guide tooling initiatives that enhance team output and reliability.
- Develop and continuously improve automation of manual processes.
- Collaborate with engineering and product teams to identify areas for improvement, prepare architecture roadmaps, and advocate to the Product Management group.
- Respond to production incidents and participate in on-call rotations.
- B.S. in Computer Science or equivalent experience.
- Minimum 5 years of experience managing AWS infrastructure.
- Minimum of 7 years in a senior, architect, or technical lead role of site reliability, systems engineering, or software development.
- A deep understanding of Site Reliability, infrastructure, and Cloud Platforms.
- Solid understanding/experience of web services, databases, and relating infrastructure/architectures.
- Previous experience with FedRAMP or DOD compliance requirements and audits.
- Strong level of scripting and automation expertise, using Python or an equivalent language.
- Proven track record of managing reliability and performance for large-scale, enterprise-level SaaS environments.
- Strong analytical and problem-solving abilities, with a proactive approach to identify and mitigate issues.
- Extensive experience designing and managing AWS infrastructure components including VPC, ELB/ALB, IAM, KMS, EC2, Route53, AWS Config, CloudTrail, CloudFormation across both AWS commercial and GovCloud Regions.
- Must be a U.S Citizen or Green Card Holder for meeting with FedRAMP High authorized access requirements.
-
Site Reliability Engineer
4 weeks ago
Newton, MA, USA, United States Intelliswift Software Inc Full timeSite Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our dynamic team at Intelliswift Software Inc. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities:System Monitoring and Incident Response: Monitor...
-
Site Reliability Engineer
4 weeks ago
Newton, MA, USA, United States Intelliswift Full timeJob Title: Site Reliability EngineerJob Summary:We are seeking a skilled Site Reliability Engineer to join our dynamic team at Intelliswift. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities:System Monitoring and Incident...
-
Site Reliability Engineer
4 weeks ago
Newton, MA, USA, United States Cypress HCM Full timeJob SummaryWe are seeking a skilled Site Reliability Engineer to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities:System Monitoring and Incident Response: Monitor system health, performance metrics, and...
-
Site Reliability Engineer
4 weeks ago
Newton, MA, USA, United States Software Guidance and Assistance, Inc. Full timeJob Title: Site Reliability EngineerSoftware Guidance and Assistance, Inc. (SGA) is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure.Key Responsibilities:System Monitoring and Incident Response:...
-
Site Reliability Engineer 2
3 weeks ago
Newton, MA, United States Intelliswift Full timeSite Reliability Engineer 2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team at Intelliswift. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities: System Monitoring and Incident Response:...
-
Site Reliability Engineer
4 weeks ago
Newton, Massachusetts, United States Software Guidance and Assistance, Inc. Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Software Guidance and Assistance, Inc. (SGA). As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Monitor system health, performance...
-
Site Reliability Engineering Manager
4 weeks ago
Boston, MA , USA, United States Insight Global Full timeSite Reliability Engineering ManagerA leading retail company in the $7 billion industry is seeking a Site Reliability Engineering Manager to lead a team of 7-10 Site Reliability Engineers in Boston, MA.Key Responsibilities:Lead a team of Site Reliability Engineers in supporting and monitoring production for the eCommerce platform.Develop and implement...
-
Site Reliability Engineer
2 months ago
Newton, United States Cypress HCM Full timeSite Reliability Engineer 2 Description:Reason: Special Project |Department: Stock US Eng | 6 MonthsJob SummaryWe are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance....
-
Site Reliability Engineer
4 weeks ago
Newton, Massachusetts, United States Software Guidance & Assistance Full timeJob Title: Site Reliability EngineerSoftware Guidance & Assistance, Inc. (SGA) is seeking a skilled Site Reliability Engineer to join our team for a contract assignment with a premier SaaS client in Newton, MA.Responsibilities:Monitor system health, performance metrics, and availability, and respond promptly to incidents and outages to ensure minimal...
-
Site Reliability Engineer
4 weeks ago
Newton, United States Intelliswift Software Full timeTitle : Site Reliability EngineerLocation : Newton, MA HybridDuration : 6 MonthsPay rate : $38.73 per hour on W2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and...
-
Site Reliability Engineer
4 weeks ago
Newton, Massachusetts, United States Akraya Inc. Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Akraya Inc. The ideal candidate will possess expertise in system monitoring, infrastructure management, automation, and a keen interest in enhancing system reliability.Key ResponsibilitiesEnsure system health and responsiveness to incidents with minimal downtime.Optimize...
-
Site Reliability Engineer 2
3 weeks ago
Newton, Massachusetts, United States NextDeavor Full timeSite Reliability Engineer 2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team at NextDeavor. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities: Monitor system health, performance metrics, and...
-
Site Reliability Engineer
7 days ago
Newton, MA, United States Intelliswift Software Full timeTitle : Site Reliability EngineerLocation : Newton, MA HybridDuration : 6 MonthsPay rate : $38.73 per hour on W2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and...
-
Site Reliability Engineer
4 weeks ago
Newton, United States Software Guidance & Assistance Full timeSoftware Guidance & Assistance, Inc., (SGA), is searching for a Site Reliability Engineer for a contract assignment with one of our premier SaaS clients in Newton, MA. Responsibilities : System Monitoring and Incident Response: Monitor system health, performance metrics, and availability. Respond promptly to incidents and outages, ensuring minimal...
-
Site Reliability Engineer
4 weeks ago
Newton, United States Software Guidance & Assistance Full timeSoftware Guidance & Assistance, Inc., (SGA), is searching for a Site Reliability Engineer for a contract assignment with one of our premier SaaS clients in Newton, MA or Fully Remote. Responsibilities : System Monitoring and Incident Response: Monitor system health, performance metrics, and availability. Respond promptly to incidents and outages,...
-
Site Reliability Engineer 2
6 days ago
Newton, United States Intelliswift Full timeJob ID: 24-05261 Site Reliability Engineer Newton, MA Hybrid 6 Months $38.73 per hour on W2 We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance. You will play a...
-
Site Reliability Engineer 2
4 weeks ago
Newton, Massachusetts, United States Intelliswift Full timeJob Title: Site Reliability Engineer 2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team at Intelliswift. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance.Key Responsibilities:System Monitoring and Incident...
-
Site Reliability Engineer
4 weeks ago
Washington, DC , USA, United States Radius Networks Inc Full timeAbout Radius Networks IncRadius Networks Inc is the global leader in location technology solutions, powering some of the world's largest restaurant, grocery, retail, and hospitality brands with its Flybuy platform. Flybuy helps companies deliver a seamless customer experience, boost loyalty, and drive efficient staff operations.Job SummaryWe're seeking a...
-
Site Reliability Engineer 2
4 weeks ago
Newton, United States TalentBridge Full timeRole: Site Reliability Engineer 2Location: Newton MADuration: Long Term Job Summary: We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance. You will play a crucial...
-
Site Reliability Engineer 2
3 weeks ago
Newton, Massachusetts, United States Cypress HCM Full timeJob DescriptionWe are seeking a skilled Site Reliability Engineer 2 to join our team at Cypress HCM. As a key member of our infrastructure team, you will play a crucial role in ensuring the seamless operation of our services.Key Responsibilities:System Monitoring and Incident Response: Monitor system health, performance metrics, and availability. Respond...