Senior Cloud Reliability Engineer
4 days ago
We are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at AEG. As a key member of our Cloud Operations team, you will be responsible for leading and mentoring our SRE and TechOps teams with a focus on automation to drive accountability, efficiency, and continuous improvement.
Key Responsibilities- Build and Maintain Observability Frameworks: Develop and implement observability frameworks to monitor the health and performance of our services, ensuring uptime and reliability.
- Incident Response and On-Call Support: Be the first line of defense in troubleshooting and resolving incidents without relying on runbooks, using strong problem-solving skills.
- API Testing: Perform thorough API testing for published content using tools like Postman and Cypress to ensure accuracy and performance.
- Infrastructure as Code: Utilize Terraform for managing infrastructure, including ServiceNow integrations, and automate workflows.
- Monitoring and Logging: Leverage Datadog, or equivalent tools such as New Relic or Splunk, to set up monitoring, logging, and alerting systems.
- Collaboration and Communication: Work closely with cross-functional teams, including developers, operations, and product managers, to ensure seamless integration and deployment of services.
- AWS Resources Management: Manage and optimize AWS resources, including EKS and ECS, to ensure scalability and cost-efficiency.
- CI/CD Pipeline Management: Use GitLab pipelines for continuous integration and deployment, ensuring smooth and automated delivery of code changes.
- Integration: Integrate tools like ServiceNow with Slack or Asana to streamline workflows and enhance team communication.
- 7+ years of experience in Cloud Expertise and Technical Operations, with 5+ years in architecting and managing cloud solutions (AWS, Azure, Google Cloud).
- Proven background in complex technology operations environments, including infrastructure, network, security, and incident management.
- Proficiency in implementing automation tools and a proven ability to drive automation excellence within the organization.
- Strong team leadership skills, with experience in managing or mentoring roles within technology operations (ITSM/ITOM).
- Cloud First mindset, with experience in AWS and familiarity with GCP and Azure.
- Programming Languages: Proficiency in Python, with familiarity in Go, React/React Native.
- Infrastructure as Code: Experience with Terraform.
- API Data Quality checks and Frontend Testing: Hands-on experience with Cypress, Postman, and monitoring tools like Datadog (or equivalents like New Relic or Splunk).
- Cloud Infrastructure: Strong understanding of AWS services, particularly EKS and ECS.
- CI/CD Pipelines: Experience with GitLab for managing pipelines and automating deployments.
- Observability: Expertise in setting up and maintaining observability frameworks to monitor and improve system reliability.
- Troubleshooting: Excellent problem-solving and analytical abilities.
- Startup Mentality: Comfortable in a fast-paced environment where wearing multiple hats is the norm.
- Collaborative: Strong team player with excellent communication skills.
- Curiosity and Passion: A genuine passion for technology, with a strong desire to learn and explore new tools and methodologies.
- Love of Learning: While a formal computer science degree is required, a solid foundation in coding and problem-solving, whether self-taught or through experience, is essential.
We offer a competitive salary range of $130,000 - $155,000, comprehensive benefits, and a hybrid Office/Remote Work Schedule to promote Work-Life balance. We are an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status.
-
Senior Cloud Site Reliability Engineer
2 weeks ago
New York, New York, United States Federal Reserve Bank Full timeJob SummaryWe are seeking a highly skilled Senior Cloud Site Reliability Engineer to join our Enterprise Support Infrastructure Engineering team within the Bank's Technology Group. As a key member of our team, you will be responsible for providing infrastructure technology capabilities to enable cloud hosting for business applications.Key...
-
Senior Cloud Site Reliability Engineer
1 week ago
New York, New York, United States Federal Reserve Bank Full timeJob SummaryThe Federal Reserve Bank of New York is seeking a Senior Cloud Site Reliability Engineer to join our Enterprise Support Infrastructure Engineering team. As a key member of our team, you will be responsible for providing infrastructure technology capabilities to enable cloud hosting for business applications.You will work closely with our service...
-
Senior Cloud Engineer
2 weeks ago
New York, New York, United States Bloomberg Full timeJob Title: Senior Cloud EngineerBloomberg's Public Cloud Solutions team is seeking a highly skilled Senior Cloud Engineer to join our team. As a Senior Cloud Engineer, you will be responsible for designing, building, and maintaining cloud-based infrastructure and applications.Key Responsibilities:Design and implement scalable cloud-based solutions across...
-
Staff Cloud Reliability Engineer
2 weeks ago
New York, New York, United States Celonis GmbH Full timeAbout the RoleCelonis is seeking a highly skilled Cloud Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for designing, implementing, and managing cloud-based applications and platforms that meet our high standards for reliability and scalability.Key ResponsibilitiesDesign and implement cloud-based applications...
-
Cloud Service Reliability Engineer
4 weeks ago
New York, New York, United States Forhyre Full timeJob Title: Cloud Service Reliability EngineerWe are seeking a skilled Cloud Service Reliability Engineer to join our team at Forhyre. As a Cloud Service Reliability Engineer, you will be responsible for designing, implementing, and maintaining systems that ensure the reliability and efficiency of our cloud-based services.Key Responsibilities:Develop and...
-
Cloud Service Reliability Engineer
4 weeks ago
New York, New York, United States Forhyre Full timeJob Title: Cloud Service Reliability EngineerWe are seeking a skilled Cloud Service Reliability Engineer to join our team at Forhyre. As a Cloud Service Reliability Engineer, you will be responsible for designing, implementing, and maintaining systems on premise or in the cloud, focusing on identity and access management, cloud computing...
-
Cloud Reliability Engineering Manager
3 weeks ago
New York, New York, United States Syndio Full timeJob Title: Cloud Reliability Engineering ManagerAt Syndio, we're seeking a seasoned Cloud Reliability Engineering Manager to join our team. As a key member of our engineering leadership, you will be responsible for defining and driving a vision for cloud reliability engineering, aligning it with broader engineering and business goals.About the RoleThis is a...
-
Site Reliability Engineer
2 weeks ago
New York, New York, United States Diverse Lynx Full timeJob Title: Site Reliability Engineer - Cloud Expert Job Summary: We are seeking a highly skilled Site Reliability Engineer with expertise in cloud engineering to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems. Responsibilities: *...
-
Cloud Reliability Engineering Manager
4 weeks ago
New York, New York, United States Syndio Full timeJob Title: Cloud Reliability Engineering ManagerAt Syndio, we're seeking a seasoned Cloud Reliability Engineering Manager to join our team. As a key member of our engineering leadership, you will be responsible for defining and driving a vision for cloud reliability engineering, aligning it with broader engineering and business goals.About the RoleThis is a...
-
Cloud Security Site Reliability Engineer
1 month ago
New York, New York, United States Citigroup Inc Full timeJob Title: Cloud Security Site Reliability EngineerCitigroup Inc. is seeking a highly skilled Cloud Security Site Reliability Engineer to join our team. As a key member of our Cloud Security team, you will be responsible for ensuring the security and reliability of our cloud-based systems and applications.Job Summary:The Cloud Security Site Reliability...
-
Senior Cloud Engineer
2 weeks ago
New York, New York, United States Comcast Full timeJob DescriptionComcast is seeking a highly skilled Senior Cloud Engineer to join our team. As a key member of our cloud infrastructure team, you will be responsible for designing, architecting, and implementing scalable and reliable cloud infrastructure on AWS.Key Responsibilities:Design and implement cloud infrastructure solutions using AWS services,...
-
Senior Cloud Engineer
1 week ago
New York, New York, United States Squarespace Full timeAbout the RoleWe are seeking an experienced Senior Cloud Engineer to join our Compute team. As a key member of our infrastructure engineering team, you will be responsible for designing, building, and maintaining our cloud-based infrastructure. Your expertise will help us ensure the reliability and scalability of our systems, enabling us to deliver...
-
Senior Site Reliability Engineer, Compute
2 weeks ago
New York, New York, United States Squarespace Full timeAbout the RoleWe are seeking an experienced Senior Site Reliability Engineer to join our Compute team. As a key member of our infrastructure engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign, develop, and maintain scalable and highly available cloud-based...
-
Senior IT Site Reliability Engineer
4 weeks ago
New York, New York, United States Hudson River Trading Full timeSenior IT Site Reliability EngineerHudson River Trading (HRT) is a leading financial services company that leverages a scientific approach to trading. We are seeking a highly skilled Senior IT Site Reliability Engineer to join our IT Solutions Delivery team.This team is responsible for developing and maintaining the corporate productivity stack for the...
-
Senior Site Reliability Engineer
2 weeks ago
New York, New York, United States Major League Soccer Full timeJob Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Major League Soccer. As a key member of our technical operations team, you will be responsible for ensuring the reliability, performance, and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement...
-
Cloud and Site Reliability Engineering Manager
4 weeks ago
New York, New York, United States Syndio Full timeJob OverviewSyndio is seeking a highly skilled Cloud and Site Reliability Engineering Manager to join our team. As a key member of our engineering leadership team, you will be responsible for leading a team of SREs, PE's, and COE's in designing, implementing, and operating production systems using best practices in automation, monitoring, and...
-
Senior Cloud Infrastructure Engineer
4 weeks ago
New York, New York, United States Sigma Software Full timeJob Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Sigma Software. As a key member of our engineering team, you will be responsible for designing, building, and maintaining our cloud infrastructure and observability solutions.About the Role:This is an exciting opportunity to work...
-
New York, New York, United States Syndio Full timeJob Title: Cloud and Site Reliability Engineering ManagerAt Syndio, we're seeking a highly skilled Cloud and Site Reliability Engineering Manager to join our team. As a key member of our engineering leadership, you will be responsible for defining and driving a vision for our cloud platform, ensuring it is scalable, reliable, and secure.About the RoleThis is...
-
Senior Cloud Software Engineer
1 week ago
New York, New York, United States Mark43 Full timeAbout the RoleWe are seeking a highly skilled Senior Cloud Software Engineer to join our team at Mark43. As a key member of our engineering team, you will be responsible for designing, developing, and deploying cloud-based software solutions that meet the needs of our customers.Key Responsibilities:Design and develop scalable, secure, and reliable...
-
Cloud and Site Reliability Engineering Manager
4 weeks ago
New York, New York, United States Syndio Full timeJob DescriptionEmpower organizations to achieve fairness and equity in the workplace by joining Syndio, a Series-C technology company committed to creating diverse and inclusive workplaces. As a Cloud and Site Reliability Engineering Manager, you will play a critical role in defining and driving a vision for the organization's cloud platform, ensuring it is...