SRE Lead Engineer
3 weeks ago
We are seeking a highly skilled SRE Lead Engineer to help lead transformational initiatives within IT operations, encompassing development as well. As a crucial figure in this role, you will participate in designing and implementing cutting-edge SRE solutions, driving the transformation of IT operations organizations to adopt an engineering-centric approach.
- Participate in the design and architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance.
- Evangelize SRE evolution within IT operations and promote a culture of engineering excellence and best practices.
- Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation.
- Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind.
- Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identify areas for optimization.
- Understand incident and problem management processes, post-mortems, and drive improvements to prevent future incidents.
- Analyze resource utilization patterns and forecast future capacity needs to ensure optimal performance and cost-efficiency.
- Ensure that SRE practices align with security and compliance requirements and implement measures to protect systems and data.
- Operational excellence with a focus on automation and developing tools to streamline operational tasks and increase efficiency.
- Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice.
- Ability to develop close relationships with other operational teams to integrate SRE practices and drive overall operational improvements across the enterprise.
- Stay up to date on industry trends, new technologies, and best practices in SRE and apply relevant advancements to the organization.
Qualifications:
- Around 10-12 years of SRE hands-on experience with cloud technologies, development, SRE toolsets, and automation.
- Strong hands-on experience with any Cloud Technology (AWS): Control Tower, Project Setup, Creating Accounts, RDS, SSO.
- Solid understanding and hands-on experience with Docker/Kubernetes.
- Should have good experience with Linux Commands, GitLab CICD Setup, and Terraform (state management, etc).
- Monitoring & alerting setup experience with Splunk, Prometheus, Grafana, Kibana, ELK, etc.
- Hands-on APM Tool/s experience, preferably Datadog or AppDynamics or Dynatrace.
- Good understanding of Observability Framework leveraging programmatic SLI/SLO blueprints to standardize the collection of golden signals.
- Should have automation (data refresh, releases, DB snapshots) experience using Ansible or any other scripting languages.
- Experience with following languages (Groovy-DSL, Java, Python, Yaml, and microservices architecture).
- Good understanding and hands-on experience with MQ, Kafka.
- Experience with Databases (Oracle, MySQL).
Good to have:
- Any of the relevant professional certifications - Certified Site Reliability Engineer (CSRE), Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer Professional, Google Cloud Professional; DevOps Engineer.
-
SRE Lead Engineer
4 weeks ago
New York, New York, United States Atika Technologies Full timeJob Title: SRE Lead EngineerWe are seeking a highly skilled SRE Lead Engineer to help lead transformational initiatives within IT operations, encompassing development as well.Key Responsibilities:Participate in the design and architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability,...
-
SRE Engineering Manager
3 weeks ago
New York, New York, United States Hispanic Technology Executive Council Full timeAs a key member of the Hispanic Technology Executive Council, we are seeking an exceptional SRE Engineering Manager to drive operational excellence and foster collaboration within our Site Reliability Engineering team.This role requires a unique blend of technical expertise, leadership abilities, and exceptional organisational skills to improve the...
-
SRE Delivery Lead
3 weeks ago
New York, New York, United States Wells Fargo Full timeAbout this role:Wells Fargo is seeking an experienced SRE Delivery Lead to direct our Site Reliability Engineering team. The ideal candidate will oversee the reliability, scalability, and efficiency of our infrastructure and services, driving innovation and best practices across the organization.Key Responsibilities: Develop and execute the overall SRE...
-
Principal Engineer SRE
4 weeks ago
New York, New York, United States LSEG (London Stock Exchange Group) Full timeJob Title: Principal Engineer SREAre you ready to take on a challenging role in the fast-paced world of finance and technology? If you have a passion for innovation, a thirst for challenge, and a love for all things fintech, then this might be the perfect opportunity for you.LSEG (London Stock Exchange Group) is a leading global financial markets...
-
SRE Engineer Position in Chicago
4 weeks ago
New York, New York, United States Spruce Infotech Full timeJob Title: SRE Engineer Position in ChicagoAbout the Role:We are seeking an experienced SRE Engineer to join our team at Spruce Infotech. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and highly available systems. Your expertise in Linux systems, networking, monitoring, databases,...
-
SRE Engineer Position in Chicago
3 weeks ago
New York, New York, United States Spruce Infotech Full timeJob Opportunity for SRE EngineerWe are seeking an experienced SRE Engineer to join our team at Spruce Infotech in Chicago. The ideal candidate will have a strong background in Linux systems, networking, monitoring, databases, containers, and cloud technologies.Key ResponsibilitiesDesign and implement automations to auto-remediate issues in production,...
-
SRE Engineering Manager
4 weeks ago
New York, New York, United States Citigroup Inc Full timeJob DescriptionAs a seasoned Engineering Manager, you will play a pivotal role in driving operational excellence within the Site Reliability Engineering (SRE) team. Your primary objective will be to improve the productivity of engineers, create effective reporting mechanisms, and ensure timely and budget-conscious completion of work. You will be responsible...
-
Software Engineering/SRE Team Leader
4 weeks ago
New York, New York, United States Bloomberg Full timeAbout BloombergBloomberg is the market leader for financial data and workflows globally, servicing hundreds of thousands of financial professionals across a wide variety of roles.Job DescriptionWe are seeking an experienced engineering leader to join our Platform Security group. The successful candidate will be responsible for leading a team of engineers in...
-
SRE Architect
3 weeks ago
New York, New York, United States CAPGEMINI ENGINEERING Full timeAt Capgemini Engineering, we're looking for an experienced SRE Lead to join our team. As a key member of our organization, you'll be responsible for leading the transformation of clients through technology by leveraging your knowledge and curiosity of current and future enterprise technologies, methods, and approaches.Key Responsibilities:Enable clients to...
-
Lead SRE Engineer, Post Trade Solutions
4 weeks ago
New York, New York, United States LSEG (London Stock Exchange Group) Full timeJob SummaryThe successful candidate for the Site Reliability Engineer role will be reporting to the Head of Client Digital Technology, Post Trade. They will develop an AWS cloud-based infrastructure supporting the Equities matching and confirmation service. The SRE will be part of an Engineering team provisioning the required platform, using Infrastructure...
-
Principal SRE Leader
4 weeks ago
New York, New York, United States SS&C Technologies Full timeJob Title: Principal SRESS&C Technologies is seeking a highly skilled Principal SRE to lead our team of security engineers in enhancing the security posture of our Cloud First AWS platforms.Key Responsibilities:Manage and mentor a team of security engineers, fostering a collaborative and innovative environment.Provide technical guidance and support to team...
-
SRE Architect
4 weeks ago
New York, New York, United States Capgemini Full timeJob DescriptionCapgemini is seeking an experienced SRE Lead to join our team. As a key member of our Cloud Operations team, you will be responsible for leading the transformation of clients through technology by leveraging your knowledge and curiosity of current and future enterprise technologies, methods, and approaches.Key Responsibilities:Enable clients...
-
Software Engineering/SRE Team Leader
3 weeks ago
New York, New York, United States Bloomberg Full timeBloomberg is a leading financial data and workflows company, serving hundreds of thousands of professionals worldwide. Our Platform Security group works closely with Information Security and Security Operations to protect our information and products, enable secure development, provide data and intelligence for system protection, and ensure workplace agility...
-
Senior Lead SRE Engineer, Post Trade Solutions
4 weeks ago
New York, New York, United States LSEG (London Stock Exchange Group) Full timeJob DescriptionWe are seeking a highly skilled Senior Lead SRE Engineer to join our Post Trade Solutions team. As a key member of our engineering team, you will be responsible for designing, building, and maintaining a cloud-based infrastructure to support our Equities matching and confirmation service.The successful candidate will have a strong background...
-
SRE Architect
4 weeks ago
New York, New York, United States CAPGEMINI ENGINEERING Full timeJob DescriptionAs a seasoned SRE Architect at Capgemini Engineering, you will be responsible for leading the transformation of clients through technology by leveraging your knowledge and curiosity of current and future enterprise technologies, methods, and approaches. Enable clients to navigate and scale adoption of New IT methodologies and operating models...
-
SRE Support Specialist
4 weeks ago
New York, New York, United States Huntress Talent Full timeJob OverviewWe are seeking a highly skilled SRE Support Engineer to join our team at Huntress Talent. The ideal candidate will have a strong technical background, solid analytical skills, and proven problem-solving experience.Key ResponsibilitiesAct as a product expert, reviewing and responding to software and hardware issues, evaluating product...
-
SRE Architect
4 weeks ago
New York, New York, United States Capgemini Full timeJob DescriptionCapgemini is seeking an experienced SRE Lead to join our team. As an SRE Lead, you will be responsible for leading the transformation of clients through technology by leveraging your knowledge and curiosity of current and future enterprise technologies, methods, and approaches.Key Responsibilities:Enable clients to navigate and scale adoption...
-
SRE Architect
4 weeks ago
New York, New York, United States Capgemini Full timeJob Description:Capgemini is seeking an experienced SRE Lead to join our team. As an SRE Lead, you will be responsible for leading the transformation of clients through technology by leveraging your knowledge and curiosity of current and future enterprise technologies, methods, and approaches.Key Responsibilities:Enable clients to navigate and scale adoption...
-
Reliability Engineering Lead
4 days ago
New York, New York, United States Trumid Full timeAbout UsTrumid is a pioneering fintech that's revolutionizing fixed income trading. Our cutting-edge electronic solutions are empowering us to grow rapidly, and we're seeking exceptional talent to redefine the intersection of technology and finance.The OpportunityWe're looking for a Lead Site Reliability Engineer (SRE) to ensure our systems' reliability,...
-
Senior Software Engineer/SRE
4 weeks ago
New York, New York, United States Bloomberg Full timeAbout the RoleWe are seeking a highly skilled Software Engineer to join our team at Bloomberg, where you will play a critical role in designing and developing predictive data models for our system capacity. As a member of our SRE team, you will be responsible for building systems capable of early detection of issues through metrics and signals, and...