Site Reliability Engineer
4 weeks ago
We're a leading provider of integrated payment solutions, dedicated to securing and optimizing payments for global commerce. Our mission is to deliver a complete payments optimization platform that sets the standard for tokenization and transaction routing.
Job OverviewWe're seeking a skilled Site Reliability Engineer to join our team. As a key member of our IT operations team, you'll be responsible for ensuring the reliability and scalability of our systems, leveraging development skills and a mindset to drive automation and continuous integration and delivery.
Key Responsibilities- Manage and resolve complex production incidents, applying a systematic problem-solving approach and a strong sense of ownership.
- Participate in Agile stories to streamline and enhance day-to-day operations, creating and utilizing technical procedural documentation.
- Proactively monitor applications and infrastructure, influencing resiliency and scalability in production environments.
- Conduct Root Cause Analysis on critical production outages, developing and implementing mitigation strategies.
- Utilize production support expertise to influence new designs, architectures, standards, and methods, maintaining stability and availability for large-scale distributed systems.
- Identify and implement opportunities for automation of routine maintenance tasks, data gathering, and resolution of common issues.
- Develop new skills and technical expertise, sharing knowledge with others and building software and systems to manage platform infrastructure and applications.
- Gather and analyze operating systems/applications metrics to assist in performance tuning and fault finding.
- Participate in system design consulting, platform management, capacity planning, testing & release procedures.
- Bachelor's Degree in Computer Science or relevant experience.
- In-depth understanding of web service protocols and REST API design and consumption.
- Experience with container and serverless computing, Microsoft Azure/AWS developer/architecture certifications preferred.
- Skilled in Cloud/PaaS Environments, LAN, WAN, Network Security.
- Proficient in building reliable, scalable, enterprise systems, identifying root-cause sources of instability in high-traffic, large-scale distributed systems.
- Linux administration, troubleshooting, and performance tuning experience.
- Understanding of observability principles, tools, and practices that promote observability.
- Experience with continuous integration tools, trouble-shooting skills that span systems, network, and code.
- Ability to implement, administer, and troubleshoot network infrastructure devices, including firewalls and load balancers.
- Configuration management and orchestration experience.
This position will be hybrid in the Lehi, Utah area.
-
Site Reliability Engineer
4 weeks ago
Reston, Virginia, United States WideNet Consulting Group Full timeJob Title: Site Reliability EngineerAbout the RoleWe are seeking an experienced Site Reliability Engineer to join our team at WideNet Consulting Group. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities· Monitor and analyze system performance...
-
Site Reliability Engineer
4 weeks ago
Reston, Virginia, United States Microsoft Corporation Full timeJob Title: Site Reliability EngineerAt Microsoft, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud services.Key Responsibilities:Design, develop, and deliver software engineering solutions to serve and...
-
Site Reliability Engineer
3 weeks ago
Reston, Virginia, United States Blue Sky Innovative Solutions Full timeJob Title: Site Reliability EngineerBlue Sky Innovative Solutions is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure. Your expertise in Red Hat Linux Automation and DevOps practices will be essential in...
-
Site Reliability Engineer
3 weeks ago
Reston, Virginia, United States Microsoft Corporation Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Microsoft Corporation. As a key member of our Office 365 government cloud team, you will be responsible for designing, developing, and delivering software engineering solutions to serve and protect our O365 government clouds.Key ResponsibilitiesOwn deployment,...
-
Reston, Virginia, United States Microsoft Full timeTransforming the Future of Cloud ServicesAt Microsoft, we're committed to being cloud-first, and we're looking for talented Site Reliability Engineers to help us shape the future of cloud services. As a Site Reliability Engineer, you'll play a critical role in designing and implementing scenarios for our customers, ensuring the reliability and scalability of...
-
Site Reliability Engineer
3 weeks ago
Reston, Virginia, United States Blue Sky Innovative Solutions Full timeJob Title: Site Reliability EngineerBlue Sky Innovative Solutions is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure. You will be responsible for designing, implementing, and maintaining our cloud...
-
Reston, Virginia, United States Microsoft Corporation Full timeTransforming the Future of Cloud ServicesAt Microsoft, we're committed to being cloud-first, and we're looking for talented Site Reliability Engineers to help design and implement scenarios for our customers. As a Site Reliability Engineer, you'll play a critical role in shaping the future of cloud services and ensuring the reliability and scalability of our...
-
Site Reliability Engineer
3 weeks ago
Reston, Virginia, United States Intelligent Waves Full timeJob DescriptionIntelligent Waves is seeking a highly skilled Site Reliability Engineer to join our team in Reston, VA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systemsCollaborate with...
-
Site Reliability Engineer
3 weeks ago
Reston, Virginia, United States Infosys Full timeAbout the RoleInfosys is seeking a highly skilled Site Reliability Engineer to join our team. As an SRE Consultant, you will play a critical role in defining business process consulting solutions that enable our clients to meet the changing needs of the global landscape.Key ResponsibilitiesDefine problems, propose, and create solutions to drive business...
-
Site Reliability Engineer
4 weeks ago
Reston, Virginia, United States Insight Global Full timeJob Title: Site Reliability EngineerNVIDIA is seeking a seasoned Site Reliability Engineer to join its Infrastructure, Planning and Processes organization. This is an ON PREM data center role that requires experience with Linux operating systems and an understanding of Kubernetes.Key Responsibilities:Design and implement reliable systems and processes to...
-
Site Reliability Engineer
4 weeks ago
Reston, Virginia, United States InterEx Group Full timeSenior Site Reliability EngineerKey ResponsibilitiesEnhance the reliability of critical solutions, applications, and platforms to ensure high uptime and performance.Develop software for enterprises, focusing on scalability, maintainability, and efficiency.Continuously identify and implement improvements to processes and systems.Manage risks and resolve...
-
Principal Site Reliability Engineer
2 weeks ago
Reston, Virginia, United States Palo Alto Networks Full timeJob DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure for our FedRAMP SASE product portfolio.Key Responsibilities:Design and implement scalable and reliable infrastructure...
-
Senior Site Reliability Engineering Manager
3 weeks ago
Reston, Virginia, United States Microsoft Full timeJob Title: Senior Site Reliability Engineering ManagerMicrosoft is seeking a highly skilled and experienced Senior Site Reliability Engineering Manager to join our team. As a key member of our Site Reliability Engineering team, you will be responsible for providing technical leadership and direction to a team of engineers, building and running critical...
-
Site Reliability Engineer
3 weeks ago
Reston, Virginia, United States Red Gate Group Full timeRed Gate Group is seeking a skilled Site Reliability Administrator to support the Defense Threat Reduction Agency (DTRA) in Reston, VA. In this vital role, you will ensure mission-critical back-end infrastructure remains operational, secure, and compliant. You will help maintain and scale clustered environments, drive innovation, and lead efforts in...
-
Senior Site Reliability Engineering Manager
2 weeks ago
Reston, Virginia, United States Microsoft Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineering Manager to join our team at Microsoft. As a key member of our Site Reliability Engineering team, you will be responsible for providing technical leadership to a team of highly passionate and skilled engineers.Key Responsibilities:Recruit, on-board, and grow a team of Software...
-
Senior Site Reliability Engineering Manager
4 weeks ago
Reston, Virginia, United States Microsoft Corporation Full timeAbout the RoleMicrosoft is seeking a highly skilled and experienced Senior Site Reliability Engineering Manager to lead our team of engineers in delivering critical public-sector service environments. As a key member of our Site Reliability Engineering team, you will be responsible for providing deep technical leadership, recruiting and growing a team of...
-
Site Reliability Engineer
1 month ago
Reston, Virginia, United States Red Gate Group Full timeJob Title: Site Reliability AdministratorRed Gate Group is seeking a skilled Site Reliability Administrator to support the Defense Threat Reduction Agency (DTRA) in Reston, VA. In this vital role, you will ensure mission-critical back-end infrastructure remains operational, secure, and compliant.Key Responsibilities:Manage and secure Red Hat Enterprise Linux...
-
Principal Site Reliability Engineer
2 weeks ago
Reston, Virginia, United States Palo Alto Networks Full timeJob DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, maintaining, and scaling production services and server farms within our FedRAMP SASE product portfolio.You will work closely with our development teams to ensure...
-
Site Reliability Engineer
1 month ago
Reston, Virginia, United States Red Gate Group Full timeJob Title: Site Reliability AdministratorRed Gate Group is seeking a skilled Site Reliability Administrator to support the Defense Threat Reduction Agency (DTRA) in Reston, VA. In this vital role, you will ensure mission-critical back-end infrastructure remains operational, secure, and compliant.Key Responsibilities:Manage and secure Red Hat Enterprise Linux...
-
Site Reliability Engineer
4 weeks ago
Reston, Virginia, United States Booz Allen Hamilton Full timeJob SummaryWe are seeking a highly skilled Site Reliability Administrator, Senior to join our team. As a key member of our operations team, you will play a vital role in ensuring the reliability and scalability of our back-end infrastructure.Key ResponsibilitiesManage and maintain Red Hat Enterprise Linux (RHEL) systems in a multi-tenant enterprise network...