We have other current jobs related to this field that you can find below


  • Reston, Virginia, United States Microsoft Full time

    About the RoleWe are seeking a highly skilled and experienced Senior Site Reliability Engineering Manager to join our team at Microsoft. As a key member of our engineering organization, you will be responsible for providing technical leadership to a team of highly passionate and skilled engineers.Key Responsibilities:Recruit, on-board, and grow a team of...


  • Reston, Virginia, United States Microsoft Full time

    Unlock the Power of Cloud Services with MicrosoftAs a leader in cloud innovation, Microsoft is revolutionizing the business world with cutting-edge solutions. We're seeking skilled Site Reliability Engineers to design and implement top-notch solutions for our customers.Contribute to Shaping the Future of Cloud Computing3+ years of experience in software...


  • Reston, United States FYI - FOR YOUR INFORMATION, INC. Full time

    FYI For your Information has an immediate opportunity for a DevOps Site Reliability Engineer in Reston, VA. As a site reliability engineer, you will work with the Department of Defense (DoD) on the development of robust systems by building a resilient infrastructure. This is a great opportunity for a solid Reliability Engineer who is interested in automating...


  • Reston, Virginia, United States Microsoft Full time

    About the Role: Join the Office 365 team as a Principal Site Reliability Engineer, where you will play a pivotal role in enhancing the delivery of essential features within our government cloud offerings. Your expertise in quality, reliability, and innovation will be crucial in advancing the continuous delivery of services that enhance the Teams Phone...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital member of our dynamic Engineering Innovation Factory Team. This team of solution architects and digital engineers is dedicated to defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital part of our dynamic Engineering Innovation Factory Team, which consists of solution architects and digital engineers. This team is responsible for defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital part of our dynamic Engineering Innovation Factory Team. This team comprises solution architects and digital engineers dedicated to shaping and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating...


  • Reston, Virginia, United States Microsoft Full time

    About the Role: Microsoft is seeking a Principal Site Reliability Engineer to join our dynamic Office 365 team, which is dedicated to delivering exceptional communication and collaboration solutions. In this pivotal role, you will leverage your expertise in ensuring the reliability and quality of our services, particularly within the government cloud sector....


  • Reston, Virginia, United States Red Gate Group Full time

    Company DescriptionAt RED GATE we do everything we can to serve our clients:Using the right technical skills, unique methodologies, best practices, and integrated technology, we help clients implement bold solutions. New approaches to emerging and evolving threats. Non-traditional ways to overcome entrenched obstacles. Advantage through opportunity. If you...


  • Reston, United States Red Gate Group Full time

    Job DescriptionJob DescriptionCompany DescriptionAt RED GATE we do everything we can to serve our clients: Using the right technical skills, unique methodologies, best practices, and integrated technology, we help clients implement bold solutions. New approaches to emerging and evolving threats. Non-traditional ways to overcome entrenched obstacles....


  • Reston, United States Comcast Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...


  • Reston, Virginia, United States Microsoft Full time

    Microsoft is seeking a Senior Site Reliability Engineer to join our Cloud and Artificial Intelligence Silver Team. This team is tasked with the deployment and management of a Secure Work Area, which includes the infrastructure necessary for collaboration within a highly secure environment. In this position, you will collaborate with engineers who facilitate...


  • Reston, United States Peraton Full time

    Required Qualifications: Bachelor’s Degree in Computer Science, Information Technology, or a related field and 8 years of Cloud Engineering experience. Must possess and maintain TS/SCI Clearance with Polygraph. Experience as a Cloud Engineer or similar role. Strong understanding of AWS core services such as EC2, R3, S3, RDS, VPC, IAM, and...


  • Reston, Virginia, United States Microsoft Full time

    About the Role:Microsoft is seeking a Principal Site Reliability Engineer to join our Office 365 team, which is dedicated to delivering advanced communication and collaboration solutions. This role is pivotal in enhancing the reliability and performance of our services within the government cloud sector.Key Responsibilities:Drive the evolution of our...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is looking for a Reliability Engineer to be part of our dynamic Engineering Innovation Factory Team, comprised of solution architects and digital engineers. This role is pivotal in defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating digital...

  • Land Development

    4 weeks ago


    Reston, United States DesignForce Full time

    Now Hiring: Land Development (Site-Civil) Engineer, SeniorAre you an experienced civil engineer looking to make a significant impact on Northern Virginia’s most community-centered projects? Our client is seeking a seasoned Land Development Engineer to join their esteemed team in Reston, VA.As a dynamic, multidisciplinary engineering and planning...

  • Land Development

    1 month ago


    Reston, United States DesignForce Full time

    Now Hiring: Land Development (Site-Civil) Engineer, SeniorAre you an experienced civil engineer looking to make a significant impact on Northern Virginia’s most community-centered projects? Our client is seeking a seasoned Land Development Engineer to join their esteemed team in Reston, VA.As a dynamic, multidisciplinary engineering and planning...


  • Reston, United States DAN Solutions Full time

    Job DescriptionJob DescriptionREQUIRES AN EXISTING/ACTIVE TS/SCI WITH CI POLYGRAPH - NO REMOTE WORK, MUST WORK ON SITEThe job duties of the Senior Network Engineer are as follows:· Perform network design and engineering support services for the ATIP Program IT architecture. Solve complex Network issues. Engineer, design, implement network solutions...

  • Reliability Engineer

    3 weeks ago


    Reston, United States SAIC Full time

    DescriptionSAIC is seeking a Reliability Engineer to join our energized Engineering Innovation Factory Team of solution architects and digital engineers to define and build the infrastructure that fuels the Digital Engineering Transformation across our entire industry. Our work creating digital ecosystems span an exciting variety of programs, customers and...


  • Reston, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Oracle Corporation, is seeking the following. Apply via Dice today! Job Description Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the...

Senior Site Reliability Engineer

3 months ago


Reston, United States Mindlance Full time

Position Title: Senior Site Reliability Engineer

Location: Reston, VA / Fully Remote

Duration: Long Term Assignment

Must Have Skills:

APIs and Microservices.

Atlassian Suite of products (Jira, Confluence, Bitbucket, Crowd).

AWS Code Pipeline.

Bachelor s Degree - CS or Engineering.

DevOps.

Git and Bitbucket, including branching workflows.

Infrastructure as code using AWS CDK, Cloud Formation, or similar scripting techniques.

IP Networking, VPCs, DNS, Load Balancing, and firewalls.

Linux-based systems administration.

Node.js/JavaScript programming language and it s frameworks and design patterns.

On call rotations.

Scalable production environment.

Scripting experience in a Cloud-based environment.

Nice To Have:

Monitoring suites (Ex: New Relic, Splunk, Sumo Logic)

ESSENTIAL FUNCTIONS/RESPONSIBILITIES:

Design, develop and implement automated solutions, based on a set of standards and processes which establish consistency across the enterprise, to reduce risk and promote efficiencies in support of the organization s goals and objectives.

Responsible for the quality of your work; will develop and implement a set of quality criteria and the associated validation methods to ensure that any deliverable meets the expected quality levels of our customers, use quality management standards/metrics to ensure quality levels are maintained, seek new approaches and techniques to improve quality levels and analyze the impact of quality control and quality assurance on project performance.

Actively review Observability custom and COTS products and implement improvements seen within the industry to drive continuous improvement of the Observability products efficiency, scalability, and quality.

Managing and resolving incidents, conducting incident reviews, and managing problems with a focus on proactivity.

Incident management - Act in key response roles during major incidents. Participate in an on-call rotation with other team members. Participate in the post-mortem review of incidents for Root Cause Analysis (RCA).

Participate in system design consulting, AWS platform management, and capacity planning.

Provide support (coaching and mentoring) for teammate's work activities on a regular basis.

Use product SLAs, enterprise standards/metrics to ensure product availability and user experience quality levels are maintained, seek innovative approaches and techniques to improve quality levels and analyze the impact of the product changes on application performance and availability.

Design and develop tools and processes to aid in improving infrastructure reliability and allow for monitoring and reporting.

Write complex code, building infrastructure as code, work with serverless based cloud environments and build the supporting automated toolsets necessary to support the continuous metric collection pipeline.

Integrate COTS products across the continuous delivery pipeline to provide a comprehensive automated system from epic definition, development, test and deploy of CB applications within our data center and Amazon.

A hands-on engineer who leads by doing. Take responsibility for creating design specifications, unit testing, and preparing technical documentation. Develop solutions from business initiation through operational integrity.

Support the development of Observability standards by creating templates for ease of use and increase of Observability capabilities adoption.

Foster and build a community of practice for collective learning of the Observability tools and systems across all development teams.

Be in an on-call rotation to respond to incidents that impact Client's availability and provide support for Development team engineers with customer related incidents.

Use your on-call experiences to analyze and prevent incidents from ever happening.

Qualifications needed for the role:

A bachelor s degree preferably in Computer Science, Engineering or MIS.

5-8 years of experience in software systems, programming, and infrastructure development and administration.

Preferred skills and attributes for the role:

Strong, proven experience as a DevOps engineer in a scalable production environment administrating one or more of the following: Atlassian Suite of products (Jira, Confluence, Bitbucket, Crowd).

Ability to operate in a high-pressure environment, quickly troubleshoot complex issues and successfully handle multiple priorities

Strong practical Linux-based systems administration skills and scripting experience in a Cloud-based environment.

Experience with Node.js/JavaScript programming language and it s frameworks and design patterns.

Experience working with APIs and Microservices.

Working knowledge of IP Networking, VPCs, DNS, Load Balancing, and Firewalls.

Experience building infrastructure as code using AWS CDK, Cloud Formation, or similar scripting techniques.

Experience managing releases into production using AWS Code Pipeline.

Expertise with Git and Bitbucket, including branching workflows.

Experience with monitoring suites (Ex: New Relic, Splunk, Sumo Logic) -- is a plus.

Excellent interpersonal and collaboration skills with the ability to work with a diverse set of colleagues.

Strong decision-making, problem-solving skills, critical thinking, and testing skills.

Self-starter with the ability to set priorities, work independently, and attain goals.

The ethos of continuous improvement and interest in learning new things.

Strong ability to understand and internalize the big picture and broader implications.

#J-18808-Ljbffr