Senior Site Reliability Engineer

4 weeks ago


Herndon, United States LanceSoft Full time

JOB DESCRIPTION
Position: Senior Site Reliability Engineer
Mode: long term contract (work order are extended 2x year)
Remote: YES
Team: Engineering/Infrastructure - there are 5 on this team
Profile: This is a senior level, consultative role that will serve as the SME for solutions around observability, incident management and underlying infrastructure. Seeking a person who has implemented solutions that support business functionality as well as the underlying infrastructure required to run and deploy those solutions. Someone that has leadership and communication skills to mentor others. Must have critical thinking, solid communications, technology based with python scripting, API integrations and full scope of how systems/applications work together, etc.
**Position is weekdays, but there is an oncall rotation for nights/weekends**
Rate: position is in line with traditional, senior SW engineer roles we fill
Process: Remote Interview Protocol - .5 with Donna/Chip followed by 1.0 Tech Panel (TBD) - Verification - Remote onboarding

Imagine a dynamic environment where you are surrounded by brilliant minds every day, dedicated to building technologies that accelerate the college opportunity pathway for millions of students around the globe. Envision an organization where machine learning, distributed systems, artificial intelligence, networking, security, optimization, UX, and UI come together to create ingenious solutions, and the possibilities for their evolution are endless.
Are you a passionate, high energy, technology hungry, disciplined and committed engineer? Every day, we look forward to meeting with our work friends to deliver our passion of technology.
Named by Fast Company as one of the most innovative education companies, the *** is a mission-focused organization focused on improving educational opportunities and outcomes, particularly for disadvantaged students, in the context of a competitive business environment.
As a Senior Site Reliability Engineer, you will research, design, and implement solutions to attain high quality process automation within the Information Technology division and across the College Career Access business units.
You have designed, developed, and implemented solutions that support business functionality as well as the underlying infrastructure required to run and deploy those solutions. You must possess hands-on technical skills and experience with Amazon Web Services and continuous delivery systems.
As an engineer, you must have excellent written and oral communication skills and be adaptive to the changing needs of the department and the organization. You must have experience with building and maintaining highly effective relationships with team members and multiple stakeholders across multiple projects. This is the position in which you will exercise all the knowledge gained when you were receiving your Computer Science, Electrical Engineering, or any other related engineering field degree.
ESSENTIAL FUNCTIONS/RESPONSIBILITIES
•Design, develop and implement automated solutions, based on a set of standards and processes which establish consistency across the enterprise, to reduce risk and promote efficiencies in support of the organization's goals and objectives.
•Responsible for the quality of your work; will develop and implement a set of quality criteria and the associated validation methods to ensure that any deliverable meets the expected quality levels of our customers, use quality management standards/metrics to ensure quality levels are maintained, seek new approaches and techniques to improve quality levels and analyze the impact of quality control and quality assurance on project performance.
•Actively review Observability custom and COTS products and implement improvements seen within the industry to drive continuous improvement of the Observability products' efficiency, scalability, and quality.
•Managing and resolving incidents, conducting incident reviews, and managing problems with a focus on proactivity
•Incident management - Act in key response roles during major incidents. Participate in an on-call rotation with other team members. Participate in the post-mortem review of incidents for Root Cause Analysis (RCA)
•Participate in system design consulting, AWS platform management, and capacity planning
•Provide support (coaching and mentoring) for teammate's work activities on a regular basis
•Use product SLAs, enterprise standards/metrics to ensure product availability and user experience quality levels are maintained, seek innovative approaches and techniques to improve quality levels and analyze the impact of the product changes on application performance and availability.
•Design and develop tools and processes to aid in improving infrastructure reliability and allow for monitoring and reporting.
•Write complex code, building infrastructure as code, work with serverless based cloud environments and build the supporting automated toolsets necessary to support the continuous metric collection pipeline.
•Integrate COTS products across the continuous delivery pipeline to provide a comprehensive automated system from epic definition, development, test and deploy of CB applications within our data center and Amazon.
•A hands-on engineer who leads by doing. Take responsibility for creating design specifications, unit testing, and preparing technical documentation. Develop solutions from business initiation through operational integrity.
•Support the development of Observability standards by creating templates for ease of use and increase of Observability capabilities' adoption
•Foster and build a community of practice for collective learning of the Observability tools and systems across all development teams.
•Be in an on-call rotation to respond to incidents that impact *** availability and provide support for Development team engineers with customer related incidents.
•Use your on-call experiences to analyze and prevent incidents from ever happening.
Qualifications needed for the role:
•A bachelor's degree preferably in Computer Science, Engineering or MIS.
•5-8 years of experience in software systems, programming, and infrastructure development and administration.
Preferred skills and attributes for the role:
•Strong, proven experience as a DevOps engineer in a scalable production environment administrating one or more of the following: Atlassian Suite of products (Jira, Confluence, Bitbucket, Crowd).
•Ability to operate in a high-pressure environment, quickly troubleshoot complex issues and successfully handle multiple priorities
•Strong practical Linux-based systems administration skills and scripting experience in a Cloud-based environment.
•Experience with Node.js/JavaScript programming language and it's frameworks and design patterns.
•Experience working with APIs and Microservices.
•Working knowledge of IP Networking, VPCs, DNS, Load Balancing, and Firewalls.
•Experience building infrastructure as code using AWS CDK, Cloud Formation, or similar scripting techniques.
•Experience managing releases into production using AWS Code Pipeline.
•Expertise with Git and Bitbucket, including branching workflows.
•Experience with monitoring suites (Ex: New Relic, Splunk, Sumo Logic)is a plus.
•Excellent interpersonal and collaboration skills with the ability to work with a diverse set of colleagues.
•Strong decision-making, problem-solving skills, critical thinking, and testing skills.
•Self-starter with the ability to set priorities, work independently, and attain goals.
•The ethos of continuous improvement and interest in learning new things.
•Strong ability to understand and internalize the big picture and broader implications.



  • Herndon, United States The Swift Group Full time

    Job DescriptionJob DescriptionThe Swift Group is looking for a Site Reliability Engineer to fill a position overseeing a mission critical system. Candidate will work hand in hand with DevOpS engineers and Developers to complex Systems Software work risk free and continuously in a production environment. Our Site Reliability Engineer is a hybrid software...


  • Herndon, Virginia, United States General Dynamics Information Technology Full time

    Req ID: RQ170481Type of Requisition: RegularClearance Level Must Be Able to Obtain: NoneJob Family: Systems EngineeringSkills:AWS Devops,Docker (Software),Java,Kubernetes,Python (Programming Language)Experience:10 + years of related experienceUS Citizenship Required:YesJob Description:Transform technology into opportunity as a Lead Site Reliability Engineer...


  • Herndon, United States Design Force Full time

    Now Hiring: Senior Site/Civil Engineer Would you like to leave your mark on some of Northern Virginia’s most community centered projects while receiving hands on mentorship in a collaborative, and energetic environment? Our client is currently seeking a Senior Site/Civil Engineer to join their office in Herndon, VA . Our client is a dynamic,...


  • Herndon, United States Design Force Full time

    Now Hiring: Senior Site/Civil Engineer Would you like to leave your mark on some of Northern Virginias most community centered projects while receiving hands on mentorship in a collaborative, and energetic environment? Our client is currently seeking a Senior Site/Civil Engineer to join their office in Herndon, VA. Our client is a dynamic,...


  • Herndon, United States DesignForce Full time

    Now Hiring: Senior Site/Civil Engineer Would you like to leave your mark on some of Northern Virginia’s most community centered projects while receiving hands on mentorship in a collaborative, and energetic environment? Our client is currently seeking a Senior Site/Civil Engineer to join their office in Herndon, VA.Our client is a dynamic,...


  • Herndon, United States DesignForce Full time

    Now Hiring: Senior Site/Civil Engineer Would you like to leave your mark on some of Northern Virginia’s most community centered projects while receiving hands on mentorship in a collaborative, and energetic environment? Our client is currently seeking a Senior Site/Civil Engineer to join their office in Herndon, VA.Our client is a dynamic,...


  • Herndon, United States DesignForce Full time

    Now Hiring: Senior Site/Civil Engineer Would you like to leave your mark on some of Northern Virginia’s most community centered projects while receiving hands on mentorship in a collaborative, and energetic environment? Our client is currently seeking a Senior Site/Civil Engineer to join their office in Herndon, VA.Our client is a dynamic,...


  • Herndon, United States The Swift Group Full time

    Job Description Job Description The Swift Group is looking for an experienced Site Reliability Engineer to join our technology-based program supporting a key customer. This person will provide subject expertise and guidance to IT developers during the software development life cycle. Overseeing the development, testing, and implementation of technical...


  • Herndon, United States The Swift Group Full time

    Job DescriptionJob DescriptionThe Swift Group is looking for an experienced Site Reliability Engineer to join our technology-based program supporting a key customer. This person will provide subject expertise and guidance to IT developers during the software development life cycle. Overseeing the development, testing, and implementation of technical...


  • Herndon, United States iNovex Information Systems Full time

    Job Brief . Job Description HTS (iNovex) was built on the principle that people matter first and foremost.We believe in providing a strong work/life balance by investing in our employees and encouraging professional and personal growth.We do this by offering exceptional benefits, flexible schedules, and the tools necessary to achieve success through paid...


  • Herndon, United States iNovex Information Systems Full time

    Job Brief . Job Description HTS (iNovex) was built on the principle that people matter first and foremost.We believe in providing a strong work/life balance by investing in our employees and encouraging professional and personal growth.We do this by offering exceptional benefits, flexible schedules, and the tools necessary to achieve success through paid...


  • Herndon, Virginia, United States SAP Full time

    We help the world run betterOur company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused work. We offer a highly...


  • Herndon, Virginia, United States General Dynamics Information Technology Full time

    Req ID: RQ170649Type of Requisition: RegularClearance Level Must Be Able to Obtain: NoneJob Family: Systems EngineeringSkills:AWS Devops,C++ Programming Language,Java,Python (Programming Language),Ruby (Programming Language)Experience:10 + years of related experienceUS Citizenship Required:YesJob Description:Transform technology into opportunity as a Systems...


  • Herndon, Virginia, United States Amazon Full time

    BASIC QUALIFICATIONS Bachelors degree in Physics, Mathematics, Electrical, Mechanical or Materials Engineering, or a related field. 6+ years of industry experience in construction, project management, manufacturing, or operations support. 8+ years in a quality management function with direct interaction with suppliers.DESCRIPTIONThe AWS Data Center...


  • Herndon, United States Amazon.com Inc Full time

    AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, were the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, Reliability Engineer, Liability, Reliability, Reliability, Infrastructure, Continuous Improvement,...


  • Herndon, Virginia, United States BAE Systems Full time

    Job Description Job DescriptionBAE Systems, a top-ten prime contractor to the U.S. Department of Defense, enables the U.S. government to transform data into intelligence and provides engineering, integration and sustainment support for critical military platforms and systems. Intelligence & Security provides services and products to the Department of...


  • Herndon, United States Bohler Engineering Full time

    Overview At Bohler, we empower the ambitious to become the accomplished. This greater purpose connects us with like-minded professionals, fosters meaningful relationships, and generates the alignment necessary to produce an unrivaled consulting and employment experience. Our Herndon, VA office is looking for a Senior Design Engineer who embodies this...


  • Herndon, Virginia, United States BAE Systems Full time

    Job Description Job DescriptionBAE Systems, a top-ten prime contractor to the U.S. Department of Defense, enables the U.S. government to transform data into intelligence and provides engineering, integration and sustainment support for critical military platforms and systems. Intelligence & Security provides services and products to the Department of...


  • Herndon, United States Bohler Full time

    Overview: At Bohler, we empower the ambitious to become the accomplished. This greater purpose connects us with like-minded professionals, fosters meaningful relationships, and generates the alignment necessary to produce an unrivaled consulting and employment experience. Our Herndon, VA office is looking for a Senior Design Engineer who embodies this...


  • Herndon, United States Bohler Engineering Full time

    Overview At Bohler, we empower the ambitious to become the accomplished. This greater purpose connects us with like-minded professionals, fosters meaningful relationships, and generates the alignment necessary to produce an unrivaled consulting and employment experience. Our Herndon, VA office is looking for a Senior Design Engineer who embodies this...