Incident Management Specialist
3 weeks ago
In this incident management function, manage incidents to resolution in a 24/7/365 environment using the *** incident management processes, effectively guide incident and triage calls from a technical perspective, share technical details obtained from monitoring tools and dashboards to aid troubleshooting, outline details of resolution activities, recommend and implement improved processes, provide timely status updates to stakeholders, assist with postmortem related activities and support various efforts related to operational improvements. Manage efforts to maintain application in production, including troubleshooting stoppages, repairing bugs, documenting application performance, and coordinating with technology infrastructure management.
KEY JOB FUNCTIONS
Manage IT production incidents to resolution in a 24/7/365 environment using the *** incident management processes and communicate management of incident status, impact and resolution actions.
Hands on experience managing and monitoring applications deployed on Amazon Web Services (AWS).
Troubleshooting and resolving incidents on the AWS cloud infrastructure.
Experience with building tools for monitoring and troubleshooting of system resources in an AWS environment. Ability to triage AWS related incidents using monitoring tools on AWS Cloud.
Experience with performance engineering of AWS Cloud applications.
Hands on experience working with AWS tools like EC2, ELB, RDS, Redshift, DynamoDB, Aurora, Route53, ECS, Lambada, S3, Batch, CloudWatch, CloudTrail, WAF etc.
Hands on experience with transaction level monitoring using Dynatrace and Splunk.
Ability to perform transaction level monitoring and troubleshooting in AWS cloud platform.
Eyes on glass monitoring of the health of applications as well as the underlying infrastructure.
Monitoring experience with tools like Extrahop, SolarWinds, Netcool suite, Catchpoint, MoogSoft.
Ability to analyze dashboards and reporting/monitoring tools to look at trends and patterns in application health and performance.
Proactively looking for hardware, software, and environmental alerts or malfunctions.
Effectively lead and guide Incident triage calls from a technical perspective analyzing different components of the infrastructure and application environment via the use of a variety of monitoring tools and processes.
Troubleshoot the incidents and identify root cause quickly using operations, wire data analytics, application performance management and event correlation monitoring tools.
Perform analysis of data, evaluating multiple application protocols including web, database, storage, and supporting infrastructure such as AWS, UNIX, DNS, LDAP, SSL, SMTP, and FTP.
Influence other technical teams on the calls and articulate troubleshooting steps effectively.
Lead required technical follow-up calls for critical incidents.
Assist with documentation of Root Cause Analysis (RCA) or Correction of Errors (COE) and data quality for all ECC communicated incidents.
Ensure appropriate functional and management escalation takes place as per the standards and procedures.
Follow up on items that could potentially negatively impact production operations, assist with postmortem related activities and support various efforts related to operational improvements.
Based on recommendations from management, implement new and improved processes, change processes, perform new tasks, create reports and address ad-hoc requests.
Participate in on-call rotation. Ability to work on any shifts as needed including weekends and night shifts.
Ability to report incident details and metrics to senior leadership.
EDUCATION
Bachelor's Degree or equivalent required.
MINIMUM EXPERIENCE
6+ years of related experience
SPECIALIZED KNOWLEDGE & SKILLS
6+ years of working experience with different IT Infrastructure components such as Unix/ Linux Servers, Wintel Servers, AWS, networks, firewalls, routers, load balancers, VPN, Apache, web logic, LDAP, Active Directory, Exchange, Oracle/MS SQL databases, SAN, Virtualization, Email systems, Enterprise monitoring and access management solutions for single sign on. Subject matter expertise is not required and experience with at least eight of the above is preferred.
Senior level hands-on working experience with Amazon Web Services (AWS).
Proven methodical approach to problem identification, monitoring, problem solving and resolution.
Ability to analyze different components of the infrastructure and application environments during Incident triage calls.
Aptitude to influence other technical teams on the incident calls and articulate troubleshooting steps effectively.
Experience and confidence working with all levels of management; excellent written and verbal skills.
Able to quickly and concisely communicate with senior management on technical issues in non-technical terms and to run large conference calls during Incident calls with a wide range of personnel and management levels.
Strong relationship management skills and aptitude to multi-task and work well in a high stress environment, both within teams and independently.
AWS Solution Architect Associate or higher certification
Monitoring and observability experience.
Experience with monitoring dashboards for incident detection and alerting.
Perform end-to-end analysis of transactions under an observability environment.
Troubleshoot incidents and identify root cause quickly using wire data analytics, application performance management and event correlation monitoring tools.
Diagnose and resolve incidents by providing factual data from the various monitoring and instrumentation systems.
Monitor applications and infrastructure using tools like Splunk, DynaTrace, OpenTel, Catchpoint, MoogSoft, xMatters, SignalFx, xMatters, SolarWinds, Extrahop etc.
Preferred Qualifications:
Understanding of tools like CloudFormation or Terraform
Management and troubleshooting of Middleware products on UNIX and Linux environments. Knowledge of Service Oriented Architecture (SOA), Java etc.
Understanding of Azure or Google Cloud.
Experience with OpenTel
Prior *** or Financial industry experience.
“Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.”
-
Incident Management Specialist
4 weeks ago
Reston, Virginia, United States Insight Global Full timeJob Summary:We are seeking a highly skilled Incident Manager to join our team at Insight Global. As an Incident Manager, you will be responsible for leading incident triage, communication, and restoration of critical business services to customers and partners.Key Responsibilities:Drive effective triage leadership for all CBWT related technology and...
-
Incident Response Security Specialist
1 month ago
Reston, Virginia, United States Oracle Full timeJob SummaryOracle is seeking a seasoned security analyst to join our SaaS Cloud Security team. As an Incident Response Security Specialist, you will play a key role in securing our large-scale distributed SaaS environment.Key ResponsibilitiesPerform hands-on activities including network and log analysis, malware analysis, and threat hunting.Assist with the...
-
Incident Management Specialist
2 weeks ago
Reston, VA, United States Mindlance Full timeIn this incident management function, manage incidents to resolution in a 24/7/365 environment using the *** incident management processes, effectively guide incident and triage calls from a technical perspective, share technical details obtained from monitoring tools and dashboards to aid troubleshooting, outline details of resolution activities, recommend...
-
Incident Manager
3 weeks ago
Reston, United States Technology Ventures Full timeIn this incident management function, manage incidents to resolution in a 24/7/365 environment using the incident management processes, effectively guide incident and triage calls from a technical perspective, share technical details obtained from monitoring tools and dashboards to aid troubleshooting, outline details of resolution activities, recommend and...
-
Incident Manager
2 weeks ago
Reston, VA, United States Technology Ventures Full timeIn this incident management function, manage incidents to resolution in a 24/7/365 environment using the incident management processes, effectively guide incident and triage calls from a technical perspective, share technical details obtained from monitoring tools and dashboards to aid troubleshooting, outline details of resolution activities, recommend and...
-
Incident Response Analyst
6 months ago
Reston, United States Oracle Full time*US Citizenship with preference for TS/SCI and FSP Are you interested in securing a large-scale distributed SaaS environment? Oracle's SaaS Cloud Security team is building new technologies that operate at high scale in our broadly distributed multi-tenant cloud environment. The Detections and Response Team plays a key role in enabling Oracle's Security...
-
Operational Technology Security Specialist
4 weeks ago
Reston, Virginia, United States First Quality Full timeJob Title: Operational Technology Security SpecialistJob Summary:We are seeking an experienced Operational Technology (OT) Security Specialist to join our team at First Quality. The ideal candidate will have a strong background in OT security and operations, with a focus on protecting our industrial control systems.Key Responsibilities:- Develop and maintain...
-
Cybersecurity Specialist
4 weeks ago
Reston, Virginia, United States The Maryland General Assembly Full timeJob Summary:The Maryland General Assembly is seeking a highly skilled Cybersecurity Specialist to join our team. As a key member of our Information Technology Office (ITO), you will be responsible for monitoring and preventing cybersecurity events, conducting threat intelligence, and engaging in hunting activities to proactively mitigate risks.Key...
-
Manufacturing Safety Specialist
1 month ago
Reston, Virginia, United States Gulfstream Strategic Placements Full timeJob Title: Manufacturing Safety SpecialistLocation: Not SpecifiedJob OverviewGulfstream Strategic Placements seeks a Manufacturing Safety Specialist to develop, implement, and maintain comprehensive Employee Health and Safety (EH&S) programs. The ideal candidate will lead initiatives to prevent injuries, mitigate workplace hazards, and ensure adherence to...
-
Senior Health and Safety Specialist
4 weeks ago
Reston, Virginia, United States Addison Group Full timeHSE Specialist Job DescriptionWe are seeking a highly skilled HSE Specialist to join our team at Addison Group.Responsibilities:Conduct regular safety inspections and site visits to ensure adherence to HSE regulations and company policies.Develop, implement, and update HSE policies, procedures, and programs in collaboration with various departments.Manage...
-
Asset Protection Specialist
2 months ago
Reston, United States Home Depot Full timeHome Depot - JobID: F02E64F5118D4151A34815113CCED625 [Loss Prevention / Security] As an Asset Protection Specialist at Home Depot, you'll: Prevent financial loss caused by theft and fraud; Support safety and environmental program compliance in your assigned store/multiple stores; Identify incidents of theft and fraud, review CCTV and exception reports,...
-
Cybersecurity Specialist
4 weeks ago
Reston, Virginia, United States The Maryland General Assembly Full timeThe Maryland General Assembly is seeking an experienced Cybersecurity Specialist to join our team.About the Role:The successful candidate will be responsible for monitoring and preventing cybersecurity events, conducting threat intelligence, engaging in hunting activities, and coordinating incident response for cyber incidents and forensic investigations.Key...
-
Infrastructure Security Specialist
4 weeks ago
Reston, Virginia, United States Pomeroy Full timeJob Title: Infrastructure Security SpecialistDescription:Pomeroy is seeking an experienced Infrastructure Security Specialist to act as the infrastructure liaison to our internal security team. The ideal candidate will be responsible for enabling productivity while protecting the organization's mission through maintaining and enhancing security architecture...
-
IT Support Specialist
4 weeks ago
Reston, Virginia, United States Addison Group Full timeAddison Group is seeking a skilled IT Support Specialist to join their team. This is a 6-month contract-to-hire opportunity with a possibility of direct hire conversion after the contract period.Key Responsibilities:• Provide technical support to end-users via phone, email, and ticket portal.• Troubleshoot and resolve hardware and software issues...
-
Portfolio Management Specialist
4 weeks ago
Reston, Virginia, United States Sunrise Affordable Housing Group, Inc. Full timePortfolio Management SpecialistJob OverviewSunrise Affordable Housing Group, Inc. is seeking a skilled Portfolio Management Specialist to support the development and rehabilitation of affordable housing properties. The successful candidate will play a critical role in overseeing various aspects of project development, including predevelopment design,...
-
IT Asset Management Specialist
4 weeks ago
Reston, Virginia, United States System Soft Technologies Full timeJob Title: Technical Support SpecialistLocation: Detroit, MI (Remote)Contract: 12 Months+Rate: $23.00/HRAbout the Role:This is a 12-month contract position for a Technical Support Specialist. The ideal candidate will have experience in asset management and IT deployment, as well as a strong foundation in desktop/computer support.Key Responsibilities:Provide...
-
Network Operations Center Manager
4 weeks ago
Reston, Virginia, United States Innova Solutions Full timeJob Title: Wireless NOC Incident ManagerAbout the Role:Innova Solutions is seeking a highly skilled Wireless NOC Incident Manager to join our team. The successful candidate will be responsible for managing the 24/7 Network Operations Center and leading a team of 5G Network Surveillance & Fault Isolation & Management teams.Manage the day-to-day operations of...
-
Land Acquisition Specialist
4 weeks ago
Reston, Virginia, United States Affinity Management Group Full timeExciting Career OpportunityAt Affinity Management Group, we are seeking an experienced Land Acquisition Specialist to perform right-of-way activities for the negotiation, acquisition, and maintenance of easements necessary to execute pipeline, road, power line, cable, and other related projects.Key Responsibilities:Conduct thorough title searches and...
-
Asset Protection Professional
1 month ago
Reston, Virginia, United States Home Depot Full timeJob SummaryAs an Asset Protection Specialist at Home Depot, you will play a critical role in preventing financial loss caused by theft and fraud. You will support safety and environmental program compliance in your assigned store or multiple stores, identifying incidents of theft and fraud, reviewing CCTV and exception reports, and monitoring the store's...
-
Acquia and Akamai Technology Specialist
3 weeks ago
Reston, Virginia, United States Hamdan Resources Full timeDigital Asset Management SpecialistWe are seeking a skilled digital asset management specialist with subject matter expertise in Acquia and Akamai technologies to support our team. The ideal candidate will be responsible for the implementation, configuration, maintenance, and troubleshooting of content management systems (CMS) and content delivery networks...