Site Reliability Engineer
4 weeks ago
Do not wait to apply after reading this description a high application volume is expected for this opportunity.
RESPONSIBILITIES
Develop software to detect unusual error activity. Implement workflows and processes that are designed to identify and reduce the overall number of application/system errors.
Collaborate with software development as part of the SDLC to design and implement availability, reliability, and error monitoring solutions in their applications.
Take responsibility for removing, isolating, or remediating errors, debugs, warnings or other kinds of messages from existing logs to improve overall log content and usefulness.
Limit system downtime by defining and enforcing standards for incident responses, error tracking, monitoring, and alerting with the goal to improve established reliability metrics.
Effectively respond to escalated site reliability issues any time of the day while on-call.
Conduct regular research on best practices and new technology for monitoring, alerting, error tracking and detection and application performance
Education/Certification:
Bachelors degree in Computer Science, MIS or related field
Experience:
3+ years experience utilizing alerting and telemetry tools such as Grafana, Prometheus, Splunk, Dynatrace and others
2+ years experience with Splunk SPL
2+ years experience with at least one programming language such as PHP, Python, Java, .Net
PREFERRED QUALIFICATIONS
Experience:
1+ years experience with CI/CD
1+ years experience with container and container orchestration such as Docker and Kubernetes
1+ years experience with Prom
1+ years experience with SQL
Skills/Abilities:
Troubleshooting in a large-scale networked environment
Knowledge of Paycoms applications, systems, and database
Paycom is an equal opportunity employer and prohibits discrimination and harassment of any kind. Paycom makes employment decisions on the basis of business needs, job requirements, individual qualifications and merit. Paycom wants to have the best available people in every job. Therefore, Paycom does not permit its employees to harass, discriminate or retaliate against other employees or applicants because of race, color, religion, sex, sexual orientation, gender identity, pregnancy, national origin, military and veteran status, age, physical or mental disability, genetic characteristic, reproductive health decisions, family or parental status or any other consideration made unlawful by applicable laws. Equal employment opportunity will be extended to all persons in all aspects of the employer-employee relationship. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, training, promotion, discipline, compensation benefits, and separation of employment. The Human Resources Department has overall responsibility for this policy and maintains reporting and monitoring procedures. Any questions or concerns should be referred to the Human Resources Department. ****To learn more about Paycom's affirmative action policy, equal employment opportunity, or to request an accommodation - Click on the link to find more information:paycom.com/careers/eeoc
#LI-Hybrid
##
-
Site Reliability Engineer
4 months ago
Oklahoma City, United States Paycom Payroll Llc Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...
-
Site Reliability Engineer
1 month ago
Oklahoma City, United States Paycom Payroll Llc Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.Do not wait to apply after reading...
-
Site Reliability Engineer
4 weeks ago
Oklahoma City, United States Paycom Payroll Llc Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...
-
Site Reliability Engineer
4 months ago
Oklahoma City, OK, United States Paycom Payroll Llc Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...
-
Site Reliability Engineer
4 weeks ago
Oklahoma City, Oklahoma, United States Oracle Full timeJob SummaryOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, scalability, and performance of our medical AI systems and infrastructure.Key ResponsibilitiesCollaborate with cross-functional teams to optimize our technology operations and...
-
Site Reliability Engineer
4 weeks ago
Foster City, California, United States Omega Solutions Inc Full timeJob Title: Site Reliability EngineerAt Omega Solutions Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our critical platforms and applications.Key Responsibilities:* 8+ years of experience in Site Reliability...
-
Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Your...
-
Site Reliability Engineer
4 weeks ago
Foster City, California, United States Bayone Full timeJob SummaryAt Bayone, we are seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and performance of our large production service.Key ResponsibilitiesHost OS upgradesDocker image upgradesSSL certificate upgradesRequirementsBachelor's degree in...
-
Site Reliability Engineer
2 days ago
Redwood City, United States 1872 Consulting Full timeSite Reliability Engineer - 100% RemoteRole Summary:Site Reliability Engineers (SREs) are responsible for working with different developer teams to keep our systems running smoothly. They are a blend of pragmatic operators and software craftspeople that apply excellent problem-solving and communication skills to develop or configure tools that will automate,...
-
Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full timeAbout the RoleWe are seeking a talented Site Reliability Engineer to join our SRE Platforms team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Our team is responsible for designing and...
-
AWS Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States Syntricate Technologies Full timeWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly our AWS environment.The ideal candidate will have strong experience with AWS, with a focus on SRE principles...
-
Site Reliability Engineer
4 weeks ago
Kansas City, Missouri, United States Datum Technologies Group Full timeJob SummaryAt Datum Technologies Group, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining efficient technology platforms that meet both internal and external customer needs while effectively managing associated risks.Key...
-
Site Reliability Engineer Position
4 weeks ago
Kansas City, Missouri, United States Infinite Computer Solutions Full timeJob Title: Site Reliability Engineer PositionJob Description: We are seeking a skilled Site Reliability Engineer to join our team at Infinite Computer Solutions.Key Responsibilities:* Strong experience with Ansible, Gitlab, deployment, packages, Linux, Unix, Splunk, and Dynatrace is required.* A minimum of 8 to 10 years of experience in a similar role is...
-
Senior Site Reliability Engineer
4 weeks ago
Kansas City, Missouri, United States Granicus Full timeThe Granicus team is seeking a skilled Senior Site Reliability Engineer to join our cloud infrastructure team. As a key member of our SRE team, you will be responsible for ensuring the availability, scalability, and efficiency of our customer environments.You will work closely with our engineering teams to design and build modern tools for the SRE and other...
-
Reliability and Maintenance Engineer
4 weeks ago
Oklahoma City, Oklahoma, United States Management Business Solutions Full timeManagement Business Solutions is seeking a skilled Reliability and Maintenance Engineer to lead the maintenance and reliability efforts for our plant site. As a key member of our team, you will be responsible for developing and implementing a reliability strategy that aligns with our business needs.Key Responsibilities:Develop and implement a reliability...
-
Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States The Dignify Solutions LLC Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at The Dignify Solutions LLC. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining our cloud-based infrastructure. Your expertise in cloud platforms, automation tools, and security fundamentals will be crucial in...
-
Principal Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States Fidelity Investments Full timeJob Title: Principal Site Reliability EngineerThe Role:As a member of the TechOps SRE team at Fidelity Investments, you will work closely with our engineering partners to enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes environments are best-in-class and central to our enterprise-grade infrastructure...
-
Principal Site Reliability Engineer
2 weeks ago
Redwood City, United States Oracle Full timeThe Background Oracle’s Fusion Applications group is designing and building the next-gen deployment platform for its suite of software products. We focus on transforming how Software Developers and DevOps engineers build cloud applications for enterprise customers. Our team is building new services to improve developer productivity and automate the process...
-
Site Reliability Engineer Leader
4 weeks ago
Jersey City, New Jersey, United States The Dignify Solutions LLC Full timeJob SummaryWe are seeking a highly experienced Site Reliability Engineer Leader to join our team at The Dignify Solutions LLC. The ideal candidate will have a strong background in building and running applications in production with uptime over 99%.Key ResponsibilitiesDesign and implement large-scale Reliability & Observability Programs for complex...
-
Principal Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States Fidelity TalentSource LLC Full timeJob Description:The RoleAs a member of the TechOps SRE team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation.Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments...