Current jobs related to SRE Engineer - Columbia, Maryland - Saxon Global
-
Site Reliability Engineer 2
4 weeks ago
Columbia, Maryland, United States CyberCore Technologies Full timeJob SummaryCyberCore Technologies is seeking a highly skilled Site Reliability Engineer 2 to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the stability, scalability, and security of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain scalable and secure cloud infrastructure using...
-
Lead Director – Observability Governance
4 weeks ago
Columbia, South Carolina, United States CVS Health Full timeJob SummaryCVS Health is seeking a highly skilled Lead Director – Observability Governance to lead our strategic planning for the enterprise observability roadmap and establish a governance body and related procedures to set and maintain observability standards.Key ResponsibilitiesCollaborate with observability engineering and operations teams, service...
SRE Engineer
2 months ago
Location - Maryland (Remote)
Hire Type - Contract
Job DescriptionWe are seeking a highly skilled SRE Engineer to join our team at Saxon Global. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and performance of our cloud-based systems.
- UNIX and Linux Expertise: Strong expertise in UNIX and Linux operating systems, including system configuration, performance tuning, and troubleshooting.
- Site Reliability Engineering: Relevant experience of 6 years in Site Reliability Engineering and DevOps implementation approaches and related technologies.
- Monitoring and Observability: Strong expertise in monitoring and observability tools, particularly DataDog.
- Additional Tools: Experience with other tools such as New Relic, Prometheus, Grafana, or Splunk is a plus.
- SRE Practices: Strong understanding and experience with SRE practices, including incident response, blameless postmortems, error budgeting, and SLOs.
- eCommerce Support: Knowledge of eCommerce support and managing and troubleshooting SFCC applications, understanding the architecture, and diagnosing and resolving issues.
- DevOps Practices: Strong knowledge of DevOps practices, CI/CD pipelines, infrastructure automation, and configuration management using tools like Git, Jenkins, Ansible, or Terraform.
- Cloud Platforms: Familiarity with cloud platforms like AWS or Azure, including infrastructure provisioning, monitoring, and management.
- Scripting and Programming: Experience with scripting and programming languages like Python, Bash, or PowerShell.
- Web Technologies: Solid understanding of web technologies, protocols, and frameworks, including HTTP, SSL, REST, and SOAP.
- Containerization and Orchestration: Experience with containerization and orchestration technologies like Docker and Kubernetes is a plus.
- Chaos Engineering: Experience with Chaos Engineering implementation will be a plus, including designing experiments and automating monitoring system behavior using tools like Chaos Mesh, Chaos Monkey etc.
- Problem-Solving and Communication: Strong analytical and problem-solving skills, with the ability to work in a fast-paced, dynamic environment. Excellent communication and collaboration skills to effectively work with cross-functional teams and external stakeholders.