Site Reliability Engineer
3 weeks ago
Will define enterprise-wide applications development strategies to ensure that applications and
infrastructure that are brought to market meet customer requirements and are stable, reliable, and
production ready.
Duties include:
1. Design, code, test and deliver automation tools for production applications deployment and
maintenance;
2. Automate infrastructure creation and provisioning processes in AWS, Azure and OCI cloud
platforms using CloudFormation, Terraform, Ansible, Python and Shell scripting;
3. Develop applications using Java, Spring Boot, Datadog, Elasticsearch, ReactJS and Docker
Compose technologies for monitoring and alerting systems;
4. Enhance automation around configuration management, tooling and striving towards continuous
delivery of software;
5. Migrate docker compose and docker swarm style applications to Kubernetes using Helm Charts
involving working with Kafka, Postgres, RabbiMQ, ELK stack and Redis technologies;
6. Migrate applications between cloud platforms - AWS, Azure, OCI and GCP;
7. Deploy new versions of software releases to production environments and also work on
configuration change requests for production environments; and
8. Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure
of incidents on both public cloud and on premise environments.
Position Requirements:Master's Degree in Computer Science, Computer Engineering, Information Quality or a related field plus
1 year of experience in software development roles.
1 year of experience to include the following skills (all skills gained concurrently):
1) Experience with Java, Python and bash scripting languages;
2) Experience with Containerization technologies including Docker, Kubernetes and Swarm;
3) Experience with Terraform, CloudFormation, Ansible and Helm tools for automation;
4) Experience with creating CI/CD pipelines and jobs using Teamcity and Jenkins;
5) Experience with ELK stack, Kafka, RabbitMQ, SNS and SQS messaging technologies;
6) Experience with Postgres, MySQL, MongoDB, Redis and Memcached persistence technologies;
7) Experience with with gradle, maven, Github, Datadog, kibana, splunk, Swagger, Kong
dashboard, Postman and JIRA;
8) Experience with VPC, EC2, ELB, EKS, RDS, IAM, Kinesis, S3, Lambda, CloudFront and
CloudWatch services;
9) Experience with at least one of AWS, Azure or OCI cloud platforms; and
10) Experience with gathering & analyzing metrics using Datadog and to create monitors, synthetics
and dashboards for production applications and systems maintenance.
-
Site Reliability Engineer III
1 month ago
Dallas, Texas, United States JPMorganChase Full timeJob Description There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Enterprise technology, Infrastructure platforms team, you will solve...
-
Senior Lead Site Reliability Engineer
2 months ago
Dallas, Texas, United States JPMorganChase Full timeJob Description Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the Corporate Sector, Infrastructure Platforms organization, you work with your fellow stakeholders...
-
Site Reliability Engineer III
2 months ago
Dallas, Texas, United States JPMorganChase Full timeJob Description There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Infrastructure Platform, Web Hosting team, you will solve complex...
-
Lead Site Reliability Engineer
2 months ago
Dallas, Texas, United States JPMorganChase Full timeJob Description Lead and conduct resiliency design reviews, break up complex problems, and act as a technical lead for medium to large sized critical products.As a Lead Site Reliability Engineer at JPMorgan Chase within the Enterprise Technology Infrastructure Platforms, you hold a leadership role in your team, demonstrate strong knowledge across multiple...
-
Site Reliability Engineer
3 weeks ago
Dallas, Texas, United States PMG Full timePMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to deliver value, innovation, and business transformation.WHO WE AREAgile. Authentic....
-
Site Reliability Engineering Manager
14 hours ago
Dallas, Texas, United States Apple Full timeJob SummaryApple is seeking a highly skilled Site Reliability Engineering Manager to lead a team responsible for providing a platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish.Key ResponsibilitiesEstablish and maintain SRE practices for a private cloud service to...
-
Dallas, Texas, United States Wise Skulls llc Full timeJob OverviewPosition: Site Reliability Engineer (Python)Location: Dallas, TX (On-site presence required)Contract Duration: 12 monthsPartnering Company: Wise Skulls LLCClient: ConfidentialKey Responsibilities:Minimum of 5 years of relevant experience in the field.Proficient in Python programming and familiar with frameworks such as Django or Flask.Mandatory...
-
Senior Site Reliability Engineer
4 days ago
Dallas, Texas, United States Cognizant Full timeSenior Site Reliability Engineer (Hybrid) Cognizant stands as a prominent global entity delivering IT solutions, encompassing digital transformation, technology services, consulting, and operational support. At Cognizant, we embrace innovative thinking and explore new concepts daily. Our mission is to assist leading enterprises in reimagining their...
-
Dallas, Texas, United States American Airlines Full timeIntroductionAre you ready to embark on a journey filled with opportunities, both professionally and personally? Become a part of the American Airlines family, where you can explore the globe, enhance your skills, and evolve into your best self. As you begin this exciting chapter, you will face challenges with adaptability and poise, acquiring new...
-
Reliability Engineering Specialist
16 hours ago
Dallas, Texas, United States Diverse Lynx Full timeJob Title: Site Reliability EngineerLocation: RemoteDuration: Full TimeJob Description:Key Responsibilities:Ensure the reliability and uptime of systems, minimizing downtime and meeting service-level objectives (SLOs).Develop, automate, and implement tools to streamline processes, deploy applications, and manage infrastructure.Set up and maintain monitoring...
-
Reliability Engineer
14 hours ago
Dallas, Texas, United States Dice Full timeAbout the Role:Dice is a leading career destination for tech experts at every stage of their careers. We're seeking a skilled Reliability Engineer to join our team.Job Summary:We're looking for a highly motivated and experienced Reliability Engineer to join our team. As a Reliability Engineer, you will be responsible for ensuring that our applications are...
-
On-Site Engineering Specialist
1 week ago
Dallas, Texas, United States Jobot Full timeJoin Our Innovative TeamAbout Jobot:At Jobot, we pride ourselves on making a significant impact in the engineering sector. We are currently in search of a passionate and skilled Permanent Field Engineer to enhance our team.Why Choose Us? Comprehensive Benefits Package 401(K) Plan Generous Paid Time Off Supportive Team Environment that Promotes Career...
-
Lead Security Reliability Engineer
1 week ago
Dallas, Texas, United States Aurora Innovation Full timeAbout UsAurora Innovation is at the forefront of revolutionizing transportation through self-driving technology. Our mission is to enhance safety, accessibility, and efficiency in transportation systems. The Aurora Driver is a sophisticated self-driving platform that caters to various vehicle types, including freight and passenger transport, forming the...
-
Reliability Engineering Specialist
1 week ago
Dallas, Texas, United States Siri InfoSolutions Inc Full timePosition OverviewGreetings,We are currently seeking a Reliability Engineering Specialist to join our dynamic team.This role involves ensuring the reliability and performance of our systems through effective monitoring and automation.Key Responsibilities:Implementing automation solutions to enhance system stability and performance.Conducting regular health...
-
Infrastructure Reliability Specialist
1 week ago
Dallas, Texas, United States STIAOS Technologies Full timeAbout STIAOS Technologies: STIAOS Technologies is a prominent player in the tech industry, looking for a talented Infrastructure Reliability Specialist to enhance their team. The successful applicant will possess a strong background in Java Spring Boot, Kubernetes, and the eCommerce sector.Core Responsibilities:Work collaboratively with cross-functional...
-
On-Site Safety Engineer
1 week ago
Dallas, Texas, United States Brady Full timePosition: Field Service Engineer - LOTO Requisition ID:: 3581 About Us: Brady is a global leader in safety, identification, and compliance solutions, dedicated to enhancing workplace safety and productivity. Our extensive expertise spans various industries, ensuring our products are trusted worldwide. From manufacturing to healthcare, our solutions are...
-
Reliability Engineer
16 hours ago
Dallas, Texas, United States Hearst Full timeAbout UsHomecare Homebase, a subsidiary of Hearst Corporation, is a leading provider of healthcare software solutions. Our mission is to deliver innovative, cloud-based technologies that improve clinical, operational, and financial outcomes for homecare and hospice agencies across the United States.Our CultureWe value a culture of caring, action, respect,...
-
Electrical Reliability Engineer III
1 week ago
Dallas, Texas, United States Gaf Full timePosition Overview At GAF, we prioritize more than just structures; we prioritize our people. Here, you will have access to the necessary tools and resources to advance your career. You will immerse yourself in our exceptional culture and be empowered to assist your colleagues, clients, and, most importantly, your community. Within our organization, we...
-
Site Project Engineer
1 week ago
Dallas, Texas, United States Tutor PeriniParsons JV Full timeTutor Perini/Parsons JV is currently seeking a Project Engineer to enhance their team.About Tutor Perini/Parsons JVThis company excels in the planning, execution, and upkeep of electrical systems, structured cabling solutions, integrated security technologies, and advanced building systems. Their extensive portfolio encompasses projects across the United...
-
Lead Engineer for PRA Site Operations
4 days ago
Dallas, Texas, United States Westinghouse Electric Company, LLC Full timeAre you eager to contribute to a forward-thinking team dedicated to advancing Westinghouse's mission of delivering sustainable energy solutions? At Westinghouse, we value our workforce as our greatest asset and strive to attract and recruit top-tier talent while fostering a diverse and inclusive workplace.If you envision thriving in such an environment, we...