Site Reliability Engineer

Found in: Appcast Linkedin GBL C2 - 3 weeks ago


Austin, United States Pinnacle Group, Inc. Full time

Job Details:4


  • Excellent Python, bash, and scripting fundamentals
  • Expertise with AWS/GCP cloud platform
  • Expertise with infrastructure-as-code tools such as Terraform, Cloud Deployment Manager, Ansible, or Chef.
  • Experience in building systems where observability is a first-class concern using protocols and tools that cover the space of log aggregation, analytics, monitoring, distributed systems tracing and alerting
  • Experience with containerization and cluster management technologies like Docker, Kubernetes and EKS.
  • Familiarity with microservices architecture and container orchestration with Kubernetes
  • Experience in designing and managing a predictive alerting platform using monitoring tools such as Prometheus, Grafana, Cloud monitoring, Splunk.
  • Proficient at Linux system administration
  • Experience with modern web services architectures
  • Experience with Git
  • Expertise in Kubernetes – probably certified.
  • Able to build tools from scratch when needed.
  • Ability to quickly learn new and existing technologies.
  • Strong problem solving skills


Responsibilities:


Implement features that enable customer engineers to easily enable and configure Kubernetes and Infrastructure capabilities.


For example:


* Adding a `datadog: true` flag to our `cluster.yaml` configuration file. When enabled, this ensures that Datadog agents are installed and configured properly for this cluster.

* Modifying an image build process from x86-only to also support ARM

* Building a tool to provision AWS Managed Prometheus


A stellar candidate will be able to share experience:


* Managing infrastructure in production with Terraform, Pulumi, or a similar tool

* Developing Kubernetes components in Golang, like Operators or Admission Controllers

* Independently troubleshooting and resolving issues related to Kubernetes and Cloud infrastructure (AWS, GCP, etc).


Pay Range: $75-95

The specific compensation for this position will be determined by a number of factors, including the scope, complexity and location of the role as well as the cost of labor in the market; the skills, education, training, credentials and experience of the candidate; and other conditions of employment. Our full-time consultants have access to benefits including medical, dental, vision as well as 401K contributions.



  • Austin, United States JobRialto Full time

    Description: The Client Site Reliability team is responsible for the operations and infrastructure of all consumer-facing production systems and developer-facing systems at Client Games, including NBA Client game services, customer-facing account services, and websites. This team handles systems and services spanning multiple datacenters both terrestrial and...


  • Austin, United States Procore Technologies Full time

    Job Description What if you could use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world, and yet it’s also one of the world’s least digitized industries. That’s why we’re looking for...


  • Austin, United States ClickHouse Full time

    We are committed to providing our customers with reliable and secure services so we are building out our newly formed Site Reliability Engineering team. As one of the first joiners to our Reliability Engineering Team at ClickHouse, you will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance...

  • Site Reliability Engineer II

    Found in: Resume Library US A2 - 1 week ago


    Austin, Texas, United States Procore Technologies Full time

    Job Description What if you could use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world, and yet it’s also one of the world’s least digitized industries. That’s why we’re looking for...

  • Senior Site Reliability Engineer

    Found in: Resume Library US A2 - 2 weeks ago


    Austin, Texas, United States Visa Full time

    Job Description As a part of the Product Reliability Engineering (PRE) Organization of VISA , you will be responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. In this role, your time will be split between operations/on-call duties and developing systems and software that...

  • Software Engineer

    4 days ago


    Austin, United States Apple Full time

    Software Engineer (Site Reliability) Operations Lead, Enterprise Systems Conversational Engineering develops next generation communications, AI, and NLP solutions to support Apple Customers. Our mission is to maintain a comprehensive and effective support, sales & payment experience for customers around the globe. Our conversational engineering platform is...

  • Sr. Site Reliability Engineer

    Found in: Resume Library US A2 - 3 weeks ago


    Austin, Texas, United States Visa Full time

    Job Description Cybersource Production Support is responsible for supporting the CyberSource applications for enterprise-level. This team responds to all reports of application problems in production and staging environments, and works as quickly as possible to mitigate impacts, provide RCAs and recommendations, and to generate reports and analytics on...

  • Sr. Site Reliability Engineer

    Found in: Resume Library US A2 - 2 weeks ago


    Austin, Texas, United States Visa Full time

    Job Description Visa has a great toolbox of leading technologies including Cybersource and Authorize.net. Together, we are building leading edge full-service Payment Management solutions combining global payment processing, fraud management and payment security systems. We are looking for talented, technical, proactive, energetic, and passionate...

  • Senior Site Reliability Engineer

    Found in: Resume Library US A2 - 2 weeks ago


    Austin, Texas, United States Visa Full time

    Job Description The Product Reliability Engineering (PRE) group prides itself in keeping the applications and systems of Visa up and running to cater to the 24*7 needs of the business. Essential Functions: Support critical applications and ensure the stability of the applications by performing proactive maintenance activities, engage in automation...

  • Senior Database Reliability Engineer

    Found in: beBee jobs US - 3 weeks ago


    Austin, Texas, United States NinjaOne Full time

    Senior Database Reliability Engineer (DBRE) About the Role At NinjaOne we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior Database Reliability Engineer (DBRE) to join our SRE team in the Platform Engineering organization and help us scale our products to millions of...

  • Senior Database Reliability Engineer

    Found in: beBee S US - 3 weeks ago


    Austin, United States NinjaOne Full time

    Senior Database Reliability Engineer (DBRE) About the Role At NinjaOne we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior Database Reliability Engineer (DBRE) to join our SRE team in the Platform Engineering organization and help us scale our products to millions of...

  • Site Reliability Developer Join OCI-Ns2 with Security Clearance

    Found in: Dice One Red US C2 - 2 weeks ago


    Austin, United States Oracle Corporation Full time

    Job Description Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the critically important stack,...

  • Site Reliability Developer Join OCI-Ns2 with Security Clearance

    Found in: Dice One Red US C2 - 1 day ago


    Austin, United States Oracle Corporation Full time

    Job Description Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the critically important stack,...

  • Project Engineer

    2 weeks ago


    Austin, United States Precision Recruiters Full time

    Overview: Our client is seeking a highly skilled Civil Engineer with a focus on land development and/or site development projects. In this role, the selected candidate will apply engineering principles to design infrastructure for commercial land development projects. Additionally, they will actively engage with professional associations and community...


  • Austin, United States Netspend Full time

    About the Company: Ouro is dedicated to delivering financial empowerment to millions of Americans, leveraging a proprietary payments technology platform that fuels its fintech product innovations. From prepaid, credit and debit account solutions, to digital account and money movement services, Ouro has a broad suite of products and technologies that deliver...

  • Senior Site Reliability Engineer

    Found in: Resume Library US A2 - 2 weeks ago


    Austin, Texas, United States Visa Full time

    Job Description The Work itself:   Loyalty and Benefits Product Reliability Engineering team is responsible to maintain a stable and robust production system for all Applications which are handing a huge number of enrollments, campaigns, Online Access queries, Client interfaces for VISA. They are responsible for second level problem identification,...

  • DevOps Engineer

    Found in: Lensa US P 2 C2 - 32 minutes ago


    Austin, United States eTeam Full time

    Site Reliability Engineer Job Summary Hardware Engineering is seeking a Site Reliability Engineer to support multiple internal applications. From brainstorming through implementation, the Site Reliability Engineer will work with engineers of several internal tools to build performant and fault tolerant infrastructure in a way that is maintainable, scalable,...

  • Project Engineer

    7 days ago


    Austin, United States CareerBuilder Full time

    HBK Engineering is seeking a Project Engineer to support our growing portfolio of land development projects, including electric vehicle charging stations, commercial and utility-scale solar, battery energy storage sites and utility-related civil site work. HBK is transforming essential infrastructure to achieve a sustainable future and empower the...

  • Senior Engineer

    2 weeks ago


    Austin, United States HBK Engineering, LLC Full time

    Job DescriptionJob DescriptionHBK Engineering is a seeking Licensed Professional Civil Site Engineer to support our growing portfolio of land development projects, including electric vehicle charging stations, commercial and utility-scale solar, battery energy storage sites and utility-related civil site work.  HBK is transforming essential infrastructure...

  • Senior Engineer

    5 days ago


    Austin, United States HBK Engineering, LLC Full time

    Job DescriptionJob DescriptionHBK Engineering is a seeking Licensed Professional Civil Site Engineer to support our growing portfolio of land development projects, including electric vehicle charging stations, commercial and utility-scale solar, battery energy storage sites and utility-related civil site work.  HBK is transforming essential infrastructure...