Senior Site Reliability Engineer

1 week ago


Austin, United States Cognite Full time
About Cognite

Embark on a transformative journey with Cognite, a global SaaS forerunner in leveraging data to unravel complex business challenges through our cutting-edge Cognite Data Fusion (CDF) platform. We were awarded the 2022 Technology Innovation Leader for Global Digital Industrial Platforms & Cognite was recognized as 2024 Microsoft Energy and Resources Partner of the Year. In the realm of industrial digitalization, we stand at the forefront, reshaping the future of Oil & Gas, Manufacturing and Energy sectors. Join us in this venture where data meets ingenuity, and together, we forge the path to a smarter, more connected industrial future.

Learn more about Cognite here

Cognite Product Tour 2024

Cognite Product Tour 2023

Data Contextualization Masterclass 2023

Our values

Impact: Cogniters strive to make an impact in all that they do. We are result-oriented, always asking ourselves.

Ownership: Cogniters embrace a culture of ownership. We go beyond our comfort zones to contribute to the greater good, fostering inclusivity and sharing responsibilities for challenges and success.

Relentless: Cogniters are relentless in their pursuit of innovation. We are determined and deliverable (never ruthless or reckless), facing challenges head-on and viewing setbacks as opportunities for growth.

We're seeking a highly skilled and versatile SRE to work on the SRE team in the infrastructure department. This role is designed for a proactive and knowledgeable individual who is comfortable working across all parts of the tech stack. This includes infrastructure, frontend, and backend. The ideal candidate will excel in a collaborative environment, demonstrating a strong ability to proactively reach out to various teams within the company to collaborate on optimizing service reliability and up time.

A strong candidate will enjoy getting their hands dirty writing code in multiple languages, as well as figuring out optimized infrastructure setups and consumption patterns.

You will be a technical nomad. Optimizing, learning and teaching across the organization.

Key Role & Responsibilities include
    • Collaborate with product teams across the organization to understand cloud usage patterns and identify optimization opportunities; both technical and functional.
    • Develop and implement tools and processes for monitoring, analyzing cloud performance across GCP, AWS, and Azure.
    • Work closely with development teams to optimize application performance and increase up time without impacting service quality.
    • Participate in the design and deployment of scalable and efficient cloud infrastructure solutions.
    • Lead initiatives to educate teams on best practices for reliability management and efficiency.
    • Provide regular reports and insights on cloud performance, optimization efforts, and other reliability metrics.
Key Skills, Knowledge & Abilities Include
    • Experience with one or multiple of the major cloud providers, GCP, AWS, Azure
    • Fluent in one or more backend oriented languages like Go, Kotlin, Python, Rust (and an eagerness to learn whatever is needed to get the job done)
    • SQL databases from a major vendor such as Postgres, Oracle, ...
    • Building and maintaining dashboards with tools such as Grafana, PowerBI or similar


A snapshot of our many perks and benefits as a Cogniter

* Competitive Compensation including base plus bonus

* 401(k) with 4% employer matching

* Health, Dental, Vision & Disability Coverages with premiums fully covered for employees and all dependents

* Unlimited PTO + flexibility to enjoy it

* 18 Company Holidays including the week between Christmas & New Years

* Paid Parental Leave Program

* Employee Stock Purchase Program (ESPP)

* Employee Referral Program

* Company Paid Friday Lunch via DoorDash + Fully Stocked Fridges in the offices

* Join a team of 70 different nationalities with Diversity, Equality and Inclusion (DEI) in focus .

* A highly modern and fun working environment with sublime culture across the organization, follow us on Instagram @cognitedata to know more

* Opportunity to work with and learn from some of the best people on some of the most ambitious projects found anywhere, across industries

* Join our HUB to be part of the conversation directly with Cogniters and our partners.

* Paid mobile phone and WiFI

*A pet lover? Get the chance to meet Spot

Why choose Cognite?

* Join us in making a real and lasting impact in one of the most exciting and fastest-growing new software companies in the world.

* We have repeatedly demonstrated that digital transformation, when anchored on strong DataOps, drives business value and sustainability for clients and allows front-line workers, as well as domain experts, to make better decisions every single day.

* Cognite Earns 2023 Microsoft Partner of the Year Award; Recognized as a Global Leader in Energy & Resources and Industrials & Manufacturing

* Frost & Sullivan named Cognite a Technology Innovation Leader

* Built In 2024 Best Places to Work in Austin, TX and Houston, TX

* Cognite Recognized as 2024 Microsoft Energy and Resources Partner of the Year

* Most recently Cognite Data Fusion® Achieved Industry First DNV Compliance for Digital Twins

Apply today

If you're excited about the opportunity to work at Cognite and make a difference in the tech industry, we encourage you to apply today We welcome candidates of all backgrounds and identities to join our team.

We encourage you to follow us on Cognite LinkedIn; we post all our openings there.

  • Austin, United States Talent Groups Full time

    Lead infrastructure provisioning, environment buildouts, and infrastructure maintenance.Develop and manage Ansible playbooks and Terraform execution plans.Leverage Kubernetes, Docker, and AWS to automate infrastructure and manage scalable cloud environments.Implement API automation to provision and teardown infrastructure components across tech stacks.Write...


  • Austin, United States Talent Groups Full time

    Lead infrastructure provisioning, environment buildouts, and infrastructure maintenance.Develop and manage Ansible playbooks and Terraform execution plans.Leverage Kubernetes, Docker, and AWS to automate infrastructure and manage scalable cloud environments.Implement API automation to provision and teardown infrastructure components across tech stacks.Write...


  • Austin, Texas, United States AutoRABIT Holding Inc. Full time

    About AutoRABITAutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare.About the RoleAs a Senior Site Reliability/DevOps Engineer at AutoRABIT, you will play a critical role in developing, scaling, and operating our cloud...


  • Austin, Texas, United States Unreal Gigs Full time

    Job DescriptionWe are seeking a skilled Senior Manager of DevOps and Site Reliability to join our team at Unreal Gigs. This role is responsible for leading the development, maintenance, and enhancement of our user-facing application and internal tools.About UsWe are a fully remote engineering team that values collaboration, innovation, and continuous...


  • austin, United States Talent Groups Full time

    Lead infrastructure provisioning, environment buildouts, and infrastructure maintenance.Develop and manage Ansible playbooks and Terraform execution plans.Leverage Kubernetes, Docker, and AWS to automate infrastructure and manage scalable cloud environments.Implement API automation to provision and teardown infrastructure components across tech stacks.Write...


  • Austin, Texas, United States AutoRABIT Holding Inc. Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability/DevOps Engineer to help us develop, scale, and operate our cloud services at AutoRABIT Holding Inc.In this role, you will work closely with our development teams to ensure applications are designed for reliability and performance. You will also collaborate with internal and customer-facing...


  • Austin, United States AutoRABIT Holding Inc. Full time

    About AutoRABIT: AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity for their development team,...


  • Austin, United States Farm Credit Bank of Texas Full time

    Job DescriptionWho we are: Farm Credit Bank of Texas is a $38.2 billion wholesale bank that has been financing agriculture and rural America for over 100 years. Headquartered in Austin, Texas, we provide funding and services to rural lending associations in five states, and we are active in the nation's capital markets. While you may not be familiar with...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job SummaryWe are seeking a highly skilled Reliability & Compliance Solutions Engineer to join our team at the Electric Reliability Council of Texas. This role will play a critical part in ensuring the reliability and compliance of our operations, working closely with subject matter experts to meet or exceed performance requirements.Main...


  • Austin, United States CV Library Full time

    Job DescriptionAs a part of the Product Reliability Engineering (PRE) Organization of VISA , you will be responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. In this role, your time will be split between operations/on-call duties and developing systems and software that help...


  • Austin, United States Visa Full time

    Company DescriptionVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt ERCOT, we strive to create a dynamic work environment that fosters innovation and collaboration. Our goal is to develop world-class solutions for today's energy challenges while promoting diversity and inclusion at all levels of our company.About the RoleWe are seeking a highly skilled Reliability & Compliance Engineer to join our team. This...


  • Austin, Texas, United States Jabil Full time

    About the RoleJabil is seeking an experienced Site Reliability Engineering Lead to contribute to the transformative growth within our Intelligent Infrastructure division. The Site Reliability Lead Engineer plays a vital role in ensuring the quality and reliability of the test network infrastructure of the Intelligent Infrastructures factories on a global...


  • Austin, Texas, United States The Electric Reliability Council of Texas (ERCOT) Full time

    We are seeking a talented Grid Reliability and Compliance Engineer to join our team at The Electric Reliability Council of Texas (ERCOT). As a key member of our team, you will be responsible for ensuring that ERCOT ISO meets or exceeds its reliability performance requirements.Your primary responsibilities will include monitoring and reporting ERCOT ISO and...


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...


  • Austin, United States CV Library Full time

    Job DescriptionWe’re looking for a Staff Site Reliability Engineer to join Procore’s Project Execution Group. In this role, you’ll lead, collaborate, partner and develop solutions to maintain the health of the core platform. The goal is to ensure the chosen design and architecture is highly available, performant and reliable as this team is directly...


  • Austin, Texas, United States AutoRABIT Holding Inc. Full time

    Job OverviewWe are looking for an experienced Senior Site Reliability/DevOps Engineer to help us develop, scale, and operate our cloud services at AutoRABIT Holding Inc.This is a unique opportunity to join a hyper-growth SaaS software company that provides a Salesforce DevSecOps platform for regulated industries. As a Senior Site Reliability/DevOps Engineer,...


  • Austin, United States Charles Schwab Full time

    Position Type: RegularYour opportunityAt Schwab, you are empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us “challenge the status quo” and transform the finance industry together.  As a Principal Site Reliability Engineer for Schwab's Technology Solutions organization, you will be responsible...


  • Austin, United States Nomi Health Full time

    We are seeking a Site Reliability Engineer (SRE) to join our team in Austin, TX. You will play a pivotal role in ensuring the reliability, performance, and scalability of our services. You will collaborate with cross-functional teams to design, implement, and manage infrastructure that is robust and resilient. Your focus will be on developing and refining...

  • Cloud Engineer

    2 weeks ago


    Austin, Texas, United States AutoRABIT Holding Inc. Full time

    Job OverviewA challenging role has become available for a Senior Site Reliability/DevOps Engineer to join our cloud services team at AutoRABIT Holding Inc.We are looking for an experienced professional with a strong background in site reliability engineering, DevOps, and automation. The ideal candidate will have a proven track record of implementing...