Senior Site Reliability Engineer

3 weeks ago


Austin, Texas, United States Cognite INC. Full time

About Cognite

Embark on a transformative journey with Cognite, a global SaaS forerunner in leveraging data to unravel complex business challenges through our cutting-edge Cognite Data Fusion (CDF) platform. We were awarded the 2022 Technology Innovation Leader for Global Digital Industrial Platforms. In the realm of industrial digitalization, we stand at the forefront, reshaping the future of Oil & Gas, Manufacturing and Energy sectors. As part of the esteemed Aker family, Cognite brings forth a legacy of innovation and excellence. Excitingly, we are set to extend our footprint to the vibrant landscapes of India, which opens the door for you to be part of the expansion from the beginning. Join us in this venture where data meets ingenuity, and together, we forge the path to a smarter, more connected industrial future.

Our teams are working on building the next-generation industrial data platform and applications across many different industries, such as Oil & Gas, Manufacturing, and Power & Utility. This includes enabling all types of workflows, including analytics, to assist in making better decisions. We are a good mix of engineers, products, and design. What we have in common is that we care deeply about the user experience and create products that users really want to use.

Our work environment is not just exciting and dynamic, but also intensely collaborative and supportive. You will work with the best domain and industry experts: designers, product managers, software developers, ML engineers, AI engineers and business leaders. We support one another, ask good questions, and give each other constructive feedback. Our goal is to leverage our diverse set of strengths and backgrounds to build innovative products, to think outside the ordinary, and grind through and nurture a great.

Learn more about Cognite here

Cognite Product Tour 2024

Cognite Product Tour 2023

Data Contextualization Masterclass 2023

Our values

Impact: Cogniters strive to make an impact in all that they do. We are result-oriented, always asking ourselves.

Ownership: Cogniters embrace a culture of ownership. We go beyond our comfort zones to contribute to the greater good, fostering inclusivity and sharing responsibilities for challenges and success.

Relentless: Cogniters are relentless in their pursuit of innovation. We are determined and deliverable (never ruthless or reckless), facing challenges head-on and viewing setbacks as opportunities for growth.

We're seeking a highly skilled and versatile SRE to work on the SRE team in the infrastructure department. This role is designed for a proactive and knowledgeable individual who is comfortable working across all parts of the tech stack. This includes infrastructure, frontend, and backend. The ideal candidate will excel in a collaborative environment, demonstrating a strong ability to proactively reach out to various teams within the company to collaborate on optimizing service reliability and up time.

A strong candidate will enjoy getting their hands dirty writing code in multiple languages, as well as figuring out optimized infrastructure setups and consumption patterns.

You will be a technical nomad. Optimizing, learning and teaching across the organization.

Key Role & Responsibilities include

  • Collaborate with product teams across the organization to understand cloud usage patterns and identify optimization opportunities; both technical and functional.
  • Develop and implement tools and processes for monitoring, analyzing cloud performance across GCP, AWS, and Azure.
  • Work closely with development teams to optimize application performance and increase up time without impacting service quality.
  • Participate in the design and deployment of scalable and efficient cloud infrastructure solutions.
  • Lead initiatives to educate teams on best practices for reliability management and efficiency.
  • Provide regular reports and insights on cloud performance, optimization efforts, and other reliability metrics.

Key Skills, Knowledge & Abilities Include

  • Experience with one or multiple of the major cloud providers, GCP, AWS, Azure
  • Fluent in one or more backend oriented languages like Go, Kotlin, Python, Rust (and an eagerness to learn whatever is needed to get the job done)
  • SQL databases from a major vendor such as Postgres, Oracle, ...
  • Building and maintaining dashboards with tools such as Grafana, PowerBI or similar

A snapshot of our many perks and benefits as a Cogniter

Competitive Compensation including base plus bonus

401(k) with 4% employer matching

Health, Dental, Vision & Disability Coverages with premiums fully covered for employees and all dependents

Unlimited PTO + flexibility to enjoy it

18 Company Holidays including the week between Christmas & New Years

Paid Parental Leave Program

Employee Stock Purchase Program (ESPP)

Employee Referral Program

Company Paid Friday Lunch via DoorDash + Fully Stocked Fridges in the offices

Join a team of 70 different nationalities with Diversity, Equality and Inclusion (DEI) in focus .

A highly modern and fun working environment with sublime culture across the organization, follow us on Instagram @cognitedata to know more

Opportunity to work with and learn from some of the best people on some of the most ambitious projects found anywhere, across industries

Join our HUB to be part of the conversation directly with Cogniters and our partners.

Paid mobile phone and WiFI

*A pet lover? Get the chance to meet Spot

Why choose Cognite?

Join us in making a real and lasting impact in one of the most exciting and fastest-growing new software companies in the world. We have repeatedly demonstrated that digital transformation, when anchored on strong DataOps, drives business value and sustainabilityfor clients and allows front-line workers, as well as domain experts, to make better decisions every single day. We were recognized as one of CNBC's top global enterprise technology startups powering digital transformation And just recently, Frost & Sullivan named Cognite a Technology Innovation Leader Most recently Cognite Data Fusion Achieved Industry First DNV Compliance for Digital Twins

Apply today

If you're excited about the opportunity to work at Cognite and make a difference in the tech industry, we encourage you to apply today We welcome candidates of all backgrounds and identities to join our team. Please do not hesitate to contact our Talent Acquisition team with any questions -

We encourage you to follow us on Cognite LinkedIn; we post all our openings there.



  • Austin, Texas, United States Apex Systems Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteDuration: 1 yearRate: $67/hr W-2We are seeking a highly skilled Site Reliability Engineer to join our team at Apex Systems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key...


  • Austin, Texas, United States Visa Full time

    Company OverviewVisa stands as a global frontrunner in digital payment solutions, orchestrating over 215 billion transactions annually across a vast network of consumers, merchants, financial institutions, and governmental bodies in more than 200 nations. Our vision is to unite the globe through the most advanced, convenient, reliable, and secure payment...


  • Austin, Texas, United States Cape Henry Associates, Acquired by JANUS Research Group Full time

    Janus is looking for a seasoned Site Reliability Engineer / DevSecOps Developer to help grow our capability with our DoD clients.Develop Infrastructure as Code (IaC) designing, implementing, and maintaining infrastructure using IaC technologies(e.g. terraform or similar) ensuring scalable, reliable, and efficient platformsCollaborate with data and other...


  • Austin, Texas, United States Expedia Group Full time

    Principal Site Reliability EngineerWe are looking for a highly qualified and seasoned Principal Site Reliability Engineer (SRE) to enhance our operations. The successful candidate will play a crucial role in guaranteeing the stability, scalability, and efficiency of our systems and services. You will collaborate closely with both development and operational...


  • Austin, Texas, United States Iodine Software Full time

    Director of Site Reliability Engineering Join us. Let's make a direct impact in healthcare. Being an Iodine employee means becoming part of something bigger: using clinical AI echnology to drive smarter healthcare processes and positively impact patient care. Who we are: Iodine is an enterprise AI company that is championing a radical rethink of how to...


  • Austin, Texas, United States Expedia Group Full time

    Principal Software Development Engineer - Site ReliabilityWe are looking for a highly proficient and seasoned Principal Software Development Engineer (SRE) to enhance our team. The successful candidate will be accountable for maintaining the reliability, scalability, and performance of our systems and services. You will collaborate closely with both...


  • Austin, Texas, United States Expedia Group Full time

    Principal Software Development Engineer - Site ReliabilityWe are in search of a highly qualified and seasoned Principal Software Development Engineer (SRE) to enhance our operations. The ideal candidate will be tasked with ensuring the dependability, scalability, and efficiency of our services and systems. You will collaborate closely with both development...


  • Austin, Texas, United States Infosys Full time

    Position Overview:Infosys is in search of a Lead Engineer for Site Reliability. This role's primary focus will be to oversee a team of Site Reliability Engineers (SREs) to proactively guarantee the stability, resilience, and scalability of our services through automation, testing, and engineering practices.Key Responsibilities:The successful candidate will...


  • Austin, Texas, United States ProCore CPA Full time

    About the RoleWe are seeking a highly skilled Staff Site Reliability Engineer to join our Cloud Infrastructure team at Procore CPA. As a key member of our team, you will be responsible for leading the development and implementation of cloud-based solutions to ensure the reliability and scalability of our services.Key ResponsibilitiesLead Cloud Infrastructure...


  • Austin, Texas, United States Thales Full time

    About the RoleThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and security of our cloud-based services.Key ResponsibilitiesCollaborate with project managers and service delivery managers to analyze traffic trends and capacity...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt the Electric Reliability Council of Texas (ERCOT), we foster a vibrant and inclusive work culture that empowers our employees to collaborate and innovate for the future of the Texas power grid and wholesale market. Our commitment to diversity and inclusion is fundamental to our corporate values, which include accountability, leadership,...


  • Austin, Texas, United States Amazon Full time

    As a Senior Reliability Engineer, you will play a pivotal role in ensuring the operational excellence of Amazon's data centers globally. Your expertise will be essential in conducting thorough evaluations and providing insightful feedback on the design aspects across various engineering disciplines. In addition to your design responsibilities, you will...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt the Electric Reliability Council of Texas (ERCOT), we pride ourselves on our inclusive and diverse work culture that empowers employees to collaborate and innovate for the future of the Texas power grid and wholesale market. We invite you to become part of our skilled and dedicated team, focused on developing exceptional solutions to address...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Apple. As a Site Reliability Engineering Manager, you will be responsible for leading a team that provides the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt the Electric Reliability Council of Texas (ERCOT), we pride ourselves on fostering a diverse and innovative work environment that empowers our employees to collaborate and shape the future of the Texas power grid and wholesale market. We are dedicated to creating world-class solutions to meet today’s energy challenges while providing...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineering Manager to join our Apple Service Engineering team. As a key member of our team, you will be responsible for establishing and maintaining the reliability and scalability of our cloud services.Key ResponsibilitiesLead a team of engineers in providing a platform for mission-critical...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt the Electric Reliability Council of Texas (ERCOT), we foster a vibrant and collaborative work environment that empowers our employees to shape the future of the Texas power grid and wholesale market. We are dedicated to creating innovative solutions to address today's energy challenges while supporting the professional growth of our...


  • Austin, Texas, United States NinjaOne Full time

    About the RoleAt NinjaOne we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Site Reliability Engineering Manager to join our Platform Engineering team and help us scale our products to millions of end-users. You will have the opportunity to build the SRE team from the ground up...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt the Electric Reliability Council of Texas (ERCOT), we foster a diverse and innovative work environment that empowers our employees to collaborate in shaping the future of the Texas power grid and wholesale market. We are dedicated to creating world-class solutions for today's and tomorrow's energy challenges while providing opportunities for...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt the Electric Reliability Council of Texas (ERCOT), we foster a vibrant and inclusive work culture that empowers our employees to collaborate in shaping the future of the Texas power grid and wholesale market. Our commitment to innovation and excellence drives us to seek talented individuals who are eager to tackle today's energy challenges...