Senior Site Reliability Engineer

2 weeks ago


Minneapolis Minnesota, United States Novon Consulting Full time
Job Title: Senior Site Reliability Engineer

We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Novon Consulting. As a key member of our engineering team, you will be responsible for establishing and driving best practices in system reliability, performance optimization, and observability.

Key Responsibilities:
  • Collaborate with development and operations teams to design, implement, and maintain observability frameworks that provide deep insights into system performance.
  • Lead the establishment of Service Level Objectives (SLOs) and Service Level Indicators (SLIs), ensuring they align with business goals and drive continuous performance improvements.
  • Partner with stakeholders to understand system performance requirements and translate them into actionable performance engineering strategies.
  • Proactively identify performance bottlenecks and collaborate with teams to implement solutions that enhance system scalability and reliability.
  • Design and execute performance regression test suites, focusing on data-intensive and ML workloads, to ensure continuous performance optimization.
  • Own the reliability and performance metrics of our systems, driving a culture of performance excellence and proactive issue resolution.
  • Collaborate with subject matter experts to gain a deep understanding of domain-specific performance challenges, particularly in data and ML pipelines.
  • Utilize tools like Datadog, Jira, and GitHub to monitor system performance, manage projects, and track issues, with a strong emphasis on performance-related metrics.
  • Define and monitor success metrics, ensuring our systems consistently meet or exceed performance and reliability targets.
  • Actively contribute to the continuous improvement of performance engineering practices across the team, fostering a culture of excellence in observability and system performance.
Requirements:
  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • Five years of experience in a site-reliability-focused role responsible for establishing reliability standards in a cloud-native environment.
  • Strong expertise in establishing SLOs/SLIs and building observability frameworks for complex systems.
  • Proficiency with cloud services, particularly AWS, and experience in designing scalable and reliable architectures.
  • Hands-on experience with performance monitoring and observability tools like Datadog.
  • Proficiency in version control systems like Git/GitHub and infrastructure as code tools like Terraform.
  • Strong interpersonal skills and excellent communication abilities, with a focus on driving performance improvements across teams.
Preferred Qualifications:
  • Proficiency in Java programming and hands-on experience with REST, Spring, and microservices development.
  • Proficiency in RDBMS schema design and index utilization.

We are an equal opportunities employer and welcome applications from diverse candidates. If you are a motivated and experienced Senior Site Reliability Engineer looking to join a dynamic team, please submit your application.



  • Minneapolis, Minnesota, United States Novon Consulting Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Novon Consulting. As a key member of our engineering team, you will be responsible for establishing and driving best practices in system reliability, performance optimization, and observability.Key Responsibilities:Collaborate with...


  • Minneapolis, Minnesota, United States Insight Global Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Insight Global. As a key member of our engineering team, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud...


  • Minneapolis, Minnesota, United States Insight Global Full time

    Job Title: Senior Site Reliability EngineerAt Insight Global, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud-based systems.Key Responsibilities:Develop and maintain Ansible playbooks...


  • Minneapolis, Minnesota, United States Novon Consulting Full time

    Job OverviewWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Novon Consulting. As a key member of our infrastructure operations team, you will be responsible for establishing and driving best practices in system reliability, performance optimization, and observability.Key ResponsibilitiesCollaborate with development and...


  • Minneapolis, United States Novon Consulting Full time

    Job DescriptionJob DescriptionContract to hire rolemust reside in the Minneapolis areaW2 onlyWe are seeking a Senior Site Reliability Engineer that will be at the forefront of establishing and driving best practices in system reliability, performance optimization, and observability. With over five years of experience, you bring deep expertise in software...


  • Minneapolis, Minnesota, United States Novon Consulting Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Novon Consulting. As a key member of our technical team, you will be responsible for establishing and driving best practices in system reliability, performance optimization, and observability.Key ResponsibilitiesDesign and Implement Observability Frameworks:...


  • Minneapolis, United States Novon Consulting Full time

    Contract to hire role must reside in the Minneapolis area W2 only We are seeking a Senior Site Reliability Engineer that will be at the forefront of establishing and driving best practices in system reliability performance optimization and observability. With over five years of experience you bring deep expertise in software development and infrastructure...


  • Minneapolis, Minnesota, United States Trane Technologies Full time

    Unlock Your Potential as a Senior Reliability EngineerAt Trane Technologies, we're committed to creating innovative climate solutions for a sustainable world. As a Senior Reliability Engineer, you'll play a crucial role in enhancing the reliability and durability of our products, ensuring they meet the highest standards of quality and performance.Key...


  • Minneapolis, Minnesota, United States Trane Technologies Full time

    About the RoleWe are seeking a highly skilled Senior Reliability Engineer to join our team at Trane Technologies. As a key member of our engineering team, you will play a critical role in enhancing the reliability and durability of our products.Key Responsibilities:Conduct Failure Mode and Effects Analysis (FMEA) to identify potential failure modes and...


  • Minneapolis, Minnesota, United States SmartThings Full time

    About the RoleWe're SmartThings, a leading IoT ecosystem, creating effortless smart home experiences. As a wholly owned subsidiary of Samsung, our corporate offices are based in Minneapolis and the Bay Area. With over 270 million users worldwide, we deliver simple, powerful experiences across Samsung's portfolio of phones, TVs, and appliances. As a founding...


  • Minneapolis, Minnesota, United States CliftonLarsonAllen Full time

    CliftonLarsonAllen (CLA) is a prominent national professional services firm committed to creating opportunities for our clients, our employees, and our communities through a range of industry-focused services including wealth advisory, digital solutions, audit, tax, consulting, and outsourcing. With a workforce exceeding 8,500 and numerous locations across...


  • Minneapolis, Minnesota, United States DSJ Global Full time

    Position: Reliability EngineerSector: Food & BeverageLocation: Nebraska, USDSJ Global is collaborating with a leading Fortune 500 manufacturing organization in search of a skilled Reliability Engineer. In this role, you will oversee the formulation, management, and daily implementation of the site's predictive reliability initiatives.Key...


  • Minneapolis, United States SmartThings Full time

    We’re SmartThings, one of the leading IoT ecosystems in the world, creating the most effortless way for anyone to create a smart home. As a wholly owned subsidiary of Samsung, our corporate offices are based in Minneapolis and the Bay Area.More than 270 million people worldwide use SmartThings to control and manage their connected life. SmartThings...


  • Minneapolis, Minnesota, United States Trane Technologies Full time

    Job DescriptionAt Trane Technologies, we're committed to creating innovative climate solutions for a sustainable world. As a Senior Reliability Engineer, you'll play a crucial role in enhancing the reliability and durability of our products.Key ResponsibilitiesConduct Failure Mode and Effects Analysis (FMEA) to identify potential failure modes and develop...


  • Minneapolis, Minnesota, United States Futran Tech Solutions Pvt. Ltd. Full time

    Job Title: Senior Site Reliability EngineerFutran Tech Solutions Pvt. Ltd. is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions to support our FinTech payments platform.Key...


  • Minneapolis, United States Entegee Full time

    Job DescriptionJob DescriptionSummary: Seeking a Senior Reliability Engineer with experience in post-market quality reliability engineering, product management, and medical devices.Job Requirements:Bachelor's degree in a relevant field and at least 4 years of experience.Experience in post-market quality reliability engineering.Product management...


  • Minneapolis, Minnesota, United States JLL Full time

    Reliability EngineerJLL is seeking a highly skilled Reliability Engineer to join our team. As a key member of our Engineering Services Reliability & Asset Management platform, you will play a critical role in implementing a strategic asset management plan to integrate clients' existing systems, including building automation, energy management, maintenance...

  • Reliability Engineer

    1 month ago


    Minneapolis, United States Apollo Inc Full time

    We're not your average airline. We're agile, resilient, and full of uncommon opportunity. Here, you can grow as part of an ambitious team that safely and collectively supports each other, our travelers, and our community. Together, we're making travel more attainable. With more than 40 years of Minnesota roots, we're a unique hybrid low-cost carrier offering...


  • Minneapolis, Minnesota, United States Cretex Medical Component and Device Technologies Full time

    Overview:About Cretex Medical Component and Device Technologies Cretex Medical is a prominent contract manufacturer specializing in precision components and assemblies for the medical device sector. Our clients regard us as a reliable partner in the realms of injection molding, laser processing, metal stamping, and device assembly. Learn more at .Position...


  • Minneapolis, Minnesota, United States Xcel Energy Full time

    Reliability Engineer Internship OpportunityXcel Energy is seeking a highly motivated and detail-oriented individual to join our reliability engineering team as an intern. This is a unique opportunity to gain hands-on experience in power plant operations and develop skills in reliability engineering.Key Responsibilities:Support the reliability engineering...