Sr Site Reliability Engineer

3 months ago


Bloomington, United States SAS Full time

Sr Site Reliability Engineer

Job Locations US-MN-Bloomington Requisition ID 20060806 Category IDeaS (a SAS company) Position Type Contractor

Passionate people. Loyal clients. Leading solutions.

With a rich culture of creative collaboration and professional growth, IDeaS’ team members build successful careers with us.

IDeaS is proud to be a global powerhouse of innovation and excellence; challenge and reward. No matter where we’re working, our teams come together to create leading revenue management solutions that accelerate our clients’ growth through revenue optimization.

Now we just need you

We are seeking a Senior Site Reliability Engineer at IDeaS, a SAS Company. You will play a pivotal role in ensuring the reliability, scalability, and performance of our revenue science software solutions. With a minimum of eight years of experience, you bring a wealth of knowledge and expertise in software development and infrastructure operations. You will serve as a go-to expert in ensuring the stability and efficiency of our systems, collaborating closely with cross-functional teams to address complex challenges. Your strong communication skills will be instrumental as you proactively build relationships and streamline processes to enhance system reliability and performance. You are persistent in the face of roadblocks, dispatch them efficiently, and pull in others when necessary, taking the initiative to ensure stability of the production environments while creating a backlog to reduce re-occurrences of issues and ensure long-term scalability. Our systems are data-intensive and require a strong focus on data and machine-learning pipelines.

What you’ll be doing...

Collaborate closely with our development and operations teams to design, implement, and maintain highly available, scalable, and resilient software solutions, with a particular focus on data and ML pipelines. Utilize your expertise in cloud computing and microservices architecture to enhance the reliability and performance of our data-intensive systems. Engage with stakeholders to understand system requirements and ensure that our solutions meet rigorous reliability and performance standards, especially in the context of data processing and machine learning. Actively participate in project scoping, scheduling, and task tracking, identifying potential reliability issues and implementing solutions to address them within our data-centric environment. Implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure and monitor the reliability and performance of our data and ML pipelines, ensuring that they meet agreed-upon targets. Collaborate with the performance engineering team to design and implement performance regression test suites tailored to data and ML workloads, ensuring that system performance is continuously monitored and optimized in these critical areas. Take ownership of the reliability and performance of our codebase, providing support to internal and external users as needed, particularly in the context of data processing and ML applications. Collaborate closely with subject matter experts to gain domain-specific insights into data and ML pipelines and document system designs and configurations accordingly. Utilize tools like Jira, Datadog, and GitHub to manage projects, track issues, and collaborate effectively with team members, with a focus on supporting data-intensive workflows. Define success metrics and monitor system performance to ensure that our solutions meet or exceed reliability and performance targets, especially in the context of data processing and ML applications. Proactively identify and address potential reliability issues before they impact system performance, with a particular emphasis on maintaining the integrity and efficiency of our data and ML pipelines. Perform other duties, as assigned

What you’ll bring to us…

Bachelor's degree in Computer Science, Engineering, or a related. Minimum of eight years of experience in software development and/or infrastructure operations. Strong interpersonal skills and excellent communication abilities, with a focus on proactive relationship-building. Proficiency with cloud services and architectures, particularly AWS. Hands-on experience with relational databases such as SQL Server, PostgreSQL, and MySQL. Understanding of web technologies and frameworks, with experience in Angular being a plus. Experience with performance monitoring and optimization tools like Datadog. Proficiency in version control systems like Git/GitHub. Experience with infrastructure as code tools like Terraform. Knowledge of agile methodologies and best practices in software development and operations.

We Support Who You Are….

As a global company, we strive to create an inclusive environment where diverse perspectives spark innovation and meet the challenges of an evolving world. Whether you’re launching a new career or expanding your current one, IDeaS is a company where you can balance great work with all other aspects of your life. 

At IDeaS, we also aspire to live our values each day by being Accountable, Curious, Passionate and Authentic. And we continue our quest to build a more inclusive environment that attracts, represents and provides a place for diverse ideas, unique perspectives, and authentic voices.



  • Bloomington, Illinois, United States Steampunk Full time

    About the RoleWe are seeking a highly skilled Sr. Site Reliability Engineer to join our team at Steampunk. As a key member of our organization, you will play a critical role in ensuring the reliability, availability, and performance of our cloud-based systems and infrastructure.Key ResponsibilitiesCollaborate with development teams to design and implement...


  • Bloomington, Illinois, United States Steampunk Full time

    About the RoleWe are seeking a highly skilled Sr. Site Reliability Engineer to join our team at Steampunk. As a key member of our organization, you will play a critical role in ensuring the reliability, availability, and performance of our cloud-based systems and infrastructure.Key ResponsibilitiesCollaborate with development teams to design and implement...


  • Bloomington, Illinois, United States Steampunk Full time

    About the RoleWe are seeking a highly skilled Sr. Site Reliability Engineer to join our team at Steampunk. As a key member of our organization, you will be responsible for ensuring the reliability, availability, and performance of our cloud-based systems and infrastructure.Key ResponsibilitiesConduct in-depth analyses of infrastructure to identify areas for...


  • Bloomington, United States SAS Full time

    Job Title: Senior Site Reliability EngineerAbout the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our team at IDeaS, a SAS Company. As a key member of our infrastructure operations team, you will play a pivotal role in ensuring the reliability, scalability, and performance of our revenue science software solutions.Key...


  • Bloomington, Illinois, United States Capital One Full time

    Job Title: Senior Software Engineer, Site Reliability EngineeringCapital One is seeking a highly skilled Senior Software Engineer, Site Reliability Engineering to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Collaborate...


  • Bloomington, Illinois, United States Capital One Financial Corp Full time

    Job Title: Lead Platform Engineer, Site Reliability EngineeringCapital One Financial Corp is seeking a highly skilled Lead Platform Engineer, Site Reliability Engineering to join our team. As a key member of our engineering organization, you will be responsible for designing, developing, and implementing scalable and reliable cloud-based systems.Key...


  • Bloomington, Illinois, United States Capital One Full time

    Transformative Site Reliability Engineer OpportunityCapital One is seeking a skilled Site Reliability Engineer to join our team and drive a major transformation within the company.As a Site Reliability Engineer, you will collaborate with Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and...


  • Bloomington, United States SAS Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at IDeaS, a SAS Company. As a key member of our infrastructure operations team, you will play a pivotal role in ensuring the reliability, scalability, and performance of our revenue science software solutions.Key ResponsibilitiesCollaborate closely with our...


  • Bloomington, Illinois, United States Steampunk Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Steampunk. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, availability, and performance of our cloud-based systems and services.Key ResponsibilitiesConduct in-depth analyses of infrastructure to identify areas...


  • Bloomington, Illinois, United States Capital One Full time

    About the RoleCapital One is seeking a highly skilled Senior Software Engineer, Site Reliability to join our team. As a key member of our engineering organization, you will be responsible for leading a portfolio of diverse technology projects and a team of developers with deep experience in distributed microservices and full-stack systems.Key...


  • Bloomington, Illinois, United States Capital One Financial Corp Full time

    Job SummaryWe are seeking a highly skilled Senior Software Engineer, Site Reliability to join our team at Capital One Financial Corp. As a key member of our engineering team, you will be responsible for designing, developing, testing, implementing, and supporting technical solutions in full-stack development tools and technologies.Key...


  • Bloomington, Illinois, United States xScion Full time

    Position OverviewBecome a vital member of our team as a Reliability Operations Engineer at xScionWork Arrangement: Remote (United States)Your Responsibilities: Guarantee the consistent functionality and availability of applications Keep abreast of technological advancements to enhance software development Actively mitigate and resolve service availability...


  • Bloomington, United States Polar Semiconductor Full time

    Job SummaryWe are seeking a highly skilled Reliability Test Engineer to join our team at Polar Semiconductor. As a key member of our reliability team, you will be responsible for developing and executing reliability tests to qualify our high-voltage power, sensor, and other products.Key ResponsibilitiesDevelop methodologies and perform reliability tests,...


  • Bloomington, Illinois, United States Booz Allen Hamilton Full time

    Position Overview:The Observability Monitoring Engineer is responsible for ensuring optimal system performance and reliability through the management and enhancement of AIOps platforms. This role involves configuring and utilizing advanced monitoring tools, executing observability tasks, and managing automated remediation processes. The engineer will also...


  • Bloomington, Illinois, United States Booz Allen Hamilton Full time

    Position Overview:The Observability Monitoring Engineer is responsible for ensuring optimal system reliability and performance through effective management of AIOps platforms. This role involves configuring and utilizing advanced monitoring tools, executing observability tasks, and overseeing automated remediation processes. The engineer will focus on...


  • Bloomington, Illinois, United States Booz Allen Hamilton Full time

    Role Overview:The Observability Monitoring Engineer is essential in ensuring optimal system performance and reliability by overseeing and enhancing AIOps platforms. This role involves configuring and utilizing advanced monitoring tools, managing observability initiatives, and supervising automated remediation processes. The engineer will focus on process...


  • Bloomington, Illinois, United States Booz Allen Hamilton Full time

    Role Overview:The Observability Monitoring Engineer is pivotal in ensuring optimal system performance and reliability through the management and enhancement of AIOps platforms. This role involves configuring and utilizing advanced monitoring tools, executing observability tasks, and supervising automated remediation processes. The engineer will focus on...


  • Bloomington, Illinois, United States Booz Allen Hamilton Full time

    Role Overview:The Observability Monitoring Engineer is responsible for ensuring optimal system performance and reliability through effective management of AIOps platforms. This role involves configuring and utilizing advanced monitoring solutions, executing observability tasks, and managing automated remediation processes. The engineer will focus on process...


  • Bloomington, United States Goodwin Recruiting Full time

    Job OverviewWe are seeking a highly skilled Civil Site Design Engineer to join our team at Goodwin Recruiting. As a key member of our team, you will be responsible for designing and developing site plans for various projects.Key ResponsibilitiesDesign and develop site plans for commercial and residential projectsCollaborate with cross-functional teams to...


  • Bloomington, Indiana, United States Actalent Full time

    Job Title: Reliability Maintenance EngineerJob OverviewActalent is seeking a skilled Reliability Maintenance Engineer to join our team. As a key member of our maintenance team, you will play a critical role in ensuring the efficient and reliable operation of equipment within a metal manufacturing plant.Responsibilities:* Develop and deploy engineering...