Site Reliability Engineer

2 weeks ago


Houston, Texas, United States Imubit Full time
About Imubit

Imubit is a pioneering company that leverages breakthrough machine learning technologies to disrupt the refining and chemical industries. Our innovative approach directly controls and optimizes refineries and chemical plants, adding millions of dollars to the plant bottom line while ensuring safe operating limits, energy efficiency, and sustainability objectives.

Our Mission

We are committed to empowering our customers to achieve real-time optimization and profitability through our patented Closed Loop Neural Network platform. Our solution is currently optimizing the manufacturing facilities of Fortune-500 companies, and we are backed by tier-1 venture capital firms such as Insight Partners.

Job Description

We are seeking a top-notch Site Reliability Engineer to design and support Imubit's cloud infrastructure. As part of this role, you will work to optimize deployment processes and ensure systems run smoothly. You will collaborate with software developers, DevOps engineers, and other stakeholders to implement robust solutions and drive continuous improvement.

Key Responsibilities
  • Design, deploy, and maintain Imubit's cloud infrastructure to provide high uptime, scalability, and security.
  • Leverage public cloud services and tools to improve efficiency and reliability of our services and workflows.
  • Architect and manage cross-cloud network infrastructure, including subnets, routing tables, IPSec VPNs, Transit Gateways, and firewall rules.
  • Engage in and improve the whole lifecycle of services, from inception and design, through deployment, operation, and refinement.
  • Participate in infrastructure on-call rotation and respond in a timely manner.
  • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
Requirements
  • 5 years of experience maintaining production-level cloud infrastructure, including public cloud services (e.g., AWS, GCP).
  • Preferred BA/B.Sc. in Computer Science or equivalent.
  • Experience with a programming language such as Python or Go.
  • Experience deploying and supporting services in Kubernetes, including GitOps management tools such as ArgoCD.
  • Familiarity with software development principles and concepts (e.g., version control, software development lifecycle).
  • Experience implementing and utilizing monitoring tools (e.g., New Relic, Splunk, Grafana, Prometheus).
  • Experience managing production databases (e.g., PostgreSQL), including managed services (e.g., AWS RDS).
  • Experience with Infrastructure-as-code concepts and tools (e.g., Terraform, Ansible).
  • Experience with secrets management tools (e.g., HashiCorp Vault, AWS Secrets Manager).
  • Interest in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug and optimize code and automate routine tasks.
  • Systematic problem-solving approach, coupled with effective communication skills and a sense of ownership and drive.
What We Offer

Imubit provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability, or genetics. We comply with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities.



  • Houston, Texas, United States Syntricate Technologies Full time

    Job Opportunity: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies Inc. in Jacksonville, FL, Cary, NC, or New York, NY. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our cloud-based systems.Key Responsibilities:Design...


  • Houston, Texas, United States Syntricate Technologies Full time

    Job Opportunity: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies Inc. in Jacksonville, FL, Cary, NC, or New York, NY. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our cloud-based systems.Key Responsibilities:Design...


  • Houston, Texas, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and improve...


  • Houston, Texas, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL...


  • Houston, Texas, United States Cognizant North America Full time

    Job Title: Site Reliability EngineerJob Summary:Cognizant North America is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems. You will work closely with cross-functional teams to design, implement, and...


  • Houston, Texas, United States Infinity Consulting Solutions Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Infinity Consulting Solutions. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Houston, Texas, United States Cognizant North America Full time

    Job Title: Site Reliability EngineerCognizant North America is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain cloud-based infrastructure using...


  • Houston, Texas, United States Franklin Fitch Full time

    Site Reliability Engineer OpportunityWe're working with a growing Managed Service Provider based out of Houston who are looking for a Site Reliability Engineer to join the team following their recent growth, to help service their new and existing clients across Texas.About the RoleThis is a chance to join a growing company and quickly progress into a...


  • Houston, Texas, United States Diverse Lynx Full time

    Role Overview We are seeking a Site Reliability Engineer (SRE) Support Analyst to manage a team responsible for building, deploying, operating, sustaining, and growing software systems that scale, monitor, secure, manage, and automate Frontier's systems. Key Responsibilities * Manage a team of engineers responsible for building, deploying, and operating...


  • Houston, Texas, United States Franklin Fitch Full time

    Site Reliability Engineer OpportunityWe're working with a growing Managed Service Provider based out of Houston who are looking for a Site Reliability Engineer to join the team following their recent growth, to help service their new and existing clients across Texas.This is a chance to join a growing company and quickly progress into a position of...


  • Houston, Texas, United States TekWissen ® Full time

    Job Title: Site Reliability EngineerWork Location: RemoteJob Type: ContractWork Type: OnsiteDuration: 5+ MonthsPay Rate: $60-60/hr.Job Description/Responsibilities:Cloud RunBigQueryPub/SubGoogle Cloud LoggingGoogle Cloud Monitoring & Metrics (including Custom Metrics)Google Cloud AlertingExperience creating SLI/SLO/SLA'sExperience identifying bottlenecks and...


  • Houston, Texas, United States Ampcus Incorporated Full time

    Job Title: Site Reliability EngineerJob Summary:As a Site Reliability Engineer at Ampcus Incorporated, you will be responsible for designing, developing, testing, and implementing software applications with expertise in Linux, Windows, Java, HTML, CSS, JavaScript, and React. With 5+ years of experience in the IT industry, you will be skilled in...


  • Houston, Texas, United States Fintex Holdings Inc Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Fintex Holdings Inc. As a key member of our engineering team, you will be responsible for ensuring the performance, scalability, and reliability of our platform. Your expertise in software development and engineering will be crucial in monitoring and investigating...


  • Houston, Texas, United States TekWissen LLC Full time

    Job OverviewTekWissen Group is a leading workforce management provider with a global presence. Our client is a renowned American multinational information technology services and consulting company, dedicated to helping top companies build stronger businesses.Job Title: Site Reliability (SRE) EngineerLocation: Houston, TXJob Type: ContractWork Type:...


  • Houston, Texas, United States Cloudious LLC Full time

    Job DescriptionCloudious LLC is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our cloud infrastructure team, you will be responsible for designing, developing, and implementing software applications that meet the highest standards of quality and reliability.Key Responsibilities:Design and implement software...


  • Houston, Texas, United States VDart Inc Full time

    Job DescriptionVDart Inc is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our IT infrastructure team, you will be responsible for designing, developing, testing, and implementing software applications.Key Responsibilities:Design and implement software applications using Linux, Windows, Java, HTML, CSS, and...


  • Houston, Texas, United States Imubit Full time

    About ImubitImubit is a pioneering company that leverages breakthrough machine learning technologies to disrupt the refining and chemical industries. Our innovative approach directly controls and optimizes refineries and chemical plants, adding millions of dollars to the plant bottom line while ensuring safe operating limits, energy efficiency, and...


  • Houston, Texas, United States Imubit Full time

    About ImubitImubit is a pioneering company that leverages breakthrough machine learning technologies to disrupt the refining and chemical industries. Our innovative approach directly controls and optimizes refineries and chemical plants, adding millions of dollars to the plant bottom line while ensuring safe operating limits, energy efficiency, and...


  • Houston, Texas, United States Imubit Full time

    About ImubitImubit is a pioneering company that leverages breakthrough machine learning technologies to disrupt the refining and chemical industries. Our innovative approach directly controls and optimizes refineries and chemical plants, adding millions of dollars to the plant bottom line while ensuring safe operating limits, energy efficiency, and...


  • Houston, Texas, United States MRI Technologies Full time

    Job Title: AppDat Site Reliability EngineerMRI Technologies is seeking an experienced Site Reliability Engineer (SRE) to support our AppDat Platform at NASA. As a key member of our team, you will play a crucial role in ensuring the reliability, scalability, and security of our cloud infrastructure.Key Responsibilities:Design, implement, and manage robust...