Senior Site Reliability Engineer

2 days ago


Austin, Texas, United States Terminal Industries Full time
About Us

Terminal Industries is a pioneering company that leverages cutting-edge machine learning to digitize, index, and automate the yard. Our platform empowers warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel. These fundamental operating assets of commerce represent the last great frontier of untapped data.

Overview

Our world-class vision engineering team has developed an engine that can process the movement of trucks and containers in real-time. We are now poised to unlock the potential of that engine by building SaaS applications that leverage the vision engine to transform the logistics industry. As part of Terminal's Site Reliability Engineering team, you will play a pivotal role in architecting and developing cutting-edge solutions.

Responsibilities
  • Oversee the deployment, management, and maintenance of IoT devices, including camera systems and sensors, to ensure seamless integration and configuration within the network.
  • Manage firmware updates and patches for IoT devices, ensuring that all devices are up-to-date and secure, and develop strategies for efficient deployment of updates.
  • Implement mechanisms for collecting and processing data from IoT devices, ensuring data integrity, availability, and confidentiality.
  • Troubleshoot and resolve connectivity issues related to IoT devices, and manage integration between IoT devices and cloud infrastructure to ensure seamless data flow and system interoperability.
  • Design and implement solutions to scale IoT deployments effectively, monitoring device performance and system health to ensure high reliability and availability.
  • Design, build, and operate infrastructure using Infrastructure as Code (IaC) tools like Terraform and Ansible, developing and maintaining infrastructure automation to ensure scalability and reliability.
  • Define and implement best practices for continuous deployment of software and services using CI/CD tools such as GitHub Actions, automating deployment processes to streamline operations.
  • Lead incident response efforts, including diagnosis, resolution, and post-mortem analysis, and implement robust monitoring and alerting systems to ensure quick detection and resolution of issues.
  • Ensure that systems adhere to security best practices and regulatory compliance requirements, implementing security measures and conducting regular audits to safeguard production environments.
Requirements
  • Minimum of 12 years of experience in Site Reliability Engineering or a related role, with a proven track record of managing complex production environments.
  • Strong background in operating systems, networking, distributed systems, and database management, with expertise in AWS cloud services and infrastructure management.
  • Hands-on experience with deploying, managing, and maintaining IoT devices and sensor systems, knowledge of IoT protocols (e.g., MQTT, CoAP), and device integration practices.
  • Experience in managing firmware updates and ensuring the security and functionality of IoT devices.
  • Proficiency in managing and troubleshooting connectivity issues in IoT environments, including wireless and wired communication protocols.
  • Experience with data collection and processing from IoT devices, including ensuring data quality and managing large volumes of data.
  • Demonstrated experience in incident response, production monitoring, and capacity planning, with the ability to handle high-pressure situations and ensure system reliability.


  • Austin, Texas, United States Expedia Group Full time

    Senior Software Development Engineer - Site ReliabilityWe are seeking a highly skilled and experienced Senior Software Development Engineer (SRE) to join our team. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our services and systems. You will work closely with development and operations teams to...


  • Austin, Texas, United States Apex Systems Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteDuration: 1 yearRate: $67/hr W-2We are seeking a highly skilled Site Reliability Engineer to join our team at Apex Systems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key...


  • Austin, Texas, United States Visa Full time

    Company OverviewVisa stands as a global frontrunner in digital payment solutions, orchestrating over 215 billion transactions annually across a vast network of consumers, merchants, financial institutions, and governmental bodies in more than 200 nations. Our vision is to unite the globe through the most advanced, convenient, reliable, and secure payment...


  • Austin, Texas, United States Weedmaps Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our engineering team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesCollaborate with Cross-Functional Teams: Work closely with our...


  • Austin, Texas, United States Cape Henry Associates, Acquired by JANUS Research Group Full time

    Janus is looking for a seasoned Site Reliability Engineer / DevSecOps Developer to help grow our capability with our DoD clients.Develop Infrastructure as Code (IaC) designing, implementing, and maintaining infrastructure using IaC technologies(e.g. terraform or similar) ensuring scalable, reliable, and efficient platformsCollaborate with data and other...


  • Austin, Texas, United States Expedia Group Full time

    Principal Site Reliability EngineerWe are looking for a highly qualified and seasoned Principal Site Reliability Engineer (SRE) to enhance our operations. The successful candidate will play a crucial role in guaranteeing the stability, scalability, and efficiency of our systems and services. You will collaborate closely with both development and operational...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and services.Key ResponsibilitiesDesign, build, and maintain robust infrastructure and automation solutionsWork closely with...


  • Austin, Texas, United States Expedia Group Full time

    Principal Software Development Engineer - Site ReliabilityWe are looking for a highly proficient and seasoned Principal Software Development Engineer (SRE) to enhance our team. The successful candidate will be accountable for maintaining the reliability, scalability, and performance of our systems and services. You will collaborate closely with both...


  • Austin, Texas, United States JobRialto Full time

    About the RoleWe are seeking a highly motivated and experienced Systems and Platform Operations Expert to join our Site Reliability Engineering & Production Services team. As a member of this team, you will work closely with other technology professionals to support Asset Management Technology - Cloud Platform solutions.Key ResponsibilitiesProvide level 2...

  • Cloud Engineer

    3 days ago


    Austin, Texas, United States Weedmaps Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our engineering team, you will play a critical role in ensuring the performance, reliability, and scalability of our cloud-based services.Key ResponsibilitiesCollaborate with Cross-Functional Teams: Work closely with our...


  • Austin, Texas, United States Expedia Group Full time

    Principal Software Development Engineer - Site ReliabilityWe are in search of a highly qualified and seasoned Principal Software Development Engineer (SRE) to enhance our operations. The ideal candidate will be tasked with ensuring the dependability, scalability, and efficiency of our services and systems. You will collaborate closely with both development...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job SummaryWe are seeking a highly skilled Senior Power System Engineer to join our team at the Electric Reliability Council of Texas (ERCOT). As a key member of our operations team, you will be responsible for ensuring the reliable operation of the electric power grid in compliance with NERC Standards, ERCOT Protocols, and Market Guides.Key...


  • Austin, Texas, United States Infosys Full time

    Position Overview:Infosys is in search of a Lead Engineer for Site Reliability. This role's primary focus will be to oversee a team of Site Reliability Engineers (SREs) to proactively guarantee the stability, resilience, and scalability of our services through automation, testing, and engineering practices.Key Responsibilities:The successful candidate will...


  • Austin, Texas, United States Apple Full time

    Role SummaryApple is seeking a talented Site Reliability Engineer to ensure the reliability, scalability, and performance of our systems and services. As an SRE, you will work closely with our engineering and operations teams to design, build, and maintain robust infrastructure and automation solutions.Key ResponsibilitiesDesign and implement scalable...


  • Austin, Texas, United States ProCore CPA Full time

    About the RoleWe are seeking a highly skilled Staff Site Reliability Engineer to join our Cloud Infrastructure team at Procore CPA. As a key member of our team, you will be responsible for leading the development and implementation of cloud-based solutions to ensure the reliability and scalability of our services.Key ResponsibilitiesLead Cloud Infrastructure...


  • Austin, Texas, United States Visa Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer - Cloud Infrastructure Expert to join our team at Visa. As a key member of our cloud infrastructure team, you will be responsible for ensuring the security, availability, and performance of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain scalable and...


  • Austin, Texas, United States Thales Full time

    About the RoleThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and security of our cloud-based services.Key ResponsibilitiesCollaborate with project managers and service delivery managers to analyze traffic trends and capacity...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt the Electric Reliability Council of Texas (ERCOT), we foster a vibrant and inclusive work culture that empowers our employees to collaborate and innovate for the future of the Texas power grid and wholesale market. Our commitment to diversity and inclusion is fundamental to our corporate values, which include accountability, leadership,...


  • Austin, Texas, United States Amazon Full time

    As a Senior Reliability Engineer, you will play a pivotal role in ensuring the operational excellence of Amazon's data centers globally. Your expertise will be essential in conducting thorough evaluations and providing insightful feedback on the design aspects across various engineering disciplines. In addition to your design responsibilities, you will...


  • Austin, Texas, United States Electric Reliability Council of Texas Full time

    Job OverviewAt the Electric Reliability Council of Texas (ERCOT), we pride ourselves on our inclusive and diverse work culture that empowers employees to collaborate and innovate for the future of the Texas power grid and wholesale market. We invite you to become part of our skilled and dedicated team, focused on developing exceptional solutions to address...