Senior Site Reliability Engineer

3 weeks ago


Austin, United States Terminal Industries Full time
About Us

Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last great frontier of untapped data. In the process, Terminal will address many industry-wide pain points, including compliance, manual processes, equipment location, phantom costs, and labor inefficiencies. Ultimately, Terminal will become the central nervous system for the yard, seamlessly connecting all data sources to support an extensive range of essential functions.

Overview

Our world class vision engineering team has built an engine that can process the movement of trucks and containers in real-time. It's now time to unlock the potential of that engine by building SaaS applications that leverage the vision engine to transform the logistics industry. As part of Terminal's Site Reliability Engineering team you will help build out the network and IoT infrastructure required to deploy and operate our camera technology at scale.

We are seeking an experienced Senior Site Reliability Engineer with a minimum of 8 years of relevant experience to join our team. As a founding member of our Engineering team, you will play a pivotal role in architecting and developing cutting-edge solutions. The ideal candidate possesses expertise in AWS, proficiency in operations, and running software at scale. They will have a deep understanding of event-driven technologies, hands-on experience with modern data stores, and a commitment to implementing observability and a passion for operational excellence. Taking ownership of production quality, reliability and security.

Responsibilities
  • Oversee the deployment, management, and maintenance of IoT devices, including camera systems and sensors. Ensure devices are properly integrated, configured, and secured within the network.
  • Manage firmware updates and patches for IoT devices, ensuring that all devices are up-to-date and secure. Develop and implement strategies for efficient deployment of updates.
  • Implement mechanisms for collecting and processing data from IoT devices. Ensure data integrity, availability, and confidentiality.
  • Troubleshoot and resolve connectivity issues related to IoT devices. Manage integration between IoT devices and cloud infrastructure, ensuring seamless data flow and system interoperability.
  • Design and implement solutions to scale IoT deployments effectively. Monitor device performance and system health to ensure high reliability and availability.
  • Design, build, and operate infrastructure using Infrastructure as Code (IaC) tools like Terraform and Ansible. Develop and maintain infrastructure automation to ensure scalability and reliability.
  • Define and implement best practices for continuous deployment of software and services using CI/CD tools such as GitHub Actions. Automate deployment processes to streamline operations.
  • Lead incident response efforts, including diagnosis, resolution, and post-mortem analysis. Implement robust monitoring and alerting systems to ensure quick detection and resolution of issues.
  • Ensure that systems adhere to security best practices and regulatory compliance requirements. Implement security measures and conduct regular audits to safeguard production environments.
Requirements
  • Minimum of 8 years of experience in Site Reliability Engineering or a related role, with a proven track record of managing complex production environments.
  • Strong background in operating systems, networking, distributed systems, and database management. Expertise in AWS cloud services and infrastructure management.
  • Hands-on experience with deploying, managing, and maintaining IoT devices and sensor systems. Knowledge of IoT protocols (e.g., MQTT, CoAP) and device integration practices.
  • Experience in managing firmware updates and ensuring the security and functionality of IoT devices.
  • Proficiency in managing and troubleshooting connectivity issues in IoT environments, including wireless and wired communication protocols.
  • Experience with data collection and processing from IoT devices, including ensuring data quality and managing large volumes of data.
  • Demonstrated experience in incident response, production monitoring, and capacity planning. Ability to handle high-pressure situations and ensure system reliability.


What We Offer

Joining the Terminal team means being part of a dynamic, innovative environment where your work directly impacts the future of logistics and the global supply chain. You will work closely with a team of experts passionate about operational excellence and technological innovation. We offer competitive salaries, a comprehensive benefits package, and opportunities for professional growth.

  • Austin, United States Expedia Group Full time

    Senior Software Development Engineer - Site Reliability  We are seeking a highly skilled and experienced Senior Software Development Engineer (SRE) to join our team. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our services and systems. You will work closely with development and operations teams to...


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...


  • Austin, Texas, United States Apex Systems Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteDuration: 1 yearRate: $67/hr W-2We are seeking a highly skilled Site Reliability Engineer to join our team at Apex Systems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key...


  • Austin, Texas, United States Visa Full time

    Company OverviewVisa stands as a global frontrunner in digital payment solutions, orchestrating over 215 billion transactions annually across a vast network of consumers, merchants, financial institutions, and governmental bodies in more than 200 nations. Our vision is to unite the globe through the most advanced, convenient, reliable, and secure payment...


  • Austin, Texas, United States Visa Full time

    About the RoleAs a Senior Site Reliability Engineer at Visa, you will play a critical role in ensuring the security and availability of our systems and applications. Our systems handle transactions worth over a trillion dollars, and we are committed to providing a secure and reliable environment for our customers.Key ResponsibilitiesEnsure the security and...


  • Austin, Texas, United States Cape Henry Associates, Acquired by JANUS Research Group Full time

    Janus is looking for a seasoned Site Reliability Engineer / DevSecOps Developer to help grow our capability with our DoD clients.Develop Infrastructure as Code (IaC) designing, implementing, and maintaining infrastructure using IaC technologies(e.g. terraform or similar) ensuring scalable, reliable, and efficient platformsCollaborate with data and other...


  • Austin, Texas, United States Expedia Group Full time

    Principal Site Reliability EngineerWe are looking for a highly qualified and seasoned Principal Site Reliability Engineer (SRE) to enhance our operations. The successful candidate will play a crucial role in guaranteeing the stability, scalability, and efficiency of our systems and services. You will collaborate closely with both development and operational...


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...


  • Austin, United States JobRialto Full time

    Skills: 6+ years of experience in systems and platform operations and technology Experience with On Prem and Public Cloud - AWS, EKS Scripting languages like Python Linux Administration and Cloud, DevOps experience would be a plus Team As a member of the Site Reliability Engineering & Production Services team, you will work with other technology...


  • Austin, United States GE Renewable Energy Power and Aviation Full time

    Job Description SummaryThe Site Reliability Engineering team is responsible for the reliability and performance of tools worldwide. We obsess over availability by building tools and engineering new systems to automate our platform. We are software engineers with full visibility and influenceacross the entire stack.Job DescriptionRoles and ResponsibilitiesIn...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and services.Key ResponsibilitiesDesign, build, and maintain robust infrastructure and automation solutionsWork closely with...


  • Austin, Texas, United States Expedia Group Full time

    Principal Software Development Engineer - Site ReliabilityWe are looking for a highly proficient and seasoned Principal Software Development Engineer (SRE) to enhance our team. The successful candidate will be accountable for maintaining the reliability, scalability, and performance of our systems and services. You will collaborate closely with both...


  • Austin, United States GE Aviation Full time

    Job Description SummaryThe Site Reliability Engineering team is responsible for the reliability and performance of tools worldwide. We obsess over availability by building tools and engineering new systems to automate our platform. We are software engineers with full visibility and influenceacross the entire stack.Job DescriptionRoles and ResponsibilitiesIn...


  • Austin, United States GE Aviation Full time

    Job Description SummaryThe Site Reliability Engineering team is responsible for the reliability and performance of tools worldwide. We obsess over availability by building tools and engineering new systems to automate our platform. We are software engineers with full visibility and influenceacross the entire stack.Job DescriptionRoles and ResponsibilitiesIn...


  • Austin, United States GE Aerospace Full time

    In this role, you will: - Work extensively with Microsoft technologies, including Azure, Azure DevOps, Windows, ASP.NET Core and Powershell. - Build automation to deploy and maintain PaaS and IaaS resources in Microsoft Azure, using Terraform, Ansibl Reliability Engineer, Liability, Engineer, Reliability, Reliability, Microsoft, Technology


  • austin, United States Expedia Partner Solutions Full time

    If you need assistance during the recruiting process due to a disability, please reach out to our Recruiting Accommodations Team through the Accommodation Request form . This form is used only by individuals with disabilities who require assistance or adjustments in applying and interviewing for a job. This form is not for inquiring about a position or the...


  • Austin, United States Expedia Partner Solutions Full time

    If you need assistance during the recruiting process due to a disability, please reach out to our Recruiting Accommodations Team through the Accommodation Request form . This form is used only by individuals with disabilities who require assistance or adjustments in applying and interviewing for a job. This form is not for inquiring about a position or the...


  • austin, United States Expedia Partner Solutions Full time

    If you need assistance during the recruiting process due to a disability, please reach out to our Recruiting Accommodations Team through the Accommodation Request form . This form is used only by individuals with disabilities who require assistance or adjustments in applying and interviewing for a job. This form is not for inquiring about a position or the...


  • Austin, Texas, United States Expedia Group Full time

    Principal Software Development Engineer - Site ReliabilityWe are in search of a highly qualified and seasoned Principal Software Development Engineer (SRE) to enhance our operations. The ideal candidate will be tasked with ensuring the dependability, scalability, and efficiency of our services and systems. You will collaborate closely with both development...


  • Austin, United States Expedia Partner Solutions Full time

    If you need assistance during the recruiting process due to a disability, please reach out to our Recruiting Accommodations Team through the Accommodation Request form. This form is used only by individuals with disabilities who require assistance or adjustments in applying and interviewing for a job. This form is not for inquiring about a position or the...