Site Reliability Engineer

5 days ago


Austin, Texas, United States Terminal Industries Full time
About Us
Terminal Industries builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning.

Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel.

These are the fundamental operating assets of commerce - and represent the last great frontier of untapped data.

In the process, Terminal Industries will address many industry-wide pain points, including compliance, manual processes, equipment location, phantom costs, and labor inefficiencies.

Ultimately, Terminal Industries will become the central nervous system for the yard, seamlessly connecting all data sources to support an extensive range of essential functions.

Our world-class vision engineering team has built an engine that can process the movement of trucks and containers in real-time.

It's now time to unlock the potential of that engine by building SaaS applications that leverage the vision engine to transform the logistics industry.

As part of Terminal Industries' Site Reliability Engineering team, you will help build out the network and IoT infrastructure required to deploy and operate our camera technology at scale.

We are seeking an experienced Site Reliability Engineer with a minimum of 5 years of relevant experience to join our team.

As a founding member of our Engineering team, you will play a pivotal role in architecting and developing cutting-edge solutions.

The ideal candidate possesses expertise in AWS, proficiency in operations, and running software at scale.

They will have a deep understanding of event-driven technologies, hands-on experience with modern data stores, and a commitment to implementing observability and a passion for operational excellence.

Taking ownership of production quality, reliability, and security.
Responsibilities
Oversee the deployment, management, and maintenance of IoT devices, including camera systems and sensors. Ensure devices are properly integrated, configured, and secured within the network.
Manage firmware updates and patches for IoT devices, ensuring that all devices are up-to-date and secure. Develop and implement strategies for efficient deployment of updates.
Implement mechanisms for collecting and processing data from IoT devices. Ensure data integrity, availability, and confidentiality.
Troubleshoot and resolve connectivity issues related to IoT devices. Manage integration between IoT devices and cloud infrastructure, ensuring seamless data flow and system interoperability.
Design and implement solutions to scale IoT deployments effectively. Monitor device performance and system health to ensure high reliability and availability.
Design, build, and operate infrastructure using Infrastructure as Code (IaC) tools like Terraform and Ansible. Develop and maintain infrastructure automation to ensure scalability and reliability.
Define and implement best practices for continuous deployment of software and services using CI/CD tools such as GitHub Actions. Automate deployment processes to streamline operations.
Lead incident response efforts, including diagnosis, resolution, and post-mortem analysis. Implement robust monitoring and alerting systems to ensure quick detection and resolution of issues.
Ensure that systems adhere to security best practices and regulatory compliance requirements. Implement security measures and conduct regular audits to safeguard production environments.
Requirements

Minimum of 5 years of experience in Site Reliability Engineering or a related role, with a proven track record of managing complex production environments.

Strong background in operating systems, networking, distributed systems, and database management. Expertise in AWS cloud services and infrastructure management.
Hands-on experience with deploying, managing, and maintaining IoT devices and sensor systems. Knowledge of IoT protocols (e.g., MQTT, CoAP) and device integration practices.
Experience in managing firmware updates and ensuring the security and functionality of IoT devices.
Proficiency in managing and troubleshooting connectivity issues in IoT environments, including wireless and wired communication protocols.
Experience with data collection and processing from IoT devices, including ensuring data quality and managing large volumes of data.
Demonstrated experience in incident response, production monitoring, and capacity planning. Ability to handle high-pressure situations and ensure system reliability.


  • Austin, Texas, United States Apple Full time

    Job Title: Site Reliability EngineerJob Summary:At Apple, we are seeking a highly skilled Site Reliability Engineer to join our Ad Platforms team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our ad-tech systems.Key Responsibilities:Implement and improve our infrastructure and...


  • Austin, Texas, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesDesign, develop, and deploy automation tools to improve the efficiency and reliability of our cloud...


  • Austin, Texas, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesDesign, develop, and deploy software to improve the availability, scalability, and efficiency of Oracle...


  • Austin, Texas, United States Unreal Gigs Full time

    Job Summary:At Unreal Gigs, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll play a critical role in ensuring the high availability, scalability, and performance of our complex distributed systems. You'll be responsible for building and maintaining highly reliable systems, automating infrastructure...


  • Austin, Texas, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automated solutions to improve the reliability and...


  • Austin, Texas, United States Thales Full time

    Job Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and security of our cloud-based services.Key Responsibilities:Collaborate with project managers and service delivery managers to analyze traffic...


  • Austin, Texas, United States Unreal Gigs Full time

    Job Summary:At Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll play a critical role in ensuring the high availability, scalability, and performance of our complex distributed systems. You'll be responsible for designing, implementing, and maintaining reliable systems, automating...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will design, build, and maintain our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple customers.Key ResponsibilitiesCollaborate with...


  • Austin, Texas, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automated solutions to improve infrastructure stability and scalabilityCollaborate with...


  • Austin, Texas, United States Terminal Industries Full time

    About UsTerminal Industries is a cutting-edge technology company that's revolutionizing the logistics industry with its innovative software solutions.Our platform leverages machine learning and IoT technology to digitize, index, and automate warehouse operations, providing warehouse operators with the intelligence needed to optimize their usage of trucks,...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will design, build, and maintain our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple customers.Key ResponsibilitiesCollaborate with...


  • Austin, Texas, United States ORACLE AMERICA Full time

    Job Summary:Oracle America is seeking a skilled Site Reliability Developer 3 to join our team in Austin, TX. As a Site Reliability Developer, you will be responsible for solving complex problems related to infrastructure and cloud services, and building automation to prevent problem recurrence.Key Responsibilities:Solve complex problems related to...


  • Austin, Texas, United States Liquibase Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Liquibase. As a key member of our DevOps team, you will be responsible for designing, implementing, and maintaining highly resilient and secure infrastructure for our SaaS platform using AWS services.Key Responsibilities:Design and implement secure and scalable...


  • Austin, Texas, United States Apple Full time

    Job Title: Site Reliability Engineering ManagerAbout the Role:Apple is seeking a highly skilled Site Reliability Engineering Manager to lead our cloud services team. As a Site Reliability Engineering Manager, you will be responsible for establishing SRE practices for our private cloud service to accelerate our ability to reliably and consistently deliver...


  • Austin, Texas, United States Oxford Knight Full time

    Database Site Reliability EngineerOxford Knight is seeking an experienced Database Site Reliability Engineer to join our Trading Systems Infrastructure team. As a key member of our team, you will be responsible for designing, building, and maintaining our diverse production database infrastructure, focusing on bare metal performance, scalability, and...


  • Austin, Texas, United States Info Way Solutions Full time

    Splunk Administration and SRE ExpertiseWe are seeking a highly skilled Splunk administrator with strong expertise in Site Reliability Engineering (SRE) and DevOps to join our team at Info Way Solutions.Key Responsibilities:Administer and optimize Splunk infrastructure for maximum performance and efficiencyDevelop and implement SRE practices to ensure high...


  • Austin, Texas, United States Futran Tech Solutions Pvt. Ltd. Full time

    Job Title: Site Reliability Engineer/Infrastructure SpecialistLocation: RemoteJob Type: Full-timeAbout the Role:We are seeking a highly skilled Site Reliability Engineer/Infrastructure Specialist to join our team at Futran Tech Solutions Pvt. Ltd. The ideal candidate will have experience supporting internet-facing production services and distributed systems,...


  • Austin, Texas, United States Oracle Full time

    Job Title: Site Reliability DeveloperOracle is seeking a highly skilled Site Reliability Developer to join our team. As a Site Reliability Developer, you will be responsible for designing, building, and deploying software to improve the availability, scalability, and efficiency of Oracle products and services.Key Responsibilities:Design and develop software...


  • Austin, Texas, United States Terminal Industries Full time

    About UsTerminal Industries is a software company that leverages machine learning to digitize, index, and automate the yard. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel.OverviewOur world-class vision engineering team has built an engine that can process...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a vital role in designing, building, and maintaining our core infrastructure.This infrastructure enables thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple...