Senior Site Reliability Engineer
2 days ago
Terminal Industries is a pioneering company that leverages cutting-edge machine learning to digitize, index, and automate the yard. Our platform empowers warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel. These fundamental operating assets of commerce represent the last great frontier of untapped data.
OverviewOur world-class vision engineering team has developed an engine that can process the movement of trucks and containers in real-time. We are now poised to unlock the potential of that engine by building SaaS applications that leverage the vision engine to transform the logistics industry. As part of Terminal's Site Reliability Engineering team, you will play a pivotal role in architecting and developing cutting-edge solutions.
Responsibilities- Oversee the deployment, management, and maintenance of IoT devices, including camera systems and sensors, to ensure seamless integration and configuration within the network.
- Manage firmware updates and patches for IoT devices, ensuring that all devices are up-to-date and secure, and develop strategies for efficient deployment of updates.
- Implement mechanisms for collecting and processing data from IoT devices, ensuring data integrity, availability, and confidentiality.
- Troubleshoot and resolve connectivity issues related to IoT devices, and manage integration between IoT devices and cloud infrastructure to ensure seamless data flow and system interoperability.
- Design and implement solutions to scale IoT deployments effectively, monitoring device performance and system health to ensure high reliability and availability.
- Design, build, and operate infrastructure using Infrastructure as Code (IaC) tools like Terraform and Ansible, developing and maintaining infrastructure automation to ensure scalability and reliability.
- Define and implement best practices for continuous deployment of software and services using CI/CD tools such as GitHub Actions, automating deployment processes to streamline operations.
- Lead incident response efforts, including diagnosis, resolution, and post-mortem analysis, and implement robust monitoring and alerting systems to ensure quick detection and resolution of issues.
- Ensure that systems adhere to security best practices and regulatory compliance requirements, implementing security measures and conducting regular audits to safeguard production environments.
- Minimum of 12 years of experience in Site Reliability Engineering or a related role, with a proven track record of managing complex production environments.
- Strong background in operating systems, networking, distributed systems, and database management, with expertise in AWS cloud services and infrastructure management.
- Hands-on experience with deploying, managing, and maintaining IoT devices and sensor systems, knowledge of IoT protocols (e.g., MQTT, CoAP), and device integration practices.
- Experience in managing firmware updates and ensuring the security and functionality of IoT devices.
- Proficiency in managing and troubleshooting connectivity issues in IoT environments, including wireless and wired communication protocols.
- Experience with data collection and processing from IoT devices, including ensuring data quality and managing large volumes of data.
- Demonstrated experience in incident response, production monitoring, and capacity planning, with the ability to handle high-pressure situations and ensure system reliability.
-
Senior Site Reliability Engineer
3 days ago
Austin, Texas, United States Expedia Group Full timeSenior Software Development Engineer - Site ReliabilityWe are seeking a highly skilled and experienced Senior Software Development Engineer (SRE) to join our team. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our services and systems. You will work closely with development and operations teams to...
-
Site Reliability Engineer
1 week ago
Austin, Texas, United States Apex Systems Full timeJob DescriptionPosition: Site Reliability EngineerLocation: RemoteDuration: 1 yearRate: $67/hr W-2We are seeking a highly skilled Site Reliability Engineer to join our team at Apex Systems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key...
-
Senior Site Reliability Engineer
3 weeks ago
Austin, Texas, United States Visa Full timeCompany OverviewVisa stands as a global frontrunner in digital payment solutions, orchestrating over 215 billion transactions annually across a vast network of consumers, merchants, financial institutions, and governmental bodies in more than 200 nations. Our vision is to unite the globe through the most advanced, convenient, reliable, and secure payment...
-
Senior Site Reliability Engineer
3 days ago
Austin, Texas, United States Weedmaps Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our engineering team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesCollaborate with Cross-Functional Teams: Work closely with our...
-
Site Reliability Engineer
2 months ago
Austin, Texas, United States Cape Henry Associates, Acquired by JANUS Research Group Full timeJanus is looking for a seasoned Site Reliability Engineer / DevSecOps Developer to help grow our capability with our DoD clients.Develop Infrastructure as Code (IaC) designing, implementing, and maintaining infrastructure using IaC technologies(e.g. terraform or similar) ensuring scalable, reliable, and efficient platformsCollaborate with data and other...
-
Lead Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Expedia Group Full timePrincipal Site Reliability EngineerWe are looking for a highly qualified and seasoned Principal Site Reliability Engineer (SRE) to enhance our operations. The successful candidate will play a crucial role in guaranteeing the stability, scalability, and efficiency of our systems and services. You will collaborate closely with both development and operational...
-
Site Reliability Engineer
6 days ago
Austin, Texas, United States Apple Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and services.Key ResponsibilitiesDesign, build, and maintain robust infrastructure and automation solutionsWork closely with...
-
Lead Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Expedia Group Full timePrincipal Software Development Engineer - Site ReliabilityWe are looking for a highly proficient and seasoned Principal Software Development Engineer (SRE) to enhance our team. The successful candidate will be accountable for maintaining the reliability, scalability, and performance of our systems and services. You will collaborate closely with both...
-
Site Reliability Engineer
19 hours ago
Austin, Texas, United States JobRialto Full timeAbout the RoleWe are seeking a highly motivated and experienced Systems and Platform Operations Expert to join our Site Reliability Engineering & Production Services team. As a member of this team, you will work closely with other technology professionals to support Asset Management Technology - Cloud Platform solutions.Key ResponsibilitiesProvide level 2...
-
Cloud Engineer
3 days ago
Austin, Texas, United States Weedmaps Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our engineering team, you will play a critical role in ensuring the performance, reliability, and scalability of our cloud-based services.Key ResponsibilitiesCollaborate with Cross-Functional Teams: Work closely with our...
-
Lead Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Expedia Group Full timePrincipal Software Development Engineer - Site ReliabilityWe are in search of a highly qualified and seasoned Principal Software Development Engineer (SRE) to enhance our operations. The ideal candidate will be tasked with ensuring the dependability, scalability, and efficiency of our services and systems. You will collaborate closely with both development...
-
Senior Power System Engineer
1 day ago
Austin, Texas, United States Electric Reliability Council of Texas Full timeJob SummaryWe are seeking a highly skilled Senior Power System Engineer to join our team at the Electric Reliability Council of Texas (ERCOT). As a key member of our operations team, you will be responsible for ensuring the reliable operation of the electric power grid in compliance with NERC Standards, ERCOT Protocols, and Market Guides.Key...
-
Lead Engineer for Site Reliability
3 weeks ago
Austin, Texas, United States Infosys Full timePosition Overview:Infosys is in search of a Lead Engineer for Site Reliability. This role's primary focus will be to oversee a team of Site Reliability Engineers (SREs) to proactively guarantee the stability, resilience, and scalability of our services through automation, testing, and engineering practices.Key Responsibilities:The successful candidate will...
-
Site Reliability Engineer
3 days ago
Austin, Texas, United States Apple Full timeRole SummaryApple is seeking a talented Site Reliability Engineer to ensure the reliability, scalability, and performance of our systems and services. As an SRE, you will work closely with our engineering and operations teams to design, build, and maintain robust infrastructure and automation solutions.Key ResponsibilitiesDesign and implement scalable...
-
Senior Site Reliability Engineer
2 weeks ago
Austin, Texas, United States ProCore CPA Full timeAbout the RoleWe are seeking a highly skilled Staff Site Reliability Engineer to join our Cloud Infrastructure team at Procore CPA. As a key member of our team, you will be responsible for leading the development and implementation of cloud-based solutions to ensure the reliability and scalability of our services.Key ResponsibilitiesLead Cloud Infrastructure...
-
Senior Site Reliability Engineer
6 days ago
Austin, Texas, United States Visa Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer - Cloud Infrastructure Expert to join our team at Visa. As a key member of our cloud infrastructure team, you will be responsible for ensuring the security, availability, and performance of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain scalable and...
-
Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Thales Full timeAbout the RoleThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and security of our cloud-based services.Key ResponsibilitiesCollaborate with project managers and service delivery managers to analyze traffic trends and capacity...
-
Senior System Planning Engineer
3 weeks ago
Austin, Texas, United States Electric Reliability Council of Texas Full timeJob OverviewAt the Electric Reliability Council of Texas (ERCOT), we foster a vibrant and inclusive work culture that empowers our employees to collaborate and innovate for the future of the Texas power grid and wholesale market. Our commitment to diversity and inclusion is fundamental to our corporate values, which include accountability, leadership,...
-
Senior Reliability Engineer
3 weeks ago
Austin, Texas, United States Amazon Full timeAs a Senior Reliability Engineer, you will play a pivotal role in ensuring the operational excellence of Amazon's data centers globally. Your expertise will be essential in conducting thorough evaluations and providing insightful feedback on the design aspects across various engineering disciplines. In addition to your design responsibilities, you will...
-
Senior System Planning Engineer
3 weeks ago
Austin, Texas, United States Electric Reliability Council of Texas Full timeJob OverviewAt the Electric Reliability Council of Texas (ERCOT), we pride ourselves on our inclusive and diverse work culture that empowers employees to collaborate and innovate for the future of the Texas power grid and wholesale market. We invite you to become part of our skilled and dedicated team, focused on developing exceptional solutions to address...