Site Reliability Engineer
2 weeks ago
Oracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.
Key Responsibilities- Design, develop, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
- Collaborate with cross-functional teams to identify and resolve complex problems related to infrastructure, cloud services, and automation.
- Develop and maintain monitoring and alerting systems to ensure timely detection and resolution of issues.
- Partner with development teams to onboard new services into our cloud environments and ensure seamless integration with existing systems.
- Build and maintain tools to address hard operational problems, such as automation, provisioning, security, scaling, availability, and resiliency.
- Drive root cause analysis and implement corrective actions to prevent future issues.
- Develop and maintain technical documentation to ensure knowledge sharing and onboarding of new team members.
- BS or MS in Computer Science or related technical field.
- 5+ years of experience in running large-scale customer-facing web services.
- Proficient in writing services/task automation in Python, Bash, Ruby, Perl, JavaScript, or Java.
- Deep knowledge of Linux internals and host-based networking.
- Expert Linux/Unix performance and stability troubleshooting skills.
- Familiarity with configuration management solutions such as Chef, Puppet, etc.
- Experience with devising, managing, and extending monitoring solutions for large-scale environments.
- Experience in database management (Oracle DB, MySQL, Postgres).
- Experience in shared file systems (Gluster, ZFS, etc.).
- Systematic problem-solving approach, strong communication skills, and a sense of ownership and drive.
- Proficient in coding complex, distributed systems using Python, Ruby, Java, or C/C++.
- Deep knowledge of Networking (TCP, UDP, DNS, DHCP, IPSec).
- Deep focus on building secure Internet-facing systems and services in hostile environments.
- 3+ years of experience in production software development with Agile methodologies.
- 3+ years managing host, network, or storage virtualization technologies.
- Expert troubleshooting skills.
- Expert fleet automation and management solutions.
- A highly competitive salary range of $79,000 to $158,200 per annum.
- A comprehensive benefits package, including medical, dental, and vision insurance, short-term disability, long-term disability, life insurance, and AD&D.
- A 401(k) Savings and Investment Plan with company match.
- Paid time off, including flexible vacation, 11 paid holidays, and paid sick leave.
- Paid parental leave and adoption assistance.
- Employee Stock Purchase Plan and financial planning and group legal services.
Oracle is an Equal Employment Opportunity Employer and welcomes applications from diverse candidates. We are committed to creating an inclusive workplace that values diversity and promotes equal opportunities for all employees.
-
Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Unreal Gigs Full timeJob Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...
-
Site Reliability Engineer
6 days ago
Austin, Texas, United States Oracle Full timeJob DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesDesign, develop, and deploy automation tools to improve the efficiency and reliability of our cloud...
-
Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Cisco Full timeAbout the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automated solutions to improve the reliability and...
-
Site Reliability Engineer
5 days ago
Austin, Texas, United States Thales Full timeJob Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and security of our cloud-based services.Key Responsibilities:Collaborate with project managers and service delivery managers to analyze traffic...
-
Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Thales Full timeJob Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team in Austin, TX. As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining our CTE product line and solutions for deployment in various environments, including on-premises, multiple clouds, and big data and...
-
Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Thales Full timeJob Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining our cloud-based infrastructure and applications.Key Responsibilities:Collaborate with project managers and service delivery managers to analyze...
-
Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Apple Full timeAbout the RoleWe are seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will design, build, and maintain our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple customers.Key ResponsibilitiesCollaborate with...
-
Site Reliability Engineer
6 days ago
Austin, Texas, United States Cisco Full timeAbout the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automated solutions to improve infrastructure stability and scalabilityCollaborate with...
-
Site Reliability Engineer
6 days ago
Austin, Texas, United States Apple Full timeAbout the RoleWe are seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will design, build, and maintain our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple customers.Key ResponsibilitiesCollaborate with...
-
Site Reliability Engineer
4 weeks ago
Austin, Texas, United States JobRialto Full timeAbout the RoleWe are seeking a highly motivated and experienced Systems and Platform Operations Expert to join our Site Reliability Engineering & Production Services team. As a member of this team, you will work closely with other technology professionals to support Asset Management Technology - Cloud Platform solutions.Key ResponsibilitiesProvide level 2...
-
Site Reliability Engineering Manager
2 weeks ago
Austin, Texas, United States Apple Full timeSite Reliability Engineering ManagerAt Apple, we're committed to delivering exceptional customer experiences through innovative products and services. As a Site Reliability Engineering Manager, you'll play a critical role in ensuring the reliability and scalability of our cloud services.Key ResponsibilitiesLead a team of SRE engineers in establishing and...
-
Principal Site Reliability Engineer
3 weeks ago
Austin, Texas, United States Terminal Industries Full timeAbout UsTerminal Industries is a pioneering company that leverages cutting-edge machine learning to digitize, index, and automate the yard. Our platform empowers warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel. These fundamental operating assets of commerce represent the last...
-
Site Reliability Engineer
1 week ago
Austin, Texas, United States Info Way Solutions Full timeSplunk Administration and SRE ExpertiseWe are seeking a highly skilled Splunk administrator with strong expertise in Site Reliability Engineering (SRE) and DevOps to join our team at Info Way Solutions.Key Responsibilities:Administer and optimize Splunk infrastructure for maximum performance and efficiencyDevelop and implement SRE practices to ensure high...
-
Principal Site Reliability Engineer
4 weeks ago
Austin, Texas, United States Terminal Industries Full timeAbout UsTerminal Industries is a software company that leverages machine learning to digitize, index, and automate the yard. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel.OverviewOur world-class vision engineering team has built an engine that can process...
-
Senior Site Reliability Engineer
2 weeks ago
Austin, Texas, United States Publishing Full timeJob DescriptionAt Publishing, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud infrastructure to support our growing business.ResponsibilitiesDesign and implement scalable cloud...
-
Site Reliability Engineer
4 weeks ago
Austin, Texas, United States Apple Full timeRole SummaryApple is seeking a talented Site Reliability Engineer to ensure the reliability, scalability, and performance of our systems and services. As an SRE, you will work closely with our engineering and operations teams to design, build, and maintain robust infrastructure and automation solutions.Key ResponsibilitiesDesign and implement scalable...
-
Senior Site Reliability Engineer
4 weeks ago
Austin, Texas, United States Weedmaps Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our infrastructure team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesLeverage your engineering expertise to build, monitor, and improve our...
-
Staff Site Reliability Engineer
6 days ago
Austin, Texas, United States H-E-B Full timeJob Title: Staff Site Reliability EngineerH-E-B Digital is seeking a highly skilled Staff Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and implementing fault-tolerant architectures, influencing code architecture, and establishing reliability standards across...
-
Senior Site Reliability Engineer
3 weeks ago
Austin, Texas, United States Terminal Industries Full timeAbout UsTerminal Industries is revolutionizing the logistics industry by digitizing, indexing, and automating yard operations. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel. These are the fundamental operating assets of commerce, and represent the last...
-
Senior Site Reliability Engineer
4 weeks ago
Austin, Texas, United States Terminal Industries Full timeAbout UsTerminal Industries is a pioneering company that leverages cutting-edge machine learning to digitize, index, and automate the yard. Our platform empowers warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel. These fundamental operating assets of commerce represent the last...