Site Reliability Engineer

3 weeks ago


Palo Alto, United States TEKsystems Full time

Description: Role: Site Reliability Engineer (SRE for Cloud) Location: Remote Project - MUST live in Pacific coast time zone Duration: 1 year with possible extension Number of positions: 1 We urgently looking for 1 Site Reliability Engineer (SRE for Cloud), mid level, who are available asap with the following skills: Role: Site Reliability Engineer (SRE): Global Payments team and Data Platforms team Duration: 1 year with possible extension Number of positions: 1 Also, needs to live in Seattle or CA or NYC. Responsibility: -Engage in and improve the whole lifecycle of services from inception and design, throughout development, capacity planning, and launch reviews, to deployment, operation, and refinement. -Design and implement software platforms and monitor frameworks for efficient, automated, and intelligent service-oriented architecture (SOA) governance. -Scale systems sustainably through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes. -Practice sustainable user support, incident response, and blameless postmortems. -Participate in On-Call rotations. Requirement: -Bachelor's degree or above, majoring in Computer Science, or related fields, with at least 2 years of related work experience. -Experience in SRE of large-scale systems deployment with high reliability and scalability. -Familiar with system operation skills in Linux and network. -Experience programming in at least one of the following languages: Python, Perl, Go, or C/C++; -Experience in designing, analyzing and troubleshooting large-scale distributed systems. -Familiar with popular CI/CD procedures and environments. -Effective communication skills and a sense of ownership and drive. Specific Skill: -Big data tools and concepts (Hadoop, Spark, Hive) -Video storage and playback architecture -SQL, NoSQL, Data modeling, Data lake (such as S3) -Responsibilities: • Data monitoring and alerting, data quality assurance and anomaly detection. • Document team processes and policies, including methods of engagement and SLOs. • Analyze, design, and implement solutions at the system level to remove bottlenecks and improve edge service performance. • Implement monitoring and alerting to improve issue detection and response. • Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues. • Participate in on-call rotations, responsible for resolving or escalating incoming events • Maintain and operate a Linux and Kubernetes environment. Qualifications: • 3+ years experience working with Unix Linux systems from kernel to shell and beyond with • experience working with system libraries, fi le systems, and client-server protocols. • Experience reading python scripts for platform operations. • Experience in networking technologies such TCP/IP, BGP, DNS, etc. in a carrier-grade environment. • Experience in developing and operating one or more of following systems: OpenStack, Kubernetes, Nginx, ipvs, ELK stack, Hadoop, etc. • Bachelor's degree or above, majoring in Computer Science or related fi elds, with at least 2 years of related work experience. Skills: SRE, monitoring, deploy, big data, OCI, key value storage, SQL, linux, ETL, Data Lake, data processing, data modeling Top Skills Details: SRE,monitoring,deploy,big data,OCI,key value storage,SQL,linux Additional Skills & Qualifications: SRE tasks include deployments, upgrades, on-call, monitoring, automating processes and dealing with systems Experience Level: Expert Level About TEKsystems: We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company. The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.



  • Palo Alto, California, United States TEKsystems Full time

    :Role: Site Reliability Engineer (SRE for Cloud)Location: Remote Project - MUST live in Pacific coast time zoneDuration: 1 year with possible extensionNumber of positions: 1We urgently looking for 1 Site Reliability Engineer (SRE for Cloud), mid level, who are available asap with the following skills:Role: Site Reliability Engineer (SRE): Global Payments...


  • Palo Alto, California, United States SHEIN Technology LLC Full time

    About the jobJob Title: Senior Site Reliability Engineer IReports to: Senior Manager of Site Reliability EngineeringJob Location: Palo Alto, CA, USAJob Status: Exempt, FT About SHEIN SHEIN is a global online fashion and lifestyle retailer, offering SHEIN branded apparel and products from a global network of vendors, all at affordable prices. Headquartered in...


  • Palo Alto, California, United States SHEIN Technology LLC Full time

    About the jobJob Title: Senior Site Reliability Engineer IReports to: Senior Manager of Site Reliability EngineeringJob Location: Palo Alto, CA, USAJob Status: Exempt, FT About SHEIN SHEIN is a global online fashion and lifestyle retailer, offering SHEIN branded apparel and products from a global network of vendors, all at affordable prices. Headquartered in...


  • Palo Alto, United States Aptos Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way. Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We...


  • Palo Alto, United States Aptos Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way. Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We...


  • Palo Alto, United States Aptos Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way. Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We...


  • Palo Alto, United States Aptos Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way. Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We...


  • Palo Alto, United States TEKsystems Full time

    Description: Role: Site Reliability Engineer (SRE for Cloud) Location: Remote Project - MUST live in Pacific coast time zone Duration: 1 year with possible extension Number of positions: 1 We urgently looking for 1 Site Reliability Engineer (SRE for Cloud), mid level, who are available asap with the following skills: Role: Site Reliability...


  • Palo Alto, United States Mediaocean Full time

    Mediaocean is powering the future of the advertising ecosystem with technology that empowers brands and agencies to deliver impactful omnichannel marketing experiences. With over $200 billion in annualized ad spend running through its software products, Mediaocean deploys AI and automation to optimize investments and outcomes. The company's advertising...


  • Palo Alto, United States ASSURED Full time

    Job Description Job Description Assured is on a mission to modernize insurance. Claims processing (i.e. should we pay this claim?), while often overlooked, is the foundation of the entire industry. It’s currently highly manual, involving phone calls, faxes, and gut instinct—costing tens of billions of dollars a year. We can do better. At Assured, we...


  • Palo Alto, United States Assured Full time

    Job DescriptionJob DescriptionAssured is on a mission to modernize insurance. Claims processing (i.e. should we pay this claim?), while often overlooked, is the foundation of the entire industry. It’s currently highly manual, involving phone calls, faxes, and gut instinct—costing tens of billions of dollars a year. We can do better.At Assured, we provide...


  • Palo Alto, United States Assured Full time

    Job DescriptionJob DescriptionAssured is on a mission to modernize insurance. Claims processing (i.e. should we pay this claim?), while often overlooked, is the foundation of the entire industry. It’s currently highly manual, involving phone calls, faxes, and gut instinct—costing tens of billions of dollars a year. We can do better.At Assured, we provide...


  • Palo Alto, United States Assured Full time

    Job DescriptionJob DescriptionAssured is on a mission to modernize insurance. Claims processing (i.e. should we pay this claim?), while often overlooked, is the foundation of the entire industry. It’s currently highly manual, involving phone calls, faxes, and gut instinct—costing tens of billions of dollars a year. We can do better.At Assured, we provide...


  • Palo Alto, United States MongoDB Full time

    The worldwide data management software market is massive (According to IDC, the worldwide database software market, which it refers to as the database management systems software market, was forecasted to be approximately $82 billion in 2023 growing to approximately $137 billion in 2027. This represents a 14% compound annual growth rate). At MongoDB we are...


  • Palo Alto, United States Quartzenterprises Full time

    We are looking for a Systems Engineer to join our growing Corporate IT Team. This is an exciting role and includes a wide set of responsibilities, with the day to day focus on the ongoing improvement and maintenance of systems and services within the Organization. This role will report into the Corp-IT System Engineers Manager. The candidate will have...


  • Palo Alto, United States Quartzenterprises Full time

    We are looking for a Systems Engineer to join our growing Corporate IT Team. This is an exciting role and includes a wide set of responsibilities, with the day to day focus on the ongoing improvement and maintenance of systems and services within the Organization. This role will report into the Corp-IT System Engineers Manager. The candidate will have...


  • Palo Alto, United States Audubon Companies Full time

    External Description Senior Mechanical Reliability Engineer Direct Hire Laplace, LA Immediate Need PTO, Benefits, and 401k Long term position This position is not open to international candidates, and does not offer relocation or per diem Audubon is currently seeking a Senior Mechanical Reliability Engineer to be part of a project team working onsite at a...


  • Palo Alto, United States Wing Inflatables Inc Full time

    About Wing: Wing offers drone delivery as a safe, fast, and sustainable solution for last mile logistics. Consumer appetites for on-demand services are increasing, but current delivery methods are inefficient, costly, and contribute to road accidents and air pollution. Wing’s fleet of highly automated delivery drones can transport small packages directly...


  • Palo Alto, United States Pivotal Full time

    Pivotal is the leader in the emerging market of electric Vertical Takeoff and Landing (eVTOL) aircraft. We design, develop, and manufacture light eVTOL aircraft and are renowned for the BlackFly, the first light eVTOL to fly manned missions and enter the consumer market. Efficient, compact, and simple, Pivotal vehicles are designed for a wide range of...


  • Palo Alto, United States MongoDB Full time

    The worldwide data management software market is massive (According to IDC, the worldwide database software market, which it refers to as the database management systems software market, was forecasted to be approximately $82 billion in 2023 growing to approximately $137 billion in 2027. This represents a 14% compound annual growth rate). At MongoDB we are...