Site Reliability Engineer

5 days ago


San Jose, United States Cisco Full time

Application window is expected to close on July 8th, 2024 Who We Are Cisco Spaces is an industry leading indoor location as a service solution to gain insights into the behavior of end user devices and network- connected objects in any place with wireless connectivity, allowing customers to make informed business decisions, optimize operations, and improve experiences. Cisco Spaces brings together multiple location-based services capabilities in a unified platform and is architected as a zero touch SaaS Cloud product. Who You'll Work With You will closely work with TechOps team in Bangalore, Cisco Spaces Dev team and QA teams (both Bangalore and SJC). Who You Are As a Site Reliability Engineer (SRE), you are a crucial member of the operations and engineering teams, responsible for ensuring the reliability, availability, and performance of our services. You have a deep understanding of both software engineering and system administration, enabling you to bridge the gap between development and operations. Your expertise lies in designing, building, and maintaining scalable and fault-tolerant systems, as well as automating repetitive tasks to improve efficiency and reduce human error. What You’ll Do You will use an array of tools and integrations to deliver suite of foundational services pivotal to Cisco's essential business functions through GitOps. We are looking for an engineer who is proficient in DevOps and GitOps to helm a flexible and adaptable team of multi-skilled engineers responsible for maintaining Cisco Spaces FedRAMP environment. You will be responsible for operating, maintaining and improving all aspects of the FedRAMP Cloud and acting as point of contact for the Bangalore TechOps team. The FedRAMP standards adhere to NIST 800-53 control policies and procedures, with regular self- and external audits to verify compliance. Ready to be a part of on-call team on rotational basis for maximum coverage. Minimum Qualifications: Bachelor's degree (or above) in engineering/computer science with an overall work experience of 4 - 8 years. Extensive experience in container orchestration tools such as AWS ECS and Kubernetes Hands on experience with automation tools such as Terraform, Ansible, Git, Jenkins, GitLab and Monitoring tools like Datadog, CloudWatch, Grafana, Prometheus, PagerDuty. Deep understanding of public cloud environment - AWS Cloud as well as Cloud Security and Compliance Hands-on exposure to automation with proficiency in at least one programming/scripting language like Bash, Python. Preferred Qualifications: Good working knowledge of http/https and configuring load balancing specifically AWS ELB HAProxy, domain and SSL certificate management. Possess a deep understanding of Linux OS internals such as kernel modules, file systems, and network stack to optimize system performance and troubleshoot complex issues. Exposure to streaming solutions such as Kafka (AWS MSK) Experience in databases like GraphDB, Postgres, Cassandra and InfluxDB Why Cisco? #WeAreCisco. We are all unique, but collectively we bring our talents to work as a team, to develop innovative technology and power a more inclusive, digital future for everyone. How do we do it? Well, for starters - with people like you Nearly every internet connection around the world touches Cisco. We’re the Internet’s optimists. Our technology makes sure the data travelling at light speed across connections does so securely, yet it’s not what we make but what we make happen which marks us out. We’re helping those who work in the health service to connect with patients and each other; schools, colleges and universities to teach in even the most challenging of times. We’re helping businesses of all shapes and size to connect with their employees and customers in new ways, providing people with access to the digital skills they need and connecting the most remote parts of the world - whether through 5G, or otherwise. We tackle whatever challenges come our way. We have each other’s backs, we recognize our accomplishments, and we grow together. We celebrate and support one another - from big and small things in life to big career moments. And giving back is in our DNA (we get 10 days off each year to do just that). We know that powering an inclusive future starts with us. Because without diversity and a dedication to equality, there is no moving forward. Our 30 Inclusive Communities, that bring people together around commonalities or passions, are leading the way. Together we’re committed to learning, listening, caring for our communities, whilst supporting the most vulnerable with a collective effort to make this world a better place either with technology, or through our actions. So, you have colorful hair? Don’t care. Tattoos? Show off your ink. Like polka dots? That’s cool. Pop culture geek? Many of us are. Passion for technology and world changing? Be you, with us #WeAreCisco Application window is expected to close on July 8th, 2024 Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis. Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.



  • San Jose, United States IBM Full time

    ENGINEERING Site Reliability Engineer, IBM Corporation, San Jose, CA (Up to 40% telecommuting permitted): Work with development teams to enable a continuous integration environment that sustains high productivity levels and emphasizes defect prevention techniques. Manage delivery pipeline....


  • San Jose, United States TEKsystems Full time

    Description: Adobe is looking for an experienced Site Reliability Engineer to join the internal tooling team support, configure, integrate, upgrade, and automate the use of enterprise tools used across their large Engineering organization. Role will be focused on user interaction, troubleshooting tickets, and maintaining servers. Skills: Linux,...


  • San Jose, California, United States Zoom Video Communications Full time

    Sponsorship is not available for this position What you can expect. As a senior level Product Resilience SRE, you will define, scope, plan, and schedule Disaster Recovery Testing at Zoom. You will document any gaps identified by our testing, and Reliability Engineer, Liability, Engineer, Product, Reliability, Reliability, Technology


  • San Jose, United States IBM Full time

    ENGINEERING To be considered for an interview, please make sure your application is full in line with the job specs as found below. Site Reliability Engineer, IBM Corporation, San Jose, CA (Up to 40% telecommuting permitted): Work with development teams to enable a continuous integration environment that sustains high productivity levels and emphasizes...


  • San Jose, United States OKX Full time

    Who We Are OKX is revolutionising world systems through our cutting-edge digital asset exchange, Web3 portal and blockchain ecosystems.We are deeply committed to shaping a fairer, more transparent and accessible society through blockchain technology and to date, we have 50+ million users, 3000+ employees and 180+ countries believing in the same vision as us....


  • San Jose, United States International Business Machines Corporation - IBM Full time

    At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought Reliability Engineer, Liability, Development Engineer, Reliability, Reliability, Engineer, Technology


  • San Diego, United States ObjectWin Technology Full time

    Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIE's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly reliable,...


  • San Diego, California, United States PEAK Technical Staffing USA Full time

    Hiring Senior Site Reliability Engineer; primary responsibilities will include contributing to the implementation and delivery of the end-to-end automation platform, to support continuous integration and continuous delivery (CI/CD), with a focus on developer self-service capabilities.NOTE:Must have build out experience with Kubernetes. This position requires...


  • San Jose, United States HCLTech Full time

    About HCLTech: HCLTech is a global technology company, home to 221,000+ people across 60 countries, delivering industry-leading capabilities centered around digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Engineering Services,...


  • San Diego, United States ObjectWin Technology Full time

    Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly...


  • San Diego, United States ObjectWin Technology Full time

    Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIE's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly reliable,...


  • San Jose, United States Hireio, Inc. Full time

    Job DescriptionJob DescriptionJob DescriptionPosition Description:Location: Usa/Usa/California/Sf Bay Area, SeattleBase Salary: 187K - 280KSponsor Visa? YesLanguage Requirements: English, Mandarin (Preferred)Our Team:Site Reliability Engineering(SRE) team combines software and systems engineering to build and run large-scale, massively distributed, and...


  • San Jose, United States The Dignify Solutions LLC Full time

    AWS Infra SRE/DevOps engineer with proven work experience ensuring reliability, availability and performance of cloud infra and platform. - Specialist on Cisco Cloud run-on for infrastructure management, who can install, run, and maintain software like docker, and containers. - Responsible for upkeep, improvements, and configurations & migration of cloud...


  • San Jose, United States The Dignify Solutions LLC Full time

    AWS Infra SRE/DevOps engineer with proven work experience ensuring reliability, availability and performance of cloud infra and platform. - Specialist on Cisco Cloud run-on for infrastructure management, who can install, run, and maintain software like docker, and containers. - Responsible for upkeep, improvements, and configurations & migration of cloud...


  • San Jose, United States HCLTech Full time

    About HCLTech:HCLTech is a global technology company, home to 221,000+ people across 60 countries, delivering industry-leading capabilities centered around digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Engineering Services,...


  • San Jose, United States HCLTech Full time

    About HCLTech:HCLTech is a global technology company, home to 221,000+ people across 60 countries, delivering industry-leading capabilities centered around digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Engineering Services,...


  • San Ramon, California, United States LaSalle Network Full time

    LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) – Grafana Observability – with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition...


  • San Jose, United States Equifax Full time

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. SREs in our team take...


  • San Jose, United States IBM Computing Full time

    IBM Site Reliability Engineering Professional in San Jose , California Introduction At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to...


  • San Jose, United States Graphiant Full time

    Enterprise wide-area networking is primed for a new paradigm with the introduction of software defined networking architecture to deliver agility, performance, services and software innovations. Graphiant is changing the networking industry and you will be part of the charge to drive evolution. You will collaborate with industry leading engineers to build a...