Site Reliability Engineer
1 day ago
Who You'll Work With
Arista Networks is looking for a skilled professional for our Engineering Productivity team to help maintain and support our rapidly expanding infrastructure and internal user base. The ideal candidate is someone who can wear many hats, can be versatile and is enthusiastic about learning new technologies.
As a part of the software engineering team, you will work with other team members to design, build and administer secure, scalable and fault-tolerant tools and infrastructure in a hybrid cloud environment.
What You'll Do
Building, integrating and maintaining tools and infrastructure facilitating internal development and testing. Improve maintainability of build system Evaluate new tools Improve speed of information back to the development team within the build systems and processes Troubleshoot and resolve systems and network issues. Adherence to infrastructure-as-code principles. Proactively ensure the highest levels of systems and infrastructure availability. Participate in the design and implementation of new systems and infrastructure projects.Qualifications
Essential Skills
Minimum 4+ years commercial experience in this space as a DevOps / SRE Engineer Solid experience with Jenkins and GitHub, ideally with a background/understanding of the Atlassian stack of products (Confluence/Jira/Bamboo/Bitbucket) UNIX / Linux systems administration (preferably RedHat/CentOS). Scripting with Python or Bash or experience at least one high level language such as Go, C++, etc.. Experience with containerization and container orchestration ( Docker, Kubernetes). Experience with (CI/CD) orchestration and software configuration management tools ( Ansible, Puppet, Salt, Chef). Ability to work in a fast paced and agile development environment. Excellent communication and documentation skills. Working knowledge/experience with Makefile/makeDesired Skills
BS/MS degree in Computer Science or a relevant experience subject. Experience with monitoring systems ( Zabbix, Nagios, Prometheus, DataDog). Experience with relational databases ( MySQL, PostgreSQL) Experience with virtualization technologies ( VMware, XenServer, RHEV, QEMU/KVM). Experience with any of the following: Elasticsearch, InfluxDB, Grafana, Artifactory. Exposure to FPGA build projects Exposure or experience with Vivado (Xilinx)#LI-SZ1
-
Site Reliability Engineer
6 days ago
Remote, Oregon, United States hireVouch Full timeSenior Site Reliability EngineerPosition OverviewWe are a mid-size entertainment company delivering captivating digital experiences to millions of customers worldwide. Our IT organization powers the infrastructure and systems behind our cutting-edge payroll and accounting applications. We are seeking a Senior Site Reliability Engineer (SRE) to enhance the...
-
Senior Site Reliability Engineer
1 week ago
Remote, Oregon, United States Grafana Labs Full timeSenior Site Reliability Engineer - DatabasesThis is a remote position and we're considering candidates in the USA & Canada.About the role:We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these...
-
Site Reliability Engineer
2 days ago
Remote, Oregon, United States Epam Full timeDescription DESCRIPTION Join our dynamic team as a Site Reliability Engineer and lead the way in optimizing and automating our Linux-based infrastructure. With 3 to 5 years of experience in Site Reliability Engineering, DevOps, or Infrastructure, you will play a crucial role in elevating our capabilities and ensuring high-impact,...
-
Site Reliability Expert
2 days ago
Remote, Oregon, United States Credit Acceptance Full timeCredit Acceptance is a leading provider of used car finance solutions, recognized for its world-class culture and innovative approach to the industry. As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our software systems.Key ResponsibilitiesCollaborate with cross-functional...
-
Remote, Oregon, United States Credit Acceptance Full timeCredit Acceptance is seeking a Senior Site Reliability Engineer to join our innovative team. As a Senior Site Reliability Engineer, you will play a crucial role in ensuring our software systems' reliability, availability, and performance.We're looking for a candidate with a strong background in software development, system architecture, and a passion for...
-
Site Reliability Engineer Intern
1 week ago
Remote, Oregon, United States Actian Full timeCompany At Actian we believe data should be a competitive advantage. Through the deployment of data technology, underpinned by a relentless and trusted service commitment, we help business critical systems transact and integrate at their very best. As a trusted leader in data management, integration, and analytics, our mission is to helping businesses...
-
Senior Site Reliability Engineer
2 weeks ago
Remote, Oregon, United States Rackspace Full timeAbout the Role We are seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a strong background in managing large-scale, data-intensive production-grade systems and infrastructure, with deep experience in cloud observability, automation, and reliability engineering at scale. A...
-
Cloud Site Reliability Engineer
1 week ago
Remote, Oregon, United States Epam Full timeDescription DESCRIPTION Are you a skilled Cloud Site Reliability Engineer with experience in AWS or GCP? Do you have a passion for maintaining CI/CD frameworks, integrating observatory stacks, and supporting Cloud applications? If so, we have an exciting opportunity for you We're currently seeking a Cloud Site Reliability Engineer to join...
-
Senior Site Reliability Engineer
1 week ago
Remote, Oregon, United States Credit Acceptance Full timeCredit Acceptance is proud to be an award-winning company with local and national workplace recognition in multiple categories Our world-class culture is shaped by dedicated Team Members who share a drive to succeed as professionals and together as a company. A great product, amazing people and our stable financial history have made us one of the largest...
-
Site Reliability Engineer
5 days ago
Remote, Oregon, United States Epam Full timeDescription We are seeking a Site Reliability Engineer (Azure) to join our team. #Not found Responsibilities As a Lead Azure SRE, you will be responsible for driving the reliability, performance, and scalability of cloud-based applications and services. Your expertise in Kubernetes, scripting, troubleshooting, and observability will be...
-
Reliability Engineer
30 minutes ago
Remote, Oregon, United States beBee Careers Full timeJob Summary:We are seeking a highly skilled Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, scalability, and security of our platform.The ideal candidate will have a strong background in infrastructure management and Site Reliability Engineering practices. You will play...
-
Senior Site Reliability Engineer
2 weeks ago
Remote, Oregon, United States Epam Full timeDescription DESCRIPTION Are you a seasoned professional with a passion for site reliability engineering and a knack for leading strategic initiatives? Join our dynamic team at EPAM, a leading global provider of digital platform engineering and software development services. We are seeking a Senior Site Reliability Engineer who can make a...
-
Senior Site Reliability Engineer
3 days ago
Remote, Oregon, United States Webflow Full timeAt Webflow, our mission is to bring development superpowers to everyone. Webflow is the leading visual development platform for building powerful websites without writing code. By combining modern web development technologies into one platform, Webflow enables people to build websites visually, saving engineering time, while clean code seamlessly generates...
-
Azure DevOps Site Reliability Engineer
2 days ago
Remote, Oregon, United States Epam Full timeDescription DESCRIPTION Are you a skilled Azure DevOps Site Reliability Engineer with a passion for ensuring business continuity and helping businesses always be near their clients? Do you have experience in optimizing and supporting OSDU deployment, performing monitoring including incidents resolution, and suggesting improvements? If so, we...
-
Staff Site Reliability Engineer
1 week ago
Remote, Oregon, United States Crisis Text Line Full timeCrisis Text Line provides free, 24/7, high-quality text-based mental health support and crisis intervention by empowering a community of trained volunteers to support people in their moments of need.Our mission is at the intersection of empathy and innovation — we promote mental well-being for people wherever they are.Our vision is an empathetic world...
-
Reliability Engineer
6 hours ago
Remote, Oregon, United States beBee Careers Full timeJob OverviewThe role of Site Reliability Engineer combines software and systems engineering to develop high-performance, massively distributed, robust systems.We are looking for individuals who can optimize system capacity and performance at all times.Main ResponsibilitiesEngage in the whole lifecycle of services—from inception and design, deployment,...
-
Director, Site Reliability Engineering
2 days ago
Remote, Oregon, United States ZINC Zillow, Inc. Full timeAbout the team Our goal in the Technology Engineering & Operations organization is to enable our customers to seamlessly leverage our tools, platforms, and solutions while delivering robust infrastructure and site availability. Our mission is twofold - to empower and support our internal customers (development teams) in building and delivering...
-
Senior Site Reliability Engineer
1 week ago
Remote, Oregon, United States Epam Full timeDescription DESCRIPTION Step into the future of cloud technology as a Senior Site Reliability Engineer specializing in Azure Data DevOps at our innovative IT company. This pivotal role offers the opportunity to design and manage cutting-edge Azure cloud infrastructure, ensuring the seamless performance and reliability of data-intensive...
-
Remote, Oregon, United States Credit Acceptance Full timeAt Credit Acceptance, we're proud to be an award-winning company with a reputation for excellence in multiple categories. Our world-class culture is built on the dedication of our team members who share a drive to succeed as professionals and together as a company. A great product, amazing people, and our stable financial history have made us one of the...
-
Cloud Reliability Engineer
7 hours ago
Remote, Oregon, United States beBee Careers Full timeJob DescriptionWe are seeking a highly skilled Chief AWS Site Reliability Engineer to join our global engineering team. The successful candidate will be responsible for ensuring the reliability and availability of our fleet services under the SRE model.This is an exciting opportunity for a passionate individual who wants to contribute to innovative projects...