Site Reliability Engineer
2 weeks ago
Description
We are hiring a
Site Reliability Engineer
to support, configure, and build our
SaaS offerings.
You will be troubleshooting and administering multiple environments including performance and quality of disaster recovery. You will support all aspects of the technical infrastructure by troubleshooting system configuration, installation, and other technical issues. This involves review of technical designs/information, automating processes through scripting, installation and configuration of software, and validation of technical environments. This position will also be responsible for the documentation of new and existing environments and validation of all key technical components.
This position is expected to interact with Servers, Databases, SaaS products, Security, and various groups within Information Technology. Candidate must have strong server, network, communication, and analytical skills.
This is an ideal role for a self-motivated professional with passion for technology and creative problem solving.
Responsibilities:
Deploy, operate, and support cloud infrastructure primarily utilizing GCP but also AWS
Responsible for the ongoing maintenance, security, and availability of several applications based on business requirements and adhering to tight operations, security, and procedural models
Ensure production level systems are running at all times and have multiple levels of redundancy to meet committed SLAs.
Applies professional-level technical skill and judgement to provide non-routine technical support for production operations to drive optimal performance, reliability, redundancy, and scale.
Develop and maintain a working knowledge of Zenoss products and services.
Document environment topology and installation details
Automation of tasks using scripting and configuration management systems
Communicates highly technical information to both technical and non-technical personnel
Work with customers to troubleshoot and resolve technical issues.
Troubleshoot network performance issues, perform intrusion monitoring, and maintain a disaster recovery procedures.
Plan for, and recommend, expansion of capacity and upgrades, patches, and new applications and equipment when necessary.
Participation in the development of information technology and infrastructure projects
Document and thoroughly understand the application architecture and system configuration across platforms
Determine the root cause of an outage, duration, and recommendations or steps to resolve issues
Provide 24x7 support for all network and server systems that are pivotal to production.
Required Experience / Skills:
Bachelor's degree in Computer Science/Engineering or equivalent relevant experience
3-6 years of professional hands-on experience with Cloud production environments
Strong scripting skills and demonstrated ability to automate tasks. (SaltStack and Python preferred)
Strong understanding of networking, firewalls, load balancers, and databases
Experience using and supporting Google Cloud Platform and Amazon Web Services.
Experience with database (MySQL) and web server technology (Apache, Tomcat, IIS, etc.) a plus
Strong verbal and written communication skills
Project and task oriented with a focus on details
Ability to proactively communicate detailed status to customer and project team
Strong organization skills
Ability to work both within a team and independently
Ability to make sound decisions based on customer needs and technical knowledge
Self-motivated and able to work under pressure to deliver high-quality solutions
Detail oriented with excellent analytical skills.
Ability to work after hours including weekends and night when required with occasional travel
Be eligible to work in the United States
More about Zenoss:
Individually Unique. Better Together.
When we come together, we accomplish amazing things. Zenoss is an established company with a start-up, entrepreneurial environment. We have a collaborative culture that is focused around making our customers successful. One thing we're not is a new-kid-on-the-street startup. Founded in 2005, we're far removed from a few folks in a garage with one great idea. We are a midsize company filled with people who have proven work experience, are smart, nimble and capable. We have credibility: Zenoss helps world-renowned enterprise customers run their IT infrastructure. Some of the most critical aspects of business rely on Zenoss. It's exciting to be part of growing and servicing these type of customers.
#J-18808-Ljbffr
-
Site Reliability Engineer
4 days ago
Austin, United States Virtu Financial Full timeVirtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...
-
Site Reliability Engineer
3 weeks ago
Austin, United States Virtu Financial Full timeVirtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...
-
site reliability engineer
5 days ago
Austin, United States Thales Full timeLocation: Austin, United States of America Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...
-
site reliability engineer
6 days ago
Austin, United States Thales Full timeLocation: Austin, United States of America Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...
-
Director - Site Reliability Engineering
4 days ago
Austin, United States Iodine Software Full timeDirector - Site Reliability Engineering Join us. Let's make a direct impact in healthcare. Being an Iodine employee means becoming part of something bigger: using clinical AI echnology to drive smarter healthcare processes and positively impact patient care. Who we are: Iodine is an enterprise AI company that is championing a radical rethink of how to...
-
Director - Site Reliability Engineering
5 days ago
Austin, Texas, United States Iodine Software Full timeDirector - Site Reliability Engineering Join us. Let's make a direct impact in healthcare. Being an Iodine employee means becoming part of something bigger: using clinical AI echnology to drive smarter healthcare processes and positively impact patient care. Who we are: Iodine is an enterprise AI company that is championing a radical rethink of how to create...
-
Lead Site Reliability Engineer
4 days ago
Austin, United States OBSERVE, LLC Full timeAbout Us Observe.AI is the fastest way to boost contact center performance with live conversation intelligence. Built on the most accurate AI engine in the industry, Observe.AI uncovers insights from 100% of customer interactions and maximizes frontline team performance through coaching and end-to-end workflow automation. With Observe.AI , companies can act...
-
Austin, United States Texas Reliability Entity Full timeExperienced Energy Reliability Engineer/Analyst Texas Reliability Entity, Inc. (Texas RE) is hiring!The Texas power grid is changing rapidly as economics, technology, and customer demands push the power industry to new limits. At the same time, what used to be low-probability events, such as extreme weather and cybersecurity breaches, are now occurring at a...
-
Senior Engineer Site Reliability
5 days ago
Austin, United States Hispanic Technology Executive Council Full timeSenior Engineer Site Reliability Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...
-
Senior Site Reliability Engineer- Remote
1 month ago
Austin, United States ClickHouse Full timeWe are committed to providing our customers with reliable and secure services so we are building out our newly formed Site Reliability Engineering team. As one of the first joiners to our Reliability Engineering Team at ClickHouse, you will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance...
-
Site Reliability Engineer
5 days ago
Austin, United States SonarSource Full timeSonar solves the trillion-dollar challenge of bad code. Sonar equips organizations to achieve and sustain a Clean Code state by empowering developers to write consistent, intentional, adaptable, and responsible code. Clean Code produces software that is maintainable, reliable, and secure, allowing development teams to spend less time fixing issues and more...
-
Site Reliability Engineer
6 days ago
Austin, United States SureCo Inc Full timeJob Type Full-time Description Job Title: Site Reliability Engineer (SRE) Location: Remote (comfortable working in the Pacific Time Zone) SureCo is changing how people in the US take care of their health - in 2020, new regulations went into effect, allowing employers to offer more choice at lower cost for employee health benefits, and SureCo is at the...
-
Site Reliability Engineer
5 days ago
Austin, United States Apple Full timeSite Reliability Engineer - Ad Platforms Austin,Texas,United States Software and Services At Apple, we work every day to build products that enrich peoples lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative and imaginative content on their devices while helping publishers and developers promote...
-
Site Reliability Engineer
5 days ago
Austin, United States SONAR Full timeSonar solves the trillion-dollar challenge of bad code. Sonar equips organizations to achieve and sustain a Clean Code state by empowering developers to write consistent, intentional, adaptable, and responsible code. Clean Code produces software that is maintainable, reliable, and secure, allowing development teams to spend less time fixing issues and more...
-
Site Reliability Engineer
2 weeks ago
Austin, United States Frontline Education Full timePosting Details Job Details Description Location Requirements: This role is Hybrid to one of our offices: Austin, Naperville or Wayne. Overview : We are looking for an outgoing and dynamic Site Reliability Engineer to manage the successful operation and support of Frontline application environments. This position is responsible...
-
Site Reliability Engineer
3 weeks ago
Austin, United States Apple Inc. Full timeImagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish! Join the Apple Service Engineering team as a Site Reliability Engineering (SRE) Manager to help support and scale cloud...
-
Austin, United States Visa Full timeCompany Description Visa is a world leader in digital payments, facilitating more than 215 billion payments transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable and secure...
-
Site Reliability Engineer
7 days ago
Austin, United States Pinnacle Group Full timeResponsibilities We are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apples internal services as well as services that users directly use. As an Operations Engineer, you will play a crucial role in helping ensure our systems...
-
Staff Site Reliability Engineer
6 days ago
Austin, United States DuckDuckGo Full timeJob Description: Hi, we’re DuckDuckGo, the Internet privacy company for everyone who wants to take back their privacy now. For over a decade, we've been building our all-in-one product, developing new privacy technology, and working with policymakers to make online privacy simple and accessible for all. Our browsers and extensions have been downloaded over...
-
Site Reliability Engineer
1 week ago
Austin, United States Pinnacle Group, Inc. Full timeResponsibilitiesWe are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apple’s internal services as well as services that users directly use. As an Operations Engineer, you will play a crucial role in helping ensure our systems...