Site Reliability Engineer

2 weeks ago


Austin, United States Zenoss Careers Full time

Description We are hiring a

Site Reliability Engineer

to support, configure, and build our

SaaS offerings.

You will be troubleshooting and administering multiple environments including performance and quality of disaster recovery. You will support all aspects of the technical infrastructure by troubleshooting system configuration, installation, and other technical issues. This involves review of technical designs/information, automating processes through scripting, installation and configuration of software, and validation of technical environments. This position will also be responsible for the documentation of new and existing environments and validation of all key technical components.

This position is expected to interact with Servers, Databases, SaaS products, Security, and various groups within Information Technology. Candidate must have strong server, network, communication, and analytical skills.

This is an ideal role for a self-motivated professional with passion for technology and creative problem solving.

Responsibilities: Deploy, operate, and support cloud infrastructure primarily utilizing GCP but also AWS Responsible for the ongoing maintenance, security, and availability of several applications based on business requirements and adhering to tight operations, security, and procedural models Ensure production level systems are running at all times and have multiple levels of redundancy to meet committed SLAs. Applies professional-level technical skill and judgement to provide non-routine technical support for production operations to drive optimal performance, reliability, redundancy, and scale. Develop and maintain a working knowledge of Zenoss products and services. Document environment topology and installation details Automation of tasks using scripting and configuration management systems Communicates highly technical information to both technical and non-technical personnel Work with customers to troubleshoot and resolve technical issues. Troubleshoot network performance issues, perform intrusion monitoring, and maintain a disaster recovery procedures. Plan for, and recommend, expansion of capacity and upgrades, patches, and new applications and equipment when necessary. Participation in the development of information technology and infrastructure projects Document and thoroughly understand the application architecture and system configuration across platforms Determine the root cause of an outage, duration, and recommendations or steps to resolve issues Provide 24x7 support for all network and server systems that are pivotal to production.

Required Experience / Skills: Bachelor's degree in Computer Science/Engineering or equivalent relevant experience 3-6 years of professional hands-on experience with Cloud production environments Strong scripting skills and demonstrated ability to automate tasks. (SaltStack and Python preferred) Strong understanding of networking, firewalls, load balancers, and databases Experience using and supporting Google Cloud Platform and Amazon Web Services. Experience with database (MySQL) and web server technology (Apache, Tomcat, IIS, etc.) a plus Strong verbal and written communication skills Project and task oriented with a focus on details Ability to proactively communicate detailed status to customer and project team Strong organization skills Ability to work both within a team and independently Ability to make sound decisions based on customer needs and technical knowledge Self-motivated and able to work under pressure to deliver high-quality solutions Detail oriented with excellent analytical skills. Ability to work after hours including weekends and night when required with occasional travel Be eligible to work in the United States More about Zenoss:

Individually Unique. Better Together.

When we come together, we accomplish amazing things. Zenoss is an established company with a start-up, entrepreneurial environment. We have a collaborative culture that is focused around making our customers successful. One thing we're not is a new-kid-on-the-street startup. Founded in 2005, we're far removed from a few folks in a garage with one great idea. We are a midsize company filled with people who have proven work experience, are smart, nimble and capable. We have credibility: Zenoss helps world-renowned enterprise customers run their IT infrastructure. Some of the most critical aspects of business rely on Zenoss. It's exciting to be part of growing and servicing these type of customers.

#J-18808-Ljbffr



  • Austin, United States Virtu Financial Full time

    Virtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...


  • Austin, United States Virtu Financial Full time

    Virtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...


  • Austin, United States Thales Full time

    Location: Austin, United States of America Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...


  • Austin, United States Thales Full time

    Location: Austin, United States of America Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...


  • Austin, United States Iodine Software Full time

    Director - Site Reliability Engineering Join us. Let's make a direct impact in healthcare. Being an Iodine employee means becoming part of something bigger: using clinical AI echnology to drive smarter healthcare processes and positively impact patient care. Who we are: Iodine is an enterprise AI company that is championing a radical rethink of how to...


  • Austin, Texas, United States Iodine Software Full time

    Director - Site Reliability Engineering Join us. Let's make a direct impact in healthcare. Being an Iodine employee means becoming part of something bigger: using clinical AI echnology to drive smarter healthcare processes and positively impact patient care. Who we are: Iodine is an enterprise AI company that is championing a radical rethink of how to create...


  • Austin, United States OBSERVE, LLC Full time

    About Us Observe.AI is the fastest way to boost contact center performance with live conversation intelligence. Built on the most accurate AI engine in the industry, Observe.AI uncovers insights from 100% of customer interactions and maximizes frontline team performance through coaching and end-to-end workflow automation. With Observe.AI , companies can act...


  • Austin, United States Texas Reliability Entity Full time

    Experienced Energy Reliability Engineer/Analyst Texas Reliability Entity, Inc. (Texas RE) is hiring!The Texas power grid is changing rapidly as economics, technology, and customer demands push the power industry to new limits. At the same time, what used to be low-probability events, such as extreme weather and cybersecurity breaches, are now occurring at a...


  • Austin, United States Hispanic Technology Executive Council Full time

    Senior Engineer Site Reliability Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...


  • Austin, United States ClickHouse Full time

    We are committed to providing our customers with reliable and secure services so we are building out our newly formed Site Reliability Engineering team. As one of the first joiners to our Reliability Engineering Team at ClickHouse, you will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance...


  • Austin, United States SonarSource Full time

    Sonar solves the trillion-dollar challenge of bad code. Sonar equips organizations to achieve and sustain a Clean Code state by empowering developers to write consistent, intentional, adaptable, and responsible code. Clean Code produces software that is maintainable, reliable, and secure, allowing development teams to spend less time fixing issues and more...


  • Austin, United States SureCo Inc Full time

    Job Type Full-time Description Job Title: Site Reliability Engineer (SRE) Location: Remote (comfortable working in the Pacific Time Zone) SureCo is changing how people in the US take care of their health - in 2020, new regulations went into effect, allowing employers to offer more choice at lower cost for employee health benefits, and SureCo is at the...


  • Austin, United States Apple Full time

    Site Reliability Engineer - Ad Platforms Austin,Texas,United States Software and Services At Apple, we work every day to build products that enrich peoples lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative and imaginative content on their devices while helping publishers and developers promote...


  • Austin, United States SONAR Full time

    Sonar solves the trillion-dollar challenge of bad code. Sonar equips organizations to achieve and sustain a Clean Code state by empowering developers to write consistent, intentional, adaptable, and responsible code. Clean Code produces software that is maintainable, reliable, and secure, allowing development teams to spend less time fixing issues and more...


  • Austin, United States Frontline Education Full time

    Posting Details Job Details Description Location Requirements: This role is Hybrid to one of our offices: Austin, Naperville or Wayne.  Overview : We are looking for an outgoing and dynamic  Site Reliability Engineer  to manage the successful operation and support of Frontline application environments. This position is responsible...


  • Austin, United States Apple Inc. Full time

    Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish! Join the Apple Service Engineering team as a Site Reliability Engineering (SRE) Manager to help support and scale cloud...


  • Austin, United States Visa Full time

    Company Description Visa is a world leader in digital payments, facilitating more than 215 billion payments transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable and secure...


  • Austin, United States Pinnacle Group Full time

    Responsibilities We are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apples internal services as well as services that users directly use. As an Operations Engineer, you will play a crucial role in helping ensure our systems...


  • Austin, United States DuckDuckGo Full time

    Job Description: Hi, we’re DuckDuckGo, the Internet privacy company for everyone who wants to take back their privacy now. For over a decade, we've been building our all-in-one product, developing new privacy technology, and working with policymakers to make online privacy simple and accessible for all. Our browsers and extensions have been downloaded over...


  • Austin, United States Pinnacle Group, Inc. Full time

    ResponsibilitiesWe are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apple’s internal services as well as services that users directly use. As an Operations Engineer, you will play a crucial role in helping ensure our systems...