![Visa](https://media.trabajo.org/img/noimg.jpg)
Staff Site Reliability Engineer
3 weeks ago
Visa is a world leader in digital payments, facilitating more than 215 billion payments transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable and secure payments network, enabling individuals, businesses and economies to thrive.
When you join Visa, you join a culture of purpose and belonging – where your growth is priority, your identity is embraced, and the work you do matters. We believe that economies that include everyone everywhere, uplift everyone everywhere. Your work will have a direct impact on billions of people around the world – helping unlock financial access to enable the future of money movement.
Join Visa: A Network Working for Everyone.
Job DescriptionProduct Reliability Engineering (PRE) is part of the Visa's technology organization. The division is responsible for maintaining and supporting Visa's data assets and provides support for value added products and services to drive innovation for our partners and clients, within Visa and globally. Product Reliability Engineering Big Data Platform Team is part of PRE and supports Open-source Big Data stack and Big Data Services in Visa.
As a Staff Site Reliability will be responsible for monitoring, troubleshooting, automating and continuously developing software products and tools to improve the availability and resiliency of Open-source Platforms at Visa.
Essential Functions:
- Perform Administration and Engineering activities on Open-source Hadoop, Open-source Spark, Airflow, Machine learning platform running on Open-source Kubernetes clusters.
- Strong Troubleshooting and debugging skills.
- Cross-team teamwork, build and maintain relationships with the customer teams, the user community, architects, and engineering teams, jointly work on key deliverables ensuring production scalability and stability.
- Effective Root-cause analysis of major production incidents and developing learning documentation.
- Plan and perform capacity expansion and upgrades in timely manner avoiding any scaling issues and bugs.
- Automation of repetitive tasks to reduce manual effort and avoid Human errors.
- Tune alerting and setup observability to proactively identify the issues and performance problems.
- Leverage Devops tools, disciplines (Incident, problem and change management) and standards in day-to-day operations.
- Perform automation and selfheal as per the requirement.
- Lead and participate in the determination of root-causes for Kubernetes Application service failures and support escalation.
- Ensure the Kubernetes platform services can effectively meet performance and SLA requirements.
- Hardening, securing the Kubernetes cluster with monitoring and auditing dashboards.
This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.
QualificationsBasic Qualification:
- 5+ years of relevant work experience with a Bachelor’s Degree or at least 2 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 0 years of work experience with a PhD, OR 8+ years of relevant work experience.
Preferred Qualifications:
- 6 or more years of work experience with a Bachelors Degree or 4 or more years of relevant experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or up to 3 years of relevant experience with a PhD.
- At least 3 years hands-on experience with On-Prem container Infrastructure – OpenShift, Opensource Kubernetes preferred.
- Knowledge of Infrastructure Operations and Production Support of container technologies and orchestration platforms is plus.
- Knowledge of Docker/Kubernetes deployment, configuration, scaling, and management of containerized applications is a plus.
- Experience in managing and tuning performance of Hadoop platforms.
- Extensive knowledge on Hadoop eco-system such as HDFS, Yarn, HIVE and SPARK.
- Excellent Shell, Python programming skills for automation requirement for repetitive dev-ops tasks.
- Understanding of security tools like Kerberos and Ranger.
- Must have Strong Knowledge & experience in Unix/Linux Systems Administration in relevant technologies.
- Experience with configuration management tools like Chef, Ansible is a plus
- Working knowledge of monitoring and logging tools: Prometheus, Grafana etc is plus.
- Excellent verbal and written communication and presentation skills, analytical and problem-solving skills
- Self-driven, Ability to work independently.
Work Hours: Varies upon the needs of the department.
Travel Requirements: This position requires travel 5-10% of the time.
Mental/Physical Requirements: This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers.
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.
Visa will consider for employment qualified applicants with criminal histories in a manner consistent with applicable local law, including the requirements of Article 49 of the San Francisco Police Code.
U.S. APPLICANTS ONLY: The estimated salary range for a new hire into this position is 119,100.00 to 154,800.00 USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401 (k), FSA/HSA, Life Insurance, Paid Time Off, and Wellness Program.
-
Staff Site Reliability Engineer
2 weeks ago
Austin, United States Procore Technologies Full timeJob Description What if you could use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world yet it’s also one of the world’s least digitized industries. That’s why we’re looking for an...
-
Staff Site Reliability Engineer
6 hours ago
Austin, United States Currency Cloud Full timeCompany DescriptionVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...
-
Staff Site Reliability Engineer
2 weeks ago
Austin, United States Procore Technologies Full timeLead projects within a small team of Reliability Engineers to continually improve the reliability of Procores services through engineering and process improvement. Collaborate with your peers to envision, design, and develop solutions in your respec Reliability Engineer, Liability, Staff, Engineer, Reliability, Reliability, Manufacturing, Technology
-
Staff Site Reliability Engineer
5 days ago
Austin, Texas, United States Procore Technologies Full timeLead projects within a small team of Reliability Engineers to continually improve the reliability of Procores services through engineering and process improvement. Collaborate with your peers to envision, design, and develop solutions in your respec Reliability Engineer, Liability, Staff, Engineer, Reliability, Reliability, Manufacturing, Technology
-
Staff Site Reliability Engineer
3 weeks ago
Austin, United States Braze Full timeAt Braze, we have found our people. We’re a genuinely approachable, exceptionally kind, and intensely passionate crew. We seek to ignite that passion by setting high standards, championing teamwork, and creating work-life harmony as we collectively navigate rapid growth on a global scale while striving for greater equity and opportunity – inside and...
-
Site Reliability Engineer
4 weeks ago
Austin, United States Virtu Financial Full timeVirtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...
-
Staff Site Reliability Engineer, Infrastructure
3 weeks ago
Austin, United States Sunrun Full timeEverything we do at Sunrun is driven by a determination to transform the way we power our lives. We know that starts at the individual employee level. We strive to foster an environment you can thrive in through our commitment to diversity, inclusion and belonging. A renewable energy revolution is beginning to blossom into the world’s largest industrial...
-
Staff Site Reliability Engineer, Infrastructure
3 weeks ago
Austin, United States Sunrun Full timeEverything we do at Sunrun is driven by a determination to transform the way we power our lives. We know that starts at the individual employee level. We strive to foster an environment you can thrive in through our commitment to diversity, inclusion and belonging. A renewable energy revolution is beginning to blossom into the world’s largest industrial...
-
Site Reliability Engineer
1 month ago
Austin, United States Virtu Financial Full timeVirtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...
-
Site Reliability Engineer
3 weeks ago
Austin, United States Virtu Financial Full timeVirtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...
-
Site Reliability Engineer
3 weeks ago
Austin, United States Virtu Financial Full timeVirtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...
-
Experienced Energy Reliability Engineer/Analyst
4 weeks ago
Austin, United States Texas Reliability Entity Full timeExperienced Energy Reliability Engineer/Analyst Texas Reliability Entity, Inc. (Texas RE) is hiring!The Texas power grid is changing rapidly as economics, technology, and customer demands push the power industry to new limits. At the same time, what used to be low-probability events, such as extreme weather and cybersecurity breaches, are now occurring at a...
-
Staff Site Reliability Engineer
4 weeks ago
Austin, United States DuckDuckGo Full timeJob Description: Hi, we’re DuckDuckGo, the Internet privacy company for everyone who wants to take back their privacy now. For over a decade, we've been building our all-in-one product, developing new privacy technology, and working with policymakers to make online privacy simple and accessible for all. Our browsers and extensions have been downloaded over...
-
Site Reliability Engineer
5 days ago
Austin, Texas, United States Pinnacle Group Full timeResponsibilities We are looking for an operations engineer to join the Crypto Services SRE team. The Crypto Services SRE team is responsible for systems and services that support a vast number of both Apples internal services as well as services that users directly use. As an Operations Engineer, you will play a crucial role in helping ensure our systems...
-
Senior Engineer Site Reliability
4 weeks ago
Austin, United States Hispanic Technology Executive Council Full timeSenior Engineer Site Reliability Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...
-
Senior Engineer Site Reliability
3 weeks ago
Austin, United States Hispanic Technology Executive Council Full timeSenior Engineer Site Reliability Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...
-
Senior Engineer Site Reliability
3 weeks ago
Austin, United States Hispanic Technology Executive Council Full timeSenior Engineer Site Reliability Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...
-
Senior Engineer Site Reliability
5 days ago
Austin, Texas, United States Dell Technologies Full time**Senior Engineer Site Reliability** Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of...
-
site reliability engineer
4 weeks ago
Austin, United States Thales Full timeLocation: Austin, United States of America Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...
-
Austin, United States Visa Full timeCompany Description Visa is a world leader in digital payments, facilitating more than 215 billion payments transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable and secure...