Site Reliability Engineer

2 weeks ago


Palo Alto, United States CareerBuilder Full time

Velocity Global seeks a Site Reliability Engineer (SRE) with observability experience. In this role, you will create the automation and support efforts of our cloud Infrastructure, identify strategies to improve our full-stack telemetry and monitoring capabilities.
SREs at Velocity Global work cross-functionally with DevOps and Engineering teams, combining operations work with software engineering principles to enable high availability of production systems. You will serve as a partner to our Engineering organization to help make their services more performant, scalable, observable, and reliable. Every engineering team at Velocity Global should be responsible for the software they build. SREs are critical in providing the tools, practices, and expertise to make that happen.
This individual will report to the Manager, Site Reliability Engineering.
RESPONSIBILITIES
Automating observability and alerting across an ever-changing landscape of microservices.
Automating Service Reliability scorecards and Production Readiness Standards.
Chaos Engineering and Game Day Simulations to discover and test fixes for weak spots that would otherwise not be identified until a real-life production incident occurred.
Software engineering project work, proposed and driven by individual SRE team members, to remove operational bottlenecks and increase velocity in ways we've never considered before.
Expand and improve our observability and monitoring footprint.
Collaborate with the Engineering and DevOps to create architectural plans, define project requirements, and establish technical standards.
Improve common operational challenges by building tools and automating scripts.
Serve on the Incident Response Team to help debug and drive resolution of production reliability issues, contribute to the postmortem, and work to prevent recurrence.
Participate in design and production reviews for new features, products, or infrastructure.
Audit and tune the configuration of systems owned by other engineering teams.
Plan for the growth of infrastructure and infrastructure reliability/resiliency.
Designing and implementing High Availability architecture underlying our platform.
Creating Disaster Recovery solutions, including backups, redundant systems, and emergency response processes.
Identify problems but also propose solutions, then go out and implement them--from submitting a merge request on another team's repository to scoping out a new reliability project.
QUALIFICATIONS
2+ years working in a relevant role, including administering observability stacks, either managed or self-hosted (e.g., DataDog, New Relic, Prometheus, Elastic Stack/ELK, AppDynamics).
Solid experience and understanding of AWS cloud services.
Experience with Operation of containerized microservices running on public cloud, asynchronous event processing, and databases.
Strong understanding of Linux, GitLab, and CI/CD pipelines.
Experience with On-call support of highly available production systems.
Experience designing and building new tools to automate repetitive tasks, prevent incidents, or improve TTR using an object-oriented programming language such as Python.
Infrastructure as Code using tools like Terraform, Terragrunt, or Cloud Formation.
Understanding of how application components interact and contribute to architectural discussions.
Identifying problems, propose solutions, and implement them from submitting a merge request on another team's repository to scoping out a new reliability project.
Our job titles may span more than one career level. The base pay depends upon many factors, such as training, transferable skills, work experience, business needs, and market demands. The base pay range is subject to change and may be modified. This role is eligible for annual performance-based bonuses, flexible time off, health care benefits, retirement savings, and employee incentive plans.
Pay Range
$116,000

$155,000 USD
GO FARTHER WITH VELOCITY
At Velocity Global, were building a dream team made up of the worlds best talent. Were looking for people like you to join us as we make opportunity borderless for people everywhere.
ABOUT VELOCITY GLOBAL
At Velocity Global, our values represent who we are and the company we want to be. We harness the power of our values to bring our unique talents together in pursuit of our common goals. In partnership with our customers and ourselves, we are better together, and together, we win.

We are dedicated to fostering diversity and inclusion across our organization, embracing the rich tapestry of cultures, backgrounds, and perspectives that our global team brings together in offices around the world. Velocity Global is an Equal Opportunity Employer committed to empowering individuals from all walks of life to achieve their professional goals with us, regardless of race, religion, gender, gender identity, pregnancy, disability, sexual orientation, age, national origin, citizenship status, or genetic information. We actively seek and encourage applications from diverse candidates, including those with disabilities, and offer accommodations throughout the selection process upon request.
Please refer to our present benefits offering here.

#J-18808-Ljbffr



  • Palo Alto, CA, United States Palantir Technologies Full time

    Palo Alto, CAInformation Security /Full-time /HybridA World-Changing CompanyPalantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more.The...


  • Palo Alto, California, United States Sunrun Full time

    Everything we do at Sunrun is driven by a determination to transform the way we power our lives. We know that starts at the individual employee level. We strive to foster an environment you can thrive in through our commitment to diversity, inclusion and belonging.OverviewThe Site Survey Technician role is responsible for performing all site surveys on...


  • Palo Alto, United States Center Pixel, Inc Full time

    We are seeking a highly skilled and experienced Senior Software Engineer to join our team. In this role, you will be responsible for designing, developing, and maintaining backend systems and services using Typescript. The ideal candidate will have a strong background in backend engineering and experience with either Typescript or a similar language, as well...


  • Palo Alto, United States Center Pixel, Inc Full time

    We are seeking a highly skilled and experienced Senior Software Engineer to join our team. In this role, you will be responsible for designing, developing, and maintaining backend systems and services using Typescript. The ideal candidate will have a strong background in backend engineering and experience with either Typescript or a similar language, as well...


  • Palo Alto, United States Teleo Inc. Full time

    Teleo is a robotics startup disrupting a trillion-dollar industry. Teleo converts construction heavy equipment, like loaders, dozers, excavators, trucks, etc. into autonomous robots. This technology allows a single operator to efficiently control multiple machines simultaneously, delivering substantial benefits to our customers while significantly enhancing...


  • Palo Alto, United States ArrayLabs, LLC Full time

    Array Labs is building a distributed radar imaging constellation to power the first accurate, real-time 3D model of the world. We are looking for a collaborative Mechanical Engineer with a specialization in spacecraft structural analysis and thermal modeling to join our Spacecraft Bus Team. This team oversees the design, analysis, fabrication, and...


  • Palo Alto, United States Teleo Inc. Full time

    Teleo is a robotics startup disrupting a trillion-dollar industry. Teleo converts construction heavy equipment, like loaders, dozers, excavators, trucks, etc. into autonomous robots. This technology allows a single operator to efficiently control multiple machines simultaneously, delivering substantial benefits to our customers while significantly enhancing...


  • Palo Alto, United States ArrayLabs, LLC Full time

    Array Labs is building a distributed radar imaging constellation to power the first accurate, real-time 3D model of the world. We are looking for a collaborative Mechanical Engineer with a specialization in spacecraft structural analysis and thermal modeling to join our Spacecraft Bus Team. This team oversees the design, analysis, fabrication, and...

  • Software Engineer

    4 weeks ago


    Palo Alto, California, United States Aika Full time

    About the RoleWe are looking for self-starter engineers with full-stack, production services, or product development experience.As a Software Engineer here, you will closely collaborate with our world-class research teams to build and deploy powerful AI systems and products that can perform previously impossible tasks and achieve unprecedented levels of...


  • Palo Alto, United States BHO Tech Full time

    We are a team of creative technologists on a mission to reimagine the art and science of photography. We are a venture-backed, pre-launch startup.The Customer Solutions Engineering Team writes lots of SW as we create new algorithms for our computational camera tech. We are looking for a real SW Wizard who knows how to make challenging image processing code...


  • Palo Alto, United States Lightning Labs Full time

    Lightning Labs is seeking to hire a Senior Engineering Manager to help scale our growing engineering organization. The ideal candidate has experience managing remote teams across several time zones, has managed their own open source projects or actively contributed to such projects in the past, and has high level working knowledge of Bitcoin. Due to the...


  • Palo Alto, United States ArrayLabs, LLC Full time

    Array Labs is building a distributed radar imaging constellation to power the first accurate, real-time 3D model of the world. We are looking for a collaborative Mechanical Engineer with a specialization in spacecraft structural analysis and thermal modeling to join our Spacecraft Bus Team. This team oversees the design, analysis, fabrication, and...


  • Palo Alto, California, United States Match Group Full time

    We are seeking a Software Engineer II (Backend) to join our Tinder Service Platform team.As you will work as a part of the Tinder Service Platform team, you will have the opportunity to learn about Tinder services and play a crucial role in the development and maintenance of scalable, reliable, and efficient central services.Our portfolio includes Tinder,...

  • CAE Engineer

    3 weeks ago


    Palo Alto, United States Tesla Full time

    **CAE Engineer - Drive Systems** ????Engineering & Information Technology????Palo Alto, California?? ID113945???? The Drive Systems team designs, optimizes, and engineers world class EV powertrains that push the boundaries of efficiency, performance, and time to market. This can only be done with a deep understanding of engineering first principles and the...


  • Palo Alto, United States Tencent Full time

    Responsibilities: Join us as we build the enterprise-level database product TDSQL, which is a distributed RDBMS created by Tencent, featuring strong consistency and high availability, a globally deployed architecture, HTAP, high SQL compatibility, distributed horizontal scaling, high performance, complete distributed transaction support, enterprise-level...

  • Engineering Lead

    3 weeks ago


    Palo Alto, United States Pika labs Full time

    ROLE: ENGINEERING LEAD Summary: We are in search of a product-focused Engineering Lead with a proven track record in web and mobile application development and infrastructure management. The ideal candidate will be a hands-on leader who excels in a dynamic environment and is capable of driving our engineering team towards delivering innovative products. Job...


  • Palo Alto, United States Holistic AI Full time

    Senior QA Engineer- Hybrid About HolisticAI HolisticAI, headquartered in Silicon, is a technology company specializing in AI solutions. Our AI Governance, Risk, and Compliance Platform ensures safe AI adoption by addressing trust, risk, security, and compliance. Our mission is to empower organizations to scale AI confidently, while our vision is to lead in...


  • Palo Alto, United States Tencent Cloud Full time

    Join us as we build the enterprise-level database product TDSQL, which is a distributed RDBMS created by Tencent, featuring strong consistency and high availability, a globally deployed architecture, HTAP, high SQL compatibility, distributed horizontal scaling, high performance, complete distributed transaction support, enterprise-level security, and other...


  • Palo Alto, United States Tencent Cloud Full time

    Join us as we build the enterprise-level database product TDSQL, which is a distributed RDBMS created by Tencent, featuring strong consistency and high availability, a globally deployed architecture, HTAP, high SQL compatibility, distributed horizontal scaling, high performance, complete distributed transaction support, enterprise-level security, and other...


  • Palo Alto, United States Tencent Cloud Full time

    Join us as we build the enterprise-level database product TDSQL, which is a distributed RDBMS created by Tencent, featuring strong consistency and high availability, a globally deployed architecture, HTAP, high SQL compatibility, distributed horizontal scaling, high performance, complete distributed transaction support, enterprise-level security, and other...