Senior Site Reliability Engineer

1 week ago


San Francisco, United States Tampa Gardens Senior Living Full time

Team Culture

Great things happen when people can bring their authentic selves to work. We empower all of our employees to share their perspectives, passions, and experiences because collectively we make a better, stronger team. Our team members collaborate closely with peers and cross-functional stakeholders throughout the business, our clients on the forefront of digital transformation, and the cutting edge of digital manufacturing thought leadership.

We take pride in our self-starter culture where employees are enabled and encouraged to achieve their professional goals through leadership guidance, learning, and development. Our philosophy is that careers are continuous journeys, and we dedicate time and offer resources so that employees can reach their full potential.

Benefits + Perks

We value you at and outside of work and know your loved ones are important. Our benefits are designed to support you and your family’s health through life’s expected and unexpected events.

Our Benefits Include:

  • Competitive Salary + Stock Options
  • Health Care Coverage + Life Insurance + Health Savings Account + Flexible Spending Account (includes spouse + children)
  • Flexible Vacation Policy
  • Adaptable Working Schedule and Environment

Our Perks Include:

  • Casual Dress Attire
  • Hybrid work flexibility
  • Catered Lunches, Snacks, and Beverages
  • Commuter Savings Program
  • Company Outings
  • Designated Volunteering Hours + Group Volunteer Events

Sight Machine is proud to be an equal opportunity employer and considers candidates regardless of age, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Sight Machine also considers qualified applicants regardless of criminal histories, consistent with legal requirements.

About Sight Machine, Inc.

Sight Machine strengthens manufacturers by providing the industry’s only standard data model and system-level visualization capabilities. By integrating all crucial data into a single innovative platform, everyone involved in the fabrication process can visualize, contextualize, and examine data in one intuitive interface.

Sight Machine is committed and mission-driven to improve lives, strengthen communities, and make the world cleaner through continuously re-envisioning manufacturing processes - making them more efficient, sustainable, and absolute. Founded in Michigan in 2011 and expanded to San Francisco in 2012, Sight Machine blends the spirit of technology innovation and the down-to-earth style of Detroit manufacturing. Our team includes early leadership from Yahoo, Tesla Motors, and Oracle. Together, we share wide industry knowledge and a commitment to advance manufacturing to a more sustainable future.

At Sight Machine, you will work with manufacturing leaders in the automotive, medical device, apparel, construction, and pharmaceutical industries. You will have access to and work with massive amounts of factory floor data to help uncover insights on how customers make products and develop solutions to pressing business problems. The platform solves problems like Extract Transform Load (ETL), information retrieval, data aggregation and analytics, factory automation, distributed computing, and security.

We place great value on professional, technical, and personal growth in an inclusive, collaborative environment. The ideal candidate will have a passion for technology and a strong can-do attitude.

What you'll do

In this role, you will join our Site Reliability and Infrastructure Team in deploying, managing, optimizing, and upgrading the systems that run Sight Machine software. You must love learning new technology, problem-solving, and building automation in the Infrastructure as Code paradigm.

Success will take a blend of technical expertise, experience with deployment technology frameworks, customer-centric focus, and a team-spirited approach to solve architectural challenges supporting your peers in Application Engineering.

Responsibilities
  • Employing DevOps principles, provide technical operational support for comprehensive cloud infrastructure operations for all customers, internal and external.
  • Troubleshoot and resolve complex systems problems that cross multiple layers of the systems stack from networking to operating systems to cloud resources to databases.
  • Instrument and respond to Monitoring and Alerting infrastructure for critical services.
  • Participate in our on-call support schedule.
  • Proactively pursue opportunities of operational innovation to improve stability, reliability, and availability of all platform components, and optimize efficiency, and propagate a security-first culture.
  • Creating, revising, and testing operational runbooks and automation for maintaining Sight Machine Infrastructure.
The Role

In this role, you will join the Cloud Infrastructure Team and take on tasks that include a focus on automation, tools, deployment, monitoring, managing, and optimizing the systems that run Sight Machine software. You must love learning new technology, have excellent problem-solving skills, and embrace the Infrastructure as Code paradigm.

Success will take a blend of technical expertise, experience with deployment technology frameworks, customer-centric focus, and a team-spirited approach to solve architectural challenges supporting your peers in Development Engineering.

Requirements
  • Embody a Quality-first & Security-first culture in all that you do.
  • 5+ years of experience with Kubernetes / Docker in at least one of the top-tier cloud providers (Azure, GCP, AWS, etc.).
  • 5+ years of experience coding with languages Python, Java, Go, Terraform, etc.
  • 5+ years of experience using IaC and CI/CD tools like FluxCD (or similar), Jenkins, Terraform, Github, etc.
  • Strong experience with the Linux OS.
  • Strong working knowledge of Networking (TCP/IP and Application).
  • A willingness to author technical documentation for design, workflows, processes, best practices, etc.
  • Willing to mentor other team members and engineers.
  • Strong bias for action vs endless planning; you’re hands-on, have made mistakes, learned from them, and can balance risk vs. impact to customers.
  • You value clear communication and you're empathetic and respectful of others.
  • Operational experience with monitoring/alerting systems such as Sentry, Opsgenie, Prometheus.
  • Deep understanding of cloud performance, and how to diagnose and resolve bottlenecks, and keep the performance at optimal levels.
Nice to haves
  • Experience with elements of our current tech stack are a plus: Kubernetes, FluxCD, Terraform, Helm Charts, Prometheus, Elasticsearch, Python, Java, Kafka, Postgres, and Jenkins.
  • Previous experience or a keen interest in industrial IoT, analytics, or manufacturing is a plus.
  • Coding experience in any of Python, Bash, Java, Go.
#J-18808-Ljbffr

  • San Francisco, United States Autodesk Full time

    Senior Site Reliability Engineer Apply Location: San Francisco, CA, USA Time Type: Full time Posted On: Posted 3 Days Ago Job Requisition ID: 24WD81384 Position Overview At Autodesk, we're not just a world leader in 3D design, engineering, and entertainment software; we're a hub of innovation committed to solving complex design and real-world problems. Our...


  • San Francisco, United States Fieldguide.ai Full time

    [Full Time] Senior Site Reliability Engineer at Fieldguide (United States) | BEAMSTART Jobs Senior Site Reliability Engineer Fieldguide United States Date Posted: 31 Oct, 2022 Work Location: San Francisco, United States Salary Offered: Not Specified Job Type: Full Time Experience Required: 3+ years Remote Work: Yes Stock Options: No Vacancies: 1...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS provider and a prominent leader in the Salesforce DevSecOps platform tailored for regulated sectors such as finance, insurance, and healthcare. Our solutions empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, United States Doppler Full time

    [Full Time] Senior Site Reliability Engineer at Doppler (United States) Senior Site Reliability Engineer Doppler United States Date Posted: 31 Oct, 2022 Work Location: San Francisco, United States Salary Offered: Not Specified Job Type: Full Time Experience Required: 6+ years Remote Work: Yes Stock Options: No Vacancies: 1 available ABOUT DOPPLER Doppler's...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our offerings empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering...


  • San Francisco, United States PicnicHealth Full time

    [Full Time] Site Reliability Engineer at PicnicHealth (United States) Site Reliability Engineer PicnicHealth United States Date Posted: 10 Aug, 2023 Work Location: San Francisco, United States Salary Offered: $160 — $190 yearly Job Type: Full Time Experience Required: 6+ years Remote Work: Yes Stock Options: No Vacancies: 1 available Healthcare needs good...


  • San Francisco, California, United States RevenueCat Full time

    About RevenueCatWe are a leading provider of mobile subscription infrastructure, handling over $3 billion in in-app purchases annually across thousands of apps. Our mission is to build a standard for mobile subscription infrastructure, and we're looking for a Senior Site Reliability Engineer to help us achieve this goal.About the RoleWe're seeking a highly...


  • San Francisco, California, United States Outdefine Full time

    About the JobWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Outdefine. As a key member of our Infrastructure team, you will be responsible for ensuring the reliability and scalability of our blockchain-based systems.Key ResponsibilitiesRun internal Chainlink and Blockchain nodes to ensure seamless connectivity and data...


  • San Francisco, United States AutoRABIT Holding, Inc. Full time

    About AutoRABIT: AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity for their development team,...


  • San Francisco, United States PicnicHealth Full time

    [Full Time] Site Reliability Engineer at PicnicHealth (United States) Site Reliability Engineer PicnicHealth United States Date Posted: 10 Aug, 2023 Work Location: San Francisco, United States Salary Offered: $160 $190 yearly Job Type: Full Time Experience Required: 6+ years Remote Work: Yes Stock Options: No Vacancies: 1 available Healthcare needs good...


  • San Francisco, California, United States Centene Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Centene. As a key member of our technology organization, you will play a critical role in ensuring the reliability, performance, and security of our platform infrastructure.Key ResponsibilitiesLead Projects and Initiatives: Help lead projects focused on...


  • San Francisco, California, United States Operant AI Full time

    Job OverviewSenior Site Reliability EngineerAs the inaugural SRE within our organization, we are looking for an individual to establish Operant's SRE strategy and operations aimed at ensuring the resilience and security of our platforms and services. If you are enthusiastic about the prospect of being an early engineer at a startup ready to revolutionize...


  • San Francisco, United States GRNET S.A. Full time

    About GRNETGRNET - National Infrastructures for Research and Technology, is an entity of the Greek Government, operating under the Ministry of Digital Governance. It provides advanced network and cloud computing services to academic and research institutions, educational entities at all levels, as well as to public, broader public, and private sector...


  • San Jose, United States Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • San Francisco, United States Tampa Gardens Senior Living Full time

    Team Culture Great things happen when people can bring their authentic selves to work. We empower all of our employees to share their perspectives, passions, and experiences because collectively we make a better, stronger team. Our team members collaborate closely with peers and cross-functional stakeholders throughout the business, our clients on the...


  • San Francisco, United States AutoRABIT Holding, Inc. Full time

    About AutoRABIT:AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity for their development team,...


  • San Francisco, United States RevenueCat Full time

    About us: RevenueCat makes building, analyzing, and growing mobile subscriptions easy. We launched as part of Y Combinator's summer 2018 batch and today are handling more than $3B of in-app purchases annually across thousands of apps. We are a mission driven, remote-first company that is building the standard for mobile subscription infrastructure. Top apps...


  • San Francisco, United States Pager Full time

    PagerDuty empowers teams of all kinds to do the critical work that moves business forward through the PagerDuty Operations Cloud.PagerDuty is seeking a Senior Site Reliability Engineer to join our SRE-Platform team. In this role you will be a key contributor to building, maintaining and scaling the Kubernetes platform that powers PagerDuty. We build...


  • San Francisco, California, United States Circle Full time

    About CircleCircle is a leading financial technology company that is revolutionizing the way value is transferred globally. Our innovative infrastructure enables businesses, institutions, and developers to harness the power of blockchain technology and capitalize on the emerging internet of money.Job SummaryWe are seeking a highly skilled Senior Site...


  • San Francisco, United States AutoRABIT Holding Inc. Full time

    Job DescriptionJob DescriptionAbout AutoRABIT:AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity...