Principal Site Reliability Engineer

2 weeks ago


Remote, Oregon, United States Blue River Technology Full time $166,000 - $293,000 per year

We're Blue River, a team of innovators driven to create intelligent machinery that solves monumental problems for our customers. We empower our customers – farmers, construction crews, and foresters - to implement safer and more sustainable solutions, driving increased profitability with less reliance on scarce labor. We believe that focusing on the small stuff – pixel-by-pixel and task-by-task - leads to big gains.

Blue River Technology aligns with John Deere's vision to "innovate on behalf of humanity" by quickly identifying and solving high-value, high-uncertainty challenges in AI, machine learning, computer vision, and robotics. BRT acts as a research and development flywheel, building not only new products but also new platforms that reliably create value for both Deere and its customers. From fully autonomous machines to highly precise farming equipment, BRT and Deere are partnering to create technical breakthroughs in industries like agriculture and construction.

Summary

We are looking for a Principal Site Reliability Engineer to join the CVML Platform team at Blue River Technology. You will work to create a hybrid infrastructure, integrating edge devices, on-premises, and cloud resources to a cohesive CVML & Robotics foundation. You will work on cost effectiveness, transparency, and security aspects of the platform, focusing on speed and quality of solutions and services provided. You will work with both your peers and stakeholders from other teams to achieve alignment on the platform's vision and technologies. You must show initiative and the ability to organize your work schedule, and be comfortable with supporting the application needs of multiple teams, systems, and products.

  • Employment Type: Full-Time
  • Work Location: Remote in the United States
  • Visa sponsorship is available for this position on a case-by-case basis.

Job Responsibilities

A combination, not necessarily all-inclusive, of the following:

  • System Design: Architect and implement various cloud and on-premise applications, systems, and infrastructure.
  • Hybrid system integration: Integrate extremely diverse systems, configure stable integration, uptime, and monitoring.
  • Edge device integration: work with edge devices of various formats and integrate them with on-prem and cloud workflows, including networking, low-level OS, and electrical/control integration.
  • Low-level performance optimization: optimize the performance and throughput of the system at the filesystem, networking, and software levels.
  • High-level optimisation of cost and stability: optimize cost, operational stability, and supportability of highly diverse platforms and tech stack.
  • Product Mindset: Collaborate with cross-functional teams to design, develop, and maintain robust, scalable, and user-friendly web and mobile data-intensive applications.
  • System Integration: Build tools that enable users to easily move between different applications and platforms to utilize the strengths of each in a coherent ecosystem.
  • Collaboration: Work closely with cross-functional teams, including data scientists, analysts, software engineers, and product managers, to understand data requirements and deliver data solutions that align with business goals.
  • Documentation: Create and maintain technical documentation, including data flow diagrams, architecture designs, and standard operating procedures.
  • Technology Evaluation: Stay up-to-date with industry trends and emerging technologies related to data engineering, recommending and implementing new tools and frameworks as appropriate.

Required Experience and Skills

  • 8+ years of experience building infrastructure with K8S, AWS, and bare metal.
  • 8+ years of experience working with Python and Go (with production experience).
  • 8+ years of experience working with infra automation tools: Terraform / Terragrunt (or Pulumi / CDK).
  • 8+ experience with Linux-based systems and networks, and a deep understanding of internal components, networking, and security aspects.
  • Has a track record of building and maintaining scalable systems in production environments.
  • Experience in building CI/CD pipelines using GitHub Actions (or GitLab / Jenkins) for application release and deployment.
  • Experience in using AWS ECS, EKS, IAM, EC2, and RDS at production scale.
  • Deep understanding of Kubernetes and its internals (kubelet, CRDs, etc) and experience with building and extending clusters from scratch.
  • Strong problem-solving skills and ability to troubleshoot complex infrastructure and networking issues.
  • Excellent communication skills to collaborate effectively with technical and non-technical stakeholders.
  • Attention to detail and commitment to producing high-quality, well-documented code.

Preferred Experience and Skills

  • Experience with standard SQL, NoSQL, and MPP databases.
  • Experience with writing production Kubernetes operators.
  • Airflow, Kubeflow, or other orchestration system experience.
  • Can understand some C++ and/or Rust, or talk with people who do.
  • Prior experience in the autonomy and robotics space is a huge plus.

Only individual applicants will be considered. We do not work with unsolicited third-party agencies or proxy interview services.

At Blue River, we're passionate about creating an inclusive workplace that promotes and values diversity. While we have more work to do to advance diversity and inclusion, we're investing in our programs, including recruiting, mentorship, career development, and learning & development to ensure they support our Diversity, Equity, and Inclusion goals. We support each employee in living a full life, enabling a thriving career, and accomplishing a meaningful, challenging mission while collaborating with incredible people. We are dedicated to building a diverse and inclusive workplace, so if you're excited about this role but your experience doesn't align completely with the job description, we encourage you to apply anyway.

We are an equal-opportunity employer and do not discriminate based on race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. Please contact us to request an accommodation.

The US annual base salary range for this position is $166,000 - $293,000, along with eligibility for Blue River's bonus and benefit programs.

Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your location during the hiring process. During the recruitment process, we may identify an alternative role or level to which you are more suited. If your ideal role at Blue River differs from the advertised position, we will provide an updated pay range as soon as possible during the hiring process.

LI-AN1

  • Remote, Oregon, United States 2Prod Technologies Corp. Full time $145,000 - $210,000 per year

    About 2Prod2Prod Technologies Corp. supports the federal government in delivering secure, scalable cloud solutions that advance critical national missions.Position Summary2Prod Technologies Corp. is seeking a Site Reliability Engineer (SRE) with strong GitLab expertise to support and enhance enterprise platforms. This role will focus primarily on GitLab...


  • Remote, Oregon, United States Maxihost Full time $120,000 - $180,000 per year

    About 's global computing platform was launched in 2019, enabling businesses to programmatically deploy single-tenant Bare Metal instances in different parts of the world. We are a team of passionate individuals about hardware, software, and network infrastructure looking to build the fastest, easiest-to-use, developer-centric single-tenant Cloud...


  • Remote, Oregon, United States Shutterfly Full time $106,000 - $151,000 per year

    At Shutterfly, we make life's experiences unforgettable. We believe there is extraordinary power in the self-expression. That's why our family of brands helps customers create products and capture moments that reflect who they uniquely are.Shutterfly is looking for a Senior Site Reliability Engineer to join our team. Shutterfly is undergoing a comprehensive...


  • Remote, Oregon, United States BABYLIST Full time $199,200 - $239,040 per year

    Who We AreBabylist is the leading registry, e-commerce, and content platform for growing families. More than 9 million people shop with Babylist every year, making it the go-to destination for seamless purchasing, trusted guidance, and expert product recommendations for new parents and the people who love them. What began as a universal registry has grown...


  • Remote, Oregon, United States D-Wave Full time $124,545 per year

    D-Wave (NYSE: QBTS), D-Wave is a leader in the development and delivery of quantum computing systems, software, and services. We are the world's first commercial supplier of quantum computers, and the only company building both annealing and gate-model quantum computers. Our mission is to help customers realize the value of quantum, today. Our quantum...


  • Remote, Oregon, United States Jellyvision Full time $145,000 - $175,000 per year

    Senior Site Reliability EngineerWho we areJellyvision ALEX, is on a mission to improve lives by helping people choose and use their benefits. We are raising the bar—for benefits and the employee experience (for our employees and those of the customers we serve) – by scaling personalization, compassion and an earnest intent to be helpful in all that we...

  • Reliability Engineer

    18 hours ago


    Remote, Oregon, United States Prolim global corporation Full time $98,000 - $118,304 per year

    Reliability Engineer (Steel Manufacturing) – Remote / Lewisville, OHLocation: Lewisville, Ohio, USA (Remote option available)Experience: 7–10 yearsAbout the RoleWe are seeking an experienced Reliability Engineer with a strong background in Steel Manufacturing to join our team. The ideal candidate will lead reliability initiatives, perform risk-based...


  • Remote, Oregon, United States Dropbox Full time

    PLEASE READ: Zones are based on your zip code. If you're within 100 miles of a listed metro area (straight-line radius), you're included in that Zone. For this role, we are hiring in Zones 2 and 3. Check your Zone here before applying.Role DescriptionDropbox is seeking a Principal Engineer to define the long-term technical vision and execution strategy for...


  • Remote, Oregon, United States Fifth Third Bank Full time $94,200 per year

    Make banking a Fifth Third betterWe connect great people to great opportunities. Are you ready to take the next step? Discover a career in banking at Fifth Third Bank.GENERAL FUNCTION:The Principal Information Security Engineer is responsible for defining, architecting, and supporting enterprise security tools in partnership with Information Security and IT...


  • Remote, Oregon, United States Lynx Full time $180,000 - $225,000 per year

    Job Title: Principal Software Architect – Networking & RTOSLocation: RemoteSalary Range: $180,000 - $225,000 + Bonus EligibleReports to: Director, RTOS & ToolsWho we are: Lynx delivers modular, open standards-based software solutions that redefine the economics of developing, deploying, and maintaining high assurance, mission critical edge platforms. These...