Platform Reliability Engineer

4 weeks ago


San Luis Obispo, California, United States The Forward Thinking Company Full time
Job Title: Platform Reliability & DevOps Engineer

Location: San Luis Obispo, CA

Job Type: Full-time

Compensation: $100,000 - $180,000 (depending on experience and qualifications)

Company Overview:

The Forward Thinking Company is a software company that is focused on revolutionizing the way enterprises build, use, and share software. Our mission is to create intuitive, user-friendly software solutions that empower our clients to transform industries and achieve success through powerful digital experiences.

Job Summary:

This unique role combines the responsibilities of a Platform Reliability Engineer and a DevOps Engineer. The ideal candidate will have the expertise to manage and support our production enterprise SaaS platform while also enhancing developer productivity through advanced tooling and environment management. This position is crucial for ensuring the stability, scalability, and efficiency of our platform and development processes.

Key Responsibilities:

  1. Design, implement, and manage scalable and reliable cloud infrastructure for our SaaS platform.
  2. Ensure the availability, performance, and security of the production environment.
  3. Develop and maintain monitoring, alerting, and incident response systems.
  4. Troubleshoot and resolve infrastructure-related issues in a timely manner.
  5. Implement and manage infrastructure as code (IaC) using Terraform.
  6. Perform regular system audits and vulnerability assessments to ensure compliance and security.

DevOps Engineer Responsibilities:

  1. Develop and maintain CI/CD pipelines to automate the deployment process.
  2. Enhance developer productivity by providing robust tooling and development environments.
  3. Collaborate with development and QA teams to streamline the software development lifecycle.
  4. Implement and manage version control systems and continuous integration tools.
  5. Automate routine tasks to improve operational efficiency and reduce manual intervention.
  6. Provide support and guidance to developers on best practices for code deployment and environment management.

Qualifications:

  1. Bachelor's degree in Computer Science, Information Technology, or a related field.
  2. 3+ years of experience in a Platform Reliability Engineer or DevOps Engineer role.
  3. Proficiency in scripting languages such as Python or Bash.
  4. Hands-on experience running scalable architectures and systems.
  5. Experience with CI/CD tools like GitHub Actions.
  6. Experience supporting end-user mobile and web applications.
  7. Strong understanding of networking concepts and security best practices.
  8. Excellent problem-solving skills and attention to detail.
  9. Ability to work independently and as part of a team.
  10. Strong communication and collaboration skills.

Preferred Qualifications:

  1. Certifications in cloud platforms (Google Cloud Certified Professional).
  2. Strong experience with Firebase and Google Cloud Platform.
  3. Strong experience with Flutter.
  4. Experience with modern application performance monitoring tools such as New Relic, Datadog, or Dynatrace.
  5. Familiarity with Agile/Scrum methodologies.
  6. Experience with serverless architectures and microservices.
  7. Knowledge of developer productivity tools and practices.

Benefits:

  1. Competitive salary and performance bonuses.
  2. Comprehensive insurance options, including health, dental, vision, with a variety of plans to meet the needs of our team members and their families.
  3. Professional development opportunities in a rapidly growing software company.


  • San Luis Obispo, California, United States The Forward Thinking Company Full time

    Job Title: Platform Reliability & DevOps EngineerLocation: San Luis Obispo, CAJob Type: Full-timeCompensation: $100,000 - $180,000 (depending on experience and qualifications)Company Overview:The Forward Thinking Company is a software company that is revolutionizing the way enterprises build, use, and share software. Our mission is to create intuitive,...


  • San Diego, California, United States Platform Science Full time

    About UsAt Platform Science, we're revolutionizing the way businesses connect and interact with the world around them. Our open IoT platform empowers innovative fleets, application developers, and equipment providers to deliver cutting-edge solutions to supply chain professionals globally.The RoleWe're seeking a highly skilled Senior Site Reliability...


  • San Diego, California, United States Platform Science Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team in San Diego, CA (or remote). As a key member of our SRE team, you will be responsible for ensuring the reliability and performance of our cloud-based platform.Key ResponsibilitiesDevelop and enhance CI/CD pipelines to streamline application deployment and...


  • San Jose, California, United States Tik Tok Full time

    Job Title: Site Reliability Engineer, Data PlatformTikTok is a leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. Our platform is built to help imaginations thrive, and we're looking for a Site Reliability Engineer to join our Data Platform team.Responsibilities:Ensure the reliability of all TikTok's...


  • San Diego, California, United States Talent Software Services Full time

    Job Title: Site Reliability Engineer - Platform SupportJoin Talent Software Services as a Site Reliability Engineer - Platform Support and be part of a tight-knit team that operates and supports the core infrastructure foundation of our platform.About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Platform Experience Group. As...


  • San Jose, California, United States Adobe Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineering Manager to lead our team in driving reliability for Adobe's AI Inference Platform, Adobe Firefly. As a key member of our Engineering organization, you will be responsible for developing a team of Site Reliability Engineers who will work closely with our Engineering teams to build,...


  • San Jose, California, United States Adobe Full time

    About the RoleWe're seeking an exceptional Site Reliability Engineering Manager to lead our AI Platform Inference Infrastructure team at Adobe. As a key member of our organization, you'll be responsible for driving reliability, scalability, and security for our AI Inference Platform, Adobe Firefly.Key ResponsibilitiesDevelop and execute the technical vision...


  • San Jose, California, United States Adobe Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineering Manager to lead our team in driving reliability for Adobe's AI Inference Platform, Adobe Firefly. As a key member of our Engineering organization, you will be responsible for developing a team of Site Reliability Engineers who will work closely with our Engineering teams to build,...


  • San Luis Obispo, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Software Development Engineer II to join our Routing Platform Team at Amazon. As a key member of our team, you will be responsible for designing, building, and maintaining low-latency, high-availability, and resilient solutions.Key ResponsibilitiesDesign and implement scalable, efficient, and reliable software...


  • San Jose, California, United States Tik Tok Full time

    Job Title: Site Reliability Engineer, Cloud Native PlatformTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to users worldwide. Our mission is to connect people across the globe, and our infrastructure team is seeking experienced site reliability engineers to build a globally distributed edge platform for...


  • San Jose, California, United States Adobe Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Adobe, working on the AI Training Platform, Adobe Firefly. As a key member of our team, you will collaborate closely with Engineering teams to build, scale, and secure the AI Platform, enabling Firefly product teams to easily manage and deploy Machine Learning...


  • San Francisco, California, United States Wasmer Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Wasmer. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions for our Edge computing platform.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure...


  • San Leandro, California, United States Omni Inclusive Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Omni Inclusive. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our digital platforms.Key Responsibilities:Design, implement, and maintain scalable and reliable...


  • San Francisco, California, United States Wasmer Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Wasmer. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our Edge computing platform.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable infrastructure solutions for our Edge computing...


  • San Francisco, California, United States Xero Full time

    About the RoleXero is a leading cloud-based accounting platform that empowers small businesses and their advisors to thrive. As a Site Reliability Engineer on our Reliability Enablement team, you'll play a critical role in ensuring the reliability and performance of our systems.Key ResponsibilitiesInvestigate operational surprises and support teams in...


  • San Jose, California, United States Trianz Full time

    About TrianzTrianz is a leading-edge technology platforms and services company that accelerates digital transformations at Fortune 100 and emerging companies worldwide in data & analytics, digital experiences, cloud infrastructure, and security.Our VisionWe believe that companies around the world face three challenges in their digital transformation journeys...

  • Platform Engineer

    2 weeks ago


    San Jose, California, United States Piper Companies Full time

    Piper Companies is seeking a skilled Platform Engineer to join their team in San Jose, CA. As a key member of the team, the successful candidate will play a crucial role in ensuring the reliability and performance of hardware devices in a lab environment.Key Responsibilities:Develop and analyze software diagnostics for hardware products in a lab...


  • San Francisco, California, United States Instabase Full time

    About InstabaseInstabase is a cutting-edge AI innovation company that empowers organizations to solve complex unstructured data problems. With a global presence and a customer-centric approach, we deliver top-tier solutions that provide unmatched advantages for everyday business operations.Job Title: Site Reliability EngineerWe are seeking a highly skilled...


  • San Francisco, California, United States Cervin Full time

    About the Role:Cervin is seeking a highly skilled Senior Platform Engineer to join our team. As a key member of our engineering organization, you will play a critical role in extending and scaling our infrastructure and product capabilities.You will be responsible for building and maintaining platform tools, leveraging innovative cloud technologies to...


  • San Jose, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...