Current jobs related to Site Reliability Engineer, Cloud Expert - San Francisco, California - Aircon Engineering Inc


  • San Diego, California, United States Insight Global Full time

    Job Title: Site Reliability Engineer - Cloud ExpertAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will be responsible for ensuring the high availability and performance of our cloud-based systems.Key Responsibilities:- Design and implement scalable and highly...


  • San Francisco, California, United States AEG Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our DevSecOps and Infrastructure team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud...


  • San Jose, California, United States Adobe Full time

    Job Title: Site Reliability EngineerAt Adobe, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.Key Responsibilities:Design, develop, and deploy cloud-based services and...


  • San Francisco, California, United States SpeedCast Full time

    Job Summary:Speedcast is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the reliability and scalability of our cloud-based infrastructure. You will work closely with our development team to ensure that our systems are highly available,...


  • San Francisco, California, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer with expertise in Java to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and...


  • San Diego, California, United States BAE Systems USA Full time

    Job DescriptionBAE Systems USA is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our cloud-based systems and infrastructure. You will work closely with our development teams to design, implement, and maintain scalable and reliable systems.Key...


  • San Jose, California, United States Adobe Full time

    Job Title: Site Reliability EngineerAt Adobe, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the highest level of uptime and Quality of Service (QoS) to Adobe's customers through operational excellence.Key Responsibilities:Define service level objectives...


  • San Diego, California, United States BAE Systems USA Full time

    Job DescriptionBAE Systems USA is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our cloud infrastructure, identifying and resolving performance issues, and implementing automation solutions to improve efficiency and scalability.Key...


  • San Jose, California, United States ApTask Full time

    ApTask is a leading global provider of workforce solutions and talent acquisition services, dedicated to shaping the future of work.As an African American-owned and Veteran-certified company, ApTask offers a comprehensive suite of services, including staffing and recruitment solutions, managed services, IT consulting, and project management.With a focus on...


  • San Francisco, California, United States Instabase Full time

    About InstabaseInstabase is a cutting-edge technology company that specializes in democratizing access to AI innovation. Our mission is to empower organizations to solve complex unstructured data problems and unlock new business opportunities.Our TeamWe are a team of passionate and innovative professionals who are dedicated to building scalable and reliable...


  • San Francisco, California, United States Zilliz Full time

    Job Title: Cloud Platform Staff Site Reliability EngineerWe are seeking a highly skilled Cloud Platform Staff Site Reliability Engineer to join our team at Zilliz. As a key member of our SRE team, you will be responsible for ensuring the reliability, availability, and performance of our distributed database systems.Key Responsibilities:Design and build tools...


  • San Francisco, California, United States SingleStore Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at SingleStore. As a key member of our engineering team, you will be responsible for designing, building, and running elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.Key Responsibilities:Help drive...


  • San Francisco, California, United States Atika Technologies Full time

    Job Summary:Atika Technologies is seeking a highly skilled Cloud Engineer and Site Reliability Specialist to support our Corporate engineering requirements. The ideal candidate will have a strong background in DevOps (80%) and SRE (20%) with expertise in AWS and Kubernetes.Key Responsibilities:⁠ ⁠Support Corporate engineering...


  • San Francisco, California, United States Apollo Solutions Full time

    Site Reliability EngineerApollo Solutions has partnered with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...


  • San Francisco, California, United States Diverse Lynx Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our technical team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly available cloud...


  • San Francisco, California, United States Instabase Full time

    About InstabaseInstabase is a global company with offices in San Francisco, New York, London, and Bengaluru. We're a people-first organization that values experimentation, curiosity, and customer obsession.Job SummaryWe're seeking a Site Reliability Engineer to join our Site Reliability and Platform Engineering team. As a key member of our team, you'll be...


  • San Francisco, California, United States Cisco Full time

    About Cisco ThousandEyesCisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver seamless digital experiences across every network.Our platform is powered by AI and an unmatched set of cloud, internet, and enterprise network telemetry data, enabling IT teams to proactively detect, diagnose, and remediate issues...


  • San Francisco, California, United States Withorb Full time

    About UsOrb is a cutting-edge technology company on a mission to revolutionize the way businesses approach revenue growth. Our team is passionate about building a robust infrastructure that enables our customers to unlock their full potential.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our...


  • San Francisco, California, United States SpeedCast Full time

    Job Title: Site Reliability EngineerAt Speedcast, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based communication solutions.Key Responsibilities:Analyze and design continuous...


  • San Francisco, California, United States PicnicHealth Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at PicnicHealth. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, efficiency, and scalability of our cloud infrastructure. You will work closely with our development team to identify and resolve infrastructure issues, and collaborate...

Site Reliability Engineer, Cloud Expert

2 months ago


San Francisco, California, United States Aircon Engineering Inc Full time

About Aircon Engineering Inc.

We're a leading engineering company, inspiring innovation in the industry with cutting-edge technology and expertise. Leading with our core values, we help companies across every sector blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you've come to the right place.

Aircon Engineering Inc. is looking for a Site Reliability Engineer to build and run multi-substrate Kubernetes and microservices platforms which power our core engineering systems and a growing set of applications across the company. This platform provides the ability to develop and deploy microservices quickly and efficiently, accelerating their path to production.

You are responsible for the high availability of a large fleet of clusters running various technologies like Kubernetes, Docker, software load balancers, Zookeeper, service mesh, Istio, and so on. You'll gain valuable experience solving real production issues, expanding your knowledge of the architecture of K8s ecosystem services and internals.

  1. You will contribute code wherever possible to drive improvement.
  2. You will automate efforts in Python/Golang/Puppet/Jenkins to eliminate manual work with day-to-day operations.
  3. You will create improvements to CI/CD pipelines built on Terraform, Spinnaker, and Argo.
  4. You will help improve the platform's visibility by implementing vital monitoring and metrics with Prometheus, Grafana, and other monitoring frameworks.
  5. You'll implement self-healing mechanisms to fix issues to reduce manual labor proactively.
  6. You will get to improve your communication and collaboration skills working with various other teams across Aircon Engineering Inc.
  7. You will interact with a highly innovative and creative team of developers and architects.
  8. You will evaluate new technologies to tackle problems as needed.

You are the ideal candidate if you are passionate about live site service ownership. You have proven a strong ability to manage large distributed systems. You are comfortable with troubleshooting complex production issues that span multiple disciplines. You bring a solid understanding of how infrastructure software components work. You are able to automate tasks using a modern high-level language. You have good written and spoken communication skills.

Job Requirements:

  1. 8+ years of experience in SRE/DevOps/Systems Engineering roles.
  2. Experience operating large scale distributed systems in cloud environments.
  3. Excellent troubleshooting skills with the ability to learn new technologies in complex distributed systems.
  4. Strong working experience with Linux Systems Administration and knowledge of Linux internals.
  5. Experience with Kubernetes, Docker, or Service Mesh.
  6. Scripting/programming languages: Python, GoLang, etc.
  7. Good knowledge of Networking protocols and components: TCP/IP Stack, Switches, Routers, Load Balancers.
  8. Experience in configuration management tools, Puppet, Chef, Ansible or other DevOps tools.
  9. Experience in any of the monitoring tools like Nagios, Grafana, Zabbix, etc.
  10. Experience with AWS, Terraform, Spinnaker.
  11. A continuous learner and a critical thinker.
  12. A great teammate with excellent communication skills.

Accommodations:

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement:

Aircon Engineering Inc. is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Aircon Engineering Inc. does not accept unsolicited headhunter and agency resumes. Aircon Engineering Inc. will not pay any third-party agency or company that does not have a signed agreement with Aircon Engineering Inc.

Aircon Engineering Inc. welcomes all.

Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Aircon Engineering Inc. will consider for employment qualified applicants with arrest and conviction records. For California-based roles, the base salary hiring range for this position is $165,600 to $323,400. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, benefits. More details about our company benefits can be found at the following link: