Site Reliability Engineer

1 day ago


Chicago IL United States Tbwa ChiatDay Inc Full time
Site Reliability Engineer (SRE) - Mandarin Speaking

Location: Chicago-HQ/Hybrid

Chowbus is a SaaS (Software as a Service) company that began as an online platform for food ordering, payment, and delivery. The company has since shifted its focus to providing an all-in-one POS (point-of-sale) system tailored to the evolving needs of the restaurant industry. Headquartered in Chicago, Illinois, Chowbus serves over 2,000 restaurants with partners across 20 major U.S. cities. Our mission is to build the most comprehensive ecosystem to empower restaurants.

We are seeking a highly motivated Site Reliability Engineer (SRE) with 2-3 years of hands-on experience in managing and scaling infrastructure. The ideal candidate will have a strong background in Computer Science or Electrical Engineering and be fluent in both English and Mandarin. This role provides an exciting opportunity to work on cutting-edge infrastructure projects, cloud-based solutions, and automation while contributing to the long-term strategy of IT management.

Responsibilities

  • Infrastructure Management: Maintain and optimize cloud-based infrastructure to ensure high availability, reliability, and scalability of services.
  • Site Reliability Engineering (SRE): Implement SRE best practices, focusing on monitoring, automation, and system performance to improve the reliability and efficiency of the infrastructure. Proactively monitor system health and respond to incidents, troubleshoot issues, and implement solutions to prevent future occurrences.
  • Automation: Automate operational tasks such as provisioning, configuration, and monitoring using Infrastructure as Code (IaC) tools like Terraform and Ansible.
  • Collaboration: Work closely with development teams to ensure seamless integration of new software and infrastructure solutions.
  • IT Management: Develop highly automated IT management and AI integration for automating routine tasks.

Qualifications

  • Education: Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field.
  • Experience: 2-3 years of experience in Infrastructure, SRE, or DevOps roles, with hands-on experience in cloud environments (AWS, GCP, or Azure), containers (Docker, Kubernetes), and automation tools.
  • Languages: Fluent in both English and Mandarin (written and spoken) for effective communication with global teams.

Technical Skills:

  • Proficient in Linux/Unix systems.
  • Experience with cloud platforms (AWS, Google Cloud, Azure).
  • Familiarity with Infrastructure as Code (Terraform, Ansible, etc.).
  • Experience with CI/CD tools (Jenkins, Github Actions, CircleCI, etc.).
  • Strong knowledge of scripting languages (Python, Bash, etc.).
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK Stack).
  • Understanding of networking, security, and system performance.

Preferred Qualifications:

  • Experience with AWS ECS and container orchestration tools.
  • Exposure to IT management principles or a strong interest in progressing into IT leadership roles.
  • Knowledge of cybersecurity best practices and compliance regulations.
  • Familiarity with Agile and DevOps methodologies.

Soft Skills:

  • Strong problem-solving skills and ability to troubleshoot complex issues.
  • Excellent communication and teamwork skills to collaborate effectively with cross-functional teams.
  • Self-motivated with a passion for learning new technologies and optimizing systems.
  • High level of responsibility and accountability.

What We Offer:

  • Medical, dental, and vision insurance.
  • 401(k).
  • 100% employer-paid Short-Term Disability (STD).
  • 100% employer-paid Life Insurance and option for additional employee-paid Life Insurance.
  • 100% employer-paid Accidental Death and Dismemberment (AD&D) Insurance and option for additional employee-paid AD&D Insurance.
#J-18808-Ljbffr

  • Chicago, IL, United States WEX, Inc. Full time

    The WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...


  • Chicago, IL, United States Nextpoint Full time

    Join the team designing and developing innovative software solutions to meet client needs while providing expert technical support. Who we are and what we offer at Nextpoint Nextpoint delivers transformative software and services for all law-kind. Our award-winning team is 100% focused on making it simple, fluid, and affordable for law firms of all...


  • Chicago, IL, United States WEX Inc. Full time

    Senior Staff Site Reliability Engineer Apply to locations: Chicago, IL; Bay Area, CA; San Francisco, CA. About the Role The WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and...


  • Chicago, IL, United States Datamaxis Full time

    Location : Chicago, IL Position Type : Fulltime (3 days a week (Tue, Wed & Thu) onsite or more if needed) Salary : $125,000 to 140,000 (10% yearly bonus) Responsibilities: Manage and monitor systems and infrastructure hosted on-premises and Cloud. Good understanding of different layers of an application and system design - networking concepts, cloud...


  • Chicago, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Chicago IllinoisDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • chicago, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Chicago IllinoisDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • chicago, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Chicago IllinoisDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • Sunnyvale, CA, United States Natcast, Inc. Full time

    Natcast (short for The National Center for the Advancement of Semiconductor Technology) is a new, purpose-built, non-profit entity created to operate the National Semiconductor Technology Center (NSTC) consortium, established by the CHIPS Act of the U.S. government. Working at Natcast represents an opportunity to help extend America’s leadership in...


  • Chicago, United States Algo Capital Group Full time

    Linux Site Reliability Engineer – Linux Systems Engineering TeamOur client, an industry leading proprietary trading firm and liquidity provider, is looking for a Linux Site Reliability Engineer to join their expanding Linux Systems Engineering Team in Chicago. The firm prides itself on its collaborative environment and usage of mostly in-home tools and...


  • chicago, United States Algo Capital Group Full time

    Linux Site Reliability Engineer – Linux Systems Engineering TeamOur client, an industry leading proprietary trading firm and liquidity provider, is looking for a Linux Site Reliability Engineer to join their expanding Linux Systems Engineering Team in Chicago. The firm prides itself on its collaborative environment and usage of mostly in-home tools and...


  • Annapolis Junction, MD, United States Maximus Full time

    General information Job Posting Title Site Reliability Engineer Date Wednesday, October 16, 2024 City Annapolis Junction State MD Country United States Working time Full-time Description & Requirements Maximus is seeking a Site Reliability Engineer to provide expertise to a federal client in support of their mission critical systems in defense of our...


  • Annapolis Junction, MD, United States Maximus Full time

    General information ...


  • Duluth, GA, United States BlueSky Resource Solutions Full time

    Job Title: Site Reliability Engineer – ObservabilityOverview:We are seeking a Site Reliability Engineer III to develop and maintain our observability platform. This role focuses on ensuring the reliability, performance, and scalability of microservices, Kubernetes clusters, and cloud infrastructure. You'll collaborate with cross-functional teams to deliver...


  • Miami, FL, United States Royal Caribbean Group Full time

    Site Reliability Engineer Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group . We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world. We are proud to be the...


  • Fairfax, VA, United States Apex Systems Full time

    We are seeking talented professionals to join our successful and growing team in building the next-generation Continuous Diagnostics and Mitigation (CDM) Cyber data solution. The CDM Program is the Cybersecurity and Infrastructure Security Agency’s (CISA) dynamic approach to strengthening the cybersecurity of Federal networks and systems through better...


  • Chicago, United States Info Way Solutions Full time

    Site Reliability Engineer in Wealth Management Chicago (IL) / Tempe (AZ) Onsite Job ROLE: This role will be Responsible for application observability, maintenance, and support, identifying and implementing preventive measures proactively, evaluates and makes recommendation on techniques, practices, or technologies that would enhance business needs. As a SRE...


  • Redwood City, CA, United States C3 AI Full time

    We are looking for an Associate Site Reliability Engineer / Site Reliability Engineer to join our team at our HQ in Redwood City, CA. Responsibilities: Maximize system uptime and availability, ensuring functional and performance SLAs. Establish end-to-end monitoring and alerting on all critical aspects. Solve complex problems for critical services...


  • Chicago, United States Saxon Global Full time

    Northern Trust Site Reliability Engineer (Azure)Location: Downtown Chicago - Onsite 2 days/week - 181 W Madison St Duration: 12+ month contract w/extension/conversion Overview The Goals Driven Wealth Management platform is a showcase product for Northern Trusts Wealth Management business and we must demonstrate our ability to deliver and support innovative...


  • Chicago, United States Enova Full time

    We are interested in every qualified candidate who is eligible to work in the United States. However, we are not able to sponsor visas or take over sponsorship at this time. #LI-Hybrid #BI-Hybrid Reports to: Technology Manager II - Tech Ops About the Role: As a Site Reliability engineer you will help maintain the reliability of our consumer business from a...


  • Newton, MA, United States Intelliswift Software Full time

    Title : Site Reliability EngineerLocation : Newton, MA HybridDuration : 6 MonthsPay rate : $38.73 per hour on W2We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and...