Senior Site Reliability Engineer- Cassandra DB

4 weeks ago


Chicago, Illinois, United States Grubhub Full time

About The Opportunity
We're all about connecting hungry diners with our network of over 300,000 restaurants nationwide. Innovative technology, user-friendly platforms and streamlined delivery capabilities set us apart and make us an industry leader in the world of online food ordering. When you join our team, you become part of a community that works together to innovate, solve problems, grow, work hard and have a ton of fun in the process
Why Work For Us
Grubhub is a place where authentically fun culture meets innovation and teamwork. We believe in empowering people and opening doors for new opportunities. If you're looking for a place that values strong relationships, embraces diverse ideas-all while having fun together-Grubhub is the place for you
We are looking for a Senior Site Reliability Engineer to join our Database Engineering organization. At Grubhub, the Database Engineering organization owns the top-level reliability, observability, and availability of the Datastore platforms, including but not limited to Cassandra, ElasticSearch and Kafka. This team contributes to projects, services, designs, and processes with the aim to steward good architecture and provide tools and services to enable software engineering teams to measure and meet reliability agreements.
The Impact You Will Make

  • Manage large critical Cassandra and Elasticsearch clusters supporting millions of transactions per day
  • Build systems to automate all build and maintenance tasks using Ansible and python
  • Develop self-service tools to allow engineers to manage and provision resources with GrubHub best practices
  • Monitor cluster availability, read/ write latencies, and other important performance metrics to proactively identify SLO misses and help mitigate issues
  • Evaluate new technologies and software versions. Test and develop roadmaps
  • Tune Cassandra and ES databases for optimizing throughput and read /write latencies
  • 24X7 on-call rotation support with rest of team for rapid incident response
  • Implement DR strategies, including backups and recovery techniques with minimal downtime.
  • Work with other engineers to manage our data persistence integration and performance with the Grubhub platform.
  • Monitor and scale Elasticsearch/Cassandra clusters to handle growth in traffic


What You Bring To The Table

  • Experience developing backend applications in Python or Java
  • Experience managing, working or developing large Elasticsearch clusters in highly available 24x7 production environments
  • Experience automating the maintenance of infrastructure using Python and Ansible or similar tools.
  • Experience managing automated cloud infrastructures on AWS or other major cloud providers.
  • Experience managing large Cassandra clusters in production is a strong plus.
  • Experience working with docker is a plus
  • Ability to quickly learn new concepts and technologies and adapt to changing needs


About Our Tech

  • Most of our internal tooling is written in Python.
  • Most of our microservices are written in Java
  • Observability tools we use: Datadog, Splunk, Lightstep.
  • Our primary persistence store is Cassandra
  • We operate in 3 Amazon regions (hot+hot+hot)
  • We primarily rely on AWS and its services: EC2, S3, SNS/SQS, ElastiCache, Lambda, etc.


And Of Course, Perks

  • Flexible PTO. Grubhub employees enjoy a generous amount of time to recharge.
  • Health and Wellness. Excellent medical, dental and vision benefits, 401k matching, employee network groups and paid parental leave are just a few of our programs to support your overall well-being.
  • Compensation. You'll receive a highly-competitive compensation package with eligibility for generous incentives, bonuses, commission, and RSUs.
  • Free Meals. Our employees get a weekly Grubhub credit to enjoy and support local restaurants.
  • Social Impact. We believe in giving back through programs like the Grubhub Community Relief Fund, and provide our employees opportunities to support causes that are important to them.


Grubhub is an equal opportunity employer. We welcome diversity and encourage a workplace that is just as diverse as the customers we serve. We evaluate qualified applicants without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, and other legally protected characteristics. If you're applying for a job in the U.S. and need a reasonable accommodation for any part of the employment process, please send an email to and let us know the nature of your request and contact information. Please note that only those inquiries concerning a request for reasonable accommodation will be responded to from this email address.
If you are a resident of the State of California and would like a copy of our CA privacy notice, please email



  • Chicago, Illinois, United States Balyasny Asset Management Full time

    We are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up.As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...


  • Chicago, Illinois, United States Motion Recruitment Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, Illinois, United States Adyen Full time

    This is AdyenAdyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...


  • Chicago, Illinois, United States Spectraforce Technologies Full time

    Title: Senior Associate Software Engineer/Senior Lead Software EngineerLocation: Chicago, IL Onsite 3 days per weekDuration: 6 Month Contract to Hire Must Haves:5-8+ years of overall software engineering experience 4-6+ years in Site Reliability Engineering Experience developing, supporting, and managing cloud technologies Experience working with...


  • Chicago, Illinois, United States Spectraforce Technologies Full time

    Title :Senior Associate Software Engineer/Senior Lead Software Engineer Location :Chicago, IL Onsite 3 days per week Duration : 6 Month Contract to Hire Must Haves: 5-8+ years of overall software engineering experience 4-6+ years in Site Reliability Engineering Experience developing, supporting, and managing cloud technologies Experience working with...


  • Chicago, Illinois, United States iManage Full time

    We offer a flexible working policy that supports the health and well-being of our iManage employees. As an organization, we value collaborating and learning from our peers in person, while providing the necessary flexibility for our employees to have a meaningful work-life balance. Please reach out to learn more.Being a Senior Site Reliability Engineerat...


  • Chicago, Illinois, United States New York Technology Partners Full time

    Job Title: Site Reliability EngineerLocation: Chicago, IL (Hybrid)Position: ContractAbout the Job:Are you a tech savvy problem solver? We are looking for a skilled Site Reliability Engineer to join our team in Chicago, IL (Hybrid). If you have experience in SRE with expertise in Kubernetes and OCP, this could be the perfect opportunity for you.Key...


  • Chicago, Illinois, United States Grindr Full time

    This is a hybrid role based in our Chicago office and will require you to be in office Tuesdays and Thursdays. What's so interesting about this role? As we enter our second year as a public company, Grindr is building on the success we've had over our 15-year history in connecting, supporting, and improving the lives of the LGBTQ+ community globally.We are...


  • Chicago, Illinois, United States NinjaTrader Full time

    NinjaTrader is an investor-backed, growth-stage FinTech company with an award-winning platform and over 1 million users. We are building products and services which empower active traders to easily analyze and react to data from the world's leading financial markets. Located in Chicago, our unique employee-centric company culture is one that our team finds...


  • Chicago, Illinois, United States McDonald's Global Technology Full time

    Job DescriptionCompany Description:McDonald's new growth strategy, Accelerating the Arches, encompasses all aspects of our business as the leading global omni-channel restaurant brand. As the consumer landscape shifts we are using our competitive advantages to further strengthen our brand. One of our core growth strategies is to Double Down on the 3Ds...


  • Chicago, Illinois, United States iManage Full time

    We offer a flexible working policy that supports the health and well-being of our iManage employees. As an organization, we value collaborating and learning from our peers in person, while providing the necessary flexibility for our employees to have a meaningful work-life balance. Please reach out to learn more. Being a Site Reliability Engineer at iManage...


  • Chicago, Illinois, United States Ahold Delhaize USA Full time

    Address: USA-IL-Chicago-300 South Riverside Plaza Store Code: Exec_Development What's Our Dish Announced in May 2018, Peapod Digital Labs (PDL) is an Ahold Delhaize USA company that powers the eCommerce and digital strategies for the Great Local Brands of Ahold Delhaize USA. Accelerating growth in digital and personalization capabilities, PDL is an...


  • Chicago, Illinois, United States Spectraforce Technologies Full time

    Role:Site Reliability/DevOps Engineer Location: Chicago, IL - 3 days hybid Duration:11+ Months Description:The Goals Driven Wealth Management (GDWM) platform is a showcase product for business providing holistic advice on wealth management to high net worth and ultra high net worth clients.Skills: Overall 8- 12 year exp Technical :SRE Tools / Technologies :...


  • Chicago, Illinois, United States Chicago Mercantile Exchange Inc. Full time

    This role is hybrid requires to be 2 days on site in our Chicago office. This role does not allow to work outside of Illinois state. Position Overview:Data System Reliability Engineer (dSRE) CME Group: Where Futures Are Made CME Group is the world's leading and most diverse derivatives marketplace. But who we are goes deeper than that, here you can impact...

  • Senior Developers

    2 weeks ago


    Chicago, Illinois, United States TransUnion Full time

    Senior Developers at TransUnion, LLC At TransUnion, we are seeking Senior Developers for various and exciting projects at multiple locations throughout the U.S. Our headquarters are located in Chicago, IL. Join our team to be involved in designing cutting-edge software applications, deploying designs into production, and monitoring application...

  • Reliability Engineer

    2 weeks ago


    Chicago, Illinois, United States Metropolitan Water District Full time

    Apply engineering knowledge and skills for moderately complex engineering projects focused on developing, measuring, analyzing, and implementing plans and procedures which ensure reliability of manufacturing components, equipment, and processes.Day-to-Day Role:Follow and commit to meet Key Performance Indicators (KPI's) for safety, quality, production,...


  • Chicago, Illinois, United States American Express Global Business Travel Full time

    Amex GBT is a place where colleagues find inspiration in travel as a force for good and - through their work - can make an impact on our industry. We're here to help our colleagues achieve success and offer an inclusive and collaborative culture where your voice is valued.Ready to explore a career path? Start your journey.J-65816 Platform Engineering LeadIn...

  • Senior Engineer

    4 weeks ago


    Chicago, Illinois, United States Bank of America Full time

    Job Description:We are seeking a highly skilled and experienced Senior Engineer Fusion Center Technology to join our dynamic team. As a Senior Engineer, you will be responsible for providing technical leadership, strategic direction, and hands-on expertise in developing and implementing innovative technology solutions. This role requires a deep understanding...


  • Chicago, Illinois, United States TEKsystems Full time

    :The Operational Intelligence team received additional funding from Global Banking for a net new work. Global Banking needs a Solution Architect to design solutions for the monitoring tools the group is currently using (Splunk & Dynatrace).The Operational Intelligence Team seeks a strong architect to develop, document and drive the monitoring strategy and...


  • Chicago, Illinois, United States Capital One Financial Corp Full time

    Center , United States of America, McLean, Virginia Senior Manager, Data Engineering (Spark, AWS) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers...