See more Collapse

Site Reliability Engineer

2 months ago


San Francisco, United States GRNET S.A. Full time

About GRNET

GRNET - National Infrastructures for Research and Technology, is an entity of the Greek Government, operating under the Ministry of Digital Governance. It provides advanced network and cloud computing services to academic and research institutions, educational entities at all levels, as well as to public, broader public, and private sector agencies. GRNET has a wide service portfolio, covering a number of sectors. A brief list is provided below:

GOV.GR: Unified Portal for all Government-related Digital Services (Ενιαία Ψηφιακή Πύλη).

RE-Cloud: Cloud Services for Research and Education (~okeanos, ViMa).

Networking: GRNET acts as the ISP for Greece's research and academic institutions.

GRNET Site Reliability Engineering

GRNET maintains its infrastructure across multiple data centers distributed throughout Greece, primarily utilizing Free and Open Source software such as Kubernetes, ArgoCD, GitLab CI, Debian GNU/Linux, OpenStack, Ceph, Ansible and more. GRNET adopts the Site Reliability Engineering approach. Our SRE department is divided into three groups: Services, Platform, and Cloud. As an SRE, you will be assigned to one of these groups, depending on the current needs, your preferences, and expertise.

Here is a summary of the responsibilities of each group:

Cloud:

Design and implement GRNET's new Cloud infrastructure utilizing OpenStack and Kubernetes.

Platform:

Design and manage our Internal Developer Platform, based on Kubernetes.

Services:

Manage services, pipelines and tooling for use on top of our Internal Developer Platform.

Your Role

Whether you have a Systems or Software Engineering background, we are seeking Senior SREs with a strong DevOps mindset that are willing for the following:

Design and implement fault-tolerant, scalable and distributed services.

Bring your technical opinion and vision to the table: It matters.

Handle problems that require under-the-hood investigation, whether it is called legacy infrastructure, technical debt or unfamiliar technical territory.

Lead projects within the team.

Able to collaborate with multiple people and teams based on a policy of openness.

Requirements

Required Qualifications

At least three (3) years of professional working experience as an SRE or Software Engineer with emphasis on infrastructure.

Bachelor's degree in Computer Science or a related field, alternatively, comparable professional experience.

Experience on designing distributed services at scale; whether on-premises or in the cloud.

Experience on running containerized workloads on Kubernetes.

Knowledge of DevOps practices that bridge gaps, promote communication and speed up processes.

Knowledge of Linux internals: Words like cgroups, tcpdump, inode, procfs should sound familiar to you.

Knowledge of at least one (1) programming language.

Working-level Proficiency in Greek. The role involves communication with Greek-speaking stakeholders and teams.

Bonus Qualifications

Experience with Data Center and Linux networking concepts and internals.

Experience with on-premises Cloud infrastructure (OpenStack, OpenNebula, etc).

Related personal projects or contributions to open-source projects.

Benefits

An exciting, dynamic and fast-paced working environment that encourages team spirit, cooperation and continuous learning of state-of-the-art technology. A competitive remuneration package and benefits, international collaborations and an environment that fosters innovation. Training and participation in technical conferences.

GRNET is dedicated to promoting diversity and inclusion in the workplace and is an equal opportunity employer. We welcome applications from individuals of varied backgrounds. Our policy ensures that no applicant is discriminated against based on race, age, color, gender identity and expression, disability, national origin, medical conditions, religion, parental status, or any legally protected characteristics. All applications will be treated with strict confidentiality.

#J-18808-Ljbffr


We have other current jobs related to this field that you can find below


  • San Francisco, United States ViralMoment Full time

    Job DescriptionJob DescriptionJob Title: DevOps/Site Reliability EngineerLocation: RemoteAbout ViralMoment:ViralMoment is an AI social listening platform that analyzes social videos to identify trending topics and provide insights to brands and agencies. Our mission is to help our clients stay ahead of the curve by leveraging cutting-edge AI technology.About...


  • San Francisco, United States Patreon Full time

    Patreon is the best place for creators to build exclusive content and community for their fans. We enable creators (podcasters, writers, musicians, illustrators, etc) to connect with their fans directly and make money from their creative work. Creators can sell one-off items from their own shops or offer recurring monthly memberships with exclusive access to...


  • San Francisco, United States Apollo Solutions Full time

    Principal Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking Fintech start-up backed by top tier venture capital. They are looking to significantly disrupt how we view, store and invest our personal finance and have already made significant waves in the industry. The Principal Site Reliability Engineer will be working closely...


  • San Francisco, United States BaseTen Labs, Inc. Full time

    ABOUT BASETEN We're a growing team of builders backed by top-tier investors including IVP, Spark Capital, and Sarah Guo at Conviction. ML teams at enterprises and category-defining AI-native companies like Descript, Bland, and Patreon use Baseten to power their core production workloads with best in class performance, security, and reliability. While we've...


  • San Francisco, United States Apollo Solutions Full time

    Principal Site Reliability Engineer SRE Apollo Solutions have proudly partnered with a Series E SaaS organization based in San Francisco. They have recently employed a highly respected CEO who has spent his career successfully scaling multiple start-ups with large exit events including a $1 billion+ IPO. We are looking for a Principal SRE based in San...


  • San Francisco, United States DAOmatch Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way.Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We believe...


  • San Francisco, California, United States Academia Full time

    SRE / Site Reliability EngineerSAN FRANCISCO, CA or REMOTE from anywhere in the USAWho we are: has built and is expanding the premier distribution and peer review platform for academic research. Guided by our mission to democratize and accelerate the world's research, Academia aims to make every academic paper ever published available for free online and...


  • San Francisco, United States AutoRABIT Holding, Inc. Full time

    About AutoRABIT: AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity for their development team...


  • San Francisco, United States Charter Global Full time

    Title: Site Reliability EngineerLocation: San Francisco, CA or Seattle, WA(5 days onsite)Duration: ContractContract Description:The primary responsibility is to monitor system performance, identify areas for improvement, and implement solutions to enhance reliability and availability.RESPONSIBILITIESGuide architecture and development teams on how to make...


  • San Jose, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe’s Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Jose, California, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe's Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Francisco, United States Gusto Full time

    About GustoGusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management tools. Today, Gusto offices in Denver, San Francisco, and New York serve more than 300,000 businesses nationwide. Our mission is to create a world...


  • San Francisco, United States Gusto Full time

    About Gusto Gusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management tools. Today, Gusto offices in Denver, San Francisco, and New York serve more than 300,000 businesses nationwide. Our mission is to create a world...


  • San Francisco, United States Withorb Full time

    Mission Orb is on an ambitious mission to provide every business with the infrastructure to unlock their revenue. Best-in class businesses find ways to effectively align their monetization to product usage—whether that's through seats, consumption, feature limits, or usage-based tiers. Orb brings that opportunity to every software company. We are...


  • San Francisco, United States Charter Global Full time

    Title: Site Reliability Engineer (Need Locals)Location: SFO, CA or Seattle, WADuration: 6-12+ MonthsResponsibilities:Guide architecture and development teams on how to make applications highly available, reliable, and performant at a global scalePartner with architecture teams to ensure operability, measurability, and manageability are accounted for in...


  • San Francisco, United States Charter Global Full time

    Title: Site Reliability Engineer (Need Locals)Location: SFO, CA or Seattle, WADuration: 6-12+ MonthsResponsibilities:Guide architecture and development teams on how to make applications highly available, reliable, and performant at a global scalePartner with architecture teams to ensure operability, measurability, and manageability are accounted for in...


  • San Francisco, United States Outdefine Full time

    Site Reliability Engineer - Outdefine Partner Location: Hybrid – Sunnyvale, CA Visa Status: USC, GC, H4 EAD Contract: Long term contract: 1099 – US Based candidates / No C2C or C2H About the job Overview: Job Overview: Experience: 5+ years as SRE or DevOps Mandatory Skills: Ecommerce industry experience SRE or DevOps experience Java Kubernetes Azure...


  • San Francisco, United States AutoRABIT Holding Inc. Full time

    Job DescriptionJob DescriptionAbout AutoRABIT:AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity...


  • San Francisco, United States AutoRABIT Holding Inc. Full time

    Job DescriptionJob DescriptionAbout AutoRABIT:AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity...


  • San Francisco, United States AutoRABIT Holding Inc. Full time

    Job DescriptionJob DescriptionAbout AutoRABIT:AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity...