Site Reliability Engineer

1 week ago

San Francisco CA, United States Writemed Full time

About Us Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care pathways? WriteMed.AI helps Biopharma and Life Sciences companies reduce time to write medical publications and regulatory paperwork. Want to make an application Make sure your CV is up to date, then read the following job specs carefully before applying. Site Reliability Engineer Location: Atlanta, GA; Miami, FL; Cambridge, MA; San Francisco, CA; Towson, MD Role Overview Our technical team supports our customers’ missions with a spirit of innovation across all technologies, including AI, GenAI, LLM, Compute, Storage, Database, Big Data, Application-level Services, Networking, Serverless, Deployment, Security, and more. This is an opportunity to partner with our principal AI Architects, Data Scientists, and Engineers to maintain a robust and secure technical foundation for our customers, ranging from small Biotech companies to large Pharmaceutical firms. Qualifications Passionate about learning and evolving with current technological trends Engineering degree or related technical discipline, or equivalent work experience Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java) Knowledge of Cloud-based applications & Containerization Technologies Understanding of metric generation, log aggregation, time-series databases, and distributed tracing Experience with industry standards like Terraform, Ansible Fundamentals in Network Design, Cloud architecture, Security, or Computer Science At least 5 years of hands-on experience in Engineering or Cloud Minimum 5 years of experience with cloud platforms (e.g., GCP, AWS, Azure) At least 3 years of experience in configuration and maintenance of applications or systems infrastructure for large-scale customer-facing companies Experience with distributed system design and architecture Responsibilities Develop software solutions to support service delivery processes Build and manage CI/CD pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering, and auto-remediation Innovate relentlessly to ensure a flawless customer experience Engage in the lifecycle of services from conception to EOL, including system design Provide consulting and capacity planning Define and deploy standards related to System Architecture, Service Delivery, metrics, and operational automation Support services, product, and engineering teams with tooling and frameworks to increase availability and incident response Improve system performance and efficiency through automation and process refinement Collaborate xrczosw with engineering teams to deliver reliable systems Increase operational efficiency and quality by treating operational challenges as software engineering problems Mentor junior team members and champion Site Reliability Engineering Participate in incident response, including on-call duties Partner with stakeholders to influence technical and business outcomes Benefits Comprehensive benefits supporting your personal and professional growth, including wellness programs, tuition reimbursement, expense programs, student loan repayment, childcare, and pet insurance Inclusive culture with active employee resource groups and supportive leadership Salary range: $140,300 to $191,550, with variations based on skills, experience, and location Eligibility for short-term and long-term incentives as part of total compensation #J-18808-Ljbffr

Site Reliability Engineer

3 weeks ago

San Francisco, CA, United States P2P Full time

Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers the powerful APIs, SDKs, and tools necessary to build and scale onchain apps and rollups. Candidates should take the time to read all the elements of this job...
Site Reliability Engineer

3 weeks ago

San Francisco, CA, United States Air Apps Full time

Join to apply for the Site Reliability Engineer (SRE) role at Air Apps Join to apply for the Site Reliability Engineer (SRE) role at Air Apps Get AI-powered advice on this job and more exclusive features. About Air Apps Are you ready to apply Make sure you understand all the responsibilities and tasks associated with this role before proceeding. At Air Apps,...
Software Engineering

2 weeks ago

San Francisco, CA, United States Jobright.ai Full time

Join to apply for the Site Reliability Engineer - Inference role at Jobright.ai 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer - Inference role at Jobright.ai Get AI-powered advice on this job and more exclusive features. Jobright is an AI-powered career platform that helps job seekers discover the top...
Site Reliability Engineer

3 weeks ago

San Francisco, CA, United States ConductorOne Full time

ConductorOne is the first AI-native identity security platform that protects every identity: human, non-human, and AI. With powerful automation, platform-level AI, and out-of-the-box connectors, it centralizes access visibility, enforces fine-grained controls, enables just-in-time access, and automates user access reviews across all apps. It’s easy to use,...
Site Reliability Engineer

2 weeks ago

San Francisco, CA, United States SOLANA FOUNDATION Full time

Our Mission Increase your chances of reaching the interview stage by reading the complete job description and applying promptly. Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers the powerful APIs, SDKs, and tools...
Site Reliability Engineer

3 weeks ago

San Francisco, United States Alchemy Full time

Join to apply for the Site Reliability Engineer role at Alchemy Join to apply for the Site Reliability Engineer role at Alchemy Our Mission Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers the powerful APIs, SDKs,...
Site Reliability Engineer

3 weeks ago

San Francisco, United States Rivago Infotech Inc Full time

Staff Site Reliability Engineer (SRE) Job Responsibilities As our Staff SRE, you'll be the primary expert responsible for our entire compute ecosystem. Your key responsibilities will include: Design, implement, and lead large-scale, cross-functional projects to improve the reliability, performance, and efficiency of our core services and infrastructure (10×...
Site Reliability Engineer

3 weeks ago

San Francisco, United States Workos Full time

About WorkOS 🚀WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with employees across North American time zones. We’re well-funded, having raised an $80M Series B. Our fast-growing customer base includes hundreds of rapidly...
Site Reliability Engineer

2 weeks ago

San Francisco, CA, United States Alchemy Full time

Join to apply for the Site Reliability Engineer role at Alchemy Join to apply for the Site Reliability Engineer role at Alchemy Our Mission Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers the powerful APIs, SDKs,...
Engineering Manager, Site Reliability

2 weeks ago

San Francisco, United States Reddit Full time

Engineering Manager, Site ReliabilityAs an Engineering Manager for Site Reliability, you will be responsible for ensuring the reliability, performance, efficiency, and resilience of your team's systems and services, as well as working to ensure that the experience of your customers other internal engineering teams steadily improves. This includes...

Americas

Europe

Asia / Oceania

Africa

Site Reliability Engineer