Agile Datapro | Site Reliability Engineer
3 weeks ago
About the Job:-
- Role: SRE or Site Reliability Engineer
- Type of Engagement: Hybrid - 2 days from Mountain View office
- Location: Mountain View
- Employment Type: Full Time with Client (W2)
Job Description/Requirement:
Design, implement, and maintain complex data systems supporting millions of customers with Cloud Native principles and best practices to ensure highly available, secure, performant, and scalable database systems
• Build and maintain CI/CD pipelines in Jenkins
• Build and deploy services in Kubernetes cluster using helm, kustomize, etc
• Contribute to infrastructure changes to AWS with deep understanding of AWS services
• Engage in on-call for pre-production and production systems supporting multi-million users
• Write/Review RCA docs to prevent recurrence of Incidents in future and share the learnings
• Contribute to major system upgrades, deployment automation, monitoring enhancements and Production changes
• Create operational playbooks, contribute to how-to articles, and gain domain knowledge to drive changes in the team
• Participate and contribute in FMEA/Chaos testing, Security remediations, etc
• Share best practices and patterns for operational excellence and cost optimization
• Reduce or eliminate manual steps by automating as much as possible
• Continuously look for opportunities to increase developer velocity and productivity
Qualifications:
• Bachelor’s or master’s degree in computer science or a related technical field. Equivalent experience will be considered
• 4+ years of hands-on development & operational experience with building and maintaining infrastructure in AWS
• Extensive performance monitoring, troubleshooting & tuning experience
• Experience with AWS services and hands-on knowledge of hosting on Cloud
• Experience with scripting languages for DevOps automation
• Experience with any one of the programming languages: Java/Python/Ruby
• Knowledge of Docker & Kubernetes, ArgoCD,
• Experience with monitoring and observability using Splunk, Wavefront, AppDynamics, Prometheus, Tracing, etc
Education:
Bachelor’s degree in computer science, Software Engineering, or a related field.
If you are interested to pursue the opportunity, please send your updated resume to saikat.g@agiledatapro.com along with your rate / salary information
-
Site Reliability Engineer
3 weeks ago
San Francisco Bay Area, United States Agile Datapro Full timeAbout the Job:-Role: SRE or Site Reliability EngineerType of Engagement: Hybrid - 2 days from Mountain View officeLocation: Mountain ViewEmployment Type: Full Time with Client (W2) Job Description/Requirement:Design, implement, and maintain complex data systems supporting millions of customers with Cloud Native principles and best practices to ensure highly...
-
Reliability Architect
3 weeks ago
San Francisco, California, United States Agile Datapro Full timeJob Opportunity:We are seeking a highly skilled Reliability Architect to join our team at Agile Datapro. The successful candidate will have a strong background in designing and implementing complex data systems using Cloud Native principles and best practices.Responsibilities:Design, implement, and maintain database systems supporting millions of...
-
Agile Datapro | Python Developer
3 weeks ago
san francisco bay area, United States Agile Datapro Full timeAbout the JobRole: Python EngineerType of Engagement: FulltimeLocation: RemoteJob Description / Requirement:We are looking for a Python Engineer to advance financial inclusion and digital transformation by adapting our platform to comply with local-market needs. On our team, you will be in the middle of it all – implementing new functionality, architecting...
-
Agile Datapro | Python Developer
1 week ago
san francisco, United States Agile Datapro Full timeAbout the JobRole: Python EngineerType of Engagement: FulltimeLocation: RemoteJob Description / Requirement:We are looking for a Python Engineer to advance financial inclusion and digital transformation by adapting our platform to comply with local-market needs. On our team, you will be in the middle of it all – implementing new functionality, architecting...
-
Agile Datapro | Program Manager | san diego, ca
1 month ago
san diego, United States Agile Datapro Full timeJob Title: Program ManagerLocation – Mountain view OR San Diego Duration: Full TimeAbout The job Role• Working larger tech organization• Managing large team of stake holders• Not exactly TPM, but plus• Core project management competencies• Understand deliverable, dependency, draw out project Gantt chart, help identify risk, proactive, great...
-
Agile Datapro | Program Manager | san diego, ca
1 month ago
san diego, United States Agile Datapro Full timeJob Title: Program ManagerLocation – Mountain view OR San Diego Duration: Full TimeAbout The job Role• Working larger tech organization• Managing large team of stake holders• Not exactly TPM, but plus• Core project management competencies• Understand deliverable, dependency, draw out project Gantt chart, help identify risk, proactive, great...
-
Site Reliability Engineer
2 weeks ago
San Francisco Bay Area, United States Bun Full timeBun is an open-source JavaScript tooling company focused on making programming simpler. We've raised $26 million from top investors in Silicon Valley, are among the top GitHub repositories and have a growing community of 33,000 Discord members.We're hiring an experienced Site Reliability Engineer to scale and maintain the infrastructure that builds and tests...
-
Python Developer
3 weeks ago
San Francisco Bay Area, United States Agile Datapro Full timeAbout the JobRole: Python EngineerType of Engagement: FulltimeLocation: RemoteJob Description / Requirement:We are looking for a Python Engineer to advance financial inclusion and digital transformation by adapting our platform to comply with local-market needs. On our team, you will be in the middle of it all – implementing new functionality, architecting...
-
Staff Site Reliability Engineer
1 month ago
San Francisco, United States Ursus Inc Full timeJOB TITLE: Staff SRE **TOP 3 SKILLS:** GoLang Kubernetes Ruby LOCATION: Remote DURATION: Direct Hire RATE RANGE: $160-180K SUMMARY: We're looking for a driven software engineer who cares deeply about their craft, and who wants to use their skills to bring about positive change in the world while working in a high performing...
-
Staff Site Reliability Engineer
2 months ago
San Francisco, United States CV Library Full timeJOB TITLE: Staff SRETOP 3 SKILLS:GoLangKubernetesRubyLOCATION: RemoteDURATION: Direct HireRATE RANGE: $160-180KSUMMARY:We're looking for a driven software engineer who cares deeply about their craft, and who wants to use their skills to bring about positive change in the world while working in a high performing organization using modern software development...
-
Site Reliability Engineer
1 month ago
San Francisco, United States Bun Full timeBun is an open-source JavaScript tooling company focused on making programming simpler. We've raised $26 million from top investors in Silicon Valley, are among the top GitHub repositories and have a growing community of 33,000 Discord members.We're hiring an experienced Site Reliability Engineer to scale and maintain the infrastructure that builds and tests...
-
Site Reliability Engineer
1 week ago
San Francisco, United States EVONA Full timeSite Reliability Engineer (SRE)Location: San Francisco Bay AreaRole Overview:We are seeking a highly skilled Site Reliability Engineer (SRE) to join a dynamic team at a rapidly growing technology company. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical systems, while implementing automation...
-
Site Reliability Engineer
2 months ago
San Francisco, United States Ellation, Inc. Full timeWho We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...
-
Site Reliability Engineer
1 month ago
San Francisco, United States Unreal Gigs Full timeAre you passionate about building and maintaining resilient systems that ensure high availability and performance? Do you excel at automating processes, troubleshooting complex issues, and creating systems that scale smoothly? If you're ready to take on the challenge of ensuring reliable, efficient, and secure system operations, our client has the perfect...
-
Site Reliability Engineering Lead
2 hours ago
San Francisco, California, United States Indotronix International Corporation Full timeJob DescriptionWe are seeking a highly experienced Site Reliability Engineering Lead to join our team at Indotronix International Corporation.The ideal candidate will have experience with site reliability engineering, Kubernetes, Docker, CI/CD, and Jenkins, as well as strong production support skills. A background in Splunk or similar logging/observability...
-
Lead Site Reliability Engineer
3 days ago
San Francisco, United States Federal Reserve Bank of San Francisco Full timeCompany: Federal Reserve Bank of San FranciscoWe are the Federal Reserve Bank of San Francisco-public servants with a mission to advance the nation's monetary, financial, and payment systems to build a stronger economy for all Americans. We are a community-engaged bank, and are committed to understanding and serving the vibrant, expansive communities of the...
-
Site Reliability Engineer
2 months ago
San Francisco, United States New York Technology Partners Full timeMust Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years
-
Site Reliability Engineer
2 months ago
san francisco, United States New York Technology Partners Full timeMust Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years
-
Site Reliability Engineer
4 days ago
San Francisco, United States Arbitrum Full timeOur mission is to bring blockchain to a billion people. The Alchemy Platform is a world class developer platform designed to make building on the blockchain easy. We've built leading infrastructure in the space, powering over$105billion in transactions for tens of millions of users in 99% of countries worldwide. The Alchemy team draws from decades of deep...
-
Sr Site Reliability Engineer
1 month ago
San Francisco, United States Federal Reserve Bank of San Francisco Full timeCompany: Federal Reserve Bank of San FranciscoJob Description:While the SF Fed is a Reserve Bank, we're not what you might expect. We're unreserved here. That means we seek new and diverse perspectives. We spark conversations and encourage debate. We build opportunity. We pursue careers that are true to ourselves. We are looking for people who want to help...