Site Reliability Engineer
2 weeks ago
Who are we:
In 2019, our founders were working as engineers solving complex cross domain problems within government organisations
TwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home.
Day Rate: £500 - £600
Location: Remote
Security Clearance: Eligible for DV Clearance
About the role:
Our cross-domain services are used in high-profile government organisations. The demand for these services continues to grow in both scope and scale. We are seeking an experienced Site Reliability Engineer to help satisfy that demand. As an SRE you will be responsible for ensuring the availability, performance and cost effectiveness of these services. You will be working with multiple feature development teams and the BAU/Support team to define and evolve our cloud & on-prem infrastructure & delivery pipelines, improving system observability, demonstrating performance and capacity improvements and proactively identifying and mitigating reliability risks.
Key Responsibilities of the Site Reliability Engineer:
- Collaborate with Software Engineers to improve reliability and performance in their subsystems
- Partner with System Administrators in automating toil and eliminating alerts
- Evolve observability and monitoring capabilities to identify and solve problems before they impact the business
- Support development environments to help us achieve our delivery and quality goals
- Research and evaluate technologies, tools and services to influence buy-vs-build decisions
- Develop expertise in diverse technical and business domains
- Expand your knowledge of the technical stacks used
Skills & Experience Required:
- Experience using modern configuration management tools (such as Ansible, Chef or similar)
- Experience working with Terraform
- Experience working with docker containers & container orchestration tools (such as Kubernetes, OpenShift or Docker Swarm)
- Experience both using and maintaining CI / CD tools (such as Jenkins or similar)
- Experience with monitoring tools such as InfluxDB, Prometheus or Grafana.
- Experience of event-driven integration with MQ messaging (RabbitMQ or similar AMQP solution)
- Good understanding of relational databases and SQL
- Linux command line, administration and shell scripting
- Working knowledge of network security protocols
- Experience using, developing with and maintaining cloud hosting services (ideally AWS EC2, RDS, S3, Lambda)
Desirable Skills:
- Industry experience writing well-tested code in one of our platform languages (Java, Go, Python or similar)
- Knowledge of cross domain principles & technologies
- Experience of working in a service management environment
- Practical applications of using observability patterns in previous systems
- Creating and monitoring system availability metrics and using those to drive work that reduces downtime
- Experience in Azure
Further Information:
To meet the security requirements of certain clients and industries we serve, any job offer will be contingent upon the successful completion of a security screening process.
At TwinStream, we take pride in being an equal opportunity employer. We celebrate diversity and are committed to fostering an inclusive environment where all individuals are valued and respected. We welcome applications from qualified candidates regardless of race, religion, disability, age, sexual orientation, or gender.
-
Site Reliability Engineer
1 week ago
USA, United States Baseten Full time $200,000 - $250,000 per yearABOUT BASETENBaseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. With...
-
Manager, Site Reliability Engineering
2 weeks ago
(usa), United States GEMINI Full timeDepartment s to ensure smooth integration of applications and systems. Define and enforce Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure system reliability and uptime. Monitor system performance, troubleshoot issues, and ensure timely incident response, root cause analysis, and problem resolution. Implement effective...
-
Staff Site Reliability Engineer, Platform
6 days ago
(usa), United States GEMINI Full timeDepartment : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate, and...
-
(usa), United States GEMINI Full timeDepartment : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate, and...
-
Site Reliability Engineer with 2K
2 days ago
USA, United States eTek IT Services Full timeJob DescriptionPosition: Site reliability Engineer Location: Remote Duration: 1 year Required Qualification:6+ years of demonstrated influence across one or more teams for large scale projects that drive impact and improvement across the organization 6+ years of developing tools for automation of processes or augmenting off the shelf tool functionality6+...
-
Senior Site Reliability Engineer, Onchain
1 week ago
(usa), United States GEMINI Full timeDepartment : Onchain The Role: Senior Site Reliability Engineer The infrastructure team at Gemini creates and manages software tools and platforms, automates the creation and support of this infrastructure, helps integrate complex processes, and supports secure data access. Security of customers’ digital assets and personal information held with Gemini is...
-
Site Reliability Engineer
8 hours ago
USA, VA, McLean ( Greensboro Dr, Hamilton), United States Booz Allen Hamilton Full timeSite Reliability EngineerThe Opportunity:Everyone is trying to "harness the cloud," but not everyone knows how. As a DevOps engineer, you're eager to develop, manage, and secure a container platform that meets your client's needs and takes advantage of cloud capabilities. We need you to help us develop container management software to solve some of our...
-
Principal Reliability Engineer
5 days ago
MA: Innovation Dr Tewks Bdg North Street Building , Tewksbury, MA, USA, United States RTX Full time $101,000 - $203,000Date Posted: Country:United States of AmericaLocation:MA134: Innovation Dr Tewks Bdg North Street Building 400, Tewksbury, MA, 01876 USAPosition Role Type:OnsiteU.S. Citizen, U.S. Person, or Immigration Status Requirements: Active and transferable U.S. government issued security clearance is required prior to start date. U.S. citizenship is required, as...
-
Sr. Reliability Engineer
1 week ago
MA: Tewksbury, Ma Bldg Concord Apple Hill Drive Concord - Building , Tewksbury, MA, USA, United States RTX Full time $101,000 - $203,000 per yearDate Posted: Country:United States of AmericaLocation:MA133: Tewksbury, Ma Bldg 3 Concord 50 Apple Hill Drive Concord - Building 3, Tewksbury, MA, 01876 USAPosition Role Type:OnsiteU.S. Citizen, U.S. Person, or Immigration Status Requirements: The ability to obtain and maintain a U.S. government issued security clearance is required. U.S. citizenship is...
-
Senior Infrastructure Engineer
2 days ago
USA, United States Octane Full timeOctane is unlocking the power of financial products for merchants and consumers. Our cutting-edge technology and innovative financial products empower businesses with more control and flexibility, enabling them to deliver seamless digital experiences, drive customer loyalty, and build long-term value.Octane supports merchants throughout the sales cycle:...