Current jobs related to Site Reliability Engineer - Dallas - Appspace


  • Dallas, Texas, United States The Goldman Sachs Group Full time

    Job Title: Site Reliability EngineerAt Goldman Sachs, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability and reliability of our firm's most critical platform services.Key Responsibilities:Develop and implement automation tooling to improve the...


  • Dallas, Texas, United States The Goldman Sachs Group Full time

    Job Title: Site Reliability EngineerAt Goldman Sachs, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability and reliability of our firm's most critical platform services.Key Responsibilities:Develop and implement automation tooling to improve the...


  • Dallas, Texas, United States Glocomms Full time

    Job Title: Site Reliability EngineerGlocomms is seeking a highly skilled Site Reliability Engineer to join their team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the company's cloud infrastructure.Responsibilities:Design and implement scalable and highly available cloud infrastructureDevelop and...


  • Dallas, Texas, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.**Key Responsibilities:*** Design, implement, and maintain scalable and reliable cloud...


  • Dallas, Texas, United States Bayone Full time

    Job Title: Site Reliability EngineerBayone is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining highly available and scalable applications deployed in Azure.Key Responsibilities:Design and implement automation tools and scripts to streamline...


  • Dallas, Texas, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, reliability, and performance of our applications and infrastructure.Key Responsibilities:Design, implement, and maintain scalable and...


  • Dallas, Texas, United States STIAOS Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at STIAOS Technologies in Dallas, TX. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our ecommerce platform.Key Responsibilities:Collaborate with cross-functional teams to identify...


  • Dallas, Texas, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerAt Diverse Lynx LLC, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, reliability, and performance of our applications and infrastructure.Key Responsibilities:Design, implement, and maintain scalable and...


  • Dallas, Texas, United States Motion Recruitment Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Motion Recruitment. As a Site Reliability Engineer, you will be responsible for ensuring the stability, scalability, and performance of our applications.About the RoleThis is a direct hire, hybrid role (3-4 days onsite) in Dallas, Texas. The...


  • Dallas, Texas, United States Motion Recruitment Full time

    Job Title: Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our team at Motion Recruitment. As a Site Reliability Engineer, you will be responsible for ensuring the stability, scalability, and performance of our applications.About the RoleThis is a direct hire, hybrid role (3-4 days onsite) in Dallas, Texas. The ideal...


  • Dallas, Texas, United States Themesoft Inc. Full time

    Site Reliability EngineerAt Themesoft Inc., we're seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Foster a culture of reliability and efficiency by sharing best...


  • Dallas, Texas, United States The Goldman Sachs Group Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for ensuring the availability and reliability of our firm's most critical platform services.Key Responsibilities:Develop and implement incident management processes to ensure...


  • Dallas, Texas, United States STIAOS Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at STIAOS Technologies in Dallas, TX. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our ecommerce systems.Key Responsibilities:Collaborate with cross-functional teams to identify and...


  • Dallas, Texas, United States Tata Consultancy Services Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Tata Consultancy Services. As an SRE Support Analyst, you will play a critical role in ensuring the stability and sustainability of our software systems.Key ResponsibilitiesDrive the stability and sustainability of our next-generation systems and discover innovative...


  • Dallas, Texas, United States Saxon Global Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Saxon Global. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based e-commerce and retail platform.Key ResponsibilitiesDesign, develop, and maintain tools to improve the reliability,...


  • Dallas, Texas, United States The Goldman Sachs Group Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for ensuring the availability and reliability of our firm's most critical platform services.Key ResponsibilitiesDevelop and maintain automation tooling to improve the reliability of our platform and...


  • Dallas, Texas, United States Diverse Lynx Full time

    Job DescriptionRole: Site Reliability Engineer/DevOps EngineerLocation: Dallas, TX (Onsite)Duration: Full-timeJob Description: We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the availability, reliability, and performance of our applications...


  • Dallas, United States Themesoft Inc. Full time

    SRE Engineer/ Dallas, TX Location / FTE / Hybrid Role. Job Description: The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. The roleMonitor...


  • Dallas, Texas, United States Glow Networks Full time

    Site Reliability Engineer (SRE for Datacenter)At Glow Networks, we are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability and performance of our datacenter infrastructure. Responsibilities:Data monitoring and alerting, data quality assurance, and anomaly...


  • Dallas, Texas, United States Mastech Digital Full time

    About the Role:We are seeking a skilled Site Reliability Engineer to join our team at Mastech Digital. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our IT systems and infrastructure.Key Responsibilities:Administration and troubleshooting in Linux and WindowsPatching and basic scripting skills (PowerShell,...

Site Reliability Engineer

2 months ago


Dallas, United States Appspace Full time

Your Role as a Site Reliability Engineer: Our Cloud Operations team seeks a Site Reliability Engineer who is passionate about problem-solving, automating, and maintaining Appspace’s Cloud Platform to support the needs of our Engineering and Customer Care teams. The ideal candidate will see manual work as an opportunity to exercise automation, will understand SRE best practices, have experience automating infrastructure deployments and developing self-healing solutions to infrastructure issues. You will work closely with a global team of cloud, engineering, product, and service professionals to improve our platform’s resiliency and scalability, which directly improves our customers’ experience with Appspace. With this role, you can grow your capabilities as a Site Reliability Engineer given the large-scale size of our cloud platform combined with our smaller-sized Cloud Operations team, which means you will have opportunities to work on all Cloud Infrastructure, end-to-end. This is a mission-critical role for Appspace, therefore while we offer flex time, it should be scheduled ahead of time, otherwise shift engagement is mandatory outside lunch and break times. On-Call coverage will be required weekly during a limited window of US daytime hours over the weekend. This is your opportunity to be part of an awesome company that is rapidly growing and defining the modern workplace experience market A Day in the Life of a Site Reliability Engineer: For this role, you will play a key role in maintaining our cloud platform, which includes an assortment of Kubernetes, Microservices, MongoDB, RabbitMQ, MySQL, Windows Server VM Infrastructure, Orchestration Engines, CI/CD and Monitoring platforms. Your day will consist of: Automating maintenance tasks for our Cloud Platform, therefore strong experience in Python and shell scripting is a must. Deploying new features and releases of our software into Kubernetes via Helm, so strong experience in Kubernetes and Helm is a must. Troubleshooting performance issues or errors thrown by the cloud platform or application, and either resolving the underlying cause, or forwarding your research to Engineering to address in the product. Actioning Request Tickets from other teams in support of their needs to enable and prepare for upcoming releases. Monitoring the application’s performance, uptime, and cloud infrastructure’s performance, looking for improvement opportunities, and proactively taking action to solve any negative trends before they become issues. Lead, Participate, or Execute within the incident management process when alerts fire, and quickly ascertain root cause, resolve the issue, and find new and creative solutions to prevent recurrence. Configure, Monitor, Research, and Evaluate workload performances both on Google Cloud Platform and Microsoft Azure Clouds. Collaborating with our Development and Quality Assurance teams to address issues in the product and platform. Documenting new or updating existing processes and procedures to share knowledge and improve on standardized approaches to solution. What You’ll Need: Must be able to learn new technologies quickly and a desire to be a life-long learner Must communicate well and adapt to working well with others across different countries and cultures. Strong background in Containers, Kubernetes, Helm, Linux, Python coding, and some experience with Windows Server OS and MacOS are a must. Experience with Google Cloud Platform, Google Kubernetes Engine, Google Compute Engine, and Google Storage is highly desired, but comparable experience with AWS or Azure will be considered. Solid troubleshooting experience and the ability to reason through a process workflow to identify a fault or odd behavior (i.e., spending time following log trails) is a must. Experience with administering MySQL & MongoDB preferred. Experience with administering message brokering systems like RabbitMQ preferred. Must be flexible on occasionally attending “off-hour” meetings (we’re a global team supporting a global customer base). Open to quarterly travel up to 5%. Nice to Haves: Experience with Build pipeline tools and the Atlassian suite (JIRA, Confluence, Bitbucket/Git, Bamboo, Octopus). Experience with monitoring and alerting platforms, especially StackDriver. Experience with HashiCorp Terraform. Experience with IIS. The Perks of Working for Appspace: For all our US based team members, we offer a variety of benefits from competitive salaries, medical, dental and vision coverage, disability coverage, employer paid life insurance, mental health resources, 401(k) plan and a fully paid parental leave program. Additional perks include: Generous PTO Flexible work schedules Remote work opportunities Paid company holidays 1/2 Day Fridays Appspace Quiet Fridays (No non-essential internal meetings scheduled) A casual dress work environment #J-18808-Ljbffr