Current jobs related to Reliability Engineer - Tampa, Florida - Data Management Group

  • Reliability Engineer

    4 weeks ago


    Tampa, Florida, United States Software Galaxy Systems, LLC Full time

    Reliability Engineering Role Overview At Software Galaxy Systems, LLC, we are seeking a skilled Reliability Engineer to join our team. Job Summary: As a Reliability Engineer, you will be responsible for reviewing equipment and systems to determine appropriate maintenance strategies to maintain equipment integrity. Key Responsibilities: * Develops...

  • Reliability Engineer

    4 weeks ago


    Tampa, Florida, United States Unicon Pharma Inc Full time

    Job OverviewUnicon Pharma Inc is seeking a skilled Reliability Engineer to join our team. As a key member of our manufacturing team, you will be responsible for ensuring the reliability and integrity of our equipment and systems.Key ResponsibilitiesDevelop and implement maintenance plans to improve equipment uptime and reliabilityConduct failure analysis and...

  • Reliability Engineer

    4 weeks ago


    Tampa, Florida, United States sgsconsulting Full time

    Job SummaryWe are seeking a skilled Reliability Engineer to join our team at Software Galaxy Systems, LLC (SGS). As a key member of our contingent workforce, you will be responsible for reviewing equipment and systems to determine optimal maintenance strategies, ensuring equipment integrity and improving overall system reliability.Your expertise in...


  • Tampa, Florida, United States LMI Full time

    Job Title: Site Reliability EngineerJob Summary:LMI is seeking a skilled Site Reliability Engineer to join our team in Tampa, Florida. As a Site Reliability Engineer, you will be responsible for building and maintaining IT infrastructure resources that serve the Command Digital and Artificial Intelligence Office's (CDAO) data analysis and data management...


  • Tampa, Florida, United States Hays Recruitment Full time

    Job Title: Senior Site Reliability EngineerHays Recruitment is seeking a highly skilled Senior Site Reliability Engineer to join our team in Kissimmee, FL. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud-based systems.Job Summary:We are looking for a talented Senior...


  • Tampa, Florida, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerAt Diverse Lynx LLC, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our Microsoft Dynamics GP system, providing day-to-day technical application support, and implementing and continuing to learn new...


  • Tampa, Florida, United States Hays Recruitment Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Hays Recruitment. As a Senior Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure and applications.Key Responsibilities:Design and implement scalable and...


  • Tampa, Florida, United States LMI Full time

    Job OpportunityLMI is seeking a skilled professional to fill the role of Site Reliability Engineer. In this position, you will be responsible for designing, implementing, and maintaining secure systems that support the Command Digital and Artificial Intelligence Office's (CDAO) data analysis and management requirements.Key Responsibilities:Design and...

  • Reliability Engineer

    3 weeks ago


    Tampa, Florida, United States Software Galaxy Systems, LLC Full time

    Job Description:We are seeking a highly skilled Reliability Engineer to join our team at Software Galaxy Systems, LLC. The ideal candidate will have pharma cGMP experience and a strong background in reliability engineering.Key Responsibilities:Develop and implement reliability strategies to improve manufacturing efficiency.Collaborate with cross-functional...


  • Tampa, Florida, United States Randstad Full time

    Job SummaryWe are seeking a highly skilled Sr. Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for developing and implementing observability strategies, utilizing industry-adopted design patterns, statistics, and trends, with visibility toward dependent systems.Responsibilities:Develops...


  • Tampa, Florida, United States Diverse Lynx Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our technical operations team, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly available...


  • Tampa, Florida, United States Hallmark Global Solutions Ltd Full time

    Job Title: Site Reliability Engineer with Mainframe ExpertiseLocation: Flexible, with options for remote workJob Summary:We are seeking a skilled Site Reliability Engineer with expertise in Mainframe systems to join our team at Hallmark Global Solutions Ltd. As a key member of our technical support team, you will be responsible for providing technical...


  • Tampa, Florida, United States LexisNexis Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at LexisNexis Risk Solutions. As a key member of our SRE team, you will be responsible for driving reliability and toil reduction projects, leveraging your expertise to automate recovery and protect service levels.Key ResponsibilitiesDesign and implement reliability...


  • Tampa, Florida, United States Hallmark Global Solutions Ltd Full time

    Job Title: Site Reliability Engineer with MainframeCompany: Hallmark Global Solutions LtdJob Summary:We are seeking a skilled Site Reliability Engineer with experience in Mainframe to join our team. The successful candidate will be responsible for providing technical support for complex applications, troubleshooting incidents, and implementing corrective...


  • Tampa, Florida, United States Randstad Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team in Tampa, FL. The successful candidate will be responsible for developing and implementing observability strategies, utilizing industry-adopted design patterns, statistics, and trends to ensure the reliability and resilience of our...


  • Tampa, Florida, United States Richard, Wayne & Roberts Full time

    We are seeking a skilled Mechanical Reliability Specialist to join our team at Richard, Wayne & Roberts in Florida. The ideal candidate will have 3+ years of experience in mechanical reliability and a degree in engineering.The successful candidate will be responsible for ensuring the reliability and efficiency of our mechanical systems. If you have a strong...


  • Tampa, Florida, United States Highlander Consultants Inc Defunct Full time

    Reliability EngineerWe are seeking a skilled Reliability Engineer to join our team at Highlander Consultants Inc Defunct. As a key member of our maintenance team, you will be responsible for analyzing equipment failure data, conducting root cause analysis, and developing maintenance strategies to reduce downtime.Responsibilities:Analyzing Equipment Failure...


  • Tampa, Florida, United States LMI Consulting, LLC Full time

    LMI Consulting, LLC is seeking a skilled Site Reliability Engineer to join our team in Tampa, Florida.The ideal candidate will have experience building and maintaining IT infrastructure resources that serve data analysis and data management requirements.Responsibilities include:Building, operating, and maintaining enterprise Multi Domain Data and Analytics...


  • Tampa, Florida, United States Strive Works Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Striveworks. As a key member of our DevOps team, you will be responsible for ensuring the seamless integration, customization, and configuration of our software solutions.Key ResponsibilitiesDesign, implement, and maintain scalable and secure cloud...


  • Tampa, Florida, United States Randstad Full time

    Job Summary:As a Senior Site Reliability Engineer at Randstad, you will be responsible for evaluating and improving the current footprint of our monitoring and observability tools, primarily in Dynatrace and Splunk. You will also work within GitLab to improve automation, release processes, and other guardrails. Additionally, you will assist with cloud...

Reliability Engineer

2 months ago


Tampa, Florida, United States Data Management Group Full time
Job Title: Reliability Engineer - Cloud Services

We are seeking an experienced Reliability Engineer to join our team and support critical projects for our Technology, Infrastructure & Operations teams.

The ideal candidate will have a strong background in performance engineering and performance testing, with a focus on cloud-based services and applications.

The successful candidate will be responsible for developing and maintaining comprehensive monitoring solutions, configuring monitoring tools and systems, and creating custom monitoring dashboards and reports.

Key Responsibilities:

  • Develop and maintain monitoring solutions for cloud-based services and applications
  • Configure monitoring tools and systems to collect relevant metrics, logs, and traces
  • Create custom monitoring dashboards and reports using Splunk, DataDog, and other tools
  • Continuously monitor cloud infrastructure's performance and capacity, anticipating and addressing potential scalability issues
  • Proactively suggest and implement improvements to enhance system reliability, resilience, and fault tolerance
  • Work on automating tasks to streamline operational processes and reduce manual intervention
  • Collaborate with cross-functional teams to investigate and resolve critical incidents
  • Work with Problem Management team to complete post-mortem analysis of incidents
  • Understand overall architecture of systems to identify gaps in monitoring and troubleshoot issues
  • Configure and maintain custom dashboards and alerts in various monitoring tools
  • Create custom reports and deliver presentations to stakeholders
  • Develop scripts for monitoring using PowerShell, Python, and Shell scripting
  • Develop metrics for business and technical teams to determine system health
  • Provide on-call support as needed
  • Lead and coordinate performance engineering for medium to large initiatives
  • Collect and document expected system performance and operational characteristics
  • Collect and/or prepare test data for test execution
  • Develop and execute performance tests, including load, stress, endurance, fail-over, and interoperability
  • Conduct technical analysis of performance test results and production systems
  • Identify, report, and review defects in assessing system performance and stability
  • Define strategy for enabling performance diagnostics and monitoring using APM tools and diagnostic techniques
  • Collaborate with developers to promote performance engineering during all phases of SDLC
  • Lead peer reviews to ensure completeness of all test assets created
  • Resolve performance and stability issues in performance test environment
  • Develop performance engineering work plan structure and project schedule
  • Review architectural design for performance risks and potential issues
  • Prepare capacity analysis when applicable

Requirements:

  • Minimum of 8 years performance engineering and performance testing experience
  • 5+ years of recent work with Ansible
  • 4+ years of work with DataDog
  • Excellent English Communications skills - Verbal & Written
  • Experience managing performance engineering efforts for applications
  • Knowledge of developing scripts for monitoring using PowerShell, Python, and Shell scripting
  • 5 years' of Splunk programming proficiency
  • 5-6 years' experience using.NET and Java application and Application Monitoring Tools like App Dynamics or Datadog
  • Proficiency in performance tuning
  • Good understanding of UI, Middleware, and backend Databases
  • BA/BS degree in Information Technology, Computer Science, or related field of study

What We Offer:

  • Opportunity to work with a talented team of professionals
  • Chance to develop and maintain comprehensive monitoring solutions
  • Collaborative and dynamic work environment
  • Professional growth and development opportunities

How to Apply:

Please submit your resume and cover letter to [insert contact information].