Current jobs related to Observability Automation - Boston - Iron Mountain


  • Boston, Massachusetts, United States Iron Mountain Inc Full time

    Job SummaryWe are seeking a highly skilled Senior Engineer to join our team as an Observability Automation Engineer. In this critical role, you will be responsible for implementing, managing, and enhancing observability platforms to ensure optimal network and application performance.Key ResponsibilitiesConfiguring alerts and dashboards to ensure timely and...


  • Boston, Massachusetts, United States Iron Mountain Inc Full time

    Job SummaryWe are seeking a highly skilled Observability Automation Engineer to join our dynamic team at Iron Mountain Inc. As a key member of our team, you will be responsible for implementing, managing, and enhancing observability platforms to ensure optimal network and application performance.Key ResponsibilitiesConfiguring alerts and dashboards to ensure...


  • Boston, Massachusetts, United States WHOOP Full time

    Unlock Human Performance with WHOOPAt WHOOP, we're on a mission to empower individuals to perform at their best. As an Observability Analyst, you'll play a crucial role in ensuring the reliability, accuracy, and performance of our data infrastructure. Your expertise will be instrumental in developing and implementing monitoring, alerting, and quality...


  • Boston, Massachusetts, United States Cortex consultants LLC Full time

    Job SummaryCortex Consultants LLC is seeking a highly skilled Senior DevOps Engineer to join our team. As a key member of our DevOps team, you will be responsible for designing and implementing automation solutions to improve operational efficiency and scalability.Key Responsibilities:Design and implement automation solutions using Terraform, Python, and...


  • Boston, Massachusetts, United States Arrowstreet Capital, Limited Partnership Full time

    Job OverviewThe Automation Engineer plays a crucial role in the Quantitative Investment Research group and the Research Systems Group at Arrowstreet Capital, Limited Partnership. This position is responsible for developer tooling, automation, and integration to enable the delivery of large-scale data and HPC solutions in the cloud.The ideal candidate has a...


  • Boston, Massachusetts, United States Arrowstreet Capital Full time

    Job OverviewThe Automation Engineer is responsible for designing and implementing automation solutions for the Quantitative Investment Research group and the Research Systems Group at Arrowstreet Capital.The team provides critical support for the department's DevOps model, enabling the delivery of large-scale data and HPC solutions in the cloud.The role...

  • Research Specialist

    2 months ago


    Boston, United States PerkinElmer Full time

    Job Responsibilities:'First-responder' troubleshooting, error recovery and simple programming of the automated sample management platform.Agilent Bravo, Beckman Echo, and Hamilton Liquid Handler 'first-responder' troubleshooting, error recovery and programming.Development and implementation of a proactive maintenance regime for sample...


  • Boston, Massachusetts, United States Motion Recruitment Full time

    Revolutionize the Automation Space as a Senior Electrical Design EngineerBoston, MassachusettsHybrid Full Time$135k - $170kMotion Recruitment is seeking a highly skilled Senior Electrical Design Engineer to join our client's growing team and company. Our client is revolutionizing the automation space with advanced observability and AI, provided by their...


  • Boston, United States Motion Recruitment Partners, LLC Full time

    Our client is seeking a Senior Electrical Design Engineer to join their growing team and company. They're revolutionizing the automation space with advanced observability, and Ai, that their all-in-one platform provides. The ideal candidate is passionate about revolutionizing the automation space! As a Senior Electrical Design Engineer, you will design...


  • Boston, Massachusetts, United States Eateam Full time

    Key Responsibilities:As a Senior DevOps Engineer at Eateam, you will be responsible for designing and implementing Continuous Integration/Continuous Deployment (CI/CD) pipelines to ensure seamless application building, testing, and deployment processes. You will also manage and administer Linux systems, developing system initialization and configuration...


  • Boston, United States Arrowstreet Capital, Limited Partnership Full time

    Job OverviewThe Automation Engineer is responsible for developer tooling, automation and integration for the Quantitative Investment Research group and the Research Systems Group. The group provides a key function within the department that allows us to achieve the velocity and quality needed inside of our DevOps model, enabling the delivery of large-scale...


  • Boston, United States Arrowstreet Capital, Limited Partnership Full time

    Job OverviewThe Automation Engineer is responsible for developer tooling, automation and integration for the Quantitative Investment Research group and the Research Systems Group. The group provides a key function within the department that allows us to achieve the velocity and quality needed inside of our DevOps model, enabling the delivery of large-scale...


  • Boston, United States Arrowstreet Capital, Limited Partnership Full time

    Job OverviewThe Automation Engineer is responsible for developer tooling, automation and integration for the Quantitative Investment Research group and the Research Systems Group. The group provides a key function within the department that allows us to achieve the velocity and quality needed inside of our DevOps model, enabling the delivery of large-scale...


  • boston, United States Arrowstreet Capital, Limited Partnership Full time

    Job OverviewThe Automation Engineer is responsible for developer tooling, automation and integration for the Quantitative Investment Research group and the Research Systems Group. The group provides a key function within the department that allows us to achieve the velocity and quality needed inside of our DevOps model, enabling the delivery of large-scale...


  • Boston, Massachusetts, United States WEX Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Platform Reliability organization. As a key member of our team, you will be responsible for ensuring the reliability and performance of our internal systems and services.As a Site Reliability Engineer, you will work closely with our development teams to design and implement...


  • Boston, Massachusetts, United States WHOOP Full time

    Unlock Human Performance with WHOOPAt WHOOP, we're on a mission to empower individuals to perform at their best. As an Observability Analyst, you'll play a crucial role in ensuring the reliability and accuracy of our data infrastructure, including customer-facing ML models.Responsibilities:Design and implement monitoring solutions to track data ingestion,...


  • Boston, Massachusetts, United States Parallel Fluidics Full time

    At Parallel Fluidics, we are building infrastructure to power the next generation of life science tools. As a key member of our manufacturing team, you will be responsible for fabricating microfluidic devices using our custom-built equipment and CNC machining centers.Key responsibilities include:Fabricating microfluidic devices using the Parallel...

  • Sr DevOps Engineer

    2 weeks ago


    Boston, United States ALIS Software LLC Full time

    DO NOT APPLY BELOW THEN 14+ YEARS EXPERIENCE Role: Sr DevOps EngineerLocation: Boston, MAModel: HybridType: ContractDuration: Long TermRate: $60/hr on C2CMust have 15+ Years ExperienceMust join project onsite in Boston, MA Must have: Terraform, Python, Migration from AWS to Azure, DevOps, Ansible And CI/CD Roles & Responsibilities · Bachelor's degree in...

  • Sr DevOps Engineer

    1 week ago


    Boston, United States ALIS Software LLC Full time

    DO NOT APPLY BELOW THEN 14+ YEARS EXPERIENCE Role: Sr DevOps EngineerLocation: Boston, MAModel: HybridType: ContractDuration: Long TermRate: $60/hr on C2CMust have 15+ Years ExperienceMust join project onsite in Boston, MA Must have: Terraform, Python, Migration from AWS to Azure, DevOps, Ansible And CI/CD Roles & Responsibilities · Bachelor's degree in...

  • Sr DevOps Engineer

    2 weeks ago


    boston, United States ALIS Software LLC Full time

    DO NOT APPLY BELOW THEN 14+ YEARS EXPERIENCE Role: Sr DevOps EngineerLocation: Boston, MAModel: HybridType: ContractDuration: Long TermRate: $60/hr on C2CMust have 15+ Years ExperienceMust join project onsite in Boston, MA Must have: Terraform, Python, Migration from AWS to Azure, DevOps, Ansible And CI/CD Roles & Responsibilities · Bachelor's degree in...

Observability Automation

2 months ago


Boston, United States Iron Mountain Full time
Observability Automation & Integration Lead Engineer

Boston, Massachusetts

Remote Eligible in Massachusetts - J0077963

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways.

Are you curious about being part of our growth story while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

Job Summary

We are actively seeking a proactive and skilled Senior Engineer specializing in Observability, Monitoring, and Automation to join our dynamic team. In this critical role, you will be responsible for implementing, managing, and enhancing observability platforms to ensure optimal network and application performance. Your responsibilities will include configuring alerts and dashboards, end-to-end automation, and conducting data trend analysis.

This position is ideal for individuals with a strong background in observability platform engineering, network and application performance monitoring, and automation, who are eager to leverage their technical expertise in a collaborative and fast-paced environment. Experience with observability platforms and tools like Datadog and SolarWinds is a plus. If you are passionate about building scalable systems, enhancing observability, and working with cross-functional teams, and are committed to delivering high-quality solutions, this could be the perfect opportunity for you.

Core experience/responsibilities

  1. 10+ years of experience with platforms such as SolarWinds, Datadog, HP Openview, BMC, etc.
  2. 10+ years of experience in network, application performance, and synthetic monitoring.
  3. Expertise in configuring alerts, creating dashboards, and conducting data trend analysis.
  4. Experience in automating the detection of missing assets and configuring them into the monitoring ecosystem via REST API/scripting.
  5. Proficiency in monitoring various end devices including routers, switches, firewalls, storage, virtual, Windows servers, Linux servers, and UNIX servers.
  6. 8+ years of experience automating infrastructure operations using tools like Ansible and Python for event correlation.
  7. Expertise in integrating monitoring data with other platforms such as CMDB/ServiceNow.
  8. Experience configuring monitors using SNMP, SSH, WinRM, WMI, JMX, etc.
  9. Ability to design and implement highly available continuous monitoring platforms for 24x7 operations.

Technical Solutions and Collaboration

  1. Recommend baseline monitoring thresholds, KPIs, and SLAs.
  2. Provide solutions to complex problems and drive process improvements.
  3. Experience with both on-premise and cloud environments.
  4. Expertise in advanced troubleshooting and root cause analysis.
  5. Proficiency with platforms like ServiceNow, Remedy, or Assyst.
  6. Identify automation opportunities and implement proactive monitoring solutions.
  7. Work effectively with Enterprise Architects, OS engineers, and operations support teams to provide training, develop guidelines, and serve as a subject matter expert.

Design and Implementation

  1. Drive enterprise tools and automation implementations while holding stakeholders accountable for their responsibilities and deliverables.
  2. Participate in technical design discussions, considering trade-offs to support business value, scalability, and delivery timelines.
  3. Ensure adherence to architectural governance and security standards.
  4. Contribute to the design and architecture of high-performance, scalable systems, ensuring they meet business requirements and are cost-effective.
  5. Create and maintain detailed design documentation, including diagrams, technical specifications, and architecture blueprints.
  6. Design systems with a focus on performance optimization, ensuring minimal latency and high throughput.
  7. Develop strategies to ensure system scalability, accommodating future growth and changes in workload.
  8. Integrate security best practices into the design and implementation of systems, ensuring robust protection against threats.
  9. Evaluate new technologies and tools, recommending their integration into the development process to enhance productivity and system capabilities.

Process/Operational Experience

  1. Plan and execute system and software installations, upgrades, and changes across the organization.
  2. Understand various methodologies such as Agile, Scrum, and manage project objectives, delivery approaches, and plans.
  3. Identify and mitigate risks throughout projects and tasks, addressing major design flaws.
  4. Experience gathering and organizing large amounts of data for instrumentation into an enterprise monitoring solution.
  5. Share knowledge of monitoring best practices with system owners and administrators to enhance overall monitoring and alerting posture.

Operational requirements

  1. Available for on-call support outside of normal business hours to address critical issues.
  2. Strong communication skills to relate technical details to non-technical leaders and users.
  3. Promote a positive working environment, encourage teamwork, and mentor rising talent.
  4. Excellent time management and organizational skills, with experience establishing guidelines for others.
  5. Ability to notice differences and issues as they arise and escalate them to management.
  6. Facilitate discussions and explore alternative approaches to resolve conflicts.
  7. Take personal accountability for decision-making and collaborating with cross-functional teams.

Education

  • Bachelor's degree in Computer Science, Information Technology, or a related field is required.

Nice to Have

  1. Working expertise in infrastructure/application log aggregation ingested into a security.
  2. Experience with log aggregation tools such as ELK, Logstash, Kibana, Splunk, or QRadar.
  3. Proficiency in Ansible and Python, with the ability to create complex SQL queries for reporting and correlation.
#J-18808-Ljbffr