Senior Observability Platform Engineer

3 hours ago


Manhattan, United States SS&C Technologies Holdings Full time

Job Description The position offers an exciting opportunity for software engineers passionate about open source software, Linux, Kubernetes, and Observability. The monitoring stack will provide comprehensive monitoring across system metrics, database performance, network health, and message queues. It will also oversee applications running on diverse cloud platforms, including Kubernetes and ESXi, as well as on bare-metal servers, virtual machines, and containers in the SS&C Private Cloud. Responsibilities: Responsible for designing, developing, implementing, and maintaining our comprehensive observability stack, including tracing, telemetry, logging, health monitoring, visualization, and dashboards. You will play a key role in ensuring the reliability, performance, and operational efficiency of our services. Design and implement a robust observability framework using composable open source solutions like Prometheus, Alertmanager, OpenTelemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar. Develop and maintain health monitoring and alerting systems for our compute platforms, databases, network infrastructure as well as Kubernetes-based platforms including GPU-supported environments. Create and manage visualization dashboards to monitor system performance, resource utilization, and operational health. Implement scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve system issues effectively. Collaborate with development and operations teams to integrate observability practices into the development lifecycle. Conduct performance analysis and optimization to ensure system reliability and efficiency. Stay updated with the latest trends and technologies in observability and performance monitoring. Collaborate with cross-functional teams (Cloud Engineering, Network, and DevOps/Solutions Engineering) to troubleshoot and resolve infrastructure issues.  Preferred Qualifications: Proven experience in observability, system and network monitoring, and system performance analysis, particularly in a cloud or data center environment. Expertise in implementing and managing observability tools and technologies such as composable open source solutions like Prometheus, Alertmanager, OpenTelemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar commercial solutions. Hands-on experience with Kubernetes. Experience with infrastructure-as-code and configuration management tools such as Consul, GitHub, Salt Stack, Terraform, etc. Proficiency in scripting and automation using languages such as Go, Python, Shell. Excellent problem-solving skills and the ability to work independently or as part of a team. Strong communication skills and the ability to work in a fast-paced, dynamic environment.  Educational Qualifications: Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field. 


  • Platform Engineer

    7 days ago


    Manhattan, United States FiscalNote Full time

    Platform Engineer - Application Deadline of January 30th, 2026About the PositionWe are looking for a Platform / DevOps Engineer to help build and operate the shared infrastructure that enables our engineering teams to deliver software reliably and efficiently. This role is infrastructure-focused, with emphasis on cloud infrastructure, automation, and...


  • Manhattan, United States Jobgether Full time

    Remote Senior DevOps EngineerThis position is posted by Jobgether on behalf of a partner company. We are currently looking for a DevOps Engineer - REMOTE. In this role, you will take charge of the core infrastructure that will support key applications and machine learning teams. The position focuses on ensuring reliable, secure, and efficient cloud...


  • Manhattan, United States Synexus Full time

    Senior Software EngineerAt Synexus, we're helping advertisers rethink how they show up in the news ecosystem. After a successful investment round, we are now in the process of launching a new brand and product to the Ad tech market. Today, blunt keyword blocking causes brands to avoid entire news sites even when the content is high-quality, balanced, and...


  • Manhattan, United States Synexus Full time

    Senior Software EngineerAt Synexus, we're helping advertisers rethink how they show up in the news ecosystem. After a successful investment round, we are now in the process of launching a new brand and product to the Ad tech market. Today, blunt keyword blocking causes brands to avoid entire news sites even when the content is high-quality, balanced, and...


  • Manhattan, United States CoreWeave Full time

    A leading technology company is seeking a Senior Director of Engineering to build and scale their Cloud Experience organization. The role involves managing teams to develop public APIs and ensuring reliability and security across services. Ideal candidates will have extensive experience in software engineering and excellent leadership skills. This position...


  • Manhattan, United States Dotdash Meredith Full time

    Job Description About The Position | We’re seeking a Principal Platform Product Manager to drive the evolution of our internal platforms that power content discovery and publishing experiences across People Inc.'s 30+ brands, reaching over 100M users monthly. This role sits at the intersection of engineering excellence and product strategy—focused on...


  • Manhattan, United States Warner Bros Discovery Full time

    Who We Are… When we say, “the stuff dreams are made of,” we’re not just referring to the world of wizards, dragons and superheroes, or even to the wonders of Planet Earth. Behind WBD’s vast portfolio of iconic content and beloved brands, are the bringing our characters to life, the bringing them to your living rooms and the creating what’s...


  • Manhattan, United States CoreWeave Full time

    CoreWeave is The Essential Cloud for AI. Built for pioneers by pioneers CoreWeave delivers a platform of technology tools and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs startups and global enterprises CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate...

  • Remote Manager

    2 weeks ago


    Manhattan, United States Jobgether Full time

    Remote Manager - AI Platform & SolutionsThis position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Delivery Platform Manager - REMOTE. In this role, you will play a pivotal part in shaping and executing the AI strategy by ensuring operational excellence for scalable AI platforms. You will lead the management of...

  • Senior Data Engineer

    2 weeks ago


    Manhattan, United States Jobgether Full time

    Senior Data Engineer - REMOTEThis position is posted by Jobgether on behalf of a partner company. We are currently looking for a Remote Data Engineer. In this role, you will significantly impact data integration and analytics initiatives, collaborating with diverse teams to ensure quality data deliverables. You will design and develop innovative data...