Principal Platform Engineer
2 weeks ago
- Principal Platform Engineer (Private Cloud)
- New York, NY (Hybrid, 3 days in office)
- Highly competitive compensation package
The Role
We are seeking a deeply experienced Systems Engineer to act as a Tech Lead for key infrastructure initiatives. This is a crucial, hands-on role for a hybrid systems and software engineer who thrives on solving complex distributed systems problems at scale.
You will be the key technical leader responsible for architecting and building the robust, automated platforms that underpin the firm's critical operations. Your primary mandate will be to lead the transition of the firm's dev teams from direct-access bare metal to a secure, managed, and automated container platform. You will act as a force multiplier for the engineering organization by leading high-impact projects, mentoring other engineers, and setting the standard for technical excellence in reliability and performance.
Responsibilities:
- Architect the Private Cloud: Lead the design and execution of high-impact projects for a distributed fleet of thousands of compute servers. You will drive decisions on hardware specifications, OS provisioning, and file system tuning to maximize performance on bare metal.
- Build the Future Platform: Lead the greenfield design and implementation of a Kubernetes-based container platform on bare metal. You will replace manual workflows with a structured, declarative system that empowers researchers while ensuring stability.
- Eliminate Operational Toil: Architect, build, and maintain mission-critical tools and automation in Python or Go. You will move beyond scripting to build resilient APIs, CLI tools, and automation frameworks that eliminate manual operational work at its source.
- Solve Deep Technical Challenges: Serve as a senior escalation point for complex Linux systems issues, diagnosing and resolving deep technical challenges related to kernel-level performance, hardware/OS compatibility, and reliable configuration distribution across multiple data centers.
- Define Observability Strategy: Drive the architecture for a modern observability data pipeline-deciding what to store, where to store it, and how to use it for automated remediation to ensure production environments remain performant.
- Technical Leadership: Mentor and guide other engineers, championing best practices in software development, infrastructure management, and site reliability engineering.
- 7+ years of experience in a senior site reliability, infrastructure, or software engineering role with a track record of success in complex, large-scale environments.
- Deep, hands-on expertise with the Linux operating system. You can explain system calls, file descriptors, memory management, and Disk I/O paths at a granular level to debug performance issues.
- Expert-level proficiency in Python or Go, with a proven track record of engineering libraries, tools, or API services (not just scripting).
- Experience designing and building Kubernetes clusters on bare metal (not just using EKS/GKE). You understand the deep architectural trade-offs of CNI networking, CSI storage, and control plane design.
- Demonstrated experience leading technical projects, driving architectural decisions, and mentoring other engineers through complex migrations.
-
Principal Engineer
5 days ago
New York, NY, United States Bank of America Full timePrincipal Engineer - GenAI Platform New York, New York To proceed with your application, you must be at least 18 years of age. Acknowledge Refer a friend To proceed with your application, you must be at least 18 years of age. Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/New-York/Principal-Engineer---GenAI-Platform_25029845) Job...
-
Principal Data Platform Engineer
5 hours ago
New York, NY, United States Blink Health Full timeCompany Overview: Blink Health is the fastest growing healthcare technology company that builds products to make prescriptions accessible and affordable to everybody. Our two primary products - BlinkRx and Quick Save - remove traditional roadblocks within the current prescription supply chain, resulting in better access to critical medications and improved...
-
Principal Data Platform Engineer
6 days ago
New York, NY, United States Blink Health Full timeCompany Overview: Blink Health is the fastest growing healthcare technology company that builds products to make prescriptions accessible and affordable to everybody. Our two primary products - BlinkRx and Quick Save - remove traditional roadblocks within the current prescription supply chain, resulting in better access to critical medications and improved...
-
Principal Data Platform Engineer
1 week ago
New York, NY, United States Blink Health Full timeCompany Overview: Blink Health is the fastest growing healthcare technology company that builds products to make prescriptions accessible and affordable to everybody. Our two primary products - BlinkRx and Quick Save - remove traditional roadblocks within the current prescription supply chain, resulting in better access to critical medications and improved...
-
Principal Data Platform Engineer
2 days ago
New York, NY, United States Blink Health Full timeCompany Overview: Blink Health is the fastest growing healthcare technology company that builds products to make prescriptions accessible and affordable to everybody. Our two primary products - BlinkRx and Quick Save - remove traditional roadblocks within the current prescription supply chain, resulting in better access to critical medications and improved...
-
Principal Software Engineer
1 week ago
New York, NY, United States Arcesium Full timeCompany Overview Arcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients...
-
Principal Platform Security Engineer
11 hours ago
New York, NY, United States Gemini Full timeAbout the Company Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014, offering a wide range of simple, reliable, and secure crypto products and services to individuals and institutions in over 70 countries. Our mission is to unlock the next era of financial, creative, and personal freedom by providing trusted access...
-
Principal Software Engineer, Data Platform
2 weeks ago
New York, NY, United States The New York Times Full timeThe mission of The New York Times is to seek the truth and help people understand the world. That means independent journalism is at the heart of all we do as a company. It's why we have a world-renowned newsroom that sends journalists to report on the ground from nearly 160 countries. It's why we focus deeply on how our readers will experience our...
-
Platform Principal Engineer
2 weeks ago
New York, NY, United States LSEG (London Stock Exchange Group) Full timeSummary of Business Unit/Function: One Policy Engine (OPE)strives to provide standardized, valuable services to improve the developer experience and cloud enablement solutions. Through innovative ways, focused risk management, and a culture of continuous improvement, OPE delivers credible services to our customers. It offers an end-to-end solution to...
-
Platform Principal Engineer
5 days ago
New York, NY, United States LSEG (London Stock Exchange Group) Full timeSummary of Business Unit/Function: One Policy Engine (OPE)strives to provide standardized, valuable services to improve the developer experience and cloud enablement solutions. Through innovative ways, focused risk management, and a culture of continuous improvement, OPE delivers credible services to our customers. It offers an end-to-end solution to...