Senior Engineer, Kubernetes Infrastructure

3 weeks ago


New York, United States CoreWeave Full time
Job DescriptionJob Description

CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI. The company's technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe. CoreWeave was ranked as one of the TIME100 most influential companies of 2024.

As the leader in the industry, we thrive in an environment where adaptability and resilience are key. Our culture offers career-defining opportunities for those who excel amid change and challenge. If you're someone who thrives in a dynamic environment, enjoys solving complex problems, and is eager to make a significant impact, CoreWeave is the place for you. Join us, and be part of a team solving some of the most exciting challenges in the industry.

CoreWeave powers the creation and delivery of the intelligence that drives innovation. To learn more about our values, please visit our careers website.

About the role:

An engineering practice is only as healthy as its foundational dependencies and CoreWeave's Kubernetes Infrastructure Team supports the platform and tools that underpin nearly every part of the cloud. Responsible for our internal Kubernetes-on-metal clusters in each datacenter, engineers on this team have the mission to manage and scale Kubernetes in one of one of the fastest growing clouds in the world. The domain of bare-metal day-0+ reliability engineering offers unique and rewarding challenges in orchestration, fleet operations, testing, observability and automation and every team member will have opportunities to develop their skills with Kuberenetes in an environment unique to being a cloud-builder, not just a cloud-consumer.

We are seeking a Senior Engineer to join the Kubernetes Infrastructure team and help us grow our orchestration platforms in scale, reliability, and featureset. This individual will join a team of 4-6 mixed-skill engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the Kubernetes Infrastructure Team, you would have the opportunity to:

  • Design and implement solutions to fascinating problems of scale for provisioning and managing (many) bare-metal Kubernetes clusters in a hands-free, growing environment.
  • Develop a toolchain and program for testing and developing against a complex cloud environment at a scale that remains agile.
  • Create custom Kubernetes interfaces, gateways, and orchestrators all managed using Gitops tools such as Argo CD and Helm.
  • Improve the performance, security, and reliability of our internal Kubernetes platforms and participate in the Kubernetes Infrastructure on-call rotation.
  • Build dashboards, alerts, and insights into the customer experience using Grafana and Prometheus ecosystem tools.
  • Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.

Wondering if you're a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match. Here are some qualities we've found compatible with our team. If a portion of this resonates with you, we'd love to talk.

  • You have two or more years of experience in a software or infrastructure engineering industry
  • You have experience operating services in production and at scale.
  • You have some experience using Kubernetes with a conceptual understanding of its major components and/or have administered unmanaged (eg, not EKS/GKE) Kubernetes clusters with some form of automation such as KubeSpray.
  • You're comfortable with the idea of using Go as your primary programming language.
  • You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks.
  • You're interested in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.
  • You can transform problems in elastic architectures, decompose them into achievable tasks, and socialize both to your teammates.
  • You're excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $185,000 to $200,000 annually. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

Hybrid Workplace

Successful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.

If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences.

What We Offer

The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Tuition Reimbursement
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Our Workplace

At CoreWeave, we are committed to operating as a hybrid workplace, offering employees flexibility in how they structure their time between in-office and remote work. We recognize the significance of fostering connections, collaboration, and creativity within our office culture and its positive impact on our business. Our philosophy operating as a hybrid workplace underscores our dedication to enabling employees to tailor work-life balance to their individual preferences.

For those who do not live within 30 miles of one of our offices, we are open to considering remote work for candidates whose skills and experience strongly align with the role. While we prioritize a hybrid work environment for most roles, we understand the importance of flexibility and are open to remote work for specific positions and specialized skill sets. Onboarding is essential to your success. New employees not based out of an office will be invited to attend onboarding training at one of our hubs within their first month of employment. We continue to foster a collaborative environment by bringing teams together quarterly.

California Consumer Privacy Act - California applicants only

CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.

As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: careers@coreweave.com.


  • Kubernetes Engineer

    1 month ago


    New York, United States Meta Resources Group Full time

    Our Client a large Pharmaceutical company is seeking a Kubernetes Engineer who will assist in the migration/integration of multiple companies into the clients current environment. The role will initially be a 6 month contract with the likely conversion to a fulltime hire. This is a remote position but may require to be onsite occasionally in the NYCSuburb...


  • New York, United States Motion Recruitment Full time

    A leading cloud provider specializing in high-performance computing is seeking a Senior DevSecOps/Infrastructure Security Engineer to join its Infrastructure Security team. This full-time, hybrid role offers competitive compensation and the opportunity to work on cutting-edge Kubernetes security solutions at scale. Required Skills & Experience 3+ years...


  • New York, United States Motion Recruitment Full time

    Our client, a large financial institution, is seeking a versatile and highly skilled Senior Infrastructure Engineer. The right candidate will have expertise in Compute, Storage, Network and cloud technologies. In this role, you will design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, performance reliability,...


  • New York, New York, United States Palantir Technologies Full time

    At Palantir Technologies, we're seeking a talented Senior Software Engineer to join our Network Infrastructure team. As a key member of this team, you will be responsible for developing and maintaining our cloud-based networking infrastructure. Your expertise in Kubernetes and cloud networking will be invaluable in helping us deliver highly scalable and...


  • New York, United States Motion Recruitment Full time

    Infrastructure Engineer Job Description Our client is a series A startup seeking an InfrastructureNetwork Engineer to help build and scale the infrastructure layer for their next-generation financial ecosystem. In this role, you will be responsible for automating and standardizing software deployment, maintaining high-performance data feeds and trading APIs,...


  • New York, United States Arcesium Full time

    Company Overview Arcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients...

  • Senior Engineer

    2 weeks ago


    New York, United States Datafielder Full time

    Sr. Kubernetes Engineer DataFielder Inc - New York, NY Tagged: KubernetesEngineer DataFielder is a woman-minority certified (MBE) staffing and consulting services agency. Our mission is to provide organizations with exceptional talent while advocating for diverse and underrepresented groups through data-driven insights. Location: New York City, NY or...


  • New York, United States Airtable Full time

    Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes. More than 500,000 organizations, including 80% of the Fortune 100, rely on Airtable to transform how work gets done. Airtable’s infrastructure is evolving to meet the needs of our fast growing engineering org. We are looking...


  • New York, United States Dynamo AI Full time

    As a Senior DevOps Engineer at Dynamo AI, you will play a crucial role in ensuring the smooth and efficient operation of our production environments. You will be responsible for building and optimizing our CI/CD pipelines, managing our AWS infrastructure, and overseeing the deployment and maintenance of our Kubernetes clusters. Your role will also involve...


  • New York, United States Abnormal Security Full time

    Job DescriptionJob DescriptionAbout the RoleAbnormal Security is looking for a Senior ML Infra Engineer to join the Detection Team. The Detection Division is focused on building the world's most advanced technology for identifying and stopping email and cloud-based attacks that were previously undetectable and help make the world a safer place. As an ML...


  • New York, New York, United States Vimeo Full time

    Job Title: Senior Data Infrastructure EngineerJob Summary:Vimeo is seeking a highly skilled Senior Data Infrastructure Engineer to manage and optimize our vector databases and data stores, including Elastic Search and AWS OpenSearch. The ideal candidate will have a strong background in cloud platforms and SRE principles, as well as experience leading data...

  • Senior Data Engineer

    16 hours ago


    New York, United States TechnoGen Full time

    Please Note: As of July 22, 2021, our team will require that all candidate submissions include a LinkedIn profile. Please do not submit any candidates that do not have a LinkedIn. Kforce has a client in Brooklyn, NY that is seeking a Senior Data Engineer. Our team is headquartered in Brooklyn but has a remote-first culture and we encourage remote applicants...


  • New York, United States Dynamo AI Full time

    At Dynamo AI, a Senior Software Engineer in the Core Infra team will design, develop, and maintain robust, secure, and scalable infrastructure and applications. You will be instrumental in deploying our advanced machine learning models in diverse production environments, ensuring optimal performance and reliability. You build full set of features in our...


  • new york city, United States Motion Recruitment Full time

    New York, New YorkOnsiteFull Time$150k - $185kOur client, a large financial institution, is seeking a versatile and highly skilled Senior Infrastructure Engineer. The right candidate will have expertise in Compute, Storage, Network and cloud technologies. In this role, you will design, implement, and manage robust infrastructure solutions, ensuring...


  • New York, New York, United States Figma Full time

    About the RoleFigma is a design and collaboration platform that is growing its team of passionate individuals who are dedicated to making design accessible to all. As a senior infrastructure systems engineer, you will play a critical role in building scalable infrastructure systems and services that power Figma's innovative tools for design and...


  • New York, United States Perchwell Full time

    Who We Are Perchwell is the modern data and workflow platform for real estate professionals and consumers. Based on the industry's foundational data, Perchwell builds a modern software suite to empower real estate professionals to do their best work, provide differentiated service to their clients, and grow their businesses. Backed by Lux Capital, Founders...


  • New York, United States Acceler8 Talent Full time

    Founding Lead Infrastructure EngineerWe're searching for a Founding Lead Infrastructure Engineer to join an ambitious, early-stage team driving LLM infrastructure forward. This role is based in New York and reports directly to the CTO. As Founding Lead Infrastructure Engineer, you’ll take ownership of core backend and infrastructure decisions that will set...

  • Senior Engineer, Scala

    5 months ago


    New York, United States Axoni Full time

    The Senior Scala Engineer will be responsible for coordinating technical development of our infrastructure. This hands-on role will be responsible for coordinating tasks across other developers, managing pull requests, as well direct software development. Candidates should have a strong understanding of algorithm development, networking components, and...


  • New York, New York, United States Vatic Labs Full time

    System Development Opportunities at Vatic LabsWe are seeking skilled engineers to build and support our globally distributed trading and research systems. As a Senior Cloud Infrastructure Developer, you will collaborate with software, hardware, and systems engineers to improve system efficiency and operations.Key Responsibilities:Design and implement CI/CD...


  • New York, New York, United States Celonis GmbH Full time

    About the Role:Celonis is seeking a highly skilled Senior Cloud Platform Engineer to join our Platform Infrastructure team. As a key member of our team, you will be responsible for building and operating a highly available Microservices Cloud platform based on Kubernetes.Key Responsibilities:Design, develop, test, and deploy scalable, secure, and highly...