Senior Infrastructure Engineer

2 weeks ago


New York, United States Toptal Full time

Job Summary We are looking for an experienced Engineer to build and scale services in a cloud environment within our Infrastructure team. Our Infrastructure Engineers work with a high-energy, fast-paced team responsible for supporting initiatives and operations across Toptal. This is a remote position. We do not offer visa sponsorship or assistance. Resumes and communication must be submitted in English. Responsibilities: The following information is intended to describe the general nature and level of work being performed. It is not intended to be an exhaustive list of all duties, responsibilities, or required skills. Toptal services are deployed across hundreds of servers. You will be responsible for designing, building, deploying, and maintaining highly available production systems based on Kubernetes. Collaborate with the development teams and help them streamline their deployment processes, observability, and self-service capabilities. We are embracing DevOps practices, where the Infrastructure team develops systems, automation, tooling, and workflows and consults/mentors developer teams to enable them to own the whole lifecycle of the software they are making. Implement monitoring for automated system health checks, develop procedures, and maintain system troubleshooting and maintenance documentation. Collaborate regularly with Engineering teams to improve the company’s engineering tools, systems, procedures, and data security, not just administer clusters and cloud services. Join daily scrum standups (GMT-3 to GMT+5). Expect pair programming, engaging in peer code reviews, and using collaboration tools like Slack and Zoom. Design, develop, document, analyze, create, test or modify computer or cloud based systems or programs. In the first week, expect to: Join our boot camp team and begin onboarding into Toptal. Learn about our team’s processes and get familiar with the code that maintains our infrastructure resources. In the first month, expect to: Gain insight into our system topology and how the whole system is structured. Understand our monitor systems, alerting systems, and security. Participate in team meetings and get familiar with the ongoing projects and initiatives. Talk and meet with people from the operations squad. In the first three months, expect to: Start working on support tasks to familiarize yourself with the core tools, setup, and everyday challenges. Exercising discretion and independent judgment, provide excellent customer service by understanding and addressing the teams’ needs and expectations through effective communication and collaboration while learning about our infrastructure. Deliver internal infrastructure and services such as monitoring, logging, automation, and data services targeted at our internal users. Support the development of CD pipelines and next-generation Kubernetes-based infrastructure platforms. In the first six months, expect to: Support Infrastructure design, architecture, and implementation. Have opportunities to be involved in systems design, identify new technologies to support the business, and resolve infrastructure compatibility and performance problems as they arise. Participate in the on-call rotation schedule (during business and after hours) to support all infrastructure-related systems. Report any downtime or performance issues the system faces, investigate to determine what caused them, and coordinate with other teams to resolve them. Handle incident resolution if a developer is not needed. Participate in our Disaster Recovery and incident analyses. In the first year, expect to: Communicate with key partners on project engagements. Partner closely with our Engineering teams to develop infrastructure automation and management solutions that focus on scalability, observability, automation, reliability, security, and quality in Google Cloud Platform. Plan and coordinate testing of changes, upgrades, patches, new releases, and new services. Participate in technology initiatives that enable developers to deliver their services to our customers with minimal friction and high quality. Qualifications and Job Requirements: 5+ years of experience with Kubernetes environments, including production operations, troubleshooting, debugging, cluster provisioning, and management. Previous experience managing infrastructure configuration and provisioning through code for large, distributed systems on public cloud platforms (AWS, GCP). Solid understanding of Linux debugging, LAN and WAN networking, IP addressing, Load Balancing, VPNs, and routing. A strong understanding of modern systems and service-related security methodologies. Hands-on experience with system and application metric collection and alerting services like Graphite, Grafana, Prometheus, InfluxDB, Sensu, etc. A keen focus on what makes a system observable. Proficient in scripting languages like Python, Bash, Ruby, etc. You have experience with continuous integration, deployment patterns, and tools like Jenkins or Argo CD. Proficiency in deploying automation with tools like Ansible, terraform, and version control. Experience with Docker, Docker Compose, and building optimized Docker files. Experience running RDBMS. PostgreSQL experience is an added advantage. Excellent troubleshooting skills. Experience in resolving complex problems through various troubleshooting protocols and processes. Eagerness to help teammates, share knowledge with them, and learn from them. Outstanding written and verbal communication skills. Ability to work in a fast-paced, rapidly growing company and handle a wide variety of challenges, deadlines, and a diverse array of contacts. You must be a world-class individual contributor to thrive at Toptal. You will not be here just to tell other people what to do.



  • New York, United States Tech Brains Solutions, Inc. Full time

    Job DescriptionJob DescriptionRole: Senior API Infrastructure EngineerLocation: New York, NY | Long term roleW2/1099 onlyMust be authorised to work in the United States Duties    7 years of hands on experience in designing, developing, and maintaining scalable and reliable API infrastructure to support various internal and external...


  • New York, United States Tech Brains Solutions, Inc. Full time

    Job DescriptionJob DescriptionRole: Senior API Infrastructure EngineerLocation: New York, NY | Long term roleW2/1099 onlyMust be authorised to work in the United States Duties    7 years of hands on experience in designing, developing, and maintaining scalable and reliable API infrastructure to support various internal and external...


  • New York, New York, United States Superstate Full time

    As a Senior Infrastructure Engineer at Superstate you will help lead our team in building open, transparent, frictionless financial blockchain primitives. You will be given the opportunity to design, build, test, and launch products that make DeFi more capital-efficient, accessible, and useful. We work in Rust, Typescript, and Solidity and are forever...


  • New York, United States SilverSearch, Inc. Full time

    Our client, a leading International Law Firm based in New York City, is seeking an Infrastructure Engineer and Senior Infrastructure Engineer to join their team on a full time basis. You will be involved in several ongoing upgrade and migration projects and contribute to the smooth running of all corporate IT systemsTheir environment is as follows: Azure,...


  • New York, United States SilverSearch, Inc. Full time

    Our client, a leading International Law Firm based in New York City, is seeking an Infrastructure Engineer and Senior Infrastructure Engineer to join their team on a full time basis. You will be involved in several ongoing upgrade and migration projects and contribute to the smooth running of all corporate IT systemsTheir environment is as follows: Azure,...


  • New York, United States SilverSearch, Inc. Full time

    Our client, a leading International Law Firm based in New York City, is seeking an Infrastructure Engineer and Senior Infrastructure Engineer to join their team on a full time basis. You will be involved in several ongoing upgrade and migration projects and contribute to the smooth running of all corporate IT systemsTheir environment is as follows: Azure,...


  • New York, United States Fathom Full time

    Fathom is on a mission to use AI to understand and structure the world's medical data, starting by making sense of the terabytes of clinician notes contained within the electronic health records of the world's largest health systems. Our deep learning engine automates the translation of patient records into the billing codes used for healthcare provider...


  • New York, United States PRI Technology Full time

    Role: Senior Infrastructure Engineer - VPFull- time/permanent role, with bonus and benefits!Hybrid Remote in New York, NY - 3 days/week onsite.No c2c/3rd party! This is a direct hire!The Sr. Infrastructure Engineer provides overall direction, technical leadership, and expertise supporting cloud and on-premises applications, infrastructure platforms, storage...


  • New York, United States PRI Technology Full time

    Role: Senior Infrastructure Engineer - VPFull- time/permanent role, with bonus and benefits!Hybrid Remote in New York, NY - 3 days/week onsite.No c2c/3rd party! This is a direct hire!The Sr. Infrastructure Engineer provides overall direction, technical leadership, and expertise supporting cloud and on-premises applications, infrastructure platforms, storage...


  • New York, United States PRI Technology Full time

    Role: Senior Infrastructure Engineer - VPFull- time/permanent role, with bonus and benefits!Hybrid Remote in New York, NY - 3 days/week onsite.No c2c/3rd party! This is a direct hire!The Sr. Infrastructure Engineer provides overall direction, technical leadership, and expertise supporting cloud and on-premises applications, infrastructure platforms, storage...


  • New York, United States Planet Technology Full time

    **Local and US Citizen candidates only**Sr. Engineer for on-prem infrastructure support with focus on modernizing infrastructure. Additional focuses on security best practices, disaster recovery technologies, SaaS offering support (Applications) and cloud platforms. Day-To-Day and Project Support*Help support and modernize current technology stack –...


  • New York, United States Planet Technology Full time

    **Local and US Citizen candidates only**Sr. Engineer for on-prem infrastructure support with focus on modernizing infrastructure. Additional focuses on security best practices, disaster recovery technologies, SaaS offering support (Applications) and cloud platforms. Day-To-Day and Project Support*Help support and modernize current technology stack –...


  • New York, United States Gotham Technology Group Full time

    Title Infrastructure Engineer IIDuration: FTE/PermanentLocation: onsite 4 days a week in NYC (1 day remote)Salary 130-140kIndustry: Non ProfitRESPONSIBILITIES:Support and maintain application portfolios and various technology solutions including, though not limited to, MS Products, O365, AVD, Mimecast, Rubrik, network, Wi-Fi, telecommunications and security...


  • New York, United States Gotham Technology Group Full time

    Title Infrastructure Engineer IIDuration: FTE/PermanentLocation: onsite 4 days a week in NYC (1 day remote)Salary 130-140kIndustry: Non ProfitRESPONSIBILITIES:Support and maintain application portfolios and various technology solutions including, though not limited to, MS Products, O365, AVD, Mimecast, Rubrik, network, Wi-Fi, telecommunications and security...


  • New York, United States Gotham Technology Group Full time

    Title Infrastructure Engineer IIDuration: FTE/PermanentLocation: onsite 4 days a week in NYC (1 day remote)Salary 130-140kIndustry: Non ProfitRESPONSIBILITIES:Support and maintain application portfolios and various technology solutions including, though not limited to, MS Products, O365, AVD, Mimecast, Rubrik, network, Wi-Fi, telecommunications and security...


  • New York, United States Superstate Full time

    As a Senior Infrastructure Engineer at Superstate you will help lead our team in building open, transparent, frictionless financial blockchain primitives. You will be given the opportunity to design, build, test, and launch products that make DeFi more capital-efficient, accessible, and useful. We work in Rust, Typescript, and Solidity and are forever...


  • New York, United States Publicis Media Full time

    Job Description Publicis Media Cloud Managed Services (CMS) is a niche group serving agencies within Publicis Groupe. Publicis Media CMS owns, manages, and maintains a cloud infrastructure that meets US and EU regulatory requirements. Providing our customers with high levels of support for their critical data is a key component of Publicis Media...


  • New York, United States StartUs GmbH Full time

    We are looking for an AV Infrastructure Engineer that will join the AV Infrastructure team at Spotify. Spotify is dedicated to creating a platform for creativity that brings artists and fans closer together. As a member of our AV Infrastructure team, you’ll be working on projects that enable Spotify to succeed in this mission daily.  The AV Infrastructure...


  • New York, United States Capital One Financial Corp Full time

    NYC 299 Park Avenue (22957), United States of America, New York, New York. Senior Lead Engineer - Generative AI Infrastructure (Remote-Eligible)Our mission at Capital One is to create trustworthy, reliable and human-in-the-loop AI systems, changing b Lead, Infrastructure, AI, Engineer, Computer Engineer, Computer Science, Technology


  • New York, United States Ivalua Full time

    Senior Security Engineer (Cloud and Infrastructure Security) - New York, NYCAbout Ivalua A "Magic Quadrant" leader, Ivalua's solutions work in a complex global economy. Our innovative Source-to-Pay solutions include automating customized workflows to source, contract, request, procure, receive, and pay for goods and services across the enterprise, refining...