Current jobs related to Scale-out Engineer - Santa Clara, California - Tenstorrent Inc.
-
AI and HPC Scale-out Systems Architect
1 week ago
Santa Clara, California, United States Intel Full timeJob SummaryWe are seeking an experienced AI and HPC Scale-out Systems architect to join our team at Intel. As a key member of our Data Center and Artificial Intelligence group, you will be responsible for architecting large-scale systems that support breakthrough performance on HPC and AI workloads.Key ResponsibilitiesArchitecting large-scale systems that...
-
Senior Cloud-Scale Analytics Engineer
7 days ago
Santa Clara, California, United States Amazon Development Center U.S., Inc. Full timeCloud-Scale Analytics EngineerAre you passionate about developing innovative cloud-scale analytics and observability solutions? Do you want to revolutionize the way people manage and derive insights from vast volumes of data in the cloud? As a Senior Cloud-Scale Analytics Engineer at Amazon Web Services (AWS), you will design, develop, and support a...
-
Santa Clara, California, United States SoundHound Full timeWe are seeking a highly skilled Data Engineer to join our team at SoundHound. As a Data Engineer, you will be responsible for designing and implementing data pipelines that empower real-time insights. You will leverage massive datasets for modeling, recommendations, and reporting solutions, and build user-facing scalable systems powering ad targeting, push,...
-
Santa Clara, California, United States Amazon Development Center U.S., Inc. Full timeCloud-Scale Data Analytics and Observability SpecialistAre you passionate about developing a next-generation cloud-scale analytics and observability platform at a fast-growing AWS service? We are searching for a Cloud-Scale Data Analytics and Observability Specialist to join the Amazon OpenSearch Observability team. In this role, you will design, develop,...
-
Physical Design Engineer
3 weeks ago
Santa Clara, California, United States Recooty Full timeExciting Opportunity for a Physical Design EngineerSynapse Design is seeking an experienced Physical Design Engineer to join our growing team.We are looking for a skilled engineer with a strong background in physical design engineering.Key Responsibilities:5+ years of industry experience in physical design engineeringExpertise in hierarchical design,...
-
HPC/AI Software Engineer
1 week ago
Santa Clara, California, United States HPE Full timeJob Description:Hewlett Packard Enterprise is seeking a highly skilled Software Engineer to join our HPC and AI organization. As a key member of the Slingshot Ethernet Fabric team, you will play a critical role in expanding HPE's High Performance Ethernet Fabric product growth through Commercial HPC use cases, AI use cases networking, systems, and...
-
Senior Infrastructure Engineer
4 weeks ago
Santa Clara, California, United States Sustainable Talent Full timeJob OverviewSustainable Talent is seeking a highly skilled Senior Infrastructure Engineer to support the NVIDIA Cloud Infrastructure Team. As a key member of our team, you will be responsible for supporting infrastructure team operations, cloud infrastructure system enrollments, deployments, and troubleshooting.Key Responsibilities:Support Infrastructure...
-
Thermal Engineer
4 weeks ago
Santa Clara, California, United States Org_Subtype_BU022_Infrastructure_Solutions_Group Full timeThermal Engineer Job DescriptionWe are seeking a highly skilled Thermal Engineer to join our AI Infrastructure Team in Austin, Texas, Santa Clara, California, or Hopkinton, Massachusetts. As a Thermal Engineer, you will play a critical role in developing next-generation large-scale AI Infrastructure with a focus on leading cooling and thermal...
-
Senior Performance Engineer
1 month ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is seeking a highly skilled Senior Performance Engineer to join our team of ambitious and forward-thinking professionals. As a key member of our AI platform, you will play a critical role in building and optimizing the tools Deep Learning engineers use worldwide to design, develop, and deploy AI applications.We are a diverse team that influences all...
-
Thermal Engineer
3 weeks ago
Santa Clara, California, United States Org_Subtype_BU022_Infrastructure_Solutions_Group Full timeThermal Engineer Job DescriptionWe are seeking a highly skilled Thermal Engineer to join our AI Infrastructure Team in Austin, Texas, Santa Clara, California, or Hopkinton, Massachusetts. As a Thermal Engineer, you will play a critical role in developing next-generation large-scale AI Infrastructure with a focus on leading Cooling and thermal...
-
Staff Data Engineer
3 weeks ago
Santa Clara, California, United States Infoblox Full timeJob Title: Staff Data EngineerAt Infoblox, we are seeking a highly skilled Staff Data Engineer to join our Cloud Engineering team. As a Staff Data Engineer, you will play a key role in designing, developing, and maintaining large-scale data systems and infrastructure to support our cloud-based products and services.Key Responsibilities:Design and develop...
-
Staff Software Engineer
7 days ago
Santa Clara, California, United States Eightfold Full timeAbout the RoleWe are seeking a highly skilled Staff Software Engineer to join our Core Infrastructure Team. As a key member of this team, you will be responsible for designing, developing, and maintaining highly distributed systems that power our products.Key Responsibilities:Design and develop large-scale software platforms that handle millions of users and...
-
Senior Site Reliability Engineer
2 days ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is a leader in AI, machine learning, and datacenter acceleration. Our company is expanding its leadership into datacenter networking with ethernet switches, NICs, and DPUs. We have continuously reinvented ourselves over two decades, with our invention of the GPU in 1999 sparking the growth of the PC gaming market, redefining modern computer graphics,...
-
Senior SQA Engineer PanOS
3 days ago
Santa Clara, California, United States Palo Alto Networks Full timeJob Title: Senior SQA Engineer PanOSJob Summary:As a Senior SQA Engineer for feature testing, you will be responsible for testing the upcoming new features on Palo Alto Networks' next-generation firewall. You will participate in the requirements and design discussions and make a difference in shaping the future direction. The work will involve close...
-
Thermal Engineer
3 weeks ago
Santa Clara, California, United States Dell Full timeThermal Engineer Job DescriptionWe are seeking a highly skilled Thermal Engineer to join our AI Infrastructure Team in Austin, Texas, Santa Clara, California, or Hopkinton, Massachusetts. As a Thermal Engineer, you will play a critical role in developing next-generation large-scale AI Infrastructure with a focus on leading Cooling and thermal...
-
Thermal Engineer
3 weeks ago
Santa Clara, California, United States Org_Subtype_BU022_Infrastructure_Solutions_Group Full timeThermal Engineer Job DescriptionWe are seeking a highly skilled Thermal Engineer to join our AI Infrastructure Team in Austin, Texas, Santa Clara California or Hopkinton Massachusetts.Key Responsibilities:Develop next-generation large-scale AI Infrastructure with a focus on leading Cooling and thermal technologies.Engage with high-profile AI customers to...
-
Performance Optimization Engineer
1 month ago
Santa Clara, California, United States NVIDIA Full timeAbout NVIDIANVIDIA is a leader in the field of artificial intelligence, deep learning, and autonomous vehicles. Our engineering teams are working on cutting-edge technologies that are transforming the world.Job SummaryWe are seeking a highly skilled software engineer to join our team as a Performance Engineer. In this role, you will be responsible for...
-
Senior Performance Engineer
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA Job OpportunityWe are seeking a highly skilled Senior Performance Engineer to join our team at NVIDIA. As a key member of our AI platform, you will be responsible for building and optimizing the tools Deep Learning engineers use to design, develop, and deploy AI applications.Key Responsibilities:Develop and optimize open-source libraries, such as...
-
Senior Production SRE Engineer
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeAbout the RoleNVIDIA is seeking a highly skilled Senior Production SRE Engineer to join our team. As a key member of our SRE team, you will be responsible for designing, implementing, and supporting large-scale storage clusters, including monitoring, logging, and alerting.You will work closely with peers on the team to improve the lifecycle of services –...
-
Senior Performance Engineer
3 days ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is seeking a highly skilled Senior Performance Engineer to join our team. As a key member of our organization, you will play a critical role in building and optimizing the tools Deep Learning engineers use to design, develop, and deploy AI applications.Key Responsibilities:Develop and optimize open-source libraries, such as Transformer Engine, to...
Scale-out Engineer
2 months ago
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
We're seeking a skilled AI Scale-Out Software Engineer to build and optimize our Tenstorrent scale-out fabric (TT-fabric) for distributed inference and training infrastructure. The ideal candidate will have expertise in deep learning, distributed systems, and low-level networking.
This role is hybrid, based out of Santa Clara, CA; Austin, TX; or Toronto, ON.
Responsibilities:
- Design, develop, and maintain TT-fabric, a low-level networking library for Tenstorrent AI processors built on top of Ethernet protocol
- Design and implement efficient distributed training systems for large-scale deep learning models
- Optimize network communication for multi-node AI processor clusters
- Tune system performance for inference and training of key AI models
- Work in the TT-Metalium team and integrate scale-out APIs into the Programming Model
- Work with AI model builder and researchers to improve both the scale out infrastructure and as well as model design
Experience & Qualifications:
- Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field.
- Proven experience in low-level software development.
- Strong proficiency in programming languages such as C / C++.
- Experience with MPI or similar distributed computing frameworks
- Experience with low-level networking libraries (e.g., libfabric, libibverbs)
- Knowledge of networking protocols, especially Ethernet and InfiniBand
- Knowledge of high-performance interconnects
- Familiarity with RDMA programming
- Familiarity with large-scale deep learning frameworks (e.g., PyTorch, TensorFlow)
- Familiarity with network offload engines and SmartNICs
- Strong communication skills and the ability to work effectively with cross-functional teams.
- Passion for technology and a commitment to pushing the boundaries of what is possible in AI.
Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
Due to U.S. Export Control laws and regulations, Tenstorrent is required to ensure compliance with licensing regulations when transferring technology to nationals of certain countries that have been licensing conditions set by the U.S. government.
Our engineering positions and certain engineering support positions require access to information, systems, or technologies that are subject to U.S. Export Control laws and regulations, please note that citizenship/permanent residency, asylee and refugee information and/or documentation will be required and considered as Tenstorrent moves through the employment process.
If a U.S. export license is required, employment will not begin until a license with acceptable conditions is granted by the U.S. government. If a U.S. export license with acceptable conditions is not granted by the U.S. government, then the offer of employment will be rescinded.