Gen-AI Engineer

3 days ago


Raleigh NC United States Cisco Systems, Inc. Full time
Who We Are
The Cisco IT team is changing the way we run Cisco's operations by leveraging the power of technology, the best of business processes, and utilizing outstanding data insights. We are redefining how Cisco designs and delivers the employee, partner, and customer experience based on a culture that values customer service. We strive for speed and agility in all that we do. Above all else, we are kind to each other. We aspire to be an industry-leading IT team, with a strong focus on AI and security to differentiate us and foster innovation. To achieve simplicity and the best employee and customer experience, we need excellent talent and the right abilities to succeed.
Who You'll Work With
You will work with our amazing Data Infrastructure & Platforms team as part of the Infrastructure & Cloud Services. You will build solutions and support them across our portfolio of capabilities, primarily focusing on enabling AI/ML and Generative AI capabilities in a multi-functional team setup. You will collaborate with other engineers, architects, and organization leadership.
Who You Are
We are looking for a highly skilled Senior GenAI Engineer to lead the deployment and management of on-premise Large Language Models (LLMs), with a focus on Retrieval Augmented Generation (RAG). This role requires expertise in developing and supporting large-scale GenAI and ML platforms, with a strong emphasis on document management, security, vector databases, and workflow orchestration. The successful candidate will have extensive experience in responsible AI practices, including toxicity screening and ensuring regulatory compliance across AI solutions.
What You'll Do
  • Deploy and manage Large Language Models (LLMs) for on-prem environments, focusing on Retrieval Augmented Generation (RAG) and ensuring high-performance infrastructure.
  • Build and optimize secure, scalable AI/ML platforms that support enterprise-level document management and data security protocols.
  • Design, implement, and manage workflows to ensure seamless document processing, ingestion, classification, and retrieval within AI models.
  • Implement and manage vector databases to support efficient document search and retrieval within AI workflows.
  • Ensure compliance with organizational and regulatory data security standards, including encryption, access control, and auditing of sensitive documents used within AI models.
  • Implement and maintain responsible AI practices, including toxicity screening, bias detection, and regulatory compliance to ensure ethical and safe AI usage.
  • Collaborate with cross-functional teams to ensure data privacy and information security requirements are met when processing documents through AI models.
  • Continuously evaluate and improve infrastructure to support evolving AI/ML needs, particularly focusing on document ingestion, classification, and retrieval.
  • Develop and maintain automated pipelines for LLM deployment and secure document processing with real-time monitoring and alerts.
  • Work closely with compliance, legal, and governance teams to ensure AI models are aligned with security and regulatory frameworks.
  • Stay updated on the latest advancements in AI/ML, document security, and responsible AI practices.
    Minimum Qualifications
    • Bachelor's in Computer Science, Computer Engineering, Electrical Engineering, or a related STEM field.
    • 7+ years of experience in engineering, with at least 2 years specifically in AI/ML engineering
    • Proficiency in programming languages such as Python, Java, C++, or similar.
    • Hands-on experience with ML frameworks such as Kubeflow, AI operators, and/or Langchain, with proficiency in Python for AI operations.
    • Experience with MLOps principles, including model deployment, versioning, and/or monitoring in secure environments.

      Preferred Qualifications
      • Master's Degree
      • Focus on on-prem LLM deployments with an emphasis on RAG
      • Expertise in building large-scale, secure AI/ML platforms with an emphasis on document management and security protocols.
      • Experience with vector databases, such as Pinecone, Weaviate, Milvus, etc.
      • Experience with multi-instance GPUs and containerized AI/ML workflows.
      • Proven ability to collaborate with cross-functional teams, including legal, compliance, and security experts.
      • Understanding of document lifecycle management, particularly in the context of AI model ingestion, classification, and retrieval.
        Why Cisco
        #WeAre Cisco, where each person is unique, but we bring our talents to work as a team and make a difference powering an inclusive future for all.
        We embrace digital and help our customers implement change in their digital businesses. Some may think we're "old" (39 years strong) and only about hardware, but we're also a software company. And a security company. We even invented an intuitive network that adapts, predicts, learns, and protects. No other company can do what we do - you can't put us in a box

        But "Digital Transformation" is an empty buzz phrase without a culture that allows for innovation, creativity, and yes, even failure (if you learn from it).

        Day to day, we focus on the give and take. We give our best, give our egos a break, and give of ourselves (because giving back is built into our DNA). We take accountability, bold steps, and take difference to heart. Because without diversity of thought and a dedication to equality for all, there is no moving forward.

        So, you have colorful hair? Don't care. Tattoos? Show off your ink. Like polka dots? That's cool. Pop culture geek? Many of us are. Passion for technology and world changing? Be you, with us

  • Sr. Python Developer

    2 weeks ago


    Charlotte, NC, United States Synechron Full time

    We are looking to hire a Sr. Python Developer with Gen AI & Prompt Engineering – Charlotte, NCPosition Overview: We are seeking a talented Python Developer with exposure to Gen AI & Prompt Engineering to join our team. The ideal candidate will play a key role in developing and integrating AI-driven solutions, leveraging Python and related technologies to...

  • Gen AI/DevOps Expert

    2 weeks ago


    Cedar Rapids, IA, United States People Tech Group Inc Full time

    Role: Gen AI/DevOps ExpertDuration: Full-timeLocation: Cedar Rapids, IowaJob descriptionWe are seeking a highly skilled Gen AI/DevOps Expert with experience in building and managing AIdriven applications, specializing in AWS, LangChain, AIOps, and Vector Databases.This role will require a deep understanding of integrating AI/ML pipelines, automating...

  • AI/ML Engineer

    2 weeks ago


    Phoenix, AZ, United States Impetus Full time

    Job Summary:We are seeking a highly skilled and experienced AI/ML Engineer with 10-12 years of experience to join our team. The ideal candidate will have extensive hands-on expertise in machine learning (ML) engineering, proficiency in Python, and enterprise-level experience deploying ML solutions. Familiarity with Google Cloud Platform (GCP)-based ML...


  • Charlotte, NC, United States Synechron Full time

    We are looking to hire a Sr. Java Fullstack Developer with React & Gen AI – Charlotte, NCPosition Overview: We are looking for a skilled Java Full Stack Developer with expertise in React and exposure to Gen AI & Prompt Engineering. The ideal candidate will play a key role in developing and maintaining robust, scalable, and high-performance software...


  • Herndon, VA, United States Idexcel Full time

    Title: Gen AI Engineer – Gen AI Engineering, AWSLocation: Herndon VADuration: FulltimeAI Engineer:Responsibilities•Contribute to building AI-powered workflows end-to-end, finding the right balance between research and development•Work with very high volume structured and unstructured data, create training and evaluation data sets•Propose and...


  • Champaign, IL, United States Cyient Full time

    Job Description:To drive innovation in heavy engineering applications in the area of stress analysis using AI/ML and data analytics. This role leverages expertise in AI, ML, and Gen AI to optimize design, simulation, and predictive analytics processes. Proficiency in FEA tools such as ANSYS, ABAQUS, Optistruct, and Hypermesh is essential, with...

  • Principal AI Engineer

    2 weeks ago


    Mountain View, CA, United States Harnham Full time

    Job Title: Principal AI EngineerLocation: Bay AreaSalary: $500-600k Total Cash Compensation (Base + Bonus)Are you ready to shape the future of AI? Join this groundbreaking business incubator, dedicated to inventing new services and technologies that enhance the lives of modern families. As a Principal AI Engineer, you'll be at the forefront of their AI...


  • Chicago, IL, United States Request Technology, LLC Full time

    ***This role is 3 days onsite each week in Chicago, hybrid remote******We are unable to provide sponsorship for this permanent full-time role in Chicago******Position is bonus eligible***Prestigious Firm is currently seeking an AI Engineer. Candidate will be responsible for developing and implementing cutting-edge legal AI solutions that drive efficiency,...


  • Jacksonville, FL, United States SoFi Full time

    Employee Applicant Privacy NoticeWho we are:Shape a brighter financial future with us.Together with our members, we’re changing the way people think about and interact with personal finance.We’re a next-generation financial services company and national bank using innovative, mobile-first technology to help our millions of members reach their goals. The...


  • Jacksonville, FL, United States SoFi Full time

    Employee Applicant Privacy NoticeWho we are:Shape a brighter financial future with us.Together with our members, we’re changing the way people think about and interact with personal finance.We’re a next-generation financial services company and national bank using innovative, mobile-first technology to help our millions of members reach their goals. The...


  • Redmond, WA, United States Centific Full time

    About the JobCentific is seeking a dynamic and visionary Field CTO to lead our efforts in Enterprise AI adoption across key industries including Retail, Consumer Packaged Goods (CPG), Quick Service Restaurants (QSR), Logistics, and Healthcare. The ideal candidate will have a deep understanding of AI technologies and their applications, along with a proven...


  • Hopewell, NJ, United States TEKsystems Full time

    No C2C. W2 Only. Must be able to work onsite 3 days a week in Hopewell, NJ. TEKsystems is currently looking for an AI Engineer/Data Scientist to assist with building a next Gen AI Model Development and Deployment Platforms. This candidate will design, engineer, build and deliver AI Infrastructure and Platform solutions for Model Development and Model...


  • Seattle, WA, United States Pixocial, Human-First AI Full time

    About AI LabOur team, Pixocial AI Lab, plays a crucial role in advancing the AI capabilities of our products. Our team is committed to supporting products with state-of-the-art algorithms, conducting world-class research to drive next-gen products, and cultivating top-tier AI researchers and engineers for the company.Our research topics spans image and video...

  • AI software developer

    2 weeks ago


    San Diego, CA, United States Prosecution Genie, Inc. Full time

    Company DescriptionProsecution Genie, Inc. (PG) is a Gen AI software company that creates custom Gen AI software for criminal prosecuting agencies. The software will help prosecutors eliminate crippling case backlogs and streamline the management of digital evidence throughout criminal litigation proceedings. Thus, PG's developers are experts in ML, NLP and...

  • AI/ML Scientist

    2 weeks ago


    Bellevue, WA, United States Stealth AI Startup * Full time

    Company Overview:We are a venture-backed stealth startup reimagining the shopping experience with the power of Generative AI for brands and retailers. We are a team of serial entrepreneurs, engineers, and research scientists from Amazon, Microsoft, and innovative commerce enablement startups. Our team has extensive experience working directly with Fortune...

  • Gen AI Lead

    7 days ago


    Phoenix, AZ, United States Cognizant Full time

    Cognizant (NASDAQ: CTSH) is a leading provider of information technology, consulting, and business process outsourcing services, dedicated to helping the world's leading companies build stronger businesses. Headquartered in Teaneck, New Jersey (U.S.). Cognizant is a member of the NASDAQ-100, the S&P 500, the Forbes Global 1000, and the Fortune 500 and we are...

  • Gen AI Lead

    2 weeks ago


    Phoenix, AZ, United States Cognizant Full time

    Cognizant (NASDAQ: CTSH) is a leading provider of information technology, consulting, and business process outsourcing services, dedicated to helping the world's leading companies build stronger businesses. Headquartered in Teaneck, New Jersey (U.S.). Cognizant is a member of the NASDAQ-100, the S&P 500, the Forbes Global 1000, and the Fortune 500 and we are...

  • Azure AI Engineer

    2 weeks ago


    New York, NY, United States Synechron Full time

    Job Description: Azure AI EngineerLocation: New York City, NY (Hybrid Model)Role Type: Full Time Role Summary:We are seeking a skilled Azure AI Engineer to join our Windows team. In this role, you will collaborate with various sectors within infrastructure, including Security and Networking, to create, launch, and support AI-based cloud and hybrid...

  • AI Software Engineer

    4 weeks ago


    Raleigh, North Carolina, United States Meta Inc Full time

    Job Summary:Meta is seeking a highly skilled AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics, with a strong focus on High-Performance Computing (HPC) and parallel computing.Key Responsibilities:Apply relevant AI and machine learning techniques to...

  • AI/ML Engineer

    2 weeks ago


    Raleigh, United States Spectraforce Technologies Full time

    Job Title: AI/ML Engineer Location: 100% Remote Duration: 12+ MonthsJob Description: Experience in engineering and deploying Generative AI models, specifically focusing on Retrieval-Augmented Generation (RAG) systems and multi-agent workflows. Strong software engineering foundation in developing and implementing state-of-the-art generative techniques and...