Senior AI/ML Engineer
3 days ago
About the Role
TEKHQS is seeking a Senior AI/ML Engineer to design, fine-tune, and deploy production-grade Generative AI and LLM-powered systems. This role is ideal for engineers who have shipped real-world ML systems, understand modern transformer architectures, and can operate across the full ML lifecycle—from data and training to inference and optimization.
You will work on scalable AI platforms, enterprise-grade GenAI solutions, and intelligent systems integrated into Web, ERP, and enterprise workflows. This is a hands-on role with strong ownership and architectural influence.
Key Responsibilities
- Design, fine-tune, and optimize transformer-based models (GPT, LLaMA, Mistral, T5) for production use cases.
- Build and maintain end-to-end GenAI pipelines: data processing, training, evaluation, deployment, and monitoring.
- Implement Retrieval-Augmented Generation (RAG) systems using vector databases and hybrid search.
- Optimize inference for latency, throughput, and cost efficiency.
- Work with multi-modal AI (text, embeddings, images, audio where applicable).
- Integrate AI services into enterprise applications, ERP systems, and SaaS platforms.
- Collaborate with product, backend, and cloud teams to deliver scalable AI solutions.
- Apply best practices in ML governance, security, and responsible AI.
Required Skills & Experience
Core AI / ML
- Strong experience with PyTorch and transformer architectures.
- Hands-on experience with LLMs, embeddings, fine-tuning (LoRA/QLoRA), and prompt engineering.
- Solid understanding of training vs inference tradeoffs, evaluation metrics, and model behavior.
GenAI & Systems
- Experience with RAG pipelines, vector databases (Pinecone, Weaviate, FAISS, Chroma).
- Familiarity with RLHF concepts (DPO, PPO, reward modeling) is a plus.
- Tokenization concepts (BPE, SentencePiece, Tiktoken).
Model Optimization & Deployment
- Quantization and optimization techniques (GPTQ, AWQ, int8, fp16).
- Model serving using vLLM, Triton, HuggingFace TGI, or similar.
- Experience deploying models on AWS, Azure, or GCP.
Data & Infrastructure
- Distributed training or inference using DeepSpeed, FSDP, Accelerate.
- Data pipelines using Parquet, WebDataset, or cloud storage.
- CI/CD for ML workflows.
Software Engineering
- Strong Python engineering practices.
- Docker and Kubernetes for ML workloads.
- Experience with monitoring, logging, and profiling ML systems.
Nice to Have
- Experience with ERP-integrated AI solutions (NetSuite, SAP, Dynamics).
- Exposure to multi-agent systems, orchestration frameworks, or AutoGen/LangGraph.
- Open-source contributions or published technical work.
Qualifications
- Bachelors or Masters degree in Computer Science, AI, Data Science, or related field.
- 4+ years of professional ML experience, with 3+ years in GenAI/LLMs.
- Proven experience deploying AI systems to production.
About TEKHQS
TEKHQS is a global technology solutions provider headquartered in Lake Forest, California, with a delivery team of 300+ professionals across Pakistan and other regions. We specialize in:
- Web & Mobile Development (Web 2.0)
- Blockchain & Web 3.0 Solutions
- AI/ML & Generative AI Systems
- ERP Services as a certified partner of SAP S/4HANA, Oracle NetSuite, and Microsoft Dynamics 365
We deliver enterprise-grade solutions across implementation, integration, customization, training, support, and staff augmentation.
-
Staff Product Manager – AI/ML
2 weeks ago
San Francisco, California, United States Snorkel AI Full timeAbout SnorkelAt Snorkel, we believe meaningful AI doesn't start with the model, it starts with the data.We're on a mission to help enterprises transform expert knowledge into specialized AI at scale. The AI landscape has gone through incredible changes between 2015, when Snorkel started as a research project in the Stanford AI Lab, to the generative AI...
-
Senior AI/ML Engineer
4 days ago
San Francisco, California, United States Sigma Computing Full time $240,000 - $270,000About the RoleAt Sigma, we're not just adding AI—we're building the future of how people work with data. Our platform already lets users explore billions of rows of data in seconds with a spreadsheet-like interface, analyze and present their data in workbooks, and build data apps and workflows. Now we're pushing further, applying AI to reshape how people...
-
Principal AI/ML Engineer
1 day ago
San Francisco, California, United States ExecutivePlacements Full timeResponsibilitiesLead and drive the development of technology and platform for the company's AI/ML engineering needs, ensure the functional richness, reliability, performance, and flexibility of this platformHelp design the architecture and lead the implementation of the AI/ML infrastructure, platform and services.Challenge the status quo and hold a high bar...
-
San Francisco, California, United States Symbolica AI Full timeAbout usSymbolica is an AI research lab pioneering the application of category theory to enable logical reasoning in machines.We're a well-resourced, nimble team of experts on a mission to bridge the gap between theoretical mathematics and cutting-edge technologies, creating symbolic reasoning models that think like humans – precise, logical, and...
-
AI ML engineer
2 weeks ago
San Francisco, California, United States Spheric Full timeMissionAt Navi, artificial intelligence isn't an experiment — it's mission-critical. We're building the AI co-pilot that will analyze flight data, interpret human performance, and deliver feedback that makes pilots safer and more effective. As an AI/ML Engineer, you'll design and deploy the machine learning systems that power everything from student...
-
AI/ML Engineer
2 weeks ago
San Francisco, California, United States Air Apps Full timeAbout Air AppsAt Air Apps, we believe in thinking bigger—and moving faster. We're a family-founded company on a mission to create the world's first AI-powered Personal & Entrepreneurial Resource Planner (PRP), and we need your passion and ambition to help us change how people plan, work, and live. Born in Lisbon, Portugal in 2018—and now with offices in...
-
Audio AI Research Engineer
2 days ago
San Francisco, California, United States David AI Full timeAbout David AIDavid AI is the first audio data research company. We bring an R&D approach to data–developing datasets with the same rigor AI labs bring to models. Our mission is to bring AI into the real world, and we believe audio is the gateway. Speech is versatile, accessible, and human—it fits naturally into everyday life. As audio AI advances and...
-
AI/ML Software Engineer
1 day ago
San Francisco, California, United States Candid Health Full timeAbout CandidAt Candid Health, we're on a mission to revolutionize healthcare by solving one of its most complex and costly problems: the billing and revenue cycle management (RCM) process. The healthcare system has long been burdened by slow, inefficient workflows that waste valuable resources, leaving providers with less time and money to focus on patient...
-
Founding ML Researcher
5 days ago
San Francisco, California, United States Unsiloed AI Full timeWe are hiring a Founding ML Researcher in San Francisco.We are building a small, talent-dense team. This role will define the engineering archetype at Unsiloed AI and set the ceiling for the team. We strongly believe technical DNA compounds (or degrades) with every hire and hence the first few matter disproportionately. You will be expected to operate...
-
AI/ML Software Engineer
1 week ago
San Francisco, California, United States Candid Health Full timeAbout CandidAt Candid Health, we're on a mission to revolutionize healthcare by solving one of its most complex and costly problems: the billing and revenue cycle management (RCM) process. The healthcare system has long been burdened by slow, inefficient workflows that waste valuable resources, leaving providers with less time and money to focus on patient...