Saksham Pathak

Saksham Pathak

About Me

I'm a Generative AI Engineer passionate about building intelligent systems that push the boundaries of what's possible with AI. Currently pursuing my M.Sc in AI/ML at IIIT Lucknow with an SGPI of 8+.

My expertise lies in creating sophisticated RAG systems, voice AI applications, and deploying large language models at scale. I thrive on transforming complex AI concepts into production-ready solutions that deliver real-world impact.

When I'm not training models or fine-tuning embeddings, you'll find me exploring the latest research papers or contributing to open-source AI projects on Hugging Face.

Python JavaScript C++ FastAPI LangChain LlamaIndex PyTorch TensorFlow Transformers Vector DBs RAG Systems Voice AI TTS Diffusion Models LLM Fine-tuning Docker AWS Git

Technical Expertise

Large Language Models

Expert in deploying and fine-tuning LLMs, prompt engineering, and building context-aware AI applications

Voice AI & TTS

Building real-time voice agents, speech synthesis systems, and conversational AI interfaces

RAG Systems

Architecting retrieval-augmented generation pipelines with FAISS, ChromaDB, and Pinecone

Backend Development

FastAPI, Flask, REST APIs, microservices architecture, and scalable AI deployment

ML Frameworks

PyTorch, TensorFlow, Hugging Face, LangChain, LlamaIndex for production ML systems

Generative Models

Diffusion models, GANs, VAEs, and multimodal AI for image and text generation

Featured Projects

FALCON

Advanced fake news detection system using ensemble learning and NLP techniques. Achieves 95% accuracy in identifying misinformation.

Python BERT PyTorch FastAPI

FaceAging-AI

Deep learning model for age progression/regression in facial images using conditional GANs and style transfer.

GANs TensorFlow OpenCV Streamlit

Enterprise RAG System

Production-grade RAG pipeline with multi-document retrieval, semantic search, and LLM orchestration for enterprise knowledge bases.

LangChain FAISS GPT-4 ChromaDB

Real-time Voice AI Agent

Conversational AI system with real-time speech recognition, natural language understanding, and text-to-speech synthesis.

Whisper ElevenLabs LLM WebRTC

Experience & Achievements

M.Sc AI/ML

2023 - Present

Indian Institute of Information Technology, Lucknow
SGPI: 8+ | Specializing in Generative AI and Deep Learning

IIT JAM Qualified

2023

Cleared the prestigious IIT Joint Admission Test for M.Sc programmes with strong percentile

GenAI Research Projects

2023 - 2024

Built multiple production-ready generative AI applications including RAG systems, voice agents, and diffusion models

Open Source Contributions

Ongoing

Active contributor on Hugging Face with multiple AI models and datasets. Building the future of AI, one commit at a time

AI/ML Publications

2024

Research work on novel RAG architectures and voice AI optimization techniques

Get In Touch