AI/ML Engineer

Building intelligentsystems thatmatter

AI/ML Engineer developing scalable AI/ML and intelligent systems that redefine how organizations process and understand information.

Scroll to explore

About

I focus on advancing memory-augmented LLMs and AI agents, currently building project-specific hierarchical memory systems for more coherent, long-term reasoning.

My past work spans vision models, multilingual RAG systems, and large-scale regressors, shaping my interest in scalable intelligence and how models store, update, and use structured knowledge effectively.

LocationPunjab, India
Emailsahil.chambyal@outlook.com
StatusAvailable for opportunities

Featured Projects

Hybrid ViT Model for Crop Diagnosis

Nov 2025

Advanced Crop Disease Detection System

Designed a hybrid ViT + CNN + Custom MLP architecture that processes images in parallel to detect 38 crop disease classes, achieving 85% training accuracy and 99.5% validation accuracy. Developed an IoT-ready inference pipeline for real-time deployment with sensor integration and filed a patent covering the model architecture and end-to-end diagnostic workflow.

Vision Transformers
CNN
Custom MLP
IoT
Key Metrics
99.5% Validation Accuracy
38 Disease Classes
IoT-Ready Pipeline
Patent Filed
Hybrid ViT Model for Crop Diagnosis screenshot 1
Hybrid ViT Model for Crop Diagnosis screenshot 2

ORIN Tri-Sense AI

Oct 2025

Multilingual RAG Chat-Runtime

Built and fine-tuned a multilingual RAG chat-runtime for universities, government, and private orgs (6+ languages). Optimized proprietary RAG (Pinecone) with tri-source retrieval and fine-tuning, reaching 86% relevance and 0.72 MRR. Scaled to 1k sessions / 200 QPS (latency<300 ms), reducing time-to-answer from 96 hours to 10 minutes.

RAG
Pinecone
Multilingual NLP
Fine-tuning
Key Metrics
1k Sessions/200 QPS
<300ms Latency
86% Relevance
0.72 MRR
ORIN Tri-Sense AI screenshot 1
ORIN Tri-Sense AI screenshot 2

Product Price Regressor (DeBERTa-v3)

Oct 2025

Price Prediction System

Designed and iteratively fine-tuned a DeBERTa-v3-base regression model over 30 epochs (15+7+8) on a 75,000-sample dataset for price prediction, ultimately achieving a final best validation metric of SMAPE 21.73% (down from initial 63.17%). Optimized training efficiency and stability on a Tesla P100-PCIE-16GB GPU using Automatic Mixed Precision (FP32-FP16), a 2e-5 learning rate, 0.01 weight decay, and gradient clipping (max_norm=1.0), with final architecture size being 183 Million Parameters.

DeBERTa-v3
Regression
PyTorch
Mixed Precision
Key Metrics
SMAPE 21.73%
75k Samples
183M Parameters
Tesla P100 GPU
Product Price Regressor (DeBERTa-v3) screenshot 1
Product Price Regressor (DeBERTa-v3) screenshot 2

Skills & Technologies

Languages

Python
Java
C++
TypeScript
JavaScript
SQL
C

AI/ML Frameworks

OpenCV
Hugging Face
PyTorch
TensorFlow
LangChain
Pandas
NumPy

Platforms & Tools

AWS
Azure
GCP
MongoDB
Linux
Git
LangSmith
n8n

Education & Certifications

Education

Bachelor of Technology

Computer Science and Engineering

Lovely Professional University • CGPA 8.33 • 2022-2026

Minor: Machine Learning and Deep Learning

Senior Secondary Education

Physics, Chemistry, Math, Computer Science

Sainik School Kapurthala • 2015-2022

Certifications & Achievements

Building Intelligent Troubleshooting Agents

Microsoft • Sep 2025

AI and Machine Learning Algorithms

Microsoft • Sep 2025

Generative AI with Large Language Models

DeepLearning.AI • Apr 2024

Microsoft Founder Hub Member

$20,000 in benefits and cloud credits

NCC A & B Certificates

6 years of service