AI & Machine Learning Engineer — Gwalior, India

SUBHASH
DANGI

Building systems that learn — from RLHF-based LLM training to production-grade computer vision and predictive pipelines.

97%
Face recognition accuracy
95%+
RLHF annotation quality
24K+
Training images processed
10K+
Agricultural records modeled
01
About

Who I Am

I'm a Data Science & Machine Learning professional with hands-on experience in RLHF-based LLM training, deep learning architectures (CNN, LSTM), and building end-to-end ML pipelines from messy raw data to deployed, production-ready systems.

At Ethara.AI, I evaluated large-scale conversational datasets for LLM improvement — sustaining 95%+ annotation accuracy across six quality dimensions including safety, coherence, and hallucination detection across thousands of samples.

My project work includes a biometric authentication system that hit 97% face recognition accuracy on 24,000+ training images, and a multi-module agricultural intelligence system processing over 10,000 crop records for price forecasting and trend analysis.

Currently pursuing B.E. Computer Science at MITS Gwalior (2024–27), actively seeking AI/ML engineering roles where I can build systems that matter.

LocationGwalior, MP, India
Phone+91-6265238648
profile.py
# Subhash Dangi

class MLEngineer:

  name     = "Subhash Dangi"
  location = "Gwalior, India"
  status   = "Open to work"

  expertise = [
    "RLHF / LLM Training",
    "CNN / LSTM",
    "Computer Vision",
    "Predictive Modeling",
    "NLP & Data Annotation",
  ]

  education = [
    "B.E. CSE — MITS (2024–27)",
    "Diploma — SRGP (2021–24)",
  ]

  def goal(self):
    return "Build systems that matter."
02
Skills

Technical Stack

Languages
Python SQL
ML & AI
RLHF LLMs CNN LSTM Regression Classification Cross-Validation
Data Engineering
Pandas NumPy Feature Engineering Data Cleaning Advanced Excel
Visualization
Tableau Power BI Matplotlib Seaborn
Databases
MySQL Spark SQL BigQuery
Tools & Metrics
Git / GitHub OpenCV F1-Score RMSE FAR / FRR Confusion Matrix
Python / ML
Deep Learning
Data Engineering
Visualization
03
Experience

Work History

Data Labeling Specialist (RLHF)
Ethara.AI · Gwalior, MP
Jul 2024 — Jan 2025
  • Annotated and assessed large-scale conversational datasets for LLM training via RLHF, maintaining 95%+ annotation accuracy across 6 quality dimensions: accuracy, relevance, coherence, safety, bias detection, and hallucination identification.
  • Ranked and evaluated 1,000+ AI-generated responses per month, reducing downstream model error rates through systematic quality assurance and validation workflows.
  • Processed thousands of labeled samples over 7 months, maintaining dataset consistency and integrity to support model fine-tuning and reinforcement learning optimization.
04
Projects

Featured Work

01
LiveTransator

A real-time AI translation app deployed live on Hugging Face Spaces. Demonstrates end-to-end deployment of language models — users can translate text instantly through a clean web interface without any setup.

Live on HuggingFace Real-time AI Deployed
Python LLMs Hugging Face Gradio
02
FutureFarm — Agricultural Intelligence

A 3-module predictive system for crop price forecasting, market trend analysis, and seed quality assessment. Processed 10,000+ records end-to-end, improving input quality by ~30%. Models evaluated on Accuracy, Precision, Recall, F1, and RMSE.

10K+ Records ~30% Quality Gain 3 Modules
Python LSTM CNN LLMs Pandas Matplotlib
03
Biometric Authentication System

Multimodal CNN-based system for face and fingerprint recognition. Trained on 24,000+ face images across 12 classes and 2,000+ fingerprint samples. Achieved 97% face recognition accuracy — production-level reliability confirmed via FAR, FRR, and Confusion Matrix analysis.

97% Accuracy 24K+ Images Enterprise-ready
Python CNN OpenCV Kaggle
05
Contact

Get In Touch

Open to full-time roles, internships, and collaborations.
If you have something interesting — let's talk.