AI & Machine Learning Engineer — Gwalior, India

SUBHASH
DANGI

Building systems that learn — from RLHF-based LLM training to production-grade computer vision and predictive pipelines.

View Projects Contact

97%

Face recognition accuracy

95%+

RLHF annotation quality

24K+

Training images processed

10K+

Agricultural records modeled

Who I Am

I'm a Data Science & Machine Learning professional with hands-on experience in RLHF-based LLM training, deep learning architectures (CNN, LSTM), and building end-to-end ML pipelines from messy raw data to deployed, production-ready systems.

At Ethara.AI, I evaluated large-scale conversational datasets for LLM improvement — sustaining 95%+ annotation accuracy across six quality dimensions including safety, coherence, and hallucination detection across thousands of samples.

My project work includes a biometric authentication system that hit 97% face recognition accuracy on 24,000+ training images, and a multi-module agricultural intelligence system processing over 10,000 crop records for price forecasting and trend analysis.

Currently pursuing B.E. Computer Science at MITS Gwalior (2024–27), actively seeking AI/ML engineering roles where I can build systems that matter.

LocationGwalior, MP, India

Emailst964494@gmail.com

Phone+91-6265238648

GitHubgithub.com/st732887

LinkedInsubhash-thakur

profile.py

# Subhash Dangi

class MLEngineer:

  name     = "Subhash Dangi"
  location = "Gwalior, India"
  status   = "Open to work"

  expertise = [
    "RLHF / LLM Training",
    "CNN / LSTM",
    "Computer Vision",
    "Predictive Modeling",
    "NLP & Data Annotation",
  ]

  education = [
    "B.E. CSE — MITS (2024–27)",
    "Diploma — SRGP (2021–24)",
  ]

  def goal(self):
    return "Build systems that matter."

Technical Stack

Languages

Python SQL

ML & AI

RLHF LLMs CNN LSTM Regression Classification Cross-Validation

Data Engineering

Pandas NumPy Feature Engineering Data Cleaning Advanced Excel

Visualization

Tableau Power BI Matplotlib Seaborn

Databases

MySQL Spark SQL BigQuery

Tools & Metrics

Git / GitHub OpenCV F1-Score RMSE FAR / FRR Confusion Matrix

Python / ML

Deep Learning

Data Engineering

Visualization

Work History

Data Labeling Specialist (RLHF)

Ethara.AI · Gwalior, MP

Jul 2024 — Jan 2025

Annotated and assessed large-scale conversational datasets for LLM training via RLHF, maintaining 95%+ annotation accuracy across 6 quality dimensions: accuracy, relevance, coherence, safety, bias detection, and hallucination identification.
Ranked and evaluated 1,000+ AI-generated responses per month, reducing downstream model error rates through systematic quality assurance and validation workflows.
Processed thousands of labeled samples over 7 months, maintaining dataset consistency and integrity to support model fine-tuning and reinforcement learning optimization.

Featured Work

LiveTransator

A real-time AI translation app deployed live on Hugging Face Spaces. Demonstrates end-to-end deployment of language models — users can translate text instantly through a clean web interface without any setup.

Live on HuggingFace Real-time AI Deployed

Python LLMs Hugging Face Gradio

Live Demo ↗

FutureFarm — Agricultural Intelligence

A 3-module predictive system for crop price forecasting, market trend analysis, and seed quality assessment. Processed 10,000+ records end-to-end, improving input quality by ~30%. Models evaluated on Accuracy, Precision, Recall, F1, and RMSE.

10K+ Records ~30% Quality Gain 3 Modules

Python LSTM CNN LLMs Pandas Matplotlib

Live Demo ↗

Biometric Authentication System

Multimodal CNN-based system for face and fingerprint recognition. Trained on 24,000+ face images across 12 classes and 2,000+ fingerprint samples. Achieved 97% face recognition accuracy — production-level reliability confirmed via FAR, FRR, and Confusion Matrix analysis.

97% Accuracy 24K+ Images Enterprise-ready

Python CNN OpenCV Kaggle

Get In Touch

Open to full-time roles, internships, and collaborations.
If you have something interesting — let's talk.

Email st964494@gmail.com

GitHub github.com/st732887

LinkedIn subhash-thakur-7ab548335

Hugging Face huggingface.co/subhashdangi

Phone +91-6265238648

Name