DEV Community

Suman Nath profile picture

Suman Nath

I learn ML by building it, breaking it, and writing down what broke. Currently fine-tuning LLMs (full FT, LoRA, QLoRA) — the walls tutorials skip.

Location Bengaluru, India Joined Joined on  Personal website https://nathlabs.com/ github website
A Better LLM Judge? The Rubric Made My Small Model Worse

A Better LLM Judge? The Rubric Made My Small Model Worse

Comments
5 min read

Want to connect with Suman Nath?

Create an account to connect with Suman Nath. You can also sign in below to proceed if you already have an account.

Already have an account? Sign in
LLM-as-a-Judge: I Built One From Scratch, Then Checked It Against Humans

LLM-as-a-Judge: I Built One From Scratch, Then Checked It Against Humans

Comments
4 min read
Breaking down the accuracy number: Building an LLM Eval Harness From Scratch

Breaking down the accuracy number: Building an LLM Eval Harness From Scratch

Comments 1
4 min read
If a 270M Model Already Worked, Why Did I Fine-Tune a 7B One?

If a 270M Model Already Worked, Why Did I Fine-Tune a 7B One?

Comments
3 min read
QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)

QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)

Comments
3 min read
LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

Comments
3 min read
I Fine-Tuned a 270M Model on My Laptop (Full Fine-Tuning, From Scratch)

I Fine-Tuned a 270M Model on My Laptop (Full Fine-Tuning, From Scratch)

Comments
2 min read
loading...