llm-sft-loss

Language ModelsLLaMA-Factorylm-evaluation-harnessrigorous codebase

Description

SFT Loss Function Design

Research Question

Design a novel loss function for supervised fine-tuning (SFT) of large language models that improves downstream task performance compared to standard cross-entropy.

Background

Standard SFT uses cross-entropy loss to train models on instruction-response pairs. However, cross-entropy treats all tokens equally and does not account for varying token difficulty or the distribution of model confidence across the vocabulary. Recent work has explored alternative loss functions that can improve SFT quality:

Focal Loss: Down-weights easy (high-confidence) tokens so the model focuses on harder tokens, controlled by a focusing parameter gamma.
Label Smoothing: Regularizes by distributing a small probability mass across all vocabulary tokens, preventing overconfident predictions.
GEM (Generalized Entropy Minimization): Uses an auxiliary softmax distribution at a different temperature to create a contrastive weighting scheme that emphasizes hard examples.
DFT (Distribution-aware Fine-Tuning): Reweights per-token losses by the model's own prediction confidence.
Entropy Regularization: Adds a term encouraging higher entropy in the output distribution, which can improve exploration and generalization.

Task

Implement custom_loss_func in custom_sft_loss.py. Your loss function receives:

outputs: model output dict (use outputs.get("logits") for logits)
labels: target token IDs (NOT pre-shifted; apply causal shift yourself)
num_items_in_batch: optional normalization scalar for gradient accumulation

The model is fine-tuned on MetaMathQA (20K samples) with Qwen3-1.7B, then evaluated on hellaswag, arc_challenge, and piqa using lm-evaluation-harness. Your goal is to maximize downstream evaluation accuracy.

Code

custom_sft_loss.py

EditableRead-only

1"""Custom SFT loss function for LLaMA-Factory supervised fine-tuning.
2
3Replace the default cross-entropy loss with a custom loss function.
4This module is imported by the SFT trainer and used as compute_loss_func.
5"""
6
7from typing import Optional
8
9import torch
10import torch.nn.functional as F
11
12IGNORE_INDEX = -100
13
14# ===== EDITABLE SECTION START =====
15# Lines below (until EDITABLE SECTION END) are the region the agent may modify.

Additional context files (read-only):

LLaMA-Factory/src/llamafactory/train/trainer_utils.py

Results

No results available yet.