TinyLoRA – Learning to Reason in 13 Parameters

📡hackernews

TinyLoRA – Learning to Reason in 13 Parameters

Recent research has shown that language models can learn to \textit{reason}, often via reinforcement learning. Some work even trains low-rank parameterizations for reasoning, but conventional LoRA cannot scale below the model dimension. We question whether even rank=1 LoRA is necessary for learning

This summary may be AI-generated and could contain inaccuracies. Read the original article for full details.

March 27, 2026·14d

hackernews trending

Read Original

0

Recent research has shown that language models can learn to \textit{reason}, often via reinforcement learning. Some work even trains low-rank parameterizations for reasoning, but conventional LoRA cannot scale below the model dimension. We question whether even rank=1 LoRA is necessary for learning to reason and propose TinyLoRA, a method for scaling low-rank adapters to sizes as small as one parameter. Within our new parameterization, we are able to train the 8B parameter size of Qwen2.5 to 91\

Article

TinyLoRA – Learning to Reason in 13 Parameters