Trending Model:#1Unlimited-OCRbaidu⬇630kTrending Model:#2Qwythos-9B-Claude-Mythos-5-1M-GGUFempero-ai⬇1114kTrending Model:#3GLM-5.2zai-org⬇160kTrending Model:#4Ornith-1.0-35B-GGUFdeepreinforce-ai⬇234kTrending Model:#5Ornith-1.0-9B-GGUFdeepreinforce-ai⬇191kTrending Model:#6gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUFyuxinlu1⬇289kTrending Model:#7Ornith-1.0-9Bdeepreinforce-ai⬇47kTrending Model:#8Qwen-AgentWorld-35B-A3BQwen⬇34kTrending Model:#9Ornith-1.0-35Bdeepreinforce-ai⬇135kTrending Model:#10Qwythos-9B-Claude-Mythos-5-1Mempero-ai⬇114kTrending Model:#1Unlimited-OCRbaidu⬇630kTrending Model:#2Qwythos-9B-Claude-Mythos-5-1M-GGUFempero-ai⬇1114kTrending Model:#3GLM-5.2zai-org⬇160kTrending Model:#4Ornith-1.0-35B-GGUFdeepreinforce-ai⬇234kTrending Model:#5Ornith-1.0-9B-GGUFdeepreinforce-ai⬇191kTrending Model:#6gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUFyuxinlu1⬇289kTrending Model:#7Ornith-1.0-9Bdeepreinforce-ai⬇47kTrending Model:#8Qwen-AgentWorld-35B-A3BQwen⬇34kTrending Model:#9Ornith-1.0-35Bdeepreinforce-ai⬇135kTrending Model:#10Qwythos-9B-Claude-Mythos-5-1Mempero-ai⬇114k

Supra-50M-Reasoning Spills Its Digital Thinking On Tiny Rigs

A small glowing brain neuron is made up of a transparent glass material with contrasting to bright neons.

Supra-50M-Reasoning is an experimental open-source language model that produces a step-by-step thinking process before giving its final response. Designed as the reasoning version of Supra-50M-Instruct, it adds a chain-of-thought segment to every answer. The model is part of SupraLabs’ Project Chimera, a collection of tiny AI models built for learning and tinkering.

SupraLabs created this model by fine-tuning their Supra-50M-Base with a small synthetic dataset of 500 examples, generated by the larger Qwen3 1.7B model. They trained it for six epochs using supervised fine-tuning to embed reasoning patterns. The result is a fully open, if error-prone, reasoning model that the team is sharing with the community.

Chain-of-thought reasoning on tiny hardware

Key features of Supra-50M-Reasoning
  • Generates a full thinking chain before answering.
  • Fine-tuned on 500 synthetic reasoning samples.
  • Trained for 6 epochs with supervised learning.
  • Open-source and part of Project Chimera.
  • Runs on consumer GPUs with bfloat16 precision.
  • Experimental with known hallucination issues.

If you run AI models on a home computer with a modest graphics card, this model gives you a safe way to observe chain-of-thought reasoning without needing cloud services. Researchers and students can study how a small model attempts to break down problems, though the thinking often includes errors. Hobbyists who enjoy tinkering with the smallest language models will find it a fun, educational tool.

Training details and known limitations

The model card reveals that only 500 handcrafted reasoning examples were used for fine-tuning, which heavily limits its accuracy and leads to frequent hallucinations. It also uses a fixed answer structure with `<|begin_of_thought|>` and `<|end_of_thought|>` tags that apps must parse. SupraLabs plans to scale up the experiment with Supra-124M and Supra-350M variants that will include chat, reasoning, and coding skills.

"It's experimental, it hallucinates, and it's fully open." — Source:Reddit