Skip to content

Commit a412d3a

Browse files
committed
Update "Tiny Recursive Model" project page with new links to the research paper and GitHub repository. Revise subtitle for clarity and enhance content with an overview of the TRM architecture, including key components and their functions.
1 parent 80fcf5f commit a412d3a

File tree

4 files changed

+51
-3
lines changed

4 files changed

+51
-3
lines changed

app/blog/tiny-recursive-model/page.tsx

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -184,6 +184,38 @@ export default function TinyRecursiveModelProject() {
184184
))}
185185
</div>
186186
)}
187+
188+
{/* Links to Paper and GitHub */}
189+
<div className="flex flex-wrap items-center justify-center gap-4 mb-8">
190+
<a
191+
href="https://arxiv.org/pdf/2510.04871v1"
192+
target="_blank"
193+
rel="noopener noreferrer"
194+
className="group flex items-center gap-2 px-6 py-3 bg-gradient-to-r from-blue-600/20 to-purple-600/20 hover:from-blue-600/30 hover:to-purple-600/30 border border-blue-500/30 hover:border-blue-400/50 text-blue-300 hover:text-blue-200 font-medium rounded-xl transition-all duration-300 shadow-lg hover:shadow-blue-500/25"
195+
>
196+
<svg className="w-5 h-5" fill="currentColor" viewBox="0 0 24 24">
197+
<path d="M9 12h6m-6 4h6m2 5H7a2 2 0 01-2-2V5a2 2 0 012-2h5.586a1 1 0 01.707.293l5.414 5.414a1 1 0 01.293.707V19a2 2 0 01-2 2z" />
198+
</svg>
199+
<span>Read Paper</span>
200+
<svg className="w-4 h-4 group-hover:translate-x-1 transition-transform" fill="none" stroke="currentColor" viewBox="0 0 24 24">
201+
<path strokeLinecap="round" strokeLinejoin="round" strokeWidth={2} d="M10 6H6a2 2 0 00-2 2v10a2 2 0 002 2h10a2 2 0 002-2v-4M14 4h6m0 0v6m0-6L10 14" />
202+
</svg>
203+
</a>
204+
<a
205+
href="https://github.com/vukrosic/learn-tiny-recursive-models"
206+
target="_blank"
207+
rel="noopener noreferrer"
208+
className="group flex items-center gap-2 px-6 py-3 bg-gradient-to-r from-purple-600/20 to-pink-600/20 hover:from-purple-600/30 hover:to-pink-600/30 border border-purple-500/30 hover:border-purple-400/50 text-purple-300 hover:text-purple-200 font-medium rounded-xl transition-all duration-300 shadow-lg hover:shadow-purple-500/25"
209+
>
210+
<svg className="w-5 h-5" fill="currentColor" viewBox="0 0 24 24">
211+
<path fillRule="evenodd" d="M12 2C6.477 2 2 6.484 2 12.017c0 4.425 2.865 8.18 6.839 9.504.5.092.682-.217.682-.483 0-.237-.008-.868-.013-1.703-2.782.605-3.369-1.343-3.369-1.343-.454-1.158-1.11-1.466-1.11-1.466-.908-.62.069-.608.069-.608 1.003.07 1.531 1.032 1.531 1.032.892 1.53 2.341 1.088 2.91.832.092-.647.35-1.088.636-1.338-2.22-.253-4.555-1.113-4.555-4.951 0-1.093.39-1.988 1.029-2.688-.103-.253-.446-1.272.098-2.65 0 0 .84-.27 2.75 1.026A9.564 9.564 0 0112 6.844c.85.004 1.705.115 2.504.337 1.909-1.296 2.747-1.027 2.747-1.027.546 1.379.202 2.398.1 2.651.64.7 1.028 1.595 1.028 2.688 0 3.848-2.339 4.695-4.566 4.943.359.309.678.92.678 1.855 0 1.338-.012 2.419-.012 2.747 0 .268.18.58.688.482A10.019 10.019 0 0022 12.017C22 6.484 17.522 2 12 2z" clipRule="evenodd" />
212+
</svg>
213+
<span>View Code</span>
214+
<svg className="w-4 h-4 group-hover:translate-x-1 transition-transform" fill="none" stroke="currentColor" viewBox="0 0 24 24">
215+
<path strokeLinecap="round" strokeLinejoin="round" strokeWidth={2} d="M10 6H6a2 2 0 00-2 2v10a2 2 0 002 2h10a2 2 0 002-2v-4M14 4h6m0 0v6m0-6L10 14" />
216+
</svg>
217+
</a>
218+
</div>
187219

188220
{/* Glow effect for the title */}
189221
<div className="absolute inset-0 text-4xl md:text-5xl lg:text-6xl font-medium leading-tight blur-sm">

app/page.tsx

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -247,15 +247,15 @@ export default function Home() {
247247
<span className="bg-slate-600/50 text-slate-300 text-xs px-2 py-1 rounded-md">Research</span>
248248
</div>
249249
<div className="absolute top-4 right-4">
250-
<span className="bg-purple-500/20 text-purple-400 text-xs px-2 py-1 rounded-md">New</span>
250+
<span className="bg-purple-500/20 text-purple-400 text-xs px-2 py-1 rounded-md">Latest</span>
251251
</div>
252252

253253
<div className="mt-8">
254254
<h4 className="text-xl font-bold mb-3 group-hover:text-purple-400 transition-colors">
255255
Tiny Recursive Model
256256
</h4>
257257
<p className="text-gray-400 text-sm mb-4 leading-relaxed">
258-
Exploring recursive architectures for efficient AI models
258+
How a 7M parameter model beats 100x bigger models at Sudoku, Mazes, and ARC-AGI using recursive reasoning with a 2-layer transformer
259259
</p>
260260
<div className="flex items-center justify-between">
261261
<span className="text-xs text-gray-500">AI Research</span>
162 KB
Loading

public/content/tiny-recursive-model/tiny-recursive-model-content.md

Lines changed: 17 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
hero:
33
title: "Tiny Recursive Model"
4-
subtitle: "Exploring recursive architectures for efficient AI models"
4+
subtitle: "New recursive reasoning AI architecture"
55
tags:
66
- "⏱️ Technical Deep Dive"
77
- "📄 Research Article"
@@ -15,6 +15,22 @@ hero:
1515

1616
It beats 100x bigger models in Sudoku-Extreme, Mazes, ARC-AGI and more.
1717

18+
In this tutorial we will learn how TRM works and do our own experiments.
19+
20+
---
21+
22+
## TRM Architecture Overview
23+
24+
![Tiny Recursive Model Architecture](/content/tiny-recursive-model/images/tiny-recursive-model-architecture.png)
25+
*Figure: The Tiny Recursive Model architecture showing the main processing block (4x transformer layers), input combination of question (x), answer (y), and reasoning (z), output processing for loss calculation, and the recursive update mechanism that iteratively refines the reasoning and prediction over up to 16 steps.*
26+
27+
The diagram above illustrates the complete TRM architecture. The model processes three key components:
28+
- **Input (x)**: The question or problem to solve (e.g., maze layout)
29+
- **Prediction (y)**: The model's current answer attempt
30+
- **Latent (z)**: The model's internal reasoning state
31+
32+
These are combined and processed through a 4-layer transformer stack, with the output used to compute cross-entropy loss. The key innovation is the recursive update mechanism at the bottom, which iteratively refines both the reasoning (z) and prediction (y) over multiple steps to progressively improve the solution.
33+
1834
---
1935

2036
## How TRM Works

0 commit comments

Comments
 (0)