diff --git a/README.md b/README.md index 64459cfe..c6548687 100644 --- a/README.md +++ b/README.md @@ -31,7 +31,7 @@ V-JEPA 2 is a self-supervised approach to training video encoders, using interne **(Top)** The encoder and predictor are pre-trained through self-supervised learning from video using a masked latent feature prediction objective, leveraging abundant natural videos to bootstrap physical world understanding and prediction. **(Bottom)** Performance of V-JEPA 2 on downstream understanding and prediction tasks. -  + 
Benchmark