DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

*** DINO-World Model ***

New work from @gaoyuezhou, @gary_phkkk, @lerrelpinto and @yannlecun.

A system capable of planning complex action sequences using an action-conditioned world model trained from off-line data.

Pure planning (no reinforcement learning whatsoever).

It uses a JEPA architecture in which the image encoder is DINOv2, and the action-conditioned predictor trained from off-line data.

1 comment

skool.com/data-alchemy

Your Community to Master the Fundamentals of Working with Data and AI — by Datalumina®

Leaderboard (30-day)

+203

+74

+33

+32

+28