DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning
*** DINO-World Model ***
New work from @gaoyuezhou, @gary_phkkk, @lerrelpinto and @yannlecun.
A system capable of planning complex action sequences using an action-conditioned world model trained from off-line data.
Pure planning (no reinforcement learning whatsoever).
It uses a JEPA architecture in which the image encoder is DINOv2, and the action-conditioned predictor trained from off-line data.
2
1 comment
Marcio Pacheco
7
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning
Data Alchemy
skool.com/data-alchemy
Your Community to Master the Fundamentals of Working with Data and AI — by Datalumina®
Leaderboard (30-day)
Powered by