Energy-Based Transformers are Scalable Learners and Thinkers by Filipa Lino
This presentation summarizes the paper “Energy-Based Transformers are Scalable Learners and Thinkers”, which combines energy-based models with standard Transformers to let neural networks iteratively refine and self-verify their predictions.

