Index
·research·reading

Pretrained Transformers as Universal Computation Engines

A study of frozen pretrained transformers as reusable computation engines across non-language sequence tasks.

paper · queued
Kevin Lu et al.
arXiv:2103.05247
source ↗

This is a useful bridge between transformers as language models and transformers as general-purpose computational substrates. The interesting claim is not just that pretraining transfers, but that frozen language-pretrained structure can support numerical, vision, and protein tasks with relatively small task-specific adaptation.

Neighborhood

Related

Language Models are Few-Shot LearnersLanguage Models are Few...AI systems engineeringAI systems engineeringFull Stack Artificial IntelligenceFull Stack Artificial I...Attention Is All You NeedAttention Is All You NeedFull-Stack Artificial IntelligenceFull-Stack Artificial Intel...The APIThe APIPretrained Transformers as Univ...