Whisper: Web-Scale Supervised Pretraining for Speech Recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Latent Diffusion (Stable Diffusion)
High-Resolution Image Synthesis with Latent Diffusion Models
InstructGPT
Training language models to follow instructions with human feedback
CLIP: Contrastive Language-Image Pre-training
Learning Transferable Visual Models From Natural Language Supervision
DDIM
Denoising Diffusion Implicit Models
GPT-3
Language Models are Few-Shot Learners
DDPM
Denoising Diffusion Probabilistic Models
T5: Text-to-Text Transfer Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
GPT-2
Language Models are Unsupervised Multitask Learners
BERT: Bidirectional Encoder Representations from Transformers
Pre-training of Deep Bidirectional Transformers for Language Understanding