• Customizing LLMs: -Supervised fine-tuning on your tasks -Self-supervised learning (SSL) on your text -RL w/ your reward model (RM) -Filter high-temp outputs w/ RM -Conditional SSL on RM-scored text -Prompt w/ context -Give it access to your tools -Train (soft) parts of prompts View Tweet