Pedro Silvestre
Pedro Silvestre
Home
Blog
Publications
Contact
Light
Dark
Automatic
"Large Language Models"
Systems Opportunities for LLM Fine-Tuning using Reinforcement Learning
Reinforcement learning-based fine-tuning (RLFT) has emerged as a crucial workload for enhancing large language models (LLMs). RLFT workflows are challenging, involving nested loops, multiple models, dynamically shaped tensors and interleaving …
Cite
×