"Reinforcement Learning"

Systems Opportunities for LLM Fine-Tuning using Reinforcement Learning

Reinforcement learning-based fine-tuning (RLFT) has emerged as a crucial workload for enhancing large language models (LLMs). RLFT workflows are challenging, involving nested loops, multiple models, dynamically shaped tensors and interleaving …

Tempo: Compiled Dynamic Deep Learning with Symbolic Dependence Graphs

Deep learning (DL) algorithms are often defined in terms of temporal relationships: a tensor at one timestep may depend on tensors from earlier or later timesteps. Such dynamic dependencies (and corresponding dynamic tensor shapes) are difficult to …