2025/08/21 20:00～2025/08/21 20:50

School of IT seminar slot (compulsory if scheduled)

Title: Meta-Learning the Intrinsic Reward Weighting in Curiosity-Driven RL Abstract: Reinforcement learning agents must find a balance between exploitation and exploration to maximise the cumulative sum of extrinsic rewards. However, it is particularly challenging for agents to explore effectively in sparse reward environments where feedback is rare. Curiosity-driven algorithms can be used to encourage effective exploration. These algorithms generate an additional reward called the intrinsic reward that encourages agents to seek novel situations. The intrinsic rewards are combined with extrinsic rewards using a weighted sum, where λ is the weighting for the intrinsic reward. However, λ is often fine-tuned for each new environment, even when environments are similar. We propose a meta-learning approach that replaces the fixed λ parameter with a neural network that outputs λ values at each time step. This network is trained using evolutionary strategies and can generalise across similar environments without retraining. Our approach highlights the potential for reducing the need for fine-tuning λ across similar tasks. Biography: Batsi is a Master's student at the University of Cape Town with interests in curiosity-driven reinforcement learning and meta-reinforcement learning. His research focuses on improving the sample efficiency of reinforcement learning algorithms.

📍 CS2A

🔖 CSHonoursTimetable

💾ics Dl 📅webcal 💻ics feed