
Переходьте в офлайн за допомогою програми Player FM !
Stefano Albrecht on Multi-Agent RL @ RLDM 2025
Manage episode 495946308 series 2536330
Stefano V. Albrecht was previously Associate Professor at the University of Edinburgh, and is currently serving as Director of AI at startup Deepflow. He is a Program Chair of RLDM 2025 and is co-author of the MIT Press textbook "Multi-Agent Reinforcement Learning: Foundations and Modern Approaches".
Featured References
Multi-Agent Reinforcement Learning: Foundations and Modern Approaches
Stefano V. Albrecht, Filippos Christianos, Lukas Schäfer
MIT Press, 2024
RLDM 2025: Reinforcement Learning and Decision Making Conference
Dublin, Ireland
EPyMARL: Extended Python MARL framework
https://github.com/uoe-agents/epymarl
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis and Filippos Christianos and Lukas Schäfer and Stefano V. Albrecht
73 епізодів
Manage episode 495946308 series 2536330
Stefano V. Albrecht was previously Associate Professor at the University of Edinburgh, and is currently serving as Director of AI at startup Deepflow. He is a Program Chair of RLDM 2025 and is co-author of the MIT Press textbook "Multi-Agent Reinforcement Learning: Foundations and Modern Approaches".
Featured References
Multi-Agent Reinforcement Learning: Foundations and Modern Approaches
Stefano V. Albrecht, Filippos Christianos, Lukas Schäfer
MIT Press, 2024
RLDM 2025: Reinforcement Learning and Decision Making Conference
Dublin, Ireland
EPyMARL: Extended Python MARL framework
https://github.com/uoe-agents/epymarl
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis and Filippos Christianos and Lukas Schäfer and Stefano V. Albrecht
73 епізодів
כל הפרקים
×Ласкаво просимо до Player FM!
Player FM сканує Інтернет для отримання високоякісних подкастів, щоб ви могли насолоджуватися ними зараз. Це найкращий додаток для подкастів, який працює на Android, iPhone і веб-сторінці. Реєстрація для синхронізації підписок між пристроями.