卡耐基梅隆大学(CMU)元学习和元强化学习课程 | Elements of Meta-Learning

时间:2020-05-15 09:17:45   收藏:0   阅读:79

技术图片
Goals for the lecture:

Introduction & overview of the key methods and developments.
[Good starting point for you to start reading and understanding papers!]

原文链接:


技术图片
@

Probabilistic Graphical Models | Elements of Meta-Learning

01 Intro to Meta-Learning

技术图片

Motivation and some examples

When is standard machine learning not enough?
Standard ML finally works for well-defined, stationary tasks.
技术图片
But how about the complex dynamic world, heterogeneous data from people and the interactive robotic systems?
技术图片

General formulation and probabilistic view

What is meta-learning?
Standard learning: Given a distribution over examples (single task), learn a function that minimizes the loss:
技术图片
Learning-to-learn: Given a distribution over tasks, output an adaptation rule that can be used at test time to generalize from a task description
技术图片

A Toy Example: Few-shot Image Classification
技术图片
技术图片

Other (practical) Examples of Few-shot Learning
技术图片
技术图片
技术图片
技术图片

Gradient-based and other types of meta-learning

Model-agnostic Meta-learning (MAML) 与模型无关的元学习

Does MAML Work?
技术图片

MAML from a Probabilistic Standpoint
Training points: 技术图片
testing points:技术图片
MAML with log-likelihood loss对数似然损失:
技术图片
技术图片

One More Example: One-shot Imitation Learning 模仿学习
技术图片

Prototype-based Meta-learning
技术图片
Prototypes:
技术图片
Predictive distribution:
技术图片
Does Prototype-based Meta-learning Work?
技术图片

Rapid Learning or Feature Reuse 特征重用
技术图片
技术图片
技术图片
技术图片

Neural processes and relation of meta-learning to GPs

Drawing parallels between meta-learning and GPs
In few-shot learning:

Recall Gaussian Processes (GPs): 高斯过程

技术图片

Conditional Neural Processes 条件神经过程
技术图片
技术图片
技术图片
技术图片

On software packages for meta-learning
A lot of research code releases (code is fragile and sometimes broken)
A few notable libraries that implement a few specific methods:

技术图片
Takeaways

02 Elements of Meta-RL

What is meta-RL and why does it make sense?

Recall the definition of learning-to-learn
Standard learning: Given a distribution over examples (single task), learn a function that minimizes the loss:
技术图片
Learning-to-learn: Given a distribution over tasks, output an adaptation rule that can be used at test time to generalize from a task description
技术图片
Meta reinforcement learning (RL): Given a distribution over environments, train a policy update rule that can solve new environments given only limited or no initial experience.
技术图片

Meta-learning for RL
技术图片

On-policy and off-policy meta-RL

On-policy RL: Quick Recap 符合策略的RL:快速回顾
技术图片
REINFORCE algorithm:
技术图片

On-policy Meta-RL: MAML (again!)

Key points:

Adaptation in nonstationary environments 不稳定环境
Classical few-shot learning setup:

Continuous adaptation setup:

Continuous adaptation

Treat policy parameters, tasks, and all trajectories as random variables
技术图片

RoboSumo: a multiagent competitive env
an agent competes vs. an opponent, the opponent’s behavior changes over time
技术图片

Takeaways

评论(0
© 2014 mamicode.com 版权所有 京ICP备13008772号-2  联系我们:gaon5@hotmail.com
迷上了代码!