Week 6. Feb. 14: Reinforcement Learning - Possibilities

Pose a question about one of the following articles:

“[Human-level control through deep reinforcement learning](https://www.nature.com/articles/nature14236)” 2015. V. Mnih...D. Hassabis.

“Direct Preference Optimization: Your Language Model is Secretly a Reward Model”. 2023

“[Learning ‘What-if’ Explanations for Sequential Decision-Making](https://arxiv.org/abs/2007.13531)” (2021).

“[Improved protein structure prediction using potentials from deep learning](https://www.nature.com/articles/s41586-019-1923-7)” (2020). 

“[Machine Theory of Mind](https://arxiv.org/abs/1802.07740)” (2018)

“[Explainability in deep reinforcement learning](https://www.webofscience.com/wos/woscc/full-record/WOS:000618603300002)” (2021)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Week 6. Feb. 14: Reinforcement Learning - Possibilities #15

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Week 6. Feb. 14: Reinforcement Learning - Possibilities #15

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions