Transformer Models Timeline Details

Viktor Garske @vemgar, Last update: Tue Dec 26 15:23:35 2023

← Back to the full graph

Helpful and Harmless

This graph is clickable!

Type

Dataset

Paper name

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Paper authors

Bai et al.

Paper link

https://arxiv.org/abs/2204.05862

Publish date

2022-04-12

Affiliation

Anthropic