AgentStack
Back to directory

RLHF Reward Modeling

Free
1.5k GitHub stars
Learning ResourceAgnosticFile System

Overview

This repository provides recipes for training reward models specifically designed for Reinforcement Learning from Human Feedback (RLHF). It is ideal for researchers and practitioners looking to implement or enhance reward modeling techniques in their AI projects.

Visit resource