The third New England RLHF Hackers Hackathon

By PulseAugur Editorial · Summary by None from 3 sources

The New England RLHF Hackers (NERH) group, primarily composed of EleutherAI collaborators, held their third hackathon focusing on Reinforcement Learning from Human Feedback (RLHF). Projects explored training models with Inverse Learning from Q-learning, aligning LLMs with idealized reward models instead of human preferences, and visualizing reward model behavior using techniques like QDAIF. Another project investigated using Sparse Autoencoders to identify features within reward models that influence their scoring, revealing potential biases against certain topics like politics or pregnancy. The group also discussed methods for directly evaluating reward models independent of the full RLHF training process. AI

Summary written by None from 3 sources. How we write summaries →

RANK_REASON The cluster describes multiple research projects and experimental findings from hackathons focused on RLHF techniques.

Read on EleutherAI Blog →

paper
other

The third New England RLHF Hackers Hackathon

COVERAGE [3]

EleutherAI Blog TIER_1 · 2023-11-26 15:00

The third New England RLHF Hackers Hackathon

Introduction At the third New England RLHF Hackathon, several interesting projects were showcased, each focusing on different aspects of machine learning and reinforcement learning. Participants and those interested in future events are encouraged to join the Discord community fo…
EleutherAI Blog TIER_1 · 2023-10-13 20:00

The second New England RLHF Hackers Hackathon

Introduction Rekindling the spirit of collaboration, the New England RLHF Hackers (NERH) hosted their second hackathon at Brown University on October 8th, 2023. Stepping up from the success of our inaugural hackathon, this event was fueled by the same enthusiasm but with a fresh …
EleutherAI Blog TIER_1 · 2023-09-19 20:00

The first New England RLHF Hackers Hackathon

Introduction Author list is alphabetical by last name. We would like to extend acknowledgements to Delta Christine Hessler and Hailey Schoelkopf. On September 10, 2023, New England RLHF Hackers (NERH) held a hackathon at Brown University. For this hackathon we came in with one si…

COVERAGE [3]

The third New England RLHF Hackers Hackathon

The second New England RLHF Hackers Hackathon

The first New England RLHF Hackers Hackathon

RELATED TOPICS