The open-source repository for PAL: Sample-Efficient Personalized Reward Modeling for Pluralistic Alignment.
-
Updated
Aug 28, 2025 - Python
The open-source repository for PAL: Sample-Efficient Personalized Reward Modeling for Pluralistic Alignment.
A curated collection of papers, benchmarks, datasets, and tools on human values in LLMs and pluralistic alignment.
Code for the MTPA framework [EMNLP Findings 2025]
Treating the AI-labeling "ground truth" as a distribution, not a fact — an argument, runnable diagnostics + soft-label/governance export, and four foundational results: social choice · spin glass · topology · information geometry.
Add a description, image, and links to the pluralistic-alignment topic page so that developers can more easily learn about it.
To associate your repository with the pluralistic-alignment topic, visit your repo's landing page and select "manage topics."