← Explore Research Questions
Resource

Collective Constitutional AI: Aligning a Language Model with Public Input

Anthropic’s partnership with the Collective Intelligence Project where approximately 1,000 Americans collectively drafted principles to guide AI behavior through online deliberation using the Polis platform. Participants submitted 1,127 statements and cast 38,252 votes, with the team converting public statements into Constitutional AI principles. The public-aligned model performed equivalently on knowledge tasks but showed notably reduced bias across nine social dimensions, particularly regarding disability representation.

Related Research Questions

There are no related research questions.