Resource

Collective Constitutional AI: Aligning a Language Model with Public Input

Anthropic’s partnership with the Collective Intelligence Project where approximately 1,000 Americans collectively drafted principles to guide AI behavior through online deliberation using the Polis platform. Participants submitted 1,127 statements and cast 38,252 votes, with the team converting public statements into Constitutional AI principles. The public-aligned model performed equivalently on knowledge tasks but showed notably reduced bias across nine social dimensions, particularly regarding disability representation.

Experimental Practice

Link https://www.anthropic.com/research/collective-constitutional-ai-aligning-a-language-model-with-public-input

Creators Anthropic

Year 2023

Related Capabilities

Routing and synthesizing

Informedness

Ability to route and synthesize data, revealing critical information, e.g. identifying common ground, high-potential ideas, thoughtful perspectives, insightful experiences, cruxes, forecasts, while helping to minimize the time required to do tasks.

Collective Constitutional AI: Aligning a Language Model with Public Input

Related Capabilities

Routing and synthesizing

Related Research Questions