Skip to content

Commit

Permalink
update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
rdnfn committed Jan 21, 2025
1 parent bfce0e6 commit fdaacf1
Showing 1 changed file with 7 additions and 2 deletions.
9 changes: 7 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,9 +1,14 @@

<p align="center">
<img src="./docs/img/00_logo_v0.png" width="110px" align="center">
</p>

# Inverse Constitutional AI

This repository contains the official implementation of the *Inverse Constitutional AI* (ICAI) algorithm [[paper]](https://arxiv.org/abs/2406.06560). ICAI compresses pairwise preference datasets into a readable list of principles (constitution) that the annotations appear to follow (e.g. "select the friendlier response"). ICAI principles provide an interpretable overview of a feedback dataset, enabling users to discover *problematic annotation biases* or *better understand differences between datasets, user groups or models*.

<p align="center">
<img src="./docs/img/01_basic_overview_v2.png" width="1000px" align="center">
<img src="./docs/img/01_basic_overview_v2.png" width="750px" align="center">
</p>


Expand Down Expand Up @@ -130,6 +135,6 @@ We introduce a first *Inverse Constitutional AI* (ICAI) algorithm that generates
</p>


### License
# License

All code in this repo is licensed under [Apache-2.0](./LICENSE).

0 comments on commit fdaacf1

Please sign in to comment.