Welcome to the AI Alliance Open Trusted Data Initiative (OTDI).
A high quality, trusted, open catalog / distributed repository of datasets for AI LLM pre-training and domain-specific fine-tuning that is amenable to a wide variety of use cases in enterprises, governments, regulated industries, and wherever high trust in the data foundations of AI is essential.
Note
We follow the AI Alliance CONTRIBUTING guidelines (also here). This includes policies for licensing repo content. You will also need to agree with and follow the AI Alliance Code of Conduct (also here).
The documentation for this repo is published using GitHub Pages. See GITHUB_PAGES for details. The docs
folder here contains the website code, but the Makefile
in the root directory has targets for running the site locally, etc. See GITHUB_PAGES.md
for details.
This repo will also be used for implementations, until such time as it makes sense to split work into separate repos. Miscellaneous other documentation, not in the website, is also captured here:
tools-notes
- Notes on potential tool choices.- TBD - code.