add intro from readme

CentreForDigitalHumanities · JeltevanBoheemen · Mar 13, 2024 · Feb 29, 2024 · Feb 29, 2024 · Feb 29, 2024
commit cf496704670678d54161760fe6f5ed3412bfc9f7
diff --git a/docs/index.md b/docs/index.md
@@ -1,3 +1,9 @@
 # I-analyzer Readers documentation
 
-Welcome! This documentation is a work in progress.
+**This documentation is a work in progress.**
+
+`ianalyzer-readers` is a python module to extract data from XML, HTML, CSV or XLSX files.
+
+This module was originally created for [I-analyzer](https://github.com/UUDigitalHumanitieslab/I-analyzer), a web application that extracts data from a variety of datasets, indexes them and presents a search interface. To do this, we wanted a way to extract data from source files without having to write a new script "from scratch" for each dataset, and an API that would work the same regardless of the source file type.
+
+The basic usage is that you will use the utilities in this package to create a `Reader` class tailored to a dataset. You specify what your data looks like, and then call the `documents()` method of the reader to get an iterator of documents - where each document is a flat dictionary of key/value pairs.