From 41ec55207838d4a9b97fe40ed05f36d40a7f7790 Mon Sep 17 00:00:00 2001 From: Mike Ranzinger Date: Wed, 4 Sep 2024 07:23:06 -0600 Subject: [PATCH] Update README.md with Adaptor docs --- README.md | 9 ++++++++- 1 file changed, 8 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 5102080..99f7084 100644 --- a/README.md +++ b/README.md @@ -305,7 +305,7 @@ RADIO allows non-square inputs. In fact, both RADIOv1 and RADIOv2 achieve higher ### Adaptors _(Currently only supported with TorchHub)_ -You may additionally specify model adaptors to achieve extra behaviors. Currently, 'clip' is the only supported adaptor. In this mode, radio will return a dict of tuples: +You may additionally specify model adaptors to achieve extra behaviors. In this mode, radio will return a dict of tuples: ```Python model = torch.hub.load(..., adaptor_names='clip', ...) @@ -318,6 +318,13 @@ clip_summary, clip_features = output['clip'] Refer to `examples/zero_shot_imagenet.py` for example usage. +#### Supported Adaptors: + +- RADIOv2.5: `clip`, `siglip`, `dino_v2`, `sam` +- RADIOv2\[.1\]: `clip`, `dino_v2`, `sam` + +The `clip` and `siglip` adaptors have the additional functionality of supporting tokenization and language encoding. Refer to `examples/zero_shot_imagenet.py` for this use, as well as the [API](https://github.com/NVlabs/RADIO/blob/main/radio/open_clip_adaptor.py#L33-L36). + ### Preprocessing By default, RADIO expects the input images to have normalized values in the `[0, 1]` range. If you already have an existing data pipeline, and you'd like conditioning to occur there instead of within the RADIO model, you can call this function: