Run a first technical test #102

anneschuth · 2024-07-17T11:53:49Z

For now we assume a model to be present and data to be present and it runs locally, and that it is linked from the system card.

the system card points to the model in the model card which has a URI to a model.
The URI (for now) is a file path (file://) to the guusje model.
We add a simple example model (as a fixture), simply checked into our code base (use model from https://github.com/MinBZK/example-data-science-project)
The path to the model is hardcoded (for now)
The path for the data is hardcoded (for now)

We add a Shap technical test to the Instrument Registry. It needs

A urn
A description
A task of type "technical test"
The task should require a model by pointing to the model field included in the system card.
The task has a urn (used below)
To define where the results should be written in the system card (in measures)

Implement the technical test that runs the Shap implementation of AI Verify using the model pointed to in the system card. We implement this as a CLI (for now, later this becomes a Celery worker), based on the minimimal ai verify repo (https://github.com/MinBZK/aiverify/tree/using_aiverify_as_api).

The implementation:

itself knows what instrument task it implements (it knows the urn of the Shap technical test task, see above)
has access to the instrument definition (by retrieving it using the urn)
has access to a system card (receives the path as a CLI argument)
calls the AI Verify API
writes the result from the SHAP test into the model card on the path defined by the instrument.

Acceptance Criteria

All shap values are stored in the system card -> model card -> model index -> results -> measures -> ...
implement how to store shap values in model card without lose of data

The text was updated successfully, but these errors were encountered:

github-actions · 2024-10-24T04:19:02Z

This issue did not have any activity in the last 90 days and will be removed after 30 days

github-actions · 2025-01-23T04:05:30Z

This issue did not have any activity in the last 90 days and will be removed after 30 days

github-actions · 2025-02-23T04:04:42Z

This issue is closed due to inactivity

anneschuth · 2025-02-23T08:08:42Z

:(

anneschuth · 2025-02-24T07:38:51Z

❤️

anneschuth added this to 👾 AI Validation Team Planning Jul 17, 2024

anneschuth converted this from a draft issue Jul 17, 2024

anneschuth changed the title ~~Run technical test~~ Run a first technical test Jul 17, 2024

github-actions bot added Stale and removed Stale labels Oct 24, 2024

github-actions bot added the Stale label Jan 23, 2025

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Feb 23, 2025

github-project-automation bot moved this from ♻ To Do to ✅ Done in 👾 AI Validation Team Planning Feb 23, 2025

robbertbos removed the Stale label Feb 24, 2025

robbertbos moved this from ✅ Done to ♻ To Do in 👾 AI Validation Team Planning Feb 24, 2025

robbertbos reopened this Feb 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run a first technical test #102

Run a first technical test #102

anneschuth commented Jul 17, 2024 •

edited

Loading

github-actions bot commented Oct 24, 2024

github-actions bot commented Jan 23, 2025

github-actions bot commented Feb 23, 2025

anneschuth commented Feb 23, 2025

anneschuth commented Feb 24, 2025

Run a first technical test #102

Run a first technical test #102

Comments

anneschuth commented Jul 17, 2024 • edited Loading

Acceptance Criteria

github-actions bot commented Oct 24, 2024

github-actions bot commented Jan 23, 2025

github-actions bot commented Feb 23, 2025

anneschuth commented Feb 23, 2025

anneschuth commented Feb 24, 2025

anneschuth commented Jul 17, 2024 •

edited

Loading