Analysis/Session Provenance #18

cjsifuen · 2022-12-02T17:01:43Z

Enable users to capture with ease, fidelity, and accuracy the actions/analysis performed on a dataset or sets of datasets.

Information to capture

Files uploaded, data filtering steps and parameters used
Dataset/visualization subsetting and parameters
Selection of visualization types, positions, sizes, etc.

Potential implementations

Capture information in a file that can be used to "rerun" what was done
Capture and save as an "instance", in a more perpetual nature

Considerations

This might look different for datasets of different sizes
This might look different for a hosted vs local version

cjsifuen · 2022-12-12T15:09:33Z

I spoke with the imaging group about learning from napari. Their strategy is different in that they support a local instance only. They capture the commands run, but I'm not sure it's actually they type of provenance we're talking about.

ergonyc · 2023-02-02T18:16:54Z

From what I understand about SODAs current workflow this should be pretty straightforward. The "output" of SODA is either a downloaded dataset or a visualization. So I think there are just 3 states to log or capture:

data origin (file name / path / source?) + metadata
sample filter
feature filter
visualization
- vis type + parameters
- subset selection

I think a generic R logging module can do capture, so that log as metadata just needs to be added to metadata and saved along with the visualization / data.

cjsifuen · 2023-02-03T16:31:45Z

From what I understand about SODAs current workflow this should be pretty straightforward. The "output" of SODA is either a downloaded dataset or a visualization. So I think there are just 3 states to log or capture:

data origin (file name / path / source?) + metadata

sample filter

feature filter

visualization

vis type + parameters

subset selection

I think a generic R logging module can do capture, so that log as metadata just needs to be added to metadata and saved along with the visualization / data.

This would be a light way to implement the first option.

A few more things to flag if this approach was taken:

Could add in a "save" or. "log" button to actively log metadata, but would also want to log changes automatically.
Might want to ensure no additional filtering takes place in the UI
Should check the R logging captures interactive visualizations

A possible way to do more complex logging/debugging could be to use a shiny logger to capture events and interactions -- though perhaps this is unnecessary. Just wanted to add some options that I found here.

cjsifuen added the enhancement New feature or request label Dec 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Analysis/Session Provenance #18

Analysis/Session Provenance #18

cjsifuen commented Dec 2, 2022 •

edited

Loading

cjsifuen commented Dec 12, 2022

ergonyc commented Feb 2, 2023

cjsifuen commented Feb 3, 2023

Analysis/Session Provenance #18

Analysis/Session Provenance #18

Comments

cjsifuen commented Dec 2, 2022 • edited Loading

cjsifuen commented Dec 12, 2022

ergonyc commented Feb 2, 2023

cjsifuen commented Feb 3, 2023

cjsifuen commented Dec 2, 2022 •

edited

Loading