Add tool for analysing images using Bioimage AI models #1391

anuprulez · 2024-03-01T15:57:40Z

Test files:
https://bioimage.io/#/?id=10.5281%2Fzenodo.5764892
https://zenodo.org/api/records/6647674

Remaining tasks:

Support for creating the original predicted image matrix
Add tool tests (imaging models are large (> 50 MB and sometimes as large as 500 MB) and it is unclear how to add sample data for tests)
Take input asTiff files if possible
Test with models of different input/output dimensions (3,4,5,...)
Add support for TensorFlow

bgruening · 2024-03-15T13:25:32Z

tools/bioimaging/bioimage_inference.xml

+    </command>
+    <inputs>
+        <param name="input_imaging_model" type="data" format="zip" label="BioImage model" help="Please load a BioImage model from file uploader"/>
+        <param name="input_image_file" type="data" format="npz" label="Image to be analysed" help="Please provide an image"/>


NPZ is numpy isn't it? How do people create such image? Should we create an NPZ file from a tiff, png as part of this tool?

Yes, NPZ is a numpy zipped file, I think. To show images as PNG or Tiff, we have to reduce the dimensionality of the original multi-dimensional image. Sometimes, these are 3 or 4 dimensional, containing several channels. To display as PNG, I am taking only the first channels where we lose some information from images. But my idea is to save these originally predicted matrices with all dimensions. I also need to think about taking PNG and Tiff as inputs and producing the original matrix along side displayable formats such as PNG, Tiff.

@beatrizserrano do you know how these NPY input/output files are generated to be used with bioimage models (e.g. https://bioimage.io/#/?id=10.5281%2Fzenodo.5764892)? Do we use some Galaxy tool to generate it? I tried to use TIFF files as input to a few models, but they do not contain a pixel matrix of images in the correct dimensions for the model to take as input. However, NPY files work fine with the models but could be tricky for the end users of this tool to generate such test/input NPY files. Or do you know someone who worked on such models?

Thank you!

chiming in on this on @beatrizserrano 's request:
I'm unaware of the context of this PR, but can report from the bioimage.io side: We demand .npy test files to be provided alongside every model contributed to bioimage.io. Among the utility functions we provide in bioimageio.core there is
https://github.com/bioimage-io/core-bioimage-io-python/blob/f798344213c179a6c836938aff9e4f6c46f8d23c/bioimageio/core/utils/image_helper.py#L136
-- a util function using imageio to attempt to load any given image and then attempting to guess it's axes, see
https://github.com/bioimage-io/core-bioimage-io-python/blob/f798344213c179a6c836938aff9e4f6c46f8d23c/bioimageio/core/utils/image_helper.py#L42

Software using the model zoo (or even specifically bioimageio.core) should try to delegate this guesswork to the end user instead.

note: the links above are from a development branch. These updated functions will be available in the coming bioimageio.core release, currently this functionality is implemented here

tools/bioimaging/bioimage_inference.xml

Fix suggestion in name Co-authored-by: Leonid Kostrykin <[email protected]>

Fix suggestion to include edam annotation Co-authored-by: Leonid Kostrykin <[email protected]>

bgruening · 2024-06-19T07:31:04Z

@anuprulez is this still WIP or can we merge it. @beatrizserrano needs it :)

kostrykin · 2024-06-30T11:51:11Z

Guys, my impression of the comments above is that there has been some confusion. So in order to clear things up, I want to try to summarize the main concerns:

Concern 1: Support of TIFF/PNG

@bgruening was addressing the input file format, which currently is only NPZ. For those people who are the target audience of this tool, this isn't a well-established standard file format such as TIFF or PNG. However, @anuprulez explained the conversion to TIFF or PNG of the output files, pointing out that it is not straightforward.

Lets consider the questions of the input file formats and the output file formats separately.

1.1) Inputs: The NPY/NPZ formats are more general than TIFF and PNG, since they can store arbitrary numpy arrays, with arbitrary data types (even mixed) and number of dimensions. Thus, conversion from TIFF/PNG to NPY/NPZ should be straightforward. Given the concerns above, I think the tool wrapper really should also accept TIFF and PNG input files and do this conversion automatically. This should be as simple as something like this:

im = imread('tiff or png image file path')
np.save('input.npy', im)  # to produce an NPY
np.savez('input.npz', im)  # to produce an NPZ

1.2) Outputs: It was pointed out that the intention of outputting the originally predicted arrays as NPZ was that no information is lost. However, I'm wondering whether converting the data to TIFF is even capable of losing information? My gut feeling is that using TIFF as the output file format should be safe, at least as long as we restrict the wrapper to NPY instead of NPZ (see below).

Concern 2: NPY or NPZ?

Please keep in mind that NPY is "the standard binary file format in NumPy for persisting a single arbitrary NumPy array on disk", and NPZ is "the standard format for persisting multiple NumPy arrays on disk", which can be compressed, but not necessarily (docs). The key difference here is that NPY is for single arrays, NPZ is for multiple arrays.

At this point I'm somewhat confused. I hope that @FynnBe can maybe add something regarding the following two concerns:

2.1) What happens if the NPZ contains more than one array? Do the bioimage.io models process them independently and yield another NPZ, which contains predictions for each array in the NPZ? Or do the models bluntly fail to process an NPZ with more than one array?

2.2.) As far as I understand the comment made by @FynnBe (link), the bioimage.io models are only guaranteed to work with NPY inputs. Doesn't that mean, that the wrapper should actually take/produce NPY instead of NPZ? If this is right, then we should forget about NPZ and restrict our discussions here to PNG, TIFF, and NPY.

cc @beatrizserrano

anuprulez · 2024-07-21T15:12:28Z

I was on holidays. I will have a look at it this week. Can you let me know when you require this feature, I can prioritise it @beatrizserrano thanks!

@kostrykin thank you for the clarification.

beatrizserrano · 2024-07-21T15:36:49Z

Thank you @anuprulez! 🤗

We want to develop tutorials at the Biohackathon in November, so it would be perfect to be able to test the tool well before that. Thanks again!

anuprulez · 2024-07-22T16:06:05Z

@kostrykin thank you for the explanation of NPY and NPZ.

I would like to elaborate the issue of using TIFF or PNG files as input. When these files are read (using OpenCV), they are represented as 2 dimensional image.

Example from Neuron Segmentation in EM model

TIFF input file shape: torch.Size([360, 360])

However, the NPY file has shape torch.Size([1, 1, 32, 256, 256])

When I use the TIFF file as the test input for prediction, I receive the following error:

RuntimeError: Expected 5-dimensional input for 5-dimensional weight [64, 1, 3, 3, 3], but got 2-dimensional input of size [360, 360] instead

The model expects an input with 5 dimensions but the TIFF file is only 2 dimensional. For other models, the input shape varies - sometimes it is 3,4 or 5. When I use the NPY file as input, it works fine and the model predicts the segmentations.

As suggested by @FynnBe, I tried to use imageo and bioimageo packages and the suggested converter methods, but TIFF file dimensions don't match that of the respective model.

The issue is to find a way to transform TIFF or PNG images to be represented in the valid dimensions so that the respective AI model can consume the input files and produce desired output.

Any suggestions here would be really helpful!

Thank you!

@beatrizserrano

kostrykin · 2024-07-22T17:14:17Z

Thanks for the follow-up @anuprulez

I see the challenge that comes with conversion of PNG/TIFF to NPY. However, by accepting only NPY and dropping support for PNG/TIFF, we would simply shift that challenge to the user of the tool, which IMO we shouldn't do.

The core issue is to determine the expected number of dimensions of the input, the conversion then simply is some reshaping, right? I see a couple of options here, in descending order of preference:

Option 1. My suspicion is that the only way to get the dimensions of the TIFF right automatically is to rely on what the model declares as the required input dimension — if it does declare anything. You do model = torch.load(model_path) to load the model, is there anything like model.input_ndim defined? I'm not familiar with the PyTorch API, but unfortunately, torch.load does not specify anything useful for what it returns.

Option 2. You quoted the error "RuntimeError: Expected 5-dimensional input for 5-dimensional weight [64, 1, 3, 3, 3], but got 2-dimensional input of size [360, 360] instead", where is that raised from? By looking into the code for where this check is being performed, we might be able to reverse-engineer how the expected dimension of the models is stored.

Option 3. In the worst-case (i.e. if Option 2 leads to no solution for some reason), I'd fallback to the documentation, like: Point the user to the documentation of the models, where, hopefully, the required number of dimensions is stated. Add an input field to the tool UI, where the user has to write down the number that was looked up from the model documentation.

anuprulez · 2024-07-23T14:47:08Z

@kostrykin thank you for the interesting pointers. I will explore how to get the input shape of the models dynamically.

Fix review comments Co-authored-by: Beatriz Serrano-Solano <[email protected]> Co-authored-by: Leonid Kostrykin <[email protected]>

kostrykin

Much thanks for adding the PNG support and everything!

Just very few comments left, sorry for being a bit picky maybe.

tools/bioimaging/bioimage_inference.xml

kostrykin · 2024-08-01T16:35:53Z

tools/bioimaging/bioimage_inference.xml

+        <token name="@VERSION_SUFFIX@">0</token>
+    </macros>
+    <creator>
+	<organization name="BioImage.IO" url="https://bioimage.io/#/" email="" />


Are you affiliated with BioImage.IO? @anuprulez Otherwise I wouldn't list them as a creator of this tool wrapper.

kostrykin · 2024-08-01T16:37:53Z

tools/bioimaging/bioimage_inference.xml

+	<param name="input_image_file" type="data" format="tiff,png" label="Input image" help="Please provide an input image for the analysis."/>
+	<param name="input_image_input_size" type="text" label="Shape of the input image" help="Provide shape of input image. See chosen model's RDF file to find correct input shape. For example: for the BioImage.IO model MitochondriaEMSegmentationBoundaryModel, the input shape is 256 x 256 x 32 x 1. Enter the shape as '256,256,32,1'."/>


Fix indentation, replace x by ⨉ multiplication symbol, add some missing articles:

Suggested change

<param name="input_image_file" type="data" format="tiff,png" label="Input image" help="Please provide an input image for the analysis."/>

<param name="input_image_input_size" type="text" label="Shape of the input image" help="Provide shape of input image. See chosen model's RDF file to find correct input shape. For example: for the BioImage.IO model MitochondriaEMSegmentationBoundaryModel, the input shape is 256 x 256 x 32 x 1. Enter the shape as '256,256,32,1'."/>

<param name="input_image_file" type="data" format="tiff,png" label="Input image" help="Please provide an input image for the analysis."/>

<param name="input_image_input_size" type="text" label="Shape of the input image" help="Provide the shape of the input image. See the chosen model's RDF file to find the correct input shape. For example, for the BioImage.IO model MitochondriaEMSegmentationBoundaryModel, the input shape is 256 ⨉ 256 ⨉ 32 ⨉ 1. Enter the shape as '256,256,32,1'."/>

I also believe that avoiding the genitive s is a good practice in technical writing, since it helps keeping things clearly structured, and the apostrophe can easily be misperceived as a quotation mark, so I suggested it in my previous review.

Ok, I replaced the x symbol by *. For this sentence Enter the shape as '256,256,32,1', I have left out any quotation or symbols to avoid confusion. It should be entered just like any other text.

@anuprulez I just looked into the RDF file because I thought it would be good to use their notation and saw that they use the x notation :) much sorry! I didn't know that you sticked to the RDF notation… so actually using the x notation which you used originally, then is the best option, I think.

tools/bioimaging/bioimage_inference.xml

Fix review comments Co-authored-by: Leonid Kostrykin <[email protected]>

tools/bioimaging/bioimage_inference.xml

Fix review comments Co-authored-by: Leonid Kostrykin <[email protected]>

tools/bioimaging/bioimage_inference.xml

Add missing articles and replace shape by size Co-authored-by: Leonid Kostrykin <[email protected]>

kostrykin

Thanks @anuprulez!

bgruening · 2024-08-02T10:28:07Z

tools/bioimaging/bioimage_inference.xml

+    <creator>
+        <organization name="BioImage.IO" url="https://bioimage.io/#/" email="" />
+        <person name="Beatriz Serrano-Solano" email="" />
+        <person name="Leonid Kostrykin" email="[email protected]" />


@anuprulez you forgot yourself here.

@beatrizserrano your email?

I would probably use <person givenName="" familyName="" email="" />

tools/bioimaging/bioimage_inference.xml

bgruening · 2024-08-02T10:30:20Z

tools/bioimaging/bioimage_inference.xml

+    </xrefs>
+    <requirements>
+        <requirement type="package" version="3.9.12">python</requirement>
+        <requirement type="package" version="2.3.1">pytorch</requirement>


Suggested change

<requirement type="package" version="2.3.1">pytorch</requirement>

<requirement type="package" version="@TOOL_VERSION@">pytorch</requirement>

was that the intention behind the version number?

My bad, I actually forgot to add this in my review.

I think tool's version should not replace Pytorch's version. Something else changes in the tool (making the tool's version to change) while Pytorch remains that same - could be problematic in future. Since, this is the first tool, I would keep the tool's version as 1.0.0

This wrapper is rather thin and is mostly "just" a wrapper for torch.load and the invocation of the model (the rest is pre- and post-processing). This suggests that the wrapper should have the same version as the wrapped tool, PyTorch in this case. If the wrapper changes in the future, you would usually increment @VERSION_SUFFIX@. This is also what I had suggested in #1391 (comment) and it is more in line with IUC recommendations https://galaxy-iuc-standards.readthedocs.io/en/latest/best_practices/tool_xml.html#tool-versions

sorry, I misinterpreted the previous comment. It's restored based on your comment. Thanks!!

tools/bioimaging/.shed.yml

Fix review Co-authored-by: Björn Grüning <[email protected]>

kostrykin

Just a minor consistency issue

tools/bioimaging/bioimage_inference.xml

Co-authored-by: Leonid Kostrykin <[email protected]>

bgruening · 2024-08-02T15:27:07Z

Thanks everyone!

anuprulez · 2024-08-02T15:35:13Z

Thanks @bgruening for merging.

Thank you very much @kostrykin for spending so much time reviewing it

anuprulez and others added 6 commits March 1, 2024 16:56

add model inference tool

f83be9d

Merge branch 'bgruening:master' into add_bioimaging_inf

9b8d6e0

add tool tests and test data

227bcf9

rebase

f6c48be

Merge branch 'bgruening:master' into add_bioimaging_inf

b742c4b

rebase

47d971a

bgruening reviewed Mar 15, 2024

View reviewed changes

kostrykin reviewed Mar 15, 2024

View reviewed changes

tools/bioimaging/bioimage_inference.xml Outdated Show resolved Hide resolved

kostrykin reviewed Mar 15, 2024

View reviewed changes

tools/bioimaging/bioimage_inference.xml Outdated Show resolved Hide resolved

anuprulez and others added 3 commits March 15, 2024 15:26

fix linting issues

865d720

Update tools/bioimaging/bioimage_inference.xml

448036c

Fix suggestion in name Co-authored-by: Leonid Kostrykin <[email protected]>

Update tools/bioimaging/bioimage_inference.xml

88eaa2a

Fix suggestion to include edam annotation Co-authored-by: Leonid Kostrykin <[email protected]>

anuprulez mentioned this pull request Jun 12, 2024

Integration of open-source AI models in Galaxy platform Helmholtz-AI-Matter/HAICON24-unconference#13

Open

kostrykin mentioned this pull request Jul 18, 2024

Create workflow for AI-based image analysis beatrizserrano/galaxy-image-community#3

Open

4 tasks

update

0dfdd4d

anuprulez and others added 8 commits July 26, 2024 17:33

add model size

490d646

fix dymanic input shapes

73f6a41

Merge branch 'bgruening:master' into add_bioimaging_inf

f50f2b0

remove comments

a0c1708

upate

e609948

fix linting error

0398227

replace test files

4f465e2

update test files

f932162

anuprulez and others added 8 commits August 1, 2024 14:07

fix remove comments

ae04594

Apply suggestions from code review

91ad340

Fix review comments Co-authored-by: Beatriz Serrano-Solano <[email protected]> Co-authored-by: Leonid Kostrykin <[email protected]>

use correct model name

abc33d6

use original value of predicted matrix

8dff2b2

add support for png

bb35e00

add creator

9961ea5

update bioimage name

83c4e22

fix review comments

8655330

kostrykin reviewed Aug 1, 2024

View reviewed changes

anuprulez and others added 2 commits August 2, 2024 09:24

Apply suggestions from code review

e4795b6

Fix review comments Co-authored-by: Leonid Kostrykin <[email protected]>

fix review comments

bf18477

kostrykin reviewed Aug 2, 2024

View reviewed changes

tools/bioimaging/bioimage_inference.xml Outdated Show resolved Hide resolved

kostrykin reviewed Aug 2, 2024

View reviewed changes

tools/bioimaging/bioimage_inference.xml Outdated Show resolved Hide resolved

Apply suggestions from code review

37a02b9

Fix review comments Co-authored-by: Leonid Kostrykin <[email protected]>

kostrykin reviewed Aug 2, 2024

View reviewed changes

tools/bioimaging/bioimage_inference.xml Outdated Show resolved Hide resolved

Apply suggestions from code review

eaabcd4

Add missing articles and replace shape by size Co-authored-by: Leonid Kostrykin <[email protected]>

kostrykin approved these changes Aug 2, 2024

View reviewed changes

bgruening reviewed Aug 2, 2024

View reviewed changes

anuprulez and others added 3 commits August 2, 2024 14:17

Apply suggestions from code review

3ed3ec7

Fix review Co-authored-by: Björn Grüning <[email protected]>

fix review

fbd83a2

restore tool version prefix

0712792

kostrykin reviewed Aug 2, 2024

View reviewed changes

tools/bioimaging/bioimage_inference.xml Outdated Show resolved Hide resolved

tools/bioimaging/bioimage_inference.xml Outdated Show resolved Hide resolved

Apply suggestions from code review

0c3df10

Co-authored-by: Leonid Kostrykin <[email protected]>

bgruening merged commit 57f4673 into bgruening:master Aug 2, 2024
11 checks passed

anuprulez deleted the add_bioimaging_inf branch August 2, 2024 15:35

beatrizserrano mentioned this pull request Aug 15, 2024

Write tutorial using AI model from bioimage.io beatrizserrano/galaxy-image-community#18

Closed

4 tasks

B0r1sD mentioned this pull request Nov 7, 2024

Update documentation for the 'Process image using a BioImage.IO model' Galaxy tool beatrizserrano/galaxy-image-community#26

Open

kostrykin mentioned this pull request Nov 7, 2024

[BHEU2024] Fix bioimage_inference tool with 3D images #1544

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tool for analysing images using Bioimage AI models #1391

Add tool for analysing images using Bioimage AI models #1391

anuprulez commented Mar 1, 2024 •

edited

Loading

bgruening Mar 15, 2024

anuprulez Mar 15, 2024 •

edited

Loading

anuprulez Mar 18, 2024 •

edited

Loading

FynnBe Mar 18, 2024 •

edited

Loading

bgruening commented Jun 19, 2024

kostrykin commented Jun 30, 2024 •

edited

Loading

anuprulez commented Jul 21, 2024

beatrizserrano commented Jul 21, 2024

anuprulez commented Jul 22, 2024

kostrykin commented Jul 22, 2024 •

edited

Loading

anuprulez commented Jul 23, 2024

kostrykin left a comment

kostrykin Aug 1, 2024

kostrykin Aug 1, 2024

anuprulez Aug 2, 2024

kostrykin Aug 2, 2024

kostrykin left a comment

bgruening Aug 2, 2024

bgruening Aug 2, 2024

kostrykin Aug 2, 2024

anuprulez Aug 2, 2024 •

edited

Loading

kostrykin Aug 2, 2024 •

edited

Loading

anuprulez Aug 2, 2024

kostrykin left a comment

bgruening commented Aug 2, 2024

anuprulez commented Aug 2, 2024

		<param name="input_image_file" type="data" format="tiff,png" label="Input image" help="Please provide an input image for the analysis."/>
		<param name="input_image_input_size" type="text" label="Shape of the input image" help="Provide shape of input image. See chosen model's RDF file to find correct input shape. For example: for the BioImage.IO model MitochondriaEMSegmentationBoundaryModel, the input shape is 256 x 256 x 32 x 1. Enter the shape as '256,256,32,1'."/>

	<requirement type="package" version="2.3.1">pytorch</requirement>
	<requirement type="package" version="@TOOL_VERSION@">pytorch</requirement>

Add tool for analysing images using Bioimage AI models #1391

Add tool for analysing images using Bioimage AI models #1391

Conversation

anuprulez commented Mar 1, 2024 • edited Loading

Choose a reason for hiding this comment

anuprulez Mar 15, 2024 • edited Loading

Choose a reason for hiding this comment

anuprulez Mar 18, 2024 • edited Loading

Choose a reason for hiding this comment

FynnBe Mar 18, 2024 • edited Loading

Choose a reason for hiding this comment

bgruening commented Jun 19, 2024

kostrykin commented Jun 30, 2024 • edited Loading

Concern 1: Support of TIFF/PNG

Concern 2: NPY or NPZ?

anuprulez commented Jul 21, 2024

beatrizserrano commented Jul 21, 2024

anuprulez commented Jul 22, 2024

kostrykin commented Jul 22, 2024 • edited Loading

anuprulez commented Jul 23, 2024

kostrykin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kostrykin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anuprulez Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

kostrykin Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kostrykin left a comment

Choose a reason for hiding this comment

bgruening commented Aug 2, 2024

anuprulez commented Aug 2, 2024

anuprulez commented Mar 1, 2024 •

edited

Loading

anuprulez Mar 15, 2024 •

edited

Loading

anuprulez Mar 18, 2024 •

edited

Loading

FynnBe Mar 18, 2024 •

edited

Loading

kostrykin commented Jun 30, 2024 •

edited

Loading

kostrykin commented Jul 22, 2024 •

edited

Loading

anuprulez Aug 2, 2024 •

edited

Loading

kostrykin Aug 2, 2024 •

edited

Loading