Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature added: Get WSI at mpp #7574

Open
wants to merge 34 commits into
base: dev
Choose a base branch
from

Conversation

NikolasSchmitz
Copy link

@NikolasSchmitz NikolasSchmitz commented Mar 25, 2024

Fixes: #4980

Description

In this pull request, the feature to retrieve a whole slide image at a given mpp (microns per pixel) resolution was implemented for every WSIReader class in the function get_wsi_at_mpp.

While the implementations in the OpenslideWSIReader and CuCIMWSIReader classes were tested thoroughly, I could not find a suitable TIFF file for testing with the TiffFileWSIReader class.

For resizing, I have used PIL.Image.resize for Openslide and TiffFile, and cucim.sklearn.transform.resize for CuCIM. Originally, I used cv2.resize, but since the package isn't listed in requirements-dev.txt, I explored alternative solutions."

Types of changes

  • Non-breaking change (fix or new feature that would not break existing functionality).
  • Breaking change (fix or new feature that would cause existing functionality to change).
  • New tests added to cover the changes.
  • Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
  • Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
  • In-line docstrings updated.
  • Documentation updated, tested make html command in the docs/ folder.

Please let me know what you think and how I can improve this feature.

Best
Niko

(Last PR was closed, because I changed the branch name to include the ticket id)

cxlcl and others added 6 commits March 27, 2024 01:47
…roject-MONAI#7308)

### Description

Based on the discussion topic
[here](Project-MONAI#7161 (comment)),
we implemented the Conjugate-Gradient algorithm for linear operator
inversion, and Stein's Unbiased Risk Estimator (SURE) [1] loss for
ground-truth-date free diffusion process guidance that is proposed in
[2] and illustrated in the algorithm below:

<img width="650" alt="Screenshot 2023-12-10 at 10 19 25 PM"
src="https://github.com/Project-MONAI/MONAI/assets/8581162/97069466-cbaf-44e0-b7a7-ae9deb8fd7f2">

The Conjugate-Gradient (CG) algorithm is used to solve for the inversion
of the linear operator in Line-4 in the algorithm above, where the
linear operator is too large to store explicitly as a matrix (such as
FFT/IFFT of an image) and invert directly. Instead, we can solve for the
linear inversion iteratively as in CG.

The SURE loss is applied for Line-6 above. This is a differentiable loss
function that can be used to train/giude an operator (e.g. neural
network), where the pseudo ground truth is available but the reference
ground truth is not. For example, in the MRI reconstruction, the pseudo
ground truth is the zero-filled reconstruction and the reference ground
truth is the fully sampled reconstruction. The reference ground truth is
not available due to the lack of fully sampled.

**Reference**
[1] Stein, C.M.: Estimation of the mean of a multivariate normal
distribution. Annals of Statistics 1981 [[paper
link](https://projecteuclid.org/journals/annals-of-statistics/volume-9/issue-6/Estimation-of-the-Mean-of-a-Multivariate-Normal-Distribution/10.1214/aos/1176345632.full)]
[2] B. Ozturkler et al. SMRD: SURE-based Robust MRI Reconstruction with
Diffusion Models. MICCAI 2023
[[paper link](https://arxiv.org/pdf/2310.01799.pdf)]

### Types of changes
<!--- Put an `x` in all the boxes that apply, and remove the not
applicable items -->
- [x] Non-breaking change (fix or new feature that would not break
existing functionality).
- [ ] Breaking change (fix or new feature that would cause existing
functionality to change).
- [x] New tests added to cover the changes.
- [x] Integration tests passed locally by running `./runtests.sh -f -u
--net --coverage`.
- [x] Quick tests passed locally by running `./runtests.sh --quick
--unittests --disttests`.
- [x] In-line docstrings updated.
- [x] Documentation updated, tested `make html` command in the `docs/`
folder.

---------

Signed-off-by: chaoliu <[email protected]>
Signed-off-by: cxlcl <[email protected]>
Signed-off-by: chaoliu <[email protected]>
Signed-off-by: YunLiu <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: YunLiu <[email protected]>
Co-authored-by: Eric Kerfoot <[email protected]>
Signed-off-by: Nikolas Schmitz <[email protected]>
Signed-off-by: Nikolas Schmitz <[email protected]>
Signed-off-by: Nikolas Schmitz <[email protected]>
Signed-off-by: monai-bot <[email protected]>

Signed-off-by: monai-bot <[email protected]>
Signed-off-by: Nikolas Schmitz <[email protected]>
Signed-off-by: Nikolas Schmitz <[email protected]>
NikolasSchmitz and others added 3 commits March 27, 2024 02:04
Signed-off-by: Nikolas Schmitz <[email protected]>
…ject-MONAI#7569)

Fixes Project-MONAI#7451

### Description
Reduces the length of error messages and error messages being propagated
twice. This helps debug better when long `ConfigComponent`s are being
instantiated. Refer to issue Project-MONAI#7451 for more details

### Types of changes
<!--- Put an `x` in all the boxes that apply, and remove the not
applicable items -->
- [x] Non-breaking change (fix or new feature that would not break
existing functionality).
- [ ] Breaking change (fix or new feature that would cause existing
functionality to change).
- [ ] New tests added to cover the changes.
- [ ] Integration tests passed locally by running `./runtests.sh -f -u
--net --coverage`.
- [x] Quick tests passed locally by running `./runtests.sh --quick
--unittests --disttests`.
- [ ] In-line docstrings updated.
- [ ] Documentation updated, tested `make html` command in the `docs/`
folder.

Signed-off-by: Suraj Pai <[email protected]>
Co-authored-by: Eric Kerfoot <[email protected]>
Fixes Project-MONAI#2872 

### Description

Implementation of mixup, cutmix and cutout as described in the original
papers.
Current implementation support both, the dictionary-based batches and
tuples of tensors.

### Types of changes
<!--- Put an `x` in all the boxes that apply, and remove the not
applicable items -->
- [x] Non-breaking change (fix or new feature that would not break
existing functionality).
- [ ] Breaking change (fix or new feature that would cause existing
functionality to change).
- [x] New tests added to cover the changes.
- [ ] Integration tests passed locally by running `./runtests.sh -f -u
--net --coverage`.
- [x] Quick tests passed locally by running `./runtests.sh --quick
--unittests --disttests`.
- [x] In-line docstrings updated.
- [x] Documentation updated, tested `make html` command in the `docs/`
folder.

---------

Signed-off-by: Juan Pablo de la Cruz Gutiérrez <[email protected]>
Signed-off-by: monai-bot <[email protected]>
Signed-off-by: elitap <[email protected]>
Signed-off-by: Felix Schnabel <[email protected]>
Signed-off-by: YanxuanLiu <[email protected]>
Signed-off-by: ytl0623 <[email protected]>
Signed-off-by: Dženan Zukić <[email protected]>
Signed-off-by: KumoLiu <[email protected]>
Signed-off-by: YunLiu <[email protected]>
Signed-off-by: Ishan Dutta <[email protected]>
Signed-off-by: dependabot[bot] <[email protected]>
Signed-off-by: kaibo <[email protected]>
Signed-off-by: heyufan1995 <[email protected]>
Signed-off-by: binliu <[email protected]>
Signed-off-by: axel.vlaminck <[email protected]>
Signed-off-by: Ibrahim Hadzic <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Timothy Baker <[email protected]>
Signed-off-by: Mathijs de Boer <[email protected]>
Signed-off-by: Fabian Klopfer <[email protected]>
Signed-off-by: Lucas Robinet <[email protected]>
Signed-off-by: Lucas Robinet <[email protected]>
Signed-off-by: chaoliu <[email protected]>
Signed-off-by: cxlcl <[email protected]>
Signed-off-by: chaoliu <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: monai-bot <[email protected]>
Co-authored-by: elitap <[email protected]>
Co-authored-by: Felix Schnabel <[email protected]>
Co-authored-by: YanxuanLiu <[email protected]>
Co-authored-by: ytl0623 <[email protected]>
Co-authored-by: Dženan Zukić <[email protected]>
Co-authored-by: Eric Kerfoot <[email protected]>
Co-authored-by: YunLiu <[email protected]>
Co-authored-by: Ishan Dutta <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Kaibo Tang <[email protected]>
Co-authored-by: Yufan He <[email protected]>
Co-authored-by: binliunls <[email protected]>
Co-authored-by: Ben Murray <[email protected]>
Co-authored-by: axel.vlaminck <[email protected]>
Co-authored-by: Mingxin Zheng <[email protected]>
Co-authored-by: Ibrahim Hadzic <[email protected]>
Co-authored-by: Dr. Behrooz Hashemian <[email protected]>
Co-authored-by: Timothy J. Baker <[email protected]>
Co-authored-by: Mathijs de Boer <[email protected]>
Co-authored-by: Mathijs de Boer <[email protected]>
Co-authored-by: Fabian Klopfer <[email protected]>
Co-authored-by: Yiheng Wang <[email protected]>
Co-authored-by: Lucas Robinet <[email protected]>
Co-authored-by: Lucas Robinet <[email protected]>
Co-authored-by: cxlcl <[email protected]>
Signed-off-by: Dr. Behrooz Hashemian <[email protected]>
@bhashemian
Copy link
Member

@NikolasSchmitz, I resolved the conflicts and updated the PR. Let's focus on the functionality and make this PR ready for reviewing. We can take care of any other issue once the PR is ready and please feel free to reach out to me or the working group if you still have any questions. Thanks for taking on this feature.

@NikolasSchmitz NikolasSchmitz marked this pull request as ready for review April 9, 2024 22:55
@NikolasSchmitz
Copy link
Author

Thank you @drbeh ! I now marked it as ready for review.

@ericspod ericspod requested a review from bhashemian April 22, 2024 14:53
monai/data/wsi_reader.py Outdated Show resolved Hide resolved
@ericspod
Copy link
Member

@drbeh would you be able to review this? I think you're best qualified here. I made some minor comments about print and code duplication but otherwise it seems fine to me without having tested it. Thanks!

@JHancox JHancox self-assigned this May 14, 2024
Copy link
Contributor

@JHancox JHancox left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the effort @NikolasSchmitz. A welcome addition! I agree with @ericspod about the refactoring to remove duplication - that would be great.

monai/data/wsi_reader.py Outdated Show resolved Hide resolved
monai/data/wsi_reader.py Outdated Show resolved Hide resolved
@NikolasSchmitz
Copy link
Author

Thank you for the feedback, @JHancox . Unfortunately, I couldn't join the meeting because I was on my way home from ICLR.
I will review the code and provide my updates as soon as possible.

… reduce redundancy; for get_mpp of TiffFileWSIReader: added check to prevent division by zero error.
@NikolasSchmitz
Copy link
Author

I factored out the actual resizing of the image into _resize_to_mpp_res() and kept the tolerance checks in get_wsi_at_mpp(). I had some problems with specific tiff files and the TiffFile backend. Some files had zero as X/Y Resolution, which led to a division by zero error in the function get_mpp (added a check for that in the new commit).
One tiff file from a Philips scanner that I tested had unobtainable mpp values using get_mpp. In this case, the tags do exist (openslide can find them), but they must be read and parsed from the property philips_metadata as XML. I have not implemented a fix for this yet, but please let me know if this is needed. In this particular case, the best option would be to use Openslide as the backend, although there might be users who prefer Tifffile.

@ericspod
Copy link
Member

ericspod commented Aug 1, 2024

There are lines added/removed to two other files that don't need to be changed in this PR, tests/test_regularization.py is causing a conflict for example. I would remove those and we can use black here through Github to resolve formatting issues if we need to.

@NikolasSchmitz
Copy link
Author

There are lines added/removed to two other files that don't need to be changed in this PR, tests/test_regularization.py is causing a conflict for example. I would remove those and we can use black here through Github to resolve formatting issues if we need to.

Thanks for the info. Interesting though, haven't noticed that these files were changed. Perhaps through one of the coding style checks.

@ericspod
Copy link
Member

ericspod commented Aug 7, 2024

There's a few changes to do but it looks good in general. We do need tests for these new methods as well so please look at the existing WSI tests to see what needs to be added to cover new functionality. The flake8 and DCO fixes can be done at the end. Thanks!

@NikolasSchmitz
Copy link
Author

There's a few changes to do but it looks good in general. We do need tests for these new methods as well so please look at the existing WSI tests to see what needs to be added to cover new functionality. The flake8 and DCO fixes can be done at the end. Thanks!

Hi Eric,
I’ve written a few tests to validate the core functionality of get_wsi_at_mpp. However, I’ve encountered an issue with the metadata in the TIFF image used for testing. It appears the MPP value is incorrect or missing. The metadata includes:

'tiff.ResolutionUnit': 'centimeter', 'tiff.XResolution': '10', 'tiff.YResolution': '10'

Here’s a screenshot of the TIFF file opened in QuPath:
image

For comparison, here’s a screenshot of the SVS file opened in QuPath:
image

It seems the SVS file was converted to TIFF, but during the conversion process, the resolution metadata got altered or lost. This discrepancy is likely affecting the accuracy of the tests.

Let me know how you’d like to proceed—should we regenerate the test image with proper metadata or use a different TIFF file for testing? I’ll continue refining the tests in the meantime.

Thanks!

@ericspod
Copy link
Member

I’ve written a few tests to validate the core functionality of get_wsi_at_mpp. However, I’ve encountered an issue with the metadata in the TIFF image used for testing. It appears the MPP value is incorrect or missing. The metadata includes:

I'm also seeing some very different information in the two header from these files I've included below. I don't know how the tiff was generated but maybe @bhashemian would and know what's the discrepancy is.

In [11]: dict(openslide.open_slide("CMU-1.tiff").properties)
Out[11]:
{'openslide.level-count': '9',
'openslide.level[0].downsample': '1',
'openslide.level[0].height': '32914',
'openslide.level[0].tile-height': '256',
'openslide.level[0].tile-width': '256',
'openslide.level[0].width': '46000',
'openslide.level[1].downsample': '2',
'openslide.level[1].height': '16457',
'openslide.level[1].tile-height': '256',
'openslide.level[1].tile-width': '256',
'openslide.level[1].width': '23000',
'openslide.level[2].downsample': '4.0001215362177929',
'openslide.level[2].height': '8228',
'openslide.level[2].tile-height': '256',
'openslide.level[2].tile-width': '256',
'openslide.level[2].width': '11500',
'openslide.level[3].downsample': '8.0002430724355857',
'openslide.level[3].height': '4114',
'openslide.level[3].tile-height': '256',
'openslide.level[3].tile-width': '256',
'openslide.level[3].width': '5750',
'openslide.level[4].downsample': '16.000486144871171',
'openslide.level[4].height': '2057',
'openslide.level[4].tile-height': '256',
'openslide.level[4].tile-width': '256',
'openslide.level[4].width': '2875',
'openslide.level[5].downsample': '32.014322017605849',
'openslide.level[5].height': '1028',
'openslide.level[5].tile-height': '256',
'openslide.level[5].tile-width': '256',
'openslide.level[5].width': '1437',
'openslide.level[6].downsample': '64.050935911470475',
'openslide.level[6].height': '514',
'openslide.level[6].tile-height': '256',
'openslide.level[6].tile-width': '256',
'openslide.level[6].width': '718',
'openslide.level[7].downsample': '128.10187182294095',
'openslide.level[7].height': '257',
'openslide.level[7].tile-height': '256',
'openslide.level[7].tile-width': '256',
'openslide.level[7].width': '359',
'openslide.level[8].downsample': '257.06193261173183',
'openslide.level[8].height': '128',
'openslide.level[8].tile-height': '256',
'openslide.level[8].tile-width': '256',
'openslide.level[8].width': '179',
'openslide.mpp-x': '1000',
'openslide.mpp-y': '1000',
'openslide.quickhash-1': '428aa6abf42c774234a463cb90e2cbf88423afc0217e46ec2e308f31e29f1a9f',
'openslide.vendor': 'generic-tiff',
'tiff.ResolutionUnit': 'centimeter',
'tiff.XResolution': '10',
'tiff.YResolution': '10'}

In [12]: dict(openslide.open_slide("Aperio-CMU-1.svs").properties)
Out[12]:
{'aperio.AppMag': '20',
'aperio.Date': '12/29/09',
'aperio.Filename': 'CMU-1',
'aperio.Filtered': '5',
'aperio.Focus Offset': '0.000000',
'aperio.ICC Profile': 'ScanScope v1',
'aperio.ImageID': '1004486',
'aperio.Left': '25.691574',
'aperio.LineAreaXOffset': '0.019265',
'aperio.LineAreaYOffset': '-0.000313',
'aperio.LineCameraSkew': '-0.000424',
'aperio.MPP': '0.4990',
'aperio.OriginalWidth': '46920',
'aperio.Originalheight': '33014',
'aperio.Parmset': 'USM Filter',
'aperio.ScanScope ID': 'CPAPERIOCS',
'aperio.StripeWidth': '2040',
'aperio.Time': '09:59:15',
'aperio.Top': '23.449873',
'aperio.User': 'b414003d-95c6-48b0-9369-8010ed517ba7',
'openslide.associated.label.height': '463',
'openslide.associated.label.width': '387',
'openslide.associated.macro.height': '431',
'openslide.associated.macro.width': '1280',
'openslide.associated.thumbnail.height': '732',
'openslide.associated.thumbnail.width': '1024',
'openslide.comment': 'Aperio Image Library v10.0.51\r\n46920x33014 [0,100 46000x32914] (256x256) JPEG/RGB Q=30|AppMag = 20|StripeWidth = 2040|ScanScope ID = CPAPERIOCS|Filename = CMU-1|Date = 12/29/09|Time = 09:59:15|User = b414003d-95c6-48b0-9369-8010ed517ba7|Parmset =
USM Filter|MPP = 0.4990|Left = 25.691574|Top = 23.449873|LineCameraSkew = -0.000424|LineAreaXOffset = 0.019265|LineAreaYOffset = -0.000313|Focus Offset = 0.000000|ImageID = 1004486|OriginalWidth = 46920|Originalheight = 33014|Filtered = 5|ICC Profile = ScanScope v1',
'openslide.icc-size': '141992',
'openslide.level-count': '3',
'openslide.level[0].downsample': '1',
'openslide.level[0].height': '32914',
'openslide.level[0].tile-height': '256',
'openslide.level[0].tile-width': '256',
'openslide.level[0].width': '46000',
'openslide.level[1].downsample': '4.0001215362177929',
'openslide.level[1].height': '8228',
'openslide.level[1].tile-height': '256',
'openslide.level[1].tile-width': '256',
'openslide.level[1].width': '11500',
'openslide.level[2].downsample': '16.000486144871171',
'openslide.level[2].height': '2057',
'openslide.level[2].tile-height': '256',
'openslide.level[2].tile-width': '256',
'openslide.level[2].width': '2875',
'openslide.mpp-x': '0.499',
'openslide.mpp-y': '0.499',
'openslide.objective-power': '20',
'openslide.quickhash-1': '30f1a38031fc0e21d81f9d01435ac4af848f6fe2bbf8f7768184336ee5d7e796',
'openslide.vendor': 'aperio',
'tiff.ImageDescription': 'Aperio Image Library v10.0.51\r\n46920x33014 [0,100 46000x32914] (256x256) JPEG/RGB Q=30|AppMag = 20|StripeWidth = 2040|ScanScope ID = CPAPERIOCS|Filename = CMU-1|Date = 12/29/09|Time = 09:59:15|User = b414003d-95c6-48b0-9369-8010ed517ba7|Parms
et = USM Filter|MPP = 0.4990|Left = 25.691574|Top = 23.449873|LineCameraSkew = -0.000424|LineAreaXOffset = 0.019265|LineAreaYOffset = -0.000313|Focus Offset = 0.000000|ImageID = 1004486|OriginalWidth = 46920|Originalheight = 33014|Filtered = 5|ICC Profile = ScanScope v1'
,
'tiff.ResolutionUnit': 'inch'}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

WSIReader support for reading by mpp or magnification
8 participants