Rename multiplier to frames_per_event and move to first dim of shape #726

thomashopkins32 · 2025-01-08T19:51:58Z

This PR does the following:

Renames multiplier -> frames_per_event
Add the frames_per_event as the first dimension of DataKey.shape
Ensure that the index provided by DetectorWriter.get_indices_written() and DetectorWriter.observe_indices_written() is divided by frames_per_event so that it actually captures the correct amount of exposures in each index (except for PandA which explicitly says it only has 1 "frame" per event)
Add unit tests showing that describe() works as intended
~~Add unit tests showing that stream resources are actually batches of exposures~~
Re-order self._writer.open() and self._writer.get_indices_written(). The writer needs to be opened in order to get the indices written. Otherwise, it has no idea what frames_per_event to use when returning the index last written.

I could not actually add tests using bluesky plans and inspecting the data afterword because TriggerInfo is hardcoded in StandardDetector. I think it is a separate issue that should be raised since it would enhance the scope of this PR. I will open an issue for this soon and mention it below.

Otherwise, I have a few open questions regarding my understanding of ophyd-async as well as the implementation which I will also leave as review comments. Please see below.

Closes #576

…n typing

…eview

… description

… dataset description" This reverts commit 488d7eb.

…ape instead

thomashopkins32

Question about shapes when frames_per_event is 1: do we want to always "squeeze" the shape?
I.e. there are a couple of options:

For 2d arrays:
- [1, h, w] -> [h, w] when frames_per_event = 1
- [frames_per_event, h, w] when frames_per_event > 1
For scalar values:
- [1,] -> [] when frames_per_event = 1
- [frames_per_event,] when frames_per_event > 1

Currently, it is set up such that if the result would be a single scalar value, the shape would be replaced with []. Otherwise, the shape always contains the extra dim.

src/ophyd_async/epics/adcore/_core_writer.py

…ve-multiplier-to-first-dim

jwlodek

I think this is close, just a few minor notes. We should try setting this up in the lab and running a test w/ collecting data from different devices w/ different frames_per_event to make sure it behaves as expected (and also to maybe work out the needed changes to the consolidators).

src/ophyd_async/core/_detector.py

src/ophyd_async/core/_hdf_dataset.py

src/ophyd_async/epics/adcore/_core_writer.py

tests/core/test_flyer.py

src/ophyd_async/epics/adcore/_core_writer.py

tests/epics/adaravis/test_aravis.py

jwlodek · 2025-01-13T16:43:11Z

Question about shapes when frames_per_event is 1: do we want to always "squeeze" the shape? I.e. there are a couple of options:
* For 2d arrays:
  
  * `[1, h, w]` -> `[h, w]` when `frames_per_event = 1`
  * `[frames_per_event, h, w]` when `frames_per_event > 1`

* For scalar values:
  
  * `[1,]` -> `[]` when `frames_per_event = 1`
  * `[frames_per_event,]` when `frames_per_event > 1`
Currently, it is set up such that if the result would be a single scalar value, the shape would be replaced with []. Otherwise, the shape always contains the extra dim.

I think I'd be in favor of avoiding such squeezing, because then we'd need a separate parameter to let us know if it had been squeezed or not. Say we have a frames_per_event of 1 w/ a dataset that's 10 x 10. If we squeeze we get [10, 10] as the shape, but there's no way of telling if this is actually a 1D dataset of size 10 w/ 10 frames per event.

…ve-multiplier-to-first-dim

thomashopkins32 · 2025-01-14T16:16:27Z

@jwlodek so the current squeezing behavior for the shape is

For 2d arrays:
- [frames_per_event, h, w] (even if frames_per_event is 1)
For scalar values:
- [1,] -> [] when frames_per_event = 1
- [frames_per_event,] when frames_per_event > 1

The final change I would make based on your comment would be to remove the squeezing on scalar values from [1,] -> [].

thomashopkins32 · 2025-01-14T19:29:52Z

Should be ready to review once more. The new shape behavior is such that the frames_per_event is always the first dimension of shape. If the len(shape) > 1, then the dtype is an array, otherwise, its a number.

thomashopkins32 · 2025-01-24T18:01:16Z

@jwlodek, @jennmald , and I did some testing on actual devices and found a few more issues that need to be resolved. We didn't get through all of the testing we planned for so we will continue next week most likely.

For ophyd-async (completed in a5b1f27) :

PandA needs to be able to handle frames_per_event > 1. @coretl do you know why this was limited to only being 1 for PandA?
The computed total_number_of_triggers needs to be multiplied by frames_per_event

For bluesky:

ConsolidatorBase needs to be reworked based on the new assumption that frames_per_event is the first dim of datum_shape.

For tiled:

TBD

That covers pretty much everything that we tested a debugged on the devices so far. We will see if changes to tiled are necessary in further testing.

coretl · 2025-01-28T10:00:04Z

PandA needs to be able to handle frames_per_event > 1. @coretl do you know why this was limited to only being 1 for PandA?

I'm not sure, I don't think that's a real restriction. If you remove it, what breaks?

jwlodek · 2025-01-28T12:23:18Z

PandA needs to be able to handle frames_per_event > 1. @coretl do you know why this was limited to only being 1 for PandA?

I'm not sure, I don't think that's a real restriction. If you remove it, what breaks?

Nothing actually, we removed it and got everything to work as expected, just into separate streams. We're going to make sure they can fit into the same stream this week

…ts frames_per_event > 1

…ve-multiplier-to-first-dim

jwlodek

We've now tested this and it is working for us, pending the Consolidator PR.

thomashopkins32 · 2025-01-31T17:40:51Z

Actually we decided that it does not work well with tiled just yet. We want tiled to have the frames_per_event explicitly in the shape which is causing issues with reading the data back from the files (due to how chunking works).

Basically, ophyd-async has the descriptor shape with the first dim as frames_per_event. bluesky's consolidators uses this to figure out the proper chunking of the data prior to writing it to the hdf5 file. Then tiled needs to also understand this chunking in order to read the data back from the file and unpack it properly.

The shape of the data from the user perspective should always be (num_events, frames_per_event, ...)

jwlodek and others added 30 commits September 4, 2024 13:16

Starting to work on ad tiff writer

652de13

Resolve merge conflicts

e289ee4

Continue working on tiff writer

f36ec3a

Further work on tiff writer, existing tests now passing.

83dff62

Remove functions moved to superclas from hdf writer

1a52a21

Significant re-org and simplification of ad classes

489cfd8

Ruff formatting

83c6884

Modify ad sim classes to reflect new superclasses

3b4f45a

Modify vimba and kinetix classes

7175b30

Modify aravis and pilatus classes

faf53d6

Update all tests to make sure they still pass with changes

5b9f60f

Some cleanup

8bbfd0e

Merge with upstream

1eab818

Changes to standard detector to account for controller/writer types i…

f6825b4

…n typing

Significant changes to base detector, controller, and writer classes

651b80d

Update detector and controller classes to reflect changes

38a61e8

Make sure panda standard det uses new type hints

aecdf04

Most tests passing

e42fa12

Merge with main and resolve conflicts

07684a4

Revert change in test that was resolved by pydantic version update

6dc09f3

Remove debugging prints

1f7dcd7

Linter fixes

35dd1b1

Fix linter error

8112220

Move creation of writer outside of base AreaDetector class init per r…

ac1e509

…eview

Make sure we don't wait for capture to be done!

8494da4

Merge with upstream

b212432

Merge with upstream

3242d45

Allow for specifying whether or not to use fileio signals for dataset…

488d7eb

… description

Revert "Allow for specifying whether or not to use fileio signals for…

a76b70f

… dataset description" This reverts commit 488d7eb.

Fix linter errors, remove unused enum

7da935e

thomashopkins32 added 4 commits January 8, 2025 10:47

Remove frames_per_event from HDFDataset, use as first dimension of sh…

df48897

…ape instead

Cleanup + ruff checks

bdda567

Added unit tests for describe with > 1 frames_per_event

93f2f97

Add unit tests for collect with > 1 frames_per_event

9410c4f

thomashopkins32 commented Jan 8, 2025

View reviewed changes

src/ophyd_async/epics/adcore/_core_writer.py Outdated Show resolved Hide resolved

src/ophyd_async/epics/adcore/_core_writer.py Outdated Show resolved Hide resolved

src/ophyd_async/epics/adcore/_core_writer.py Outdated Show resolved Hide resolved

Fix docs indentation

9ca8f2a

thomashopkins32 requested a review from jwlodek January 8, 2025 21:15

Merge branch 'main' of https://github.com/bluesky/ophyd-async into mo…

871326a

…ve-multiplier-to-first-dim

jwlodek requested changes Jan 13, 2025

View reviewed changes

thomashopkins32 added 3 commits January 14, 2025 09:27

Merge branch 'main' of https://github.com/bluesky/ophyd-async into mo…

4208d51

…ve-multiplier-to-first-dim

Remove shape from stream resource parameters

33b6b21

Ruff check fixes

3fde1d2

thomashopkins32 added 2 commits January 14, 2025 14:25

Make the first dimension for scalar values always the frames_per_event

161f022

Ruff format

2a92b9d

thomashopkins32 requested review from coretl and jwlodek January 14, 2025 19:26

Forgot one test

1935d69

Total number of triggers scaled by frames_per_event; PandA now suppor…

a5b1f27

…ts frames_per_event > 1

thomashopkins32 mentioned this pull request Jan 30, 2025

Updated ConsolidatorBase to support frames_per_event as first dim of descriptor shape bluesky/bluesky#1876

Open

thomashopkins32 added 3 commits January 30, 2025 11:19

Merge branch 'main' of https://github.com/bluesky/ophyd-async into mo…

cc12995

…ve-multiplier-to-first-dim

Fix tests

7c0dfa2

ruff format

0f3007c

jwlodek approved these changes Jan 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rename multiplier to frames_per_event and move to first dim of shape #726

Rename multiplier to frames_per_event and move to first dim of shape #726

thomashopkins32 commented Jan 8, 2025 •

edited

Loading

thomashopkins32 left a comment

jwlodek left a comment

jwlodek commented Jan 13, 2025

thomashopkins32 commented Jan 14, 2025

thomashopkins32 commented Jan 14, 2025

thomashopkins32 commented Jan 24, 2025 •

edited

Loading

coretl commented Jan 28, 2025

jwlodek commented Jan 28, 2025

jwlodek left a comment

thomashopkins32 commented Jan 31, 2025

Rename multiplier to frames_per_event and move to first dim of shape #726

Are you sure you want to change the base?

Rename multiplier to frames_per_event and move to first dim of shape #726

Conversation

thomashopkins32 commented Jan 8, 2025 • edited Loading

thomashopkins32 left a comment

Choose a reason for hiding this comment

jwlodek left a comment

Choose a reason for hiding this comment

jwlodek commented Jan 13, 2025

thomashopkins32 commented Jan 14, 2025

thomashopkins32 commented Jan 14, 2025

thomashopkins32 commented Jan 24, 2025 • edited Loading

coretl commented Jan 28, 2025

jwlodek commented Jan 28, 2025

jwlodek left a comment

Choose a reason for hiding this comment

thomashopkins32 commented Jan 31, 2025

thomashopkins32 commented Jan 8, 2025 •

edited

Loading

thomashopkins32 commented Jan 24, 2025 •

edited

Loading