Rename multiplier to frames_per_event and move to first dim of shape #726

thomashopkins32 · 2025-01-08T19:51:58Z

This PR does the following:

Renames multiplier -> frames_per_event
Add the frames_per_event as the first dimension of DataKey.shape
Ensure that the index provided by DetectorWriter.get_indices_written() and DetectorWriter.observe_indices_written() is divided by frames_per_event so that it actually captures the correct amount of exposures in each index (except for PandA which explicitly says it only has 1 "frame" per event)
Add unit tests showing that describe() works as intended
~~Add unit tests showing that stream resources are actually batches of exposures~~
Re-order self._writer.open() and self._writer.get_indices_written(). The writer needs to be opened in order to get the indices written. Otherwise, it has no idea what frames_per_event to use when returning the index last written.

I could not actually add tests using bluesky plans and inspecting the data afterword because TriggerInfo is hardcoded in StandardDetector. I think it is a separate issue that should be raised since it would enhance the scope of this PR. I will open an issue for this soon and mention it below.

Otherwise, I have a few open questions regarding my understanding of ophyd-async as well as the implementation which I will also leave as review comments. Please see below.

Closes #576
Closes #431

…n typing

…eview

… description

… dataset description" This reverts commit 488d7eb.

thomashopkins32 · 2025-01-24T18:01:16Z

@jwlodek, @jennmald , and I did some testing on actual devices and found a few more issues that need to be resolved. We didn't get through all of the testing we planned for so we will continue next week most likely.

For ophyd-async (completed in a5b1f27) :

PandA needs to be able to handle frames_per_event > 1. @coretl do you know why this was limited to only being 1 for PandA?
The computed total_number_of_triggers needs to be multiplied by frames_per_event

For bluesky:

ConsolidatorBase needs to be reworked based on the new assumption that frames_per_event is the first dim of datum_shape.

For tiled:

TBD

That covers pretty much everything that we tested a debugged on the devices so far. We will see if changes to tiled are necessary in further testing.

coretl · 2025-01-28T10:00:04Z

PandA needs to be able to handle frames_per_event > 1. @coretl do you know why this was limited to only being 1 for PandA?

I'm not sure, I don't think that's a real restriction. If you remove it, what breaks?

jwlodek · 2025-01-28T12:23:18Z

PandA needs to be able to handle frames_per_event > 1. @coretl do you know why this was limited to only being 1 for PandA?

I'm not sure, I don't think that's a real restriction. If you remove it, what breaks?

Nothing actually, we removed it and got everything to work as expected, just into separate streams. We're going to make sure they can fit into the same stream this week

…ts frames_per_event > 1

…ve-multiplier-to-first-dim

jwlodek

We've now tested this and it is working for us, pending the Consolidator PR.

thomashopkins32 · 2025-01-31T17:40:51Z

Actually we decided that it does not work well with tiled just yet. We want tiled to have the frames_per_event explicitly in the shape which is causing issues with reading the data back from the files (due to how chunking works).

Basically, ophyd-async has the descriptor shape with the first dim as frames_per_event. bluesky's consolidators uses this to figure out the proper chunking of the data prior to writing it to the hdf5 file. Then tiled needs to also understand this chunking in order to read the data back from the file and unpack it properly.

The shape of the data from the user perspective should always be (num_events, frames_per_event, ...)

jwlodek · 2025-02-21T22:30:19Z

We tested this again, and we think we are happy to merge this pending the sister PR into bluesky being ready / tested as well.

coretl · 2025-02-24T18:15:29Z

src/ophyd_async/core/_detector.py

    """

    @computed_field
    @cached_property
    def total_number_of_triggers(self) -> int:
        return (
-            sum(self.number_of_triggers)
+            sum(self.number_of_triggers) * self.frames_per_event


Either we rename number_of_triggers to number_of_events and this logic is valid, or we remove the * self.frames_per_event.
At the moment we have:

number_of_triggers

livetime and deadtime of each trigger

multiplier (which is effectively triggers_per_event)

We could move to:

number_of_events

livetime and deadtime of each frame

frames_per_event

@jwlodek do you have an opinion?

I think the naming you suggest makes sense. number_of_events is a more accurate way of describing what this value is. I agree that livetime/deadtime should be on a per-frame basis, that's how most people will interpret it.

I think it should be:

TriggerInfo.number_of_triggers -> TriggerInfo.number_of_events
TriggerInfo.frames_per_event -> TriggerInfo.triggers_per_event

This would make the total_number_of_triggers property meaningful. The word "frame" is really only relevant to detectors that capture 2-D data.

Then we can add that the word "event" here relates to a single StreamDatum index being emitted.

I'm happy with that

thomashopkins32 · 2025-02-26T22:06:12Z

@coretl @jwlodek

This might seem like overkill but please help me check my understanding of the terms. Here is what I understand:

Level	Term	Meaning	Example
EPICS (or some other control system)	"capture"	One single data point collected from a detector	A singular frame is collected from an area detector
ophyd-async	"trigger"	One or many "captures" that returns a status	5 frames are collected from an area detector
bluesky & beyond	"event"	Occurrence of a single "trigger" that finishes with successful status	5 frames are collected and a document is emitted

The TriggerInfo schema is actually pretty confusing when you try to do the following:

trigger_info = TriggerInfo(number_of_triggers=4, frames_per_event=2)
RE(bps.prepare(det, trigger_info))
RE(bp.count([det], num=10))

This example currently fails to run without error.

Is this something that should be possible? I can't even tell what the expected behavior here should be...

This may be an unintended consequence of doing a step scan versus a fly scan but I need to dig a bit further.

Either way, if there is a real issue here, it should probably be handled in a separate PR.

coretl · 2025-02-27T12:53:12Z

This might seem like overkill but please help me check my understanding of the terms.

My take:

Level	Term	Meaning	Example
EPICS (or some other control system)	"exposure"	One single data point collected from a detector	A singular frame is collected from an area detector
ophyd-async	"trigger"	One software or hardware trigger that is sent to the detector to trigger an exposure (with the given livetime)	A singular frame is collected from an area detector
bluesky & beyond	"event"	An atomic row of data in a stream within a scan, synchronous data points from multiple detectors	5 frames are collected from one detector as a single stream datum index, 1 frame from another as a single stream datum index, and a document is emitted with the motor position at this scan point

The TriggerInfo schema is actually pretty confusing when you try to do the following:
trigger_info = TriggerInfo(number_of_triggers=4, frames_per_event=2)
RE(bps.prepare(det, trigger_info))
RE(bp.count([det], num=10))

Definitely. There are 3 terms here:

number_of_triggers -> number_of_events which is the number of events to produce for each kickoff in a flyscan, it should always be set to 1 for a step scan as each step is an event
frames_per_event -> triggers_per_event which is the number of triggers and therefore the number of frames the detector will produce in each event
num argument to count plan which is the number of events to produce

So number_of_events and num are both specifying the same thing, but in a step scan the number has to be controlled from the plan, so number_of_events must equal 1

This example currently fails to run without error.

This should error with something like "Can only trigger() to contribute to a single event, but detector was prepared with number_of_events=4"

thomashopkins32 · 2025-02-27T14:03:21Z

@coretl Okay I understand now. It makes sense that a step scan can only have 1 trigger, by your definition of trigger.

However, the term "trigger" seems overloaded with bluesky's definition in the sense that a StandardDetector is Triggerable and a bluesky trigger happens once per event (in the case of a step scan, not sure of a fly scan). I suggest we use exposures_per_event to replace frames_per_event.

I will do the renaming part in this PR and add checks + expected functionality (with tests) in another PR.

jwlodek · 2025-02-27T14:04:43Z

I agree with @coretl 's understanding. One thing I thought of however when mulling this over, is that currently we are missing one thing when operating in step scan mode - namely that if I set num_images to say 5 in my AD, and run count with num=3, I'd expect 3 events w/ 5 frames per event, but currently we don't actually have a mechanism for setting frames_per_event in a step scan with trigger - you'd end up with three events of 5 images, but the shape would be incorrect with a frames_per_event of 1.

I think the easiest solution to this would be to add a property to StandardDetector that basically just returns the currently configured frames_per_event for a step scan, with it just returning 1 by default, with the expectation that the implementation should override this. So then for AD based detectors this would get the value of num_images for example, and would set frames_per_event=THE_VALUE_OF_THIS_PROPERTY in the implicit prepare in the trigger method.

Thoughts?

coretl · 2025-02-27T16:36:37Z

I think the easiest solution to this would be to add a property to StandardDetector that basically just returns the currently configured frames_per_event for a step scan, with it just returning 1 by default, with the expectation that the implementation should override this. So then for AD based detectors this would get the value of num_images for example, and would set frames_per_event=THE_VALUE_OF_THIS_PROPERTY in the implicit prepare in the trigger method.

How about we make frames_per_event optional? If it isn't set then the detector should read num_images and use it as the value of frames_per_event, if it isn't then it should set num_images to be frames_per_event.

I suggest we use exposures_per_event to replace frames_per_event.

I'm not sure whether frames or exposures is better, @jwlodek thoughts?

thomashopkins32 · 2025-02-27T22:04:16Z

How about we make frames_per_event optional? If it isn't set then the detector should read num_images and use it as the value of frames_per_event, if it isn't then it should set num_images to be frames_per_event.

I think this should be handled in a different PR. There is some complexity in doing this. Something like this is already done for the TriggerInfo.deadtime but I'm not sure we want to do it the same way for frames_per_event.

We would basically be doing the following:

frames_per_event <- num_images
num_images <- number_of_events * frames_per_event

So if someone re-uses a TriggerInfo that has frames_per_event=None, then the num_images could increase over-time if number_of_events > 1. I think this user would have to call prepare twice in the same bluesky run though?

* add use/set switch (bluesky#780) * add use/set switch * rename to set_use_switch * Remove functions from ADHDFWriter that are exact copies of superclass functions (bluesky#782) * Simpler fix standard det (bluesky#784) * Make sure the capture/arm status coro hasn't already completed before trying to await it * add offset mode switch and other missing motor fields (bluesky#783) * add offset mode switch * add other things in epics motor present in ophyd-sync * remove homf,homr,movn,tdir and make signal names consistent with ophyd * make HLS and LLS int * Testing some ideas * Renamed number_of_triggers -> number_of_events, frames_per_event -> exposures_per_event * Cleanup on expected test failure * Use FailedStatus in test --------- Co-authored-by: Jack Harper <[email protected]> Co-authored-by: Jakub Wlodek <[email protected]>

…mashopkins32/ophyd-async into move-multiplier-to-first-dim

…ve-multiplier-to-first-dim

coretl · 2025-02-28T09:19:16Z

I think this should be handled in a different PR. There is some complexity in doing this. Something like this is already done for the TriggerInfo.deadtime but I'm not sure we want to do it the same way for frames_per_event.

Agreed.

We would basically be doing the following:
frames_per_event <- num_images
num_images <- number_of_events * frames_per_event

I think I've got some terminology confused. In my mind we're mapping:

TriggerInfo.livetime -> ADAcquireTime
TriggerInfo.livetime + TriggerInfo.deadtime -> ADAcquirePeriod
TriggerInfo.number_of_events -> ADNumImages
TriggerInfo.frames_per_event -> ADNumExposures

As defined in https://areadetector.github.io/areaDetector/ADCore/ADDriver.html

All of those fields in TriggerInfo should be optional apart from number_of_events. If there is an optional field, it should take its value from the currently set areaDetector value. number_of_events should always be 1 in a step scan.

So if someone re-uses a TriggerInfo that has frames_per_event=None, then the num_images could increase over-time if number_of_events > 1. I think this user would have to call prepare twice in the same bluesky run though?

I think I made a mistake in the current implementation, the passed in TriggerInfo should be immutable and we make a copy with a different deadtime. That would stop any surprises when a TriggerInfo was reused.

thomashopkins32 · 2025-02-28T13:58:04Z

I think I've got some terminology confused. In my mind we're mapping:
* `TriggerInfo.livetime` -> `ADAcquireTime`

* `TriggerInfo.livetime + TriggerInfo.deadtime` -> `ADAcquirePeriod`

* `TriggerInfo.number_of_events` -> `ADNumImages`

* `TriggerInfo.frames_per_event` -> `ADNumExposures`
As defined in https://areadetector.github.io/areaDetector/ADCore/ADDriver.html

My understanding of the ADDriver is not great, so I will defer to @jwlodek on this one. My understanding was that we want multiple images (or data points) per event. So maybe "exposure" is not the right word and we just use "point" instead.

All of those fields in TriggerInfo should be optional apart from number_of_events. If there is an optional field, it should take its value from the currently set areaDetector value. number_of_events should always be 1 in a step scan.

Agreed! I think this would be great.

I think I made a mistake in the current implementation, the passed in TriggerInfo should be immutable and we make a copy with a different deadtime. That would stop any surprises when a TriggerInfo was reused.

Yes currently it modifies the existing TriggerInfo

jwlodek and others added 30 commits September 4, 2024 13:16

Starting to work on ad tiff writer

652de13

Resolve merge conflicts

e289ee4

Continue working on tiff writer

f36ec3a

Further work on tiff writer, existing tests now passing.

83dff62

Remove functions moved to superclas from hdf writer

1a52a21

Significant re-org and simplification of ad classes

489cfd8

Ruff formatting

83c6884

Modify ad sim classes to reflect new superclasses

3b4f45a

Modify vimba and kinetix classes

7175b30

Modify aravis and pilatus classes

faf53d6

Update all tests to make sure they still pass with changes

5b9f60f

Some cleanup

8bbfd0e

Merge with upstream

1eab818

Changes to standard detector to account for controller/writer types i…

f6825b4

…n typing

Significant changes to base detector, controller, and writer classes

651b80d

Update detector and controller classes to reflect changes

38a61e8

Make sure panda standard det uses new type hints

aecdf04

Most tests passing

e42fa12

Merge with main and resolve conflicts

07684a4

Revert change in test that was resolved by pydantic version update

6dc09f3

Remove debugging prints

1f7dcd7

Linter fixes

35dd1b1

Fix linter error

8112220

Move creation of writer outside of base AreaDetector class init per r…

ac1e509

…eview

Make sure we don't wait for capture to be done!

8494da4

Merge with upstream

b212432

Merge with upstream

3242d45

Allow for specifying whether or not to use fileio signals for dataset…

488d7eb

… description

Revert "Allow for specifying whether or not to use fileio signals for…

a76b70f

… dataset description" This reverts commit 488d7eb.

Fix linter errors, remove unused enum

7da935e

Forgot one test

1935d69

Total number of triggers scaled by frames_per_event; PandA now suppor…

a5b1f27

…ts frames_per_event > 1

thomashopkins32 mentioned this pull request Jan 30, 2025

Updated ConsolidatorBase to support frames_per_event as first dim of descriptor shape bluesky/bluesky#1876

Open

thomashopkins32 added 3 commits January 30, 2025 11:19

Merge branch 'main' of https://github.com/bluesky/ophyd-async into mo…

cc12995

…ve-multiplier-to-first-dim

Fix tests

7c0dfa2

ruff format

0f3007c

jwlodek approved these changes Jan 31, 2025

View reviewed changes

thomashopkins32 added 5 commits February 20, 2025 16:45

Merge branch 'main' into move-multiplier-to-first-dim

91adcea

Pre-commit

0a6df17

Change multiplier -> frames_per_event in _blob_detector_writer.py

0566559

Fix bob detector tests and describe dtype

40a540b

formatting

e1bc098

coretl reviewed Feb 24, 2025

View reviewed changes

Merge branch 'main' into move-multiplier-to-first-dim

e1363a2

thomashopkins32 and others added 3 commits February 27, 2025 17:55

Merge branch 'move-multiplier-to-first-dim' of https://github.com/tho…

1e48292

…mashopkins32/ophyd-async into move-multiplier-to-first-dim

Merge branch 'main' of https://github.com/bluesky/ophyd-async into mo…

5ac7180

…ve-multiplier-to-first-dim

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rename multiplier to frames_per_event and move to first dim of shape #726

Rename multiplier to frames_per_event and move to first dim of shape #726

thomashopkins32 commented Jan 8, 2025 •

edited by coretl

Loading

thomashopkins32 commented Jan 24, 2025 •

edited

Loading

coretl commented Jan 28, 2025

jwlodek commented Jan 28, 2025

jwlodek left a comment

thomashopkins32 commented Jan 31, 2025

jwlodek commented Feb 21, 2025

coretl Feb 24, 2025

jwlodek Feb 24, 2025

thomashopkins32 Feb 25, 2025

coretl Feb 25, 2025

thomashopkins32 commented Feb 26, 2025 •

edited

Loading

coretl commented Feb 27, 2025

thomashopkins32 commented Feb 27, 2025

jwlodek commented Feb 27, 2025 •

edited

Loading

coretl commented Feb 27, 2025

thomashopkins32 commented Feb 27, 2025

coretl commented Feb 28, 2025

thomashopkins32 commented Feb 28, 2025

Rename multiplier to frames_per_event and move to first dim of shape #726

Are you sure you want to change the base?

Rename multiplier to frames_per_event and move to first dim of shape #726

Conversation

thomashopkins32 commented Jan 8, 2025 • edited by coretl Loading

thomashopkins32 commented Jan 24, 2025 • edited Loading

coretl commented Jan 28, 2025

jwlodek commented Jan 28, 2025

jwlodek left a comment

Choose a reason for hiding this comment

thomashopkins32 commented Jan 31, 2025

jwlodek commented Feb 21, 2025

coretl Feb 24, 2025

Choose a reason for hiding this comment

jwlodek Feb 24, 2025

Choose a reason for hiding this comment

thomashopkins32 Feb 25, 2025

Choose a reason for hiding this comment

coretl Feb 25, 2025

Choose a reason for hiding this comment

thomashopkins32 commented Feb 26, 2025 • edited Loading

coretl commented Feb 27, 2025

thomashopkins32 commented Feb 27, 2025

jwlodek commented Feb 27, 2025 • edited Loading

coretl commented Feb 27, 2025

thomashopkins32 commented Feb 27, 2025

coretl commented Feb 28, 2025

thomashopkins32 commented Feb 28, 2025

thomashopkins32 commented Jan 8, 2025 •

edited by coretl

Loading

thomashopkins32 commented Jan 24, 2025 •

edited

Loading

thomashopkins32 commented Feb 26, 2025 •

edited

Loading

jwlodek commented Feb 27, 2025 •

edited

Loading