Move from mongodb to tiled-backed catalog #339

canismarko · 2025-01-06T18:34:52Z

The bluesky project is moving from storing raw scan documents in a mongo database via databroker to storing tabular data in a Tiled catalog backed by a relational (sqlite or postgres) database. This PR updates Haven and Firefly to use the Tiled catalog by default.

This change is necessary to enable ophyd-async detector data to be read back out by Tiled.

The aim with this PR is keep top-level features (e.g. run browser) the same as with the old database structure. This PR does not add features that will be useful with the new catalog, such as filtering by beamline; these will be added in a separate PR.

To avoid losing data, the raw documents should still be saved in the mongo database. This way, when we are ready to fully switch over we will create a new tiled catalog and stream the mongodb documents into the new catalog. I suggest we keep that old mongodb going until we are confident we don't need them.

Things to do before merging:

add tests
~~write docs~~
update iconfig_testing.toml
flake8, black, and isort
test at the beamline

…or support for ADs.

Developed on the beamline, tests fail.

…base structure.

canismarko · 2025-01-06T18:37:02Z

As-written, the tiled writer exists as a run engine callback. Previously, we put documents on the kafka topic and read them out using the mongo_consumer.py. Do we want to do that here as well?

Maybe that should be a different PR, where the run_engine factory lets you attach the kafka producer instead of the tiled writer and databroker separately?

canismarko · 2025-01-06T21:10:03Z

Also, I think documentation for this belongs on the wiki, since it extends beyond the scope of Haven.

… writer.

canismarko · 2025-01-07T20:22:35Z

As-written, the tiled writer exists as a run engine callback. Previously, we put documents on the kafka topic and read them out using the mongo_consumer.py. Do we want to do that here as well?

I added a TiledConsumer module to the queueserver package that reads documents from Kafka and sends them to Tiled. I will do a separate PR that adds a kafka produce to the run engine (the queueserver does this automatically).

src/firefly/run_browser/widgets.py

Cathyhjj · 2025-01-07T22:41:13Z

src/haven/tests/test_save_motor_positions.py

Thank you for changing the data structure for me steal!

src/queueserver/tiled_consumer.py

…sumer docstrings.

yannachen and others added 9 commits December 16, 2024 09:58

Added a tiled writer to the run engine factory.

329c0e2

Merge branch 'main' into rdb

ec51858

Enabled both tiled and databroker in the run engine by default.

8a7bf6c

Partially updated the run browser support to use the new SQL interface.

70cdc0b

Updated bluesky minimum version to 1.13.1rc1 to get proper consolidat…

f581a5e

…or support for ADs.

Updated the run browser to work properly with the new database.

7a117cf

Developed on the beamline, tests fail.

Updated tests (and a few code spots) to match the new relational data…

b5a56f5

…base structure.

Linting (isort, flake8, black) and removed stray print statements.

348b727

Merge branch 'main' into rdb

9539e9f

canismarko added 3 commits January 6, 2025 12:47

Included tiled writer info in example iconfig.toml.

5b247bb

Added latest bluesky RC to the environment file for CI.

78ff1f1

Fixed a numpy precision issue in a test.

a823595

canismarko marked this pull request as ready for review January 6, 2025 21:10

Added a Tiled consumer for streaming Bluesky docs from Kafka to Tiled…

4421103

… writer.

canismarko requested a review from Cathyhjj January 7, 2025 19:39

Black, isort, and flake8.

5df0837

Cathyhjj approved these changes Jan 7, 2025

View reviewed changes

canismarko and others added 2 commits January 7, 2025 18:21

Cleaned up stray print, and corrected 'mongo' -> 'tiled' in tiled con…

c04daee

…sumer docstrings.

Merge branch 'main' into rdb

2a5bdcb

canismarko merged commit 2645708 into main Jan 8, 2025
1 check passed

canismarko deleted the rdb branch January 8, 2025 00:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move from mongodb to tiled-backed catalog #339

Move from mongodb to tiled-backed catalog #339

canismarko commented Jan 6, 2025 •

edited

Loading

canismarko commented Jan 6, 2025

canismarko commented Jan 6, 2025

canismarko commented Jan 7, 2025

Cathyhjj Jan 7, 2025

Move from mongodb to tiled-backed catalog #339

Move from mongodb to tiled-backed catalog #339

Conversation

canismarko commented Jan 6, 2025 • edited Loading

canismarko commented Jan 6, 2025

canismarko commented Jan 6, 2025

canismarko commented Jan 7, 2025

Cathyhjj Jan 7, 2025

Choose a reason for hiding this comment

canismarko commented Jan 6, 2025 •

edited

Loading