Persisters should error if asked to save a persisted resource which doesn't already exist. #837

tpendragon · 2020-08-10T20:19:21Z

Right now the following works:

resource = metadata_adapter.persister.save(resource: Resource.new)
metadata_adapter.persister.delete(resource: resource)
saved_again = metadata_adapter.persister.save(resource: resource)
saved_again.id == resource.id # => true

However, this causes race conditions with background jobs. It should do the following:

resource = metadata_adapter.persister.save(resource: Resource.new)
metadata_adapter.persister.delete(resource: resource)
saved_again = metadata_adapter.persister.save(resource: resource)
# => Valkyrie::Persistence::ObjectNotFoundError

I'm open to other ideas, instead of ObjectNotFoundError. A quick experiment with AR shows that it just returns true for save, but doesn't do anything.

The text was updated successfully, but these errors were encountered:

dgcliff · 2020-08-10T21:02:10Z

ObjectNotFoundError seems to be the best fit, but perhaps we can highlight for the user the cause of the issue to help them untangle it. If they arrived at the error as a result of a race condition, they may not even know when/where the resource was deleted.

Setting the error message could potentially fulfill both goals.

Valkyrie::Persistence::ObjectNotFoundError.new("Attempted save on persisted resource which was not found") (stand-in language, I'm sure a better explanation is possible)

no-reply · 2020-08-11T00:40:54Z

could this be adapter specific behavior?

my intuition is that making the #persisted? check an API level contract is introducing unneeded state dependence, and likely to end up being more complicated than we bargained for. is there a proposed generic implementation that doesn't involve an extra round-trip to the backend when saving an object for which #persisted? is true?

it seems reasonable (already) that an adapter or backend might want to avoid recreating items with the same ids as any that have already been deleted, and that they might use a variety of strategies to ensure that.

i think i'd also like to hear more about the nature of the race conditions. pulibrary/figgy#4174 provides a hint, but i'm curious to know more.

tpendragon · 2020-08-11T01:19:50Z

I don't think it should be adapter specific, but I'm sure we can find efficient methods of implementation. Fedora, for instance, will have a tombstone and just throw an error. A database save could UPDATE, and solr's cheap to check.

The race condition for us was the following:

A. User uploads an item with 600 pages. 25 workers promptly start working them.
B. Worker A pulls the object it generates derivatives for, creates a derivative. This takes a bit to process.
C. User deletes the object because they've found some mistake and want to reingest. This deletes all the pages.
C. Worker A saves the page it was generating a derivative for. Now the page is suddenly persisted again, and separated from its parent (which is gone.)

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

hackartisan · 2020-10-23T13:37:09Z

Work started on branch master...837-persisters-should-error

~~Note one iCLA will be needed before this can be merged.~~ -- Update: CLA is received / on file

tpendragon · 2020-10-29T21:29:56Z

I added some commits to master...837-persisters-should-error and now only Solr is left, but I don't have a good plan on how to make Solr efficiently.

tpendragon · 2020-10-29T21:36:53Z

I also don't know what to do about #save_all. Erroring beforehand would make it stop in the middle of saving them with some of these implementations.

hackartisan · 2020-11-02T14:41:49Z

Hm. For save_all, perhaps the strategy AR uses would be worth considering - don't error but also don't do anything.

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon added the enhancement label Aug 10, 2020

tpendragon mentioned this issue Aug 10, 2020

Analyze Orphaned FileSets to figure out why they're orphaned. pulibrary/figgy#4174

Open

hackartisan added a commit that referenced this issue Oct 22, 2020

Add a shared persister spec for saving a deleted resource

c191cb3

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

hackartisan added a commit that referenced this issue Oct 22, 2020

Raise ObjectNotFound on save for memory adapter

7950ab9

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon pushed a commit that referenced this issue Jul 28, 2021

Add a shared persister spec for saving a deleted resource

5e6b03c

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon pushed a commit that referenced this issue Jul 28, 2021

Raise ObjectNotFound on save for memory adapter

6eddc43

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon mentioned this issue Jul 28, 2021

Persisters should error if you try to save something which is gone now, but used to be persisted. #867

Merged

tpendragon pushed a commit that referenced this issue Aug 4, 2021

Add a shared persister spec for saving a deleted resource

3650221

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon pushed a commit that referenced this issue Aug 4, 2021

Raise ObjectNotFound on save for memory adapter

26f3396

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon pushed a commit that referenced this issue Sep 13, 2021

Add a shared persister spec for saving a deleted resource

0c564b4

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon pushed a commit that referenced this issue Sep 13, 2021

Raise ObjectNotFound on save for memory adapter

6a8d4b1

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon pushed a commit that referenced this issue Sep 14, 2021

Add a shared persister spec for saving a deleted resource

d06ca96

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon pushed a commit that referenced this issue Sep 14, 2021

Raise ObjectNotFound on save for memory adapter

e85a12a

refs #837 Co-authored-by: Ayse Durmaz <[email protected]>

tpendragon closed this as completed in #867 Oct 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Persisters should error if asked to save a persisted resource which doesn't already exist. #837

Persisters should error if asked to save a persisted resource which doesn't already exist. #837

tpendragon commented Aug 10, 2020

dgcliff commented Aug 10, 2020

no-reply commented Aug 11, 2020

tpendragon commented Aug 11, 2020

hackartisan commented Oct 23, 2020 •

edited

Loading

tpendragon commented Oct 29, 2020 •

edited

Loading

tpendragon commented Oct 29, 2020

hackartisan commented Nov 2, 2020

Persisters should error if asked to save a persisted resource which doesn't already exist. #837

Persisters should error if asked to save a persisted resource which doesn't already exist. #837

Comments

tpendragon commented Aug 10, 2020

dgcliff commented Aug 10, 2020

no-reply commented Aug 11, 2020

tpendragon commented Aug 11, 2020

hackartisan commented Oct 23, 2020 • edited Loading

tpendragon commented Oct 29, 2020 • edited Loading

tpendragon commented Oct 29, 2020

hackartisan commented Nov 2, 2020

hackartisan commented Oct 23, 2020 •

edited

Loading

tpendragon commented Oct 29, 2020 •

edited

Loading