-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Automatic export is triggered multiple times per process #6140
Comments
I think this behavior is correct but misleading :-( The first shown task in the task manager is the task from the workflow. The second task in the task manager is the executed action from the workflow task. As both have using the same words it is confusing. edit: if you have a process with a lot of media data (50gb or more, depending on how fast a file copy is): you will see that on execution of the first task no export is done / no data get copied. Only on execution of the second task the data get exported / data get copied. Look even on the time stamp information on the exported data. |
Did you execute tasks in taskmanager parallel (config An other reason could be on exporting year issues of newspapers with all their daily issues at once. We discovered in this case a crazy, time and resource consuming behavior until |
Normal export by selecting issues an choosing export in the actions on the left is ok and working. Just finishing a whole year at once or even a month or less with "set status of a process up" is causing this. It even starts to export allready finished processes. |
We have the same discovering but without "set status of a process up" (as we did not use this on our workflow as it could break more things than it solve things after our experience). I think that this usage scenario is really not full covered by the current code. I digged into this and the code in the Finding a better and more robust way to export all kinds of processes and including of all kind of hierarchy levels is a complicated thing. |
I think you are right. There are two threads started, but the first probably only acts as a wrapper thread which triggers the actual exporting thread. I suppose we do not need the wrapper thread and can directly trigger the export thread (#6141) but i suppose this will not adress the large problems with the export as discussed here. |
One more note: the second thread is only created and started if the automatic export configuration is set to true. If this is not the case the export will immediately started. So this configuration option has influence of exporting. |
No hard evidence yet, but setting "ASYNCHRONOUS_AUTOMATIC_EXPORT" to false seem to bring some improvements on systems which try to combine multiple "set status of a process up" with autoRunLimit greater than 1.
So maybe it is worth investigating whether the instantiation of a thread, which does not much more than triggering another thread, might lead to problems. |
Using "set status of a process" up is may the cause. It is working different on different configured tasks.
This different behavior of this action decided us to not use this action as a regular user action and even using this as an administrative action is dangerous act as you must know which task will work in which way on using this action and in some situations (exporting is such an action) we did not use it. |
Describe the bug
When a lot of automatic exports are triggered for multiple processes (e.g. after multiple automatic processing steps), it seems that the export for the processes is sometimes triggered two times and the export of the same process appears two times in the task manager.
I tried to trace the behaviour and it seems as if the automatic export is triggered two times, when the previous task is closed and the next task is started and executed automatically. Maybe i am reading the code wrong, but the current implementation seems wrong.
When a task is closed
activateTasksForClosedTask
is called, which will activate the following tasks.kitodo-production/Kitodo/src/main/java/org/kitodo/production/services/workflow/WorkflowControllerService.java
Line 437 in ee97b68
The code inside
activateTask
is then callingprocessAutomaticTask
kitodo-production/Kitodo/src/main/java/org/kitodo/production/services/workflow/WorkflowControllerService.java
Line 614 in ee97b68
which will add all automatic tasks to the list
automaticTasks
kitodo-production/Kitodo/src/main/java/org/kitodo/production/services/workflow/WorkflowControllerService.java
Line 678 in ee97b68
For every task on the list Kitodo will start a
TaskScriptThread
kitodo-production/Kitodo/src/main/java/org/kitodo/production/services/workflow/WorkflowControllerService.java
Lines 464 to 467 in ee97b68
When this
TaskScriptThread
is executed it will get evaluated for the type of task, in the case of the export this will lead to the execution ofexecuteDmsExport
(taskService.executeDmsExport(this.task);
)kitodo-production/Kitodo/src/main/java/org/kitodo/production/thread/TaskScriptThread.java
Line 76 in ee97b68
This is were the problems might begin, because that method will start another
TaskScriptThread
kitodo-production/Kitodo/src/main/java/org/kitodo/export/ExportDms.java
Lines 133 to 138 in ee97b68
In my understanding we now have two threads which will end up doing the exact same thing: Exporting the process.
This is different from
generateImages
which seems to get executed in the already existing thread.kitodo-production/Kitodo/src/main/java/org/kitodo/production/services/data/TaskService.java
Line 598 in ee97b68
We probably should handle the export similar to the
generateImages
and not start a Thread, when there is already a thread for the export.Expected behavior
The export should only be triggered one time per process.
Release
3.6, Master
The text was updated successfully, but these errors were encountered: