Improve performance of DataCollectionFragment #3008

shobhitagarwal1612 · 2025-01-16T03:52:26Z

In previous PRs, we created TaskDataHandler to maintain state of TaskData and TaskSequenceHandler for generating task sequence. TaskSequenceHandler not only is a wrapper for operations related to task sequence but also caches the sequence internally for better performance.

In this PR, we aim to replace custom creation of task sequence in DataCollectionViewModel using TaskSequenceHandler. This would not only abstract out all task sequence related operations but also greatly improve the overall performance and jankiness of the data collection process. Currently, we re-compute the task sequence upto 5-7 times during each screen. This number increases with each screen as the stored values also keeps increasing. Task sequence computation is a heavy operation as it requires filtering tasks based on previously selected values.

This PR shouldn't change the behavior of the app other than improving the performance.

Ran the app locally and submitted multiple jobs to ensure no regressions are introduced. Also, added extensive unit tests for each helper class with core business logic for test coverage.

@gino-m @sufyanAbbasi PTAL?

codecov · 2025-01-18T07:22:26Z

Codecov Report

Attention: Patch coverage is 87.87879% with 4 lines in your changes missing coverage. Please review.

Project coverage is 63.46%. Comparing base (a0975c3) to head (5b5ef4c).

Files with missing lines	Patch %	Lines
...round/ui/datacollection/DataCollectionViewModel.kt	86.66%	1 Missing and 3 partials ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##             master    #3008      +/-   ##
============================================
- Coverage     63.55%   63.46%   -0.09%     
+ Complexity     1267     1258       -9     
============================================
  Files           270      270              
  Lines          6560     6539      -21     
  Branches        927      924       -3     
============================================
- Hits           4169     4150      -19     
+ Misses         1786     1783       -3     
- Partials        605      606       +1

Files with missing lines	Coverage Δ
...ground/ui/datacollection/DataCollectionFragment.kt	`38.54% <ø> (-0.64%)`	⬇️
...ndroid/ground/ui/datacollection/TaskDataHandler.kt	`93.75% <100.00%> (+0.41%)`	⬆️
...id/ground/ui/datacollection/TaskSequenceHandler.kt	`100.00% <100.00%> (ø)`
...round/ui/datacollection/DataCollectionViewModel.kt	`77.01% <86.66%> (-1.79%)`	⬇️

…wModel

This allows the method to be triggered in unit tests as well by default which is not happening at the moment.

…empty string)

…atest

sufyanAbbasi

Thanks so much! The new class is incredibly clean and the helper methods improve the readability greatly! I just had a couple of nits and question about implementation, but otherwise LGTM!

I wanted to call out that Sequences are designed to be lazily evaluated to reduce the overhead of regenerating the list every time, but perhaps I didn't implement it right the first time because I was reprocessing the Tasks list every time 🤦‍♂️:
https://kotlinlang.org/docs/sequences.html#sequence

Therefore, I think that the extra complexity in TaskSequenceHandler that initializes and refreshes the sequence can be made slightly simpler without losing any performance.

Instead, consider caching ONLY the Sequence<TaskSelection> which correlates to the user's selection, and we can still pass in the theoretical TaskSelection when computing the test sequence.

So the method, getTaskSequence(Sequence<TaskSelection>?) (which uses the cached sequence if unspecified, otherwise the test one), will return a new Sequence<Task> every time, but this will always be an O(1) operation (IIUC from reading the Kotlin documentation).

I guess this is just an implementation detail, but my intention is to reduce some cognitive overhead by focusing on the independent variable here, Sequence<TaskSelection> rather than the thing we dynamically generate (the task sequence), and some of the methods can be greatly simplified.

Anyway, just food for thought! LGTM!

sufyanAbbasi · 2025-01-23T00:40:57Z

app/src/main/java/com/google/android/ground/ui/datacollection/DataCollectionViewModel.kt

-    condition: Condition,
-    taskValueOverride: Pair<String, TaskData?>? = null,
-  ): Boolean = condition.fulfilledBy(taskDataHandler.getTaskSelections(taskValueOverride))
+    // Don't call `refreshTaskSequence` as the new value hasn't been set yet.


nit: Consider adding a new method to TaskSequenceHandler called testSequence(taskDataHandler, taskValueOverride) which encapsulates this logic. I feel like if a developer can accidentally mess it up by using the wrong method, then it's better to make it a method with proper documentation.

sufyanAbbasi · 2025-01-23T00:54:08Z

app/src/main/java/com/google/android/ground/ui/datacollection/DataCollectionViewModel.kt

-  /** Returns true if the given [taskId] is last task in sequence. */
-  fun isLastPosition(taskId: String): Boolean = taskId == getTaskSequence().last().id
+  fun isLastPositionWithValue(taskId: String, newValue: TaskData?): Boolean {
+    if (taskDataHandler.getData(taskId) == newValue) {


I just wanted to make sure I understand the logic here, so I'll walk through an example. So say I'm on the "last" task with a select of three options: A, B, and C, with two more conditional tasks ahead of it, Conditional Task A, and Conditional Task B (C will show "Done").

The user changes the selection to A, so setOnValueChanged on the button is called and this function is invoked, so Condition A is now the last task. They click next, we refresh the sequence, and save it to the task's data. Then the user hits back.

So now the user selects B, so because this case is not true, we return false from the next part, or if they select C, the next part returns true.

Now when we select A again, we fall into this if case and we reuse the last sequence generated, so isLast() will always return false?

If so, maybe we can change the comment to Reuse the existing task sequence if the value has already been saved (i.e. after pressing "Next" and going back).

shobhitagarwal1612 mentioned this pull request Jan 16, 2025

[WIP] Integrate TaskSequenceHandler with DataCollectionViewModel #2986

Closed

shobhitagarwal1612 changed the title ~~Improve performance of data collection by using TaskSequenceHandler in DataCollectionViewModel~~ Improve performance by using TaskSequenceHandler in DataCollectionViewModel Jan 16, 2025

shobhitagarwal1612 changed the title ~~Improve performance by using TaskSequenceHandler in DataCollectionViewModel~~ Improve performance of DataCollectionFragment Jan 16, 2025

shobhitagarwal1612 added 8 commits January 18, 2025 16:36

Generate task sequence using TaskSequenceHandler in DataCollectionVie…

321e2e6

…wModel

Inline isLastPositionWithValue in AbstractTaskFragment

8c8ce5e

Introduce a new method to refresh the task sequence cache

28ffaa9

Add unit tests for refreshTaskSequence method

c6285da

Replace init() method with native init {} block

b304024

This allows the method to be triggered in unit tests as well by default which is not happening at the moment.

Initialize the current task id value if missing instead of using "" (…

fbb8f64

…empty string)

Update ktdoc

d5671f1

Automatically added GitHub issue links to TODOs

d55ec01

shobhitagarwal1612 force-pushed the ashobhit/2958/integrate-sequence-handler-latest branch from 75335d5 to d55ec01 Compare January 18, 2025 11:06

Remove unused var submissionId

90a4693

shobhitagarwal1612 requested review from gino-m and sufyanAbbasi and removed request for gino-m January 18, 2025 11:12

shobhitagarwal1612 marked this pull request as ready for review January 18, 2025 11:12

auto-assign bot requested a review from scolsen January 18, 2025 11:12

Merge branch 'master' into ashobhit/2958/integrate-sequence-handler-l…

5b5ef4c

…atest

sufyanAbbasi approved these changes Jan 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of DataCollectionFragment #3008

Improve performance of DataCollectionFragment #3008

shobhitagarwal1612 commented Jan 16, 2025 •

edited

Loading

codecov bot commented Jan 18, 2025 •

edited

Loading

sufyanAbbasi left a comment

sufyanAbbasi Jan 23, 2025

sufyanAbbasi Jan 23, 2025

Improve performance of DataCollectionFragment #3008

Are you sure you want to change the base?

Improve performance of DataCollectionFragment #3008

Conversation

shobhitagarwal1612 commented Jan 16, 2025 • edited Loading

codecov bot commented Jan 18, 2025 • edited Loading

Codecov Report

sufyanAbbasi left a comment

Choose a reason for hiding this comment

sufyanAbbasi Jan 23, 2025

Choose a reason for hiding this comment

sufyanAbbasi Jan 23, 2025

Choose a reason for hiding this comment

shobhitagarwal1612 commented Jan 16, 2025 •

edited

Loading

codecov bot commented Jan 18, 2025 •

edited

Loading