Creating data splitters for moabb evaluation #624

brunaafl · 2024-06-10T13:37:06Z

Based on the Issue #612 (comment), I've created 3 data splitters related to each of the three types of moabb evaluation: WithinSubjectSplitter, CrossSessionSplitter, and CrossSubjectSplitter, defined in the file splitters.py, two evaluation splits: OfflineSplit and TimeSeriesSplit, and one meta-split, SamplerSplit, defined on meta_splitters.py file.

For the intra-subject splitters (Within and CrossSession Splitters), I assumed that all data and metadata from all subjects was already known and loaded, which can maybe create a problem with the lazy loading done in this cases. Therefore, I based them on 'Individual' versions (IndividualWithin and IndividualCrossSession) that assume metadata from a specific subject.

I ended up creating two draft versions (Group and LazyEvaluation on unified_eval.py) of an evaluation integrating all modalities, with LazyEvaluation trying to maintain the loading of data on the intra-subjects evaluations just when needed. However, taking a look at #481 and #486, maybe it was not the best and easiest solution, so I stopped working on that since it may not be that useful.

I'm working now on building the tests and refining and fixing the code a bit.

bruAristimunha · 2024-06-10T14:05:53Z

moabb/evaluations/unified_eval.py

Let's add this file in another PR

moabb/evaluations/metasplitters.py

moabb/evaluations/splitters.py

moabb/evaluations/metasplitters.py

bruAristimunha

more documentations

PierreGtch

Good job already :)
I left a few comments but didn’t look at unified_eval as you said you dropped it

moabb/evaluations/metasplitters.py

PierreGtch · 2024-06-10T16:19:00Z

moabb/evaluations/metasplitters.py

+        sessions = metadata.session.unique()
+        subjects = metadata.subject.unique()
+
+        if len(runs) > 1:


So if len(runs)>1 then calib_size is ignored.

Is this a desired behaviour @brunaafl @bruAristimunha ?

If yes, this must be very clear in the doc.

It was what I intended, but I don't know if it is the best solution, thought.

moabb/evaluations/metasplitters.py

moabb/evaluations/splitters.py

Deleting unified_eval, so it can be addressed on another pr. Working on tests.

…ters

Adding: figures for documentation

Signed-off-by: Bruna Junqueira Lopes <[email protected]>

# Conflicts: # moabb/evaluations/metasplitters.py # moabb/tests/metasplits.py

tomMoral

I only reviewed the WithinSessionSplitter. I think the PR is a little too big for easy review.
What I propose is that we do a first one with only this WithinSessionSplitter, we fix everything for this one and then move on to the extra ones. This requires a little bit of copy paste but should not be too hard to do.

From what I see, I think we need to improve the management of random_state to get reproducible splits. Another thing that would be nice is to support random order for the split (random on patient and sessions) but maybe it is more work that can be done after the first PR with the reproducible splitter.

Adding test on the reproducible order will be nice.

tomMoral · 2024-10-15T14:04:17Z

moabb/evaluations/splitters.py

+
+    """
+
+    def __init__(self, n_folds=5):


The thing that would be nice is to add a shuffle option to pass on these splits in a random order (random on the patients/session and folds).

This will allow to subsample a number of folds with diverse patient/session fi we don't want to do the full CV procedure.

tomMoral · 2024-10-15T14:05:53Z

moabb/evaluations/splitters.py

+        assert isinstance(self.n_folds, int)
+
+        subjects = metadata.subject.values
+        cv = StratifiedKFold(n_splits=self.n_folds, shuffle=True, **kwargs)


We need to be able to set a random_state for each of this CV to be able to have reproducible splits.

bruAristimunha · 2024-10-15T14:25:34Z

I will help you @brunaafl

brunaafl and others added 5 commits June 5, 2024 21:56

Creating new splitters and base evaluation

bacedc5

Adding metasplitters

419b2ca

Fixing LazyEvaluation

d6e795d

Merge branch 'NeuroTechX:develop' into eval_splitters

140670c

[pre-commit.ci] auto fixes from pre-commit.com hooks

d724674

brunaafl requested review from bruAristimunha and PierreGtch June 10, 2024 13:42