Relate restricted runs to CatalogQuery objects (ENT-9405) #947

pwnage101 · 2024-09-19T16:52:44Z

Two main parts of this:

Pre-fetch restricted runs from discovery as needed. This is required because they might not yet be imported into the database at the time we are processing a CatalogQuery which allows them, but we need them imported at that point in time so that we can relate them to the CatalogQuery.
Actually create a new M2M relationship between ContentMetadata and CatalogQuery, and relate restricted runs to queries as defined in the content filter (original goal of this ticket).

ENT-9405

iloveagent57 · 2024-09-24T13:08:57Z

enterprise_catalog/apps/catalog/models.py

+    catalog_queries_for_restricted_course_run = models.ManyToManyField(
+        CatalogQuery,
+        related_name='restricted_course_runs',
+    )


Can we create an explicit M2M model to represent this and use through?

iloveagent57 · 2024-09-24T13:18:25Z

enterprise_catalog/apps/catalog/models.py

+    # At this point, we know the `unrestricted_metadata` partition contains
+    # courses and runs that should be related to the CatalogQuery as restricted
+    # content, but it's still only a subset of restricted content that needs to
+    # be related. What's missing are any restricted runs that didn't match the
+    # original content filter, but whose parent (course) key DID match.
+    restricted_metadata_content_keys = [get_content_key(entry) for entry in restricted_metadata]
+    LOGGER.info(
+        'Retrieved %d restricted content items (%d unique) from course-discovery for catalog query %s',
+        len(restricted_metadata_content_keys),
+        len(set(restricted_metadata_content_keys)),
+        catalog_query,
+    )
+    resticted_run_metadata = get_restricted_runs_from_discovery(
+        # Intentionally pass the un-partitioned metadata list so that we can
+        # discover all restricted runs matching any course that matches the
+        # content_filter, restricted-only or not.
+        metadata,
+        catalog_query,
+        dry_run,
+    )
+    associated_restricted_run_keys = associate_restricted_runs_with_query(
+        # Union the following metadata lists:
+        # * `restricted_metadata`:
+        #     Includes any restricted-only courses, or restricted runs directly
+        #     matching the content filter, BUT not necessarily all restricted
+        #     runs beneath matching courses.
+        # * `restricted_run_metadata`:
+        #     Includes all restricted runs part of any course matching the
+        #     content filter, BUT no restricted-only courses.
+        restricted_metadata | resticted_run_metadata,
+        catalog_query,
+        dry_run
+    )
+    LOGGER.info(
+        'Associated %d restricted runs (%d unique) with catalog query %s',
+        len(associated_restricted_run_keys),
+        len(set(associated_restricted_run_keys)),
+        catalog_query,
+    )


Maybe throw this whole chunk into a new function?

iloveagent57 · 2024-09-24T13:36:37Z

enterprise_catalog/apps/catalog/models.py

+    # Partition metadata dicts by unrestricted vs. restricted.
+    restricted_metadata = []
+    unrestricted_metadata = []
+    for metadata_dict in metadata:
+        if is_content_restricted(metadata_dict):
+            restricted_metadata += metadata_dict
+        else:
+            unrestricted_metadata += metadata_dict


Is it necessary to mix the restricted run fetching in with the unrestricted fetching? Since we're going to make a second set of requests to discovery anyway, can we do as just two completely separate chunks of work?

Request the content filter without requesting restricted runs.

Request only the restricted runs defined in the content filter.

pwnage101 added 2 commits September 18, 2024 15:56

TODO

c01a109

ENT-9405

squash

7b8da9f

pwnage101 changed the title ~~Pwnage101/ent 9405~~ Relate restricted runs to CatalogQuery objects (ENT-9405) Sep 19, 2024

pwnage101 mentioned this pull request Sep 19, 2024

feat: fetch and filter out restricted course runs #939

Closed

2 tasks

pwnage101 added 7 commits September 19, 2024 16:35

squash

a04b877

squash

ddac2a0

squash

23cd813

squash

c9b1a4a

squash

ab9fae8

squash

eb9ed02

squash

aa1f93d

iloveagent57 reviewed Sep 24, 2024

View reviewed changes

iloveagent57 mentioned this pull request Oct 8, 2024

refactor: refactor get_metadata_by_query() to allow extra params #965

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relate restricted runs to CatalogQuery objects (ENT-9405) #947

Relate restricted runs to CatalogQuery objects (ENT-9405) #947

pwnage101 commented Sep 19, 2024 •

edited

Loading

iloveagent57 Sep 24, 2024

iloveagent57 Sep 24, 2024

iloveagent57 Sep 24, 2024

Relate restricted runs to CatalogQuery objects (ENT-9405) #947

Are you sure you want to change the base?

Relate restricted runs to CatalogQuery objects (ENT-9405) #947

Conversation

pwnage101 commented Sep 19, 2024 • edited Loading

iloveagent57 Sep 24, 2024

Choose a reason for hiding this comment

iloveagent57 Sep 24, 2024

Choose a reason for hiding this comment

iloveagent57 Sep 24, 2024

Choose a reason for hiding this comment

pwnage101 commented Sep 19, 2024 •

edited

Loading