Fix Python regressions in 1.9.0beta #857

benc-db · 2024-11-26T19:07:39Z

Description

Fixing two regressions related to Command submission of python models.

1.) Exceptions were not properly getting propagated to the user
2.) I had accidentally brought back the race condition that the community had already fixed for us.

Checklist

I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have updated the CHANGELOG.md and added information about my change to the "dbt-databricks next" section.

benc-db · 2024-11-26T19:08:33Z

dbt/adapters/databricks/api_client.py

+            if self.status(cluster_id) not in ["RUNNING", "PENDING"]:
+                raise DbtRuntimeError(f"Error starting terminated cluster.\n {response.content!r}")
+            else:
+                logger.debug("Presuming race condition, waiting for cluster to start")


When we query state after error, if it's pending or running it means the start failed due to race condition as another thread got the cluster started.

should it be info rather than debug or debug with status_code?

Anything that is info will show in the normal dbt output, so we are very conservative about what we log with info.
This is normal operation; if they are using multiple python models and the command api they will hit this, so I don't think this is worth bringing to the users' attention if it's running or pending. For whatever reason, the cluster start API errors if you ask to start a cluster that is already in the process of starting. If it's not running or pending, they'll get the full output from raising the error.

benc-db · 2024-11-26T19:09:39Z

dbt/adapters/databricks/api_client.py

-        )
+        ).json()
+
+        if response["results"]["resultType"] == "error":


Lost this in the refactor: for some reason Command exec will give a state of 'Finished' rather then 'Error' some times, and then stuff the error in the results.

what if the result or resultType does not exist

these are published parts of the Databricks REST API. If these don't exist, the entire API becomes untrustworthy.

benc-db · 2024-11-26T19:09:55Z

dbt/adapters/databricks/impl.py

@@ -85,7 +85,7 @@
 SHOW_TABLE_EXTENDED_MACRO_NAME = "show_table_extended"
 SHOW_TABLES_MACRO_NAME = "show_tables"
 SHOW_VIEWS_MACRO_NAME = "show_views"
-GET_COLUMNS_COMMENTS_MACRO_NAME = "get_columns_comments"
+


Noticed this wasn't referenced anywhere when doing my debugging.

benc-db · 2024-11-26T19:10:16Z

dbt/adapters/databricks/python_models/python_submissions.py

@@ -70,6 +71,8 @@ def __init__(

    @override
    def submit(self, compiled_code: str) -> None:
+        logger.debug("Submitting Python model using the Command API.")


Adding debug statement so that we can quickly determine submission method from logs.

benc-db · 2024-11-26T19:10:52Z

dbt/include/databricks/macros/adapters/columns.sql

@@ -0,0 +1,27 @@
+


Moved these to somewhere more sensible (during investigation realized these were in persist_docs, but are sometimes called outside of persisting docs).

alexguo-db

LGTM

fix a couple of python bugs?

9339b22

benc-db requested review from andrefurlan-db and rcypher-databricks as code owners November 26, 2024 19:07

benc-db commented Nov 26, 2024

View reviewed changes

benc-db requested review from jackyhu-db and alexguo-db November 26, 2024 19:11

fixed unit tests

07c9fcd

alexguo-db approved these changes Nov 26, 2024

View reviewed changes

jackyhu-db approved these changes Nov 26, 2024

View reviewed changes

benc-db merged commit 6398033 into main Nov 26, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Python regressions in 1.9.0beta #857

Fix Python regressions in 1.9.0beta #857

benc-db commented Nov 26, 2024

benc-db Nov 26, 2024

jackyhu-db Nov 26, 2024 •

edited

Loading

benc-db Nov 26, 2024

benc-db Nov 26, 2024

jackyhu-db Nov 26, 2024

benc-db Nov 26, 2024

benc-db Nov 26, 2024

benc-db Nov 26, 2024

benc-db Nov 26, 2024

alexguo-db left a comment

Fix Python regressions in 1.9.0beta #857

Fix Python regressions in 1.9.0beta #857

Conversation

benc-db commented Nov 26, 2024

Description

Checklist

Choose a reason for hiding this comment

jackyhu-db Nov 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexguo-db left a comment

Choose a reason for hiding this comment

jackyhu-db Nov 26, 2024 •

edited

Loading