SNOW-1887901: fix partitioning logic #2944

sfc-gh-yuwang · 2025-01-27T22:10:17Z

Which Jira issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

Fixes SNOW-1887901
Fill out the following pre-review checklist:
- I am adding a new automated test(s) to verify correctness of my new code
  - If this test skips Local Testing mode, I'm requesting review from @snowflakedb/local-testing
- I am adding new logging messages
- I am adding a new telemetry message
- I am adding new credentials
- I am adding a new dependency
- If this is a new feature/behavior, I'm adding the Local Testing parity changes.
- I acknowledge that I have ensured my changes to be thread-safe. Follow the link for more information: Thread-safe Developer Guidelines
Please describe how your code solves the related issue.

fix partition logic, now it would generate same results as spark do

sfc-gh-aling · 2025-01-29T19:02:47Z

src/snowflake/snowpark/dataframe_reader.py

-                if column_type != int
-                else int(processed_lower_bound + i * stride)
+            l_bound = (
+                f"{column} >= {self._to_external_value(current_value, column_type)}"


in spark

partitionColumn must be a numeric, date, or timestamp column

do we handle all 3 types here?
also can you check the pyspark behavior if passing column unsupported type

I found I actually wrote the code to support all types, so I just added a test here

sfc-gh-aling · 2025-01-29T19:04:41Z

tests/integ/test_data_source_api.py

@@ -8,6 +8,8 @@
 from unittest.mock import MagicMock
 import pytest

+from snowflake.snowpark.types import IntegerType


let's add tests for a couple of more types? like float, decimal, datetime, date

also unsupported type test

modify partition logic to match spark

cfba03e

sfc-gh-yuwang requested a review from sfc-gh-aling January 27, 2025 23:03

sfc-gh-yuwang marked this pull request as ready for review January 27, 2025 23:04

sfc-gh-yuwang requested review from a team as code owners January 27, 2025 23:04

sfc-gh-yuwang requested review from sfc-gh-jdu and sfc-gh-jrose and removed request for a team January 27, 2025 23:04

add more test

cefb740

sfc-gh-aling reviewed Jan 29, 2025

View reviewed changes

add test

7c1ed00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SNOW-1887901: fix partitioning logic #2944

SNOW-1887901: fix partitioning logic #2944

sfc-gh-yuwang commented Jan 27, 2025 •

edited

Loading

sfc-gh-aling Jan 29, 2025 •

edited

Loading

sfc-gh-yuwang Jan 30, 2025

sfc-gh-aling Jan 29, 2025

sfc-gh-aling Jan 29, 2025

SNOW-1887901: fix partitioning logic #2944

Are you sure you want to change the base?

SNOW-1887901: fix partitioning logic #2944

Conversation

sfc-gh-yuwang commented Jan 27, 2025 • edited Loading

sfc-gh-aling Jan 29, 2025 • edited Loading

Choose a reason for hiding this comment

sfc-gh-yuwang Jan 30, 2025

Choose a reason for hiding this comment

sfc-gh-aling Jan 29, 2025

Choose a reason for hiding this comment

sfc-gh-aling Jan 29, 2025

Choose a reason for hiding this comment

sfc-gh-yuwang commented Jan 27, 2025 •

edited

Loading

sfc-gh-aling Jan 29, 2025 •

edited

Loading