Skip to content

Commit

Permalink
remove debug statements
Browse files Browse the repository at this point in the history
Signed-off-by: Praateek <[email protected]>
  • Loading branch information
praateekmahajan committed Oct 30, 2024
1 parent a921c6e commit 143b114
Showing 1 changed file with 0 additions and 5 deletions.
5 changes: 0 additions & 5 deletions nemo_curator/modules/fuzzy_dedup.py
Original file line number Diff line number Diff line change
Expand Up @@ -1141,11 +1141,6 @@ def _batched_merge_and_write(
text_part_offset:end_text_offset
]

print("==== HERE ====")
print(subset_text_df.compute().to_pandas().to_dict("records"))
print(subset_bucket_df.compute().to_pandas().to_dict("records"))
print("==== HERE OVER ====")

try:
# NOTE: If we have more text-df partitions than bucket-map
# partitions, we are more likely to see an OverflowError
Expand Down

0 comments on commit 143b114

Please sign in to comment.