Skip to content

Commit

Permalink
Update docs/user-guide/gpudeduplication.rst
Browse files Browse the repository at this point in the history
Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Praateek Mahajan <[email protected]>
  • Loading branch information
praateekmahajan and sarahyurick authored Feb 6, 2025
1 parent cba7fcd commit bcb7cea
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/user-guide/gpudeduplication.rst
Original file line number Diff line number Diff line change
Expand Up @@ -197,7 +197,7 @@ Python API
cache_dir="/path/to/dedup_outputs", # must be cleared between runs
id_field="my_id",
text_field="text",
perform_removal=False, # dictates if deduplicated dataset or duplicates are returned
perform_removal=False, # dictates if deduplicated dataset or IDs of duplicates are returned
seed=42,
char_ngrams=24,
num_buckets=20,
Expand Down

0 comments on commit bcb7cea

Please sign in to comment.