Skip to content

How to train a model when the label contains a comma? #14588

Answered by GreatV
diegonc asked this question in Q&A
Discussion options

You must be logged in to vote

To address the issue of training a model when the label contains a comma, let’s break it down step by step based on the problem description and related documents:


Problem: Labels with Commas are Split

When running ppocr/utils/gen_label.py, labels like 999,000000 are incorrectly split into 999\t000000 due to the script treating the comma as a delimiter. This happens because many OCR label generation tools use specific delimiters (e.g., commas or tabs) to separate image paths and their corresponding labels in the ground truth file.


Solution: Handling Labels with Commas

To ensure that your labels containing commas are not split incorrectly, you can modify the label generation script (gen_l…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by diegonc
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants