Multiple Labels #105

funkyvoong · 2023-05-30T20:41:20Z

examine plant specimen page structure
multiple labels (original label, update label - with newer name and different collector)

kabilanmohanraj · 2023-07-25T08:51:41Z

Updates (24th July 2023):

Label segmentation task:
1.1 Reading about docAI in general, and LayoutLM model versions and their implementations.
1.2 Working to label images to fine-tune the LayoutLM model.

kabilanmohanraj · 2023-08-15T12:53:11Z

Updates (15th August):

Integrated the DETR model into the original CRAFT-TrOCR-TaxonNERD pipeline. Now, DETR-CRAFT-TrOCR-TaxonNERD. Masked the non-label regions instead of cropping the labels.
Resolved Python module dependency issues mentioned during last week's meeting; all models work coherently in one environment (will commit the conda export file).
Upgraded DETR inference pipeline to use DETRImageProcessor instead of DETRFeatureExtractor (this upgrade is necessary as DETRFeatureExtractor is to be removed from transformers==5.0).
Evaluating the performance of the pipeline with and without DETR (for label extraction).
4.1 Having issues with the TaxonNERD step with label extraction => some outputs from this step are shifted one or two indices causing the accuracy to plummet to 4% (72% previously). This is caused by images where no labels are extracted, returning fully masked images (a very subtle issue with the loop construct).

Pending:
1. Retrain the DETR model with more labeled data. I have labeled only a few images this week.
2. ReadMe for DETR and Classification model.
3. Yet to push this week's work to the repo. The codebase has not been cleaned, as I am still debugging the pipeline. Please feel free to take a look at the up to date codebase on SCC.
4. Rank labels based on the year in each of them. (Not as straight forward as I thought, will leave notes on attempts)

funkyvoong added the priority 2 label Jun 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple Labels #105

Multiple Labels #105

funkyvoong commented May 30, 2023

kabilanmohanraj commented Jul 25, 2023 •

edited

Loading

kabilanmohanraj commented Aug 15, 2023 •

edited

Loading

Multiple Labels #105

Multiple Labels #105

Comments

funkyvoong commented May 30, 2023

kabilanmohanraj commented Jul 25, 2023 • edited Loading

kabilanmohanraj commented Aug 15, 2023 • edited Loading

kabilanmohanraj commented Jul 25, 2023 •

edited

Loading

kabilanmohanraj commented Aug 15, 2023 •

edited

Loading