Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple Labels #105

Open
funkyvoong opened this issue May 30, 2023 · 2 comments
Open

Multiple Labels #105

funkyvoong opened this issue May 30, 2023 · 2 comments

Comments

@funkyvoong
Copy link
Contributor

  • examine plant specimen page structure
  • multiple labels (original label, update label - with newer name and different collector)
@kabilanmohanraj
Copy link
Collaborator

kabilanmohanraj commented Jul 25, 2023

Updates (24th July 2023):

  1. Label segmentation task:
    1.1 Reading about docAI in general, and LayoutLM model versions and their implementations.
    1.2 Working to label images to fine-tune the LayoutLM model.

@kabilanmohanraj
Copy link
Collaborator

kabilanmohanraj commented Aug 15, 2023

Updates (15th August):

  1. Integrated the DETR model into the original CRAFT-TrOCR-TaxonNERD pipeline. Now, DETR-CRAFT-TrOCR-TaxonNERD. Masked the non-label regions instead of cropping the labels.
  2. Resolved Python module dependency issues mentioned during last week's meeting; all models work coherently in one environment (will commit the conda export file).
  3. Upgraded DETR inference pipeline to use DETRImageProcessor instead of DETRFeatureExtractor (this upgrade is necessary as DETRFeatureExtractor is to be removed from transformers==5.0).
  4. Evaluating the performance of the pipeline with and without DETR (for label extraction).
    4.1 Having issues with the TaxonNERD step with label extraction => some outputs from this step are shifted one or two indices causing the accuracy to plummet to 4% (72% previously). This is caused by images where no labels are extracted, returning fully masked images (a very subtle issue with the loop construct).

Pending:
1. Retrain the DETR model with more labeled data. I have labeled only a few images this week.
2. ReadMe for DETR and Classification model.
3. Yet to push this week's work to the repo. The codebase has not been cleaned, as I am still debugging the pipeline. Please feel free to take a look at the up to date codebase on SCC.

4. Rank labels based on the year in each of them. (Not as straight forward as I thought, will leave notes on attempts)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants