Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chapter 2: Cannot load emotion dataset #151

Open
10 tasks
IlyaMichlin opened this issue Nov 30, 2024 · 3 comments
Open
10 tasks

Chapter 2: Cannot load emotion dataset #151

IlyaMichlin opened this issue Nov 30, 2024 · 3 comments

Comments

@IlyaMichlin
Copy link

Information

The problem arises in chapter:

  • [ x] Introduction
  • Text Classification
  • Transformer Anatomy
  • Multilingual Named Entity Recognition
  • Text Generation
  • Summarization
  • Question Answering
  • Making Transformers Efficient in Production
  • Dealing with Few to No Labels
  • Training Transformers from Scratch
  • Future Directions

Describe the bug

When loading the load_dataset("emotion"), there is an error FileNotFoundError: Couldn't find file at https://www.dropbox.com/s/1pzkadrvffbqw6o/train.txt?dl=1.

Tried to use an alternative dataset SetFit/emotion and got ValueError: Invalid pattern: '**' can only be an entire path component

The second error comes from the new version of fsspec. Setting it to fsspec==2023.9.2 fixed the issue.

To Reproduce

Steps to reproduce the behavior:

  1. Create python env from requirements.txt
  2. Try to load emotion dataset

Expected behavior

@evanvolgas
Copy link

I'm hitting the same problem.

@wwewwt
Copy link

wwewwt commented Dec 24, 2024

更新一下dataset版本:
Just update the dataset version:
!pip install -U transformers datasets

然后加载数据集用:
Then load the dataset using:
emotions = load_dataset("dair-ai/emotion")

@IlyaMichlin @evanvolgas

@VitaminCplus
Copy link

VitaminCplus commented Jan 20, 2025

I am using Kaggle and had the same error after following the steps above. I resolved it by first clearing my Hugging Face cache.

Try entering:

!rm -rf ~/.cache/huggingface/datasets

and then re-run these lines of script:

from datasets import load_dataset
emotion = load_dataset("dair-ai\emotion")

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants