Skip to content

Commit

Permalink
Tune data packaging / upload in Hugging Face
Browse files Browse the repository at this point in the history
  • Loading branch information
Jeronymous committed Oct 15, 2024
1 parent 5f91690 commit 9c11c81
Show file tree
Hide file tree
Showing 10 changed files with 465 additions and 225 deletions.
Empty file.
49 changes: 49 additions & 0 deletions assets/hugging_face/README_dataset_header.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,49 @@
pretty_name: Lucie Training Dataset
license: cc-by-nc-sa-4.0
language:
- en
- fr
- de
- es
- it
- code
multilinguality:
- multilingual
task_categories:
- text-generation
- text2text-generation
task_ids:
- language-modeling
tags:
- text-generation
- conditional-text-generation
viewer: true
configs:
- config_name: default
data_files:
- split: train
path: data/*/*/*parquet
- config_name: en
data_files:
- split: train
path: data/*/en/*parquet
- config_name: fr
data_files:
- split: train
path: data/*/fr/*parquet
- config_name: de
data_files:
- split: train
path: data/*/de/*parquet
- config_name: es
data_files:
- split: train
path: data/*/es/*parquet
- config_name: it
data_files:
- split: train
path: data/*/it/*parquet
- config_name: code
data_files:
- split: train
path: data/*/code/*parquet
File renamed without changes.
File renamed without changes.
Loading

0 comments on commit 9c11c81

Please sign in to comment.