A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages
language = [
"amh", "ewe", "hau", "ibo", "kin",
"lin", "lug", "orm", "sna", "sot",
"swa", "twi", "wol", "xho", "yor", "zul"
]
The Injongo dataset is available at Masakhane-NLU: Conversation AI and Benchmark datasets for African languages
Item Example:
split,domain,intent,text,spans,logical_form test,banking,balance,በ አባይ ባንክ አካውንት ለሶፋ የሚሆን ገንዘብ አለኝ,"2:9:SL:BANK_NAME,17:19:SL:SHOPPING_ITEM",[IN:balance [SL:BANK_NAME አባይ ባንክ] [SL:SHOPPING_ITEM ሶፋ] ]
pip install -e .
Additional Dependencies:
- vllm: https://docs.vllm.ai/en/latest/getting_started/installation/gpu/index.html
- tgi: https://huggingface.co/docs/text-generation-inference/installation_nvidia
More details of code can be explored with numbered jupyter notebooks (*.ipynb
).
OPENAI_API_KEY=sk-proj-
GEMINI_API_KEY=ABCD