🍎 Apples-to-Apples: Comparing the Performance of Hate Speech Detection Models in Context

Context: Project for CS6471 course at Georgia Tech, Spring 2022.

Authors:

Seema Baddam
Richard Huang
Kai McKeever

Installation phase

Please refer to install.md.

Datasets

Datasets used:

Offensive Language Identification Dataset
Implicit Hate Speech Dataset
Racism is a Virus Dataset

Please refer to datasets.md for more details.

Preprocessing phase

Before attempting the training phase, please use this command to preprocess the data:

### Start preprocessing | Default to all dataset
python -m src.utils.preprocess_utils --dataset_name all

Training phase

Please refer to training.md for more details.

We provide the trained models here. To use them, please put them in the saved-models/ folder.

Cross-domain Evaluation phase

Please refer to evaluation.md for more details.

Interpretation with XAI phase (Word cloud + Distribution plots)

⚠️ DISCLAIMER: This part of the study contains words or language that are considered profane, vulgar, or offensive by some readers. ⚠️

Please refer to interpret.md for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
data		data
docs		docs
figures		figures
gridsearch-results		gridsearch-results
saved-models		saved-models
src		src
stats-results		stats-results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gridsearch_config.yml		gridsearch_config.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🍎 Apples-to-Apples: Comparing the Performance of Hate Speech Detection Models in Context

Installation phase

Datasets

Preprocessing phase

Training phase

Cross-domain Evaluation phase

Interpretation with XAI phase (Word cloud + Distribution plots)

About

Releases

Packages

Contributors 5

Languages

License

richouzo/cs6471-project

Folders and files

Latest commit

History

Repository files navigation

🍎 Apples-to-Apples: Comparing the Performance of Hate Speech Detection Models in Context

Installation phase

Datasets

Preprocessing phase

Training phase

Cross-domain Evaluation phase

Interpretation with XAI phase (Word cloud + Distribution plots)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages