Hate Speech Machine Learning

This project seeks to answer the following questions:

Is it possible to effectively classify posts on online forums as hate speech or not hate speech?
Is it possible to procedurally generate interventionary responses to such instances of online hate speech?

For all the fun details about our process and results, check out our project write-up.

To Run

To run the classifier, run python3 main.py in the directory containing main.py. To see the process we used to test different classification models and hyperparameters, uncomment the block of code involving the performance_tester variable in main.py before running the program.
To run the Textgenrnn response generator, run python3 text_generation.py in the same directory.
To run the Sequence-to-Sequence response generator, run python3 seq2seq.py in the same directory.

Our code works best using Keras version 2.2.4, installed through regular Pip (i.e. not Anaconda).

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
data		data
.gitignore		.gitignore
Hate_Speech_Writeup.pdf		Hate_Speech_Writeup.pdf
README.md		README.md
SVM_performance_tester.py		SVM_performance_tester.py
data_processor.py		data_processor.py
main.py		main.py
seq2seq.py		seq2seq.py
support_vector_machine.py		support_vector_machine.py
text_generation.py		text_generation.py
textgenrnn_weights.hdf5		textgenrnn_weights.hdf5