GitHub - wcgan/cs5246project

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
bert_adv		bert_adv
data		data
README.txt		README.txt
adv_bert		adv_bert
bertclassifier_new.py		bertclassifier_new.py
cnn.py		cnn.py
rnn.py		rnn.py
run_bmLSTM.py		run_bmLSTM.py
run_classifier_new.py		run_classifier_new.py
run_rnn.py		run_rnn.py

Repository files navigation

This README file contains instructions to reproduce the results in our CS5246 Project: Performing Sentimenet Analysis With BERT

The dataset used for performance comparison between models are in the folder /data, with 3 files: rt-polarity.train, rt-polarity.dev, rt-polarity.test


1. To obtain results for the RNN model, 

First ensure that you are using a python3.6 environment with PyTorch 0.4.1 installed.

Train the model by running:

python rnn.py --train_file data/rt-polarity.train \
  --val_file data/rt-polarity.dev \
  --emb_file_txt [path to GloVe 300d] \
  --output_file RNN_Output/model_file \
  --epochs 10

Then, obtain the testing accuracy with:

python run_rnn.py Data/rt-polarity/test.tsv RNN_Output/model_file


2. To obtain results for the CNN model,

First ensure that you are using a python3.6 environment with PyTorch 0.4.1, torchtext 0.4.0 and spacy installed.

Train the model by running:

python cnn.py train

then, obtain the testing accuracy with:

python cnn.py test


3. To obtain results for the bmLSTM model,

First ensure that you are using a python3.6 environment with PyTorch 0.4.1 installed.

The pretrained model is accessible in: https://github.com/openai/generating-reviews-discovering-sentiment/tree/master/model

Train the model by running:

python run_bmLSTM.py


4. To obtain results for the original BERT model,

First ensure that you are using a python3.6 environment with PyTorch 0.4.1 and pytorch-pretrained-bert 0.3.0 installed.

Then, make sure the training file is named as "train.tsv" and the testing file is named as "test.tsv". 

Then, simply run

python run_classifier_new.py \
  --task_name SST-2 \
  --do_train \
  --do_test \
  --do_lower_case \
  --data_dir data/rt-polarity/ \
  --bert_model bert-base-uncased \
  --max_seq_length 128 \
  --train_batch_size 32 \
  --learning_rate 2e-5 \
  --num_train_epochs 2.0 \
  --output_dir Output/base/ \
  --load_dir Output/base/ \
  --save_dir Output/base/ \
  --gradient_accumulation_steps 32 \
  --eval_batch_size 1 \
  --model base \


5. To fine-tune BERT with the attention approach, ensure the dependencies in 4) are installed.

Then, simply run

python run_classifier_new.py \
  --task_name SST-2 \
  --do_train \
  --do_test \
  --do_lower_case \
  --data_dir data/rt-polarity/ \
  --bert_model bert-base-uncased \
  --max_seq_length 128 \
  --train_batch_size 32 \
  --learning_rate 5e-5 \
  --num_train_epochs 3.0 \
  --output_dir Output/attn/ \
  --load_dir Output/attn/ \
  --save_dir Output/attn/ \
  --gradient_accumulation_steps 32 \
  --eval_batch_size 1 \
  --model attention \
  --seed 5246 \

Note that the model files are defined in bertclassifier_new.py. Besides the attention model, we include 4 other experimental model options: "1hid": introduce an additional single hidden-layer MLP on top of [CLS] hidden state, "2hid": introduce 2 hidden-layers MLP, "noparam": does not introduce any new parameters, "full": use residual connections from all [CLS] intermediate hidden states. 


6. To fine-tune BERT with adversarial training, please refer to the folder /bert_adv.