-
Notifications
You must be signed in to change notification settings - Fork 12
[SEP] token removed from input #2
Comments
Hi! Yes, it is removed since the |
The example in the README for pytorch-pretrained-bert (for v6.1 https://github.com/huggingface/pytorch-transformers/blob/8f46cd105752c1f1218a2716ea423454273ff08b/README.md) takes [SEP] token also for constructing the segment ids similar to the paper:
Is the removal done for for the |
Thanks for noticing this! The code was made for an older version of pytorch-pretrained-bert. It seems that it doesn't work like this anymore. This is how it worked before: |
The version this code works on is |
The dataset reader seems to trim away the [SEP] token from the input. Should it be removed?
OpenNMT-APE/onmt/inputters/text_dataset.py
Line 112 in 89be3c1
The text was updated successfully, but these errors were encountered: