You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
胡森 ab2dea620b
1
4 years ago
datas version1.1 4 years ago
log 1 4 years ago
model version1.1 4 years ago
save/rocstories 1 4 years ago
submission version1.1 4 years ago
.gitignore version1.1 4 years ago
LICENSE version1.1 4 years ago
README.md version1.1 4 years ago
analysis.py version1.1 4 years ago
datasets.py version1.1 4 years ago
opt.py version1.1 4 years ago
pip3 version1.1 4 years ago
text_utils.py version1.1 4 years ago
train.py version1.1 4 years ago
utils.py version1.1 4 years ago

README.md

Status: Archive (code is provided as-is, no updates expected)

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Currently this code implements the ROCStories Cloze Test result reported in the paper by running: python train.py --dataset rocstories --desc rocstories --submit --analysis --data_dir [path to data here]

Note: The code is currently non-deterministic due to various GPU ops. The median accuracy of 10 runs with this codebase (using default hyperparameters) is 85.8% - slightly lower than the reported single run of 86.5% from the paper.

The ROCStories dataset can be downloaded from the associated website.