You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
|
|
4 years ago | |
|---|---|---|
| datas | 4 years ago | |
| log | 4 years ago | |
| model | 4 years ago | |
| save/rocstories | 4 years ago | |
| submission | 4 years ago | |
| .gitignore | 4 years ago | |
| LICENSE | 4 years ago | |
| README.md | 4 years ago | |
| analysis.py | 4 years ago | |
| datasets.py | 4 years ago | |
| opt.py | 4 years ago | |
| pip3 | 4 years ago | |
| text_utils.py | 4 years ago | |
| train.py | 4 years ago | |
| utils.py | 4 years ago | |
README.md
Status: Archive (code is provided as-is, no updates expected)
finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
Currently this code implements the ROCStories Cloze Test result reported in the paper by running:
python train.py --dataset rocstories --desc rocstories --submit --analysis --data_dir [path to data here]
Note: The code is currently non-deterministic due to various GPU ops. The median accuracy of 10 runs with this codebase (using default hyperparameters) is 85.8% - slightly lower than the reported single run of 86.5% from the paper.
The ROCStories dataset can be downloaded from the associated website.