gpt-2/README.md

# gpt-2

Code and samples from the paper ["Language Models are Unsupervised Multitask Learners"](https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf).

For now, we have only released a smaller (117M parameter) version of GPT-2.

See more details in our [blog post](https://blog.openai.com/better-language-models/).

## Installation

Git clone this repository, and `cd` into directory for remaining commands
```
git clone https://github.com/openai/gpt-2.git && cd gpt-2
```

Then, follow instructions for either native or Docker installation.

### Native Installation

Download the model data
```
sh download_model.sh 117M
```

The remaining steps can optionally be done in a virtual environment using tools such as `virtualenv` or `conda`.

Install tensorflow 1.12 (with GPU support, if you have a GPU and want everything to run faster)
```
pip3 install tensorflow==1.12.0
```
or
```
pip3 install tensorflow-gpu==1.12.0
```

Install other python packages:
```
pip3 install -r requirements.txt
```

### Docker Installation

Build the Dockerfile and tag the created image as `gpt-2`:
```
docker build --tag gpt-2 -f Dockerfile.gpu . # or Dockerfile.cpu
```

Start an interactive bash session from the `gpt-2` docker image.

You can opt to use the `--runtime=nvidia` flag if you have access to a NVIDIA GPU
and a valid install of [nvidia-docker 2.0](https://github.com/nvidia/nvidia-docker/wiki/Installation-(version-2.0)).
```
docker run --runtime=nvidia -it gpt-2 bash
```

## Usage

| WARNING: Samples are unfiltered and may contain offensive content. |
| --- |

Some of the examples below may include Unicode text characters. Set the environment variable:
```
export PYTHONIOENCODING=UTF-8
```
to override the standard stream settings in UTF-8 mode.

### Unconditional sample generation

To generate unconditional samples from the small model:
```
python3 src/generate_unconditional_samples.py | tee /tmp/samples
```
There are various flags for controlling the samples:
```
python3 src/generate_unconditional_samples.py --top_k 40 --temperature 0.7 | tee /tmp/samples
```

To check flag descriptions, use:
```
python3 src/generate_unconditional_samples.py -- --help
```

### Conditional sample generation

To give the model custom prompts, you can use:
```
python3 src/interactive_conditional_samples.py --top_k 40
```

To check flag descriptions, use:
```
python3 src/interactive_conditional_samples.py -- --help
```

## GPT-2 samples

| WARNING: Samples are unfiltered and may contain offensive content. |
| --- |

While we have not yet released GPT-2 itself, you can see some samples from it in the `gpt-2-samples` folder.
We show unconditional samples with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.
We show conditional samples, with contexts drawn from `WebText`'s test set, with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.

## Future work

We may release code for evaluating the models on various benchmarks.

We are still considering release of the larger models.
First commit 2019-02-10 20:22:00 -08:00			`# gpt-2`

README updates 2019-02-14 08:43:50 -08:00			`Code and samples from the paper ["Language Models are Unsupervised Multitask Learners"](https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf).`

			`For now, we have only released a smaller (117M parameter) version of GPT-2.`

			`See more details in our [blog post](https://blog.openai.com/better-language-models/).`
First commit 2019-02-10 20:22:00 -08:00
			`## Installation`

instructinos mention git clone 2019-02-19 18:05:57 -08:00			Git clone this repository, and `cd` into directory for remaining commands
			```
			`git clone https://github.com/openai/gpt-2.git && cd gpt-2`
			```

update readme 2019-02-19 20:40:59 -08:00			`Then, follow instructions for either native or Docker installation.`

Add a Dockerfile and document usage in README 2019-02-14 18:22:14 -05:00			`### Native Installation`

Fetch model using curl, add shebang to download_files.sh and mark it executable 2019-02-16 10:32:30 -05:00			`Download the model data`
First commit 2019-02-10 20:22:00 -08:00			```
fix downloading 2019-02-14 09:12:05 -08:00			`sh download_model.sh 117M`
First commit 2019-02-10 20:22:00 -08:00			```

separate out tensorflow install 2019-02-19 17:48:19 -08:00			The remaining steps can optionally be done in a virtual environment using tools such as `virtualenv` or `conda`.

			`Install tensorflow 1.12 (with GPU support, if you have a GPU and want everything to run faster)`
			```
			`pip3 install tensorflow==1.12.0`
			```
			`or`
			```
			`pip3 install tensorflow-gpu==1.12.0`
			```

			`Install other python packages:`
First commit 2019-02-10 20:22:00 -08:00			```
fix downloading 2019-02-14 09:12:05 -08:00			`pip3 install -r requirements.txt`
First commit 2019-02-10 20:22:00 -08:00			```

Add a Dockerfile and document usage in README 2019-02-14 18:22:14 -05:00			`### Docker Installation`

			Build the Dockerfile and tag the created image as `gpt-2`:
			```
			`docker build --tag gpt-2 -f Dockerfile.gpu . # or Dockerfile.cpu`
			```

			Start an interactive bash session from the `gpt-2` docker image.

			You can opt to use the `--runtime=nvidia` flag if you have access to a NVIDIA GPU
			`and a valid install of [nvidia-docker 2.0](https://github.com/nvidia/nvidia-docker/wiki/Installation-(version-2.0)).`
			```
			`docker run --runtime=nvidia -it gpt-2 bash`
			```

shuffle headings 2019-02-19 17:57:01 -08:00			`## Usage`

First commit 2019-02-10 20:22:00 -08:00			`\| WARNING: Samples are unfiltered and may contain offensive content. \|`
			`\| --- \|`

Minor: update readme Add note about setting PYTHONIOENCODING=UTF-8 env var for running examples 2019-02-21 00:00:19 -08:00			`Some of the examples below may include Unicode text characters. Set the environment variable:`
			```
			`export PYTHONIOENCODING=UTF-8`
			```
			`to override the standard stream settings in UTF-8 mode.`

more warning 2019-02-19 17:57:33 -08:00			`### Unconditional sample generation`

First commit 2019-02-10 20:22:00 -08:00			`To generate unconditional samples from the small model:`
			```
Minor: update readme Example will `tee` stdout to `/tmp/samples` from conditional and unconditional generation scripts. 2019-02-26 18:49:04 -08:00			`python3 src/generate_unconditional_samples.py \| tee /tmp/samples`
First commit 2019-02-10 20:22:00 -08:00			```
			`There are various flags for controlling the samples:`
			```
Minor: update readme Example will `tee` stdout to `/tmp/samples` from conditional and unconditional generation scripts. 2019-02-26 18:49:04 -08:00			`python3 src/generate_unconditional_samples.py --top_k 40 --temperature 0.7 \| tee /tmp/samples`
First commit 2019-02-10 20:22:00 -08:00			```
add samples 2019-02-14 00:17:55 -08:00
Add documentation for help flags (#81) add description for flags 2019-02-27 12:31:38 +05:30			`To check flag descriptions, use:`
			```
			`python3 src/generate_unconditional_samples.py -- --help`
			```

shuffle headings 2019-02-19 17:57:01 -08:00			`### Conditional sample generation`
interact script for conditional samples 2019-02-14 09:55:36 -08:00
			`To give the model custom prompts, you can use:`
			```
Better example parameters for conditional sample command (#41) This PR adds better initial parameters to the conditional sample generation command in the docs. The results are pretty poor in the interactive script with the default settings. Now, you'll get better results if you run the interactive samples. 2019-02-16 16:23:13 -06:00			`python3 src/interactive_conditional_samples.py --top_k 40`
interact script for conditional samples 2019-02-14 09:55:36 -08:00			```
add samples 2019-02-14 00:17:55 -08:00
Add documentation for help flags (#81) add description for flags 2019-02-27 12:31:38 +05:30			`To check flag descriptions, use:`
			```
			`python3 src/interactive_conditional_samples.py -- --help`
			```

reorganize and add temp 0.7 2019-02-19 00:43:31 -08:00			`## GPT-2 samples`

more warning 2019-02-19 17:57:33 -08:00			`\| WARNING: Samples are unfiltered and may contain offensive content. \|`
			`\| --- \|`

reorganize and add temp 0.7 2019-02-19 00:43:31 -08:00			While we have not yet released GPT-2 itself, you can see some samples from it in the `gpt-2-samples` folder.
			`We show unconditional samples with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.`
add conditional samples 2019-02-19 17:21:46 -08:00			We show conditional samples, with contexts drawn from `WebText`'s test set, with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.
reorganize and add temp 0.7 2019-02-19 00:43:31 -08:00
add samples 2019-02-14 00:17:55 -08:00			`## Future work`

			`We may release code for evaluating the models on various benchmarks.`

			`We are still considering release of the larger models.`