gpt-2/README.md

# gpt-2

Code and samples from the paper ["Language Models are Unsupervised Multitask Learners"](https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf).

We have currently released small (117M parameter) and medium (345M parameter) versions of GPT-2.

See more details in our [blog post](https://blog.openai.com/better-language-models/).

## Usage

This repository is meant to be a starting point for researchers and engineers to experiment with GPT-2.

### Some caveats

- GPT-2 models' robustness and worst case behaviors are not well-understood.  As with any machine-learned model, carefully evaluate GPT-2 for your use case, especially if used without fine-tuning or in safety-critical applications where reliability is important.
- The dataset our GPT-2 models were trained on contains many texts with [biases](https://twitter.com/TomerUllman/status/1101485289720242177) and factual inaccuracies, and thus GPT-2 models are likely to be biased and inaccurate as well.
- To avoid having samples mistaken as human-written, we recommend clearly labeling samples as synthetic before wide dissemination.  Our models are often incoherent or inaccurate in subtle ways, which takes more than a quick read for a human to notice.

### Work with us

Please [let us know](mailto:languagequestions@openai.com) if you’re doing interesting research with or working on applications of GPT-2!  We’re especially interested in hearing from and potentially working with those who are studying
- Potential malicious use cases and defenses against them (e.g. the detectability of synthetic text)
- The extent of problematic content (e.g. bias) being baked into the models and effective mitigations

## Development

See [DEVELOPERS.md](./DEVELOPERS.md)

## Contributors

See [CONTRIBUTORS.md](./CONTRIBUTORS.md)

## GPT-2 samples

| WARNING: Samples are unfiltered and may contain offensive content. |
| --- |

While we have not yet released GPT-2 itself, you can see some samples from it in the `gpt-2-samples` folder.
We show unconditional samples with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.
We show conditional samples, with contexts drawn from `WebText`'s test set, with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.

## Citation

Please use the following bibtex entry:
```
@article{radford2019language,
  title={Language Models are Unsupervised Multitask Learners},
  author={Radford, Alec and Wu, Jeff and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya},
  year={2019}
}
```

## Future work

We may release code for evaluating the models on various benchmarks.

We are still considering release of the larger models.

## License

[MIT](./LICENSE)
-												First commit

											
										
										
											2019-02-10 20:22:00 -08:00
+								# gpt-2
-												README updates

											
										
										
											2019-02-14 08:43:50 -08:00
+								Code and samples from the paper ["Language Models are Unsupervised Multitask Learners"](https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf).
-												updates for 345M model

											
										
										
											2019-05-02 20:39:33 -07:00
+								We have currently released small (117M parameter) and medium (345M parameter) versions of GPT-2.
-												README updates

											
										
										
											2019-02-14 08:43:50 -08:00
 								See more details in our [blog post](https://blog.openai.com/better-language-models/).
-												First commit

											
										
										
											2019-02-10 20:22:00 -08:00
-												update readme with usage caveats and calls for research

This write-up was loosely inspired in part by Mitchell et al.’s work on
[Model Cards for Model Reporting](https://arxiv.org/abs/1810.03993).
Adding such model usage sections could be good practice in general for
open source research projects with potentially broad applications.

											
										
										
											2019-03-06 11:30:53 -08:00
+								## Usage
-												updates for 345M model

											
										
										
											2019-05-02 20:39:33 -07:00
+								This repository is meant to be a starting point for researchers and engineers to experiment with GPT-2.
-												update readme with usage caveats and calls for research

This write-up was loosely inspired in part by Mitchell et al.’s work on
[Model Cards for Model Reporting](https://arxiv.org/abs/1810.03993).
Adding such model usage sections could be good practice in general for
open source research projects with potentially broad applications.

											
										
										
											2019-03-06 11:30:53 -08:00
 								### Some caveats
-												updates for 345M model

											
										
										
											2019-05-02 20:39:33 -07:00
+								- GPT-2 models' robustness and worst case behaviors are not well-understood.  As with any machine-learned model, carefully evaluate GPT-2 for your use case, especially if used without fine-tuning or in safety-critical applications where reliability is important.
 								- The dataset our GPT-2 models were trained on contains many texts with [biases](https://twitter.com/TomerUllman/status/1101485289720242177) and factual inaccuracies, and thus GPT-2 models are likely to be biased and inaccurate as well.
-												update readme with usage caveats and calls for research

This write-up was loosely inspired in part by Mitchell et al.’s work on
[Model Cards for Model Reporting](https://arxiv.org/abs/1810.03993).
Adding such model usage sections could be good practice in general for
open source research projects with potentially broad applications.

											
										
										
											2019-03-06 11:30:53 -08:00
+								- To avoid having samples mistaken as human-written, we recommend clearly labeling samples as synthetic before wide dissemination.  Our models are often incoherent or inaccurate in subtle ways, which takes more than a quick read for a human to notice.
 								### Work with us
-												updates for 345M model

											
										
										
											2019-05-02 20:39:33 -07:00
+								Please [let us know](mailto:languagequestions@openai.com) if you’re doing interesting research with or working on applications of GPT-2!  We’re especially interested in hearing from and potentially working with those who are studying
-												update readme with usage caveats and calls for research

This write-up was loosely inspired in part by Mitchell et al.’s work on
[Model Cards for Model Reporting](https://arxiv.org/abs/1810.03993).
Adding such model usage sections could be good practice in general for
open source research projects with potentially broad applications.

											
										
										
											2019-03-06 11:30:53 -08:00
+								- Potential malicious use cases and defenses against them (e.g. the detectability of synthetic text)
 								- The extent of problematic content (e.g. bias) being baked into the models and effective mitigations
-												add contributors md and move dev docs out

											
										
										
											2019-03-06 12:15:51 -08:00
+								## Development
-												First commit

											
										
										
											2019-02-10 20:22:00 -08:00
-												add contributors md and move dev docs out

											
										
										
											2019-03-06 12:15:51 -08:00
+								See [DEVELOPERS.md](./DEVELOPERS.md)
-												separate out tensorflow install

											
										
										
											2019-02-19 17:48:19 -08:00
-												add contributors md and move dev docs out

											
										
										
											2019-03-06 12:15:51 -08:00
+								## Contributors
-												Add a Dockerfile and document usage in README

											
										
										
											2019-02-14 18:22:14 -05:00
-												add contributors md and move dev docs out

											
										
										
											2019-03-06 12:15:51 -08:00
+								See [CONTRIBUTORS.md](./CONTRIBUTORS.md)
-												Add documentation for help flags (#81)

add description for flags
											
										
										
											2019-02-27 12:31:38 +05:30
-												reorganize and add temp 0.7

											
										
										
											2019-02-19 00:43:31 -08:00
+								## GPT-2 samples
-												more warning

											
										
										
											2019-02-19 17:57:33 -08:00
+								| WARNING: Samples are unfiltered and may contain offensive content. |
 								| --- |
-												reorganize and add temp 0.7

											
										
										
											2019-02-19 00:43:31 -08:00
+								While we have not yet released GPT-2 itself, you can see some samples from it in the `gpt-2-samples` folder.
 								We show unconditional samples with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.
-												add conditional samples

											
										
										
											2019-02-19 17:21:46 -08:00
+								We show conditional samples, with contexts drawn from `WebText`'s test set, with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.
-												reorganize and add temp 0.7

											
										
										
											2019-02-19 00:43:31 -08:00
-												updates

											
										
										
											2019-02-28 15:51:34 -08:00
+								## Citation
 								Please use the following bibtex entry:
 								```
 								@article{radford2019language,
 								  title={Language Models are Unsupervised Multitask Learners},
 								  author={Radford, Alec and Wu, Jeff and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya},
 								  year={2019}
 								}
 								```
-												add samples

											
										
										
											2019-02-14 00:17:55 -08:00
+								## Future work
 								We may release code for evaluating the models on various benchmarks.
 								We are still considering release of the larger models.
-												updates

											
										
										
											2019-02-28 15:51:34 -08:00
 								## License
-												update readme with usage caveats and calls for research

This write-up was loosely inspired in part by Mitchell et al.’s work on
[Model Cards for Model Reporting](https://arxiv.org/abs/1810.03993).
Adding such model usage sections could be good practice in general for
open source research projects with potentially broad applications.

											
										
										
											2019-03-06 11:30:53 -08:00
+								[MIT](./LICENSE)