Bonus: An abnormal number of pants
I trained GPT-2 on a collection of Halloween costumes, and saw interesting improvements over previous neural nets that I trained on the same dataset. The costumes in my main blog post represented a compromise, though, between a low-temperature (low-chaos) output that copied too many costumes from the training data, and a high-temperature (high-chaos) ou…