All posts about #1984



Exploring the latent space of GPT-2

I started playing with OpenAI’s new model GPT-2 (117M). This is the smaller model they published on Github. They laid out their reasons for not publishing the real thing in a blogpost. Bottom line: The model is so good, it might be too easily weaponized by malicious actors on …