One way to answer the question is to look for information that could have come from only one place. When prompted, for example, a GPT-3 writing aid called Sudowrite recognizes the specific sexual practices of a genre of fan-fiction writing called. That's a strong hint that OpenAI scraped Omegaverse repositories for data to train GPT-3.
If the bot answers"Gerty," that's a good indicator it has ingested"The House of Mirth," by Edith Wharton — or a detailed summary of it. Show the bot 100 samples from a given book and see how many it gets right. That's the book's score.After crunching the numbers, Bamman's team had a list. In addition to the modern public-school canon — Charles Dickens and Jack London, Frankenstein and Dracula — there are a few fun outliers.