I've noticed a number of people using AI Dungeon to test GPT-3's abilities. While it's a great way to see how GPT-3 can power an interesting application. It's a poor test of GPT-3's abilities in general. The first generation of any custom prompt is actually GPT-2.
19
27
11
170
Replying to @nickwalton00
Are there any other differences you can tell us about? Prepending, separating, or wrapping input? Fine tuning on some story focused corpus? Context size limits? Something else?

4:53 PM · Aug 2, 2020

1
0
0
1
Replying to @DerekMc00
We cut off the generation at certain points (trailing sentences etc...) Disable certain tokens to improve performance or make generation safer, fine-tune on text adventures and only use the last ~1000 tokens of context.
5
0
1
11
So that’s why it tends to go off rails after a bit. It’s fun for a while but it’s too open ended to be a proper game. What options have you considered? Maybe ask GPT to make a summary story before forgetting and then keep that? Stronger memory of place, genre and characters?
0
0
0
6