My (very limited) understanding of AI models is the input "shape" has to be well...

minimaxir · on July 27, 2020

Correct, you can only input up to 2048 tokens total (this is a big improvement over GPT-2's 1024 input size). You can use sliding windows to continue generating beyond that.

However, model training scales quadratically as input size increases which makes building larger models more difficult (which is why Reformer is trying workarounds to increase the input size).

skybrian · on July 27, 2020

Yes, there is a limited amount of input. In addition, each token may be a word or only part of a word, depending on how common it is. Common words get one token and uncommon words are divided into pieces, each of which gets a token.

unixhero · on July 27, 2020

Is GPT-3 even a computer vision AI model?

minimaxir · on July 27, 2020

No, but there ain't no rule about flattening pixels and using it as input data. https://news.ycombinator.com/item?id=23554944