If I understood your question correctly,
You want to give an input a word (token) and the model to give you multiple sequential words based on the previous output ?
If that is the case, would a model that takes a word and predicts a word work well for you ? Then you can feedback the current output as input and repeat.
The issue there is that a model like that would be learning the statistical distribution of the training data. i.e. it means that if you give the model a word, it would output the word that is statistically repeating after.
Either way, you can use RNN layers (GRU, LSTM) to build such model.
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…