$$ P(\omega_0,\omega_1,\omega_2,\ldots,\omega_s)=P(\omega_0)P(\omega_1|\omega_0)P(\omega_2|\omega_0,\omega_1)\ldots $$
$$ P(\text{token}_i) = \frac{e^{\frac{\text{logit}_i}{T}}}{\sum_j e^{\frac{\text{logit}_j}{T}}} $$