https://discuss.huggingface.co/t/llama-2-generation-config-top-p-0-6/49916 https://discuss.huggingface.co/t/llama2-pad-token-for-batched-inference/48020