From 8874310d3a6fcfda5d8e5a32777f9bab7221472c Mon Sep 17 00:00:00 2001 From: ai-modelscope Date: Mon, 28 Apr 2025 23:27:56 +0800 Subject: [PATCH] Update README.md --- README.md | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 1214c99..4adec64 100644 --- a/README.md +++ b/README.md @@ -23,6 +23,9 @@ Qwen3 is the latest generation of large language models in Qwen series, offering For more details, including benchmark evaluation, hardware requirements, and inference performance, please refer to our [blog](https://qwenlm.github.io/blog/qwen3/), [GitHub](https://github.com/QwenLM/Qwen3), and [Documentation](https://qwen.readthedocs.io/en/latest/). +> [!TIP] +> If you encounter significant endless repetitions, please refer to the [Best Practices](#best-practices) section for optimal sampling parameters, and set the ``presence_penalty`` to 1.5. + ## Quickstart The code of Qwen3 has been in the latest Hugging Face `transformers` and we advise you to use the latest version of `transformers`. @@ -94,7 +97,7 @@ For deployment, you can use `vllm>=0.8.5` or `sglang>=0.4.5.post2` to create an > [!TIP] > The `enable_thinking` switch is also available in APIs created by vLLM and SGLang. -> Please refer to [our documentation](https://qwen.readthedocs.io/) for more details. +> Please refer to our documentation for [vLLM](https://qwen.readthedocs.io/en/latest/deployment/vllm.html#thinking-non-thinking-modes) and [SGLang](https://qwen.readthedocs.io/en/latest/deployment/sglang.html#thinking-non-thinking-modes) users. ### `enable_thinking=True`