今天在X上看到提升GPT-4.5表現的作法,雖然我沒有GPT-4.5可以實際驗證,但這個作法跟李飛飛教授之前提昇大語言模型表現的方式有點像,都是強迫LLM回答前一定要思考。只是一個是透過Prompt一個是透過微調。
Prompt:
```
First, think deeply for five minutes (at a minimum — if after five minutes, you still don't have the optimal response, keep thinking until you do) about the best way to do this, inside <thinking> tags, and then respond with your answer.
```
沒有留言:
張貼留言