Post-training is an art form.



Shaping model behavior means sculpting character. Every reward is a tradeoff.

GPT-5 was a reset: balancing helpfulness with honesty, warmth without sycophancy.

The assistant shouldn't just answer. It should feel right to use.
EVERY-0.07%
GPT7.31%
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • 6
  • Repost
  • Share
Comment
0/400
BackrowObservervip
· 44m ago
Training is too troublesome, I don't understand.
View OriginalReply0
bridge_anxietyvip
· 10h ago
Tsk, it's pure craftsmanship.
View OriginalReply0
SerumSqueezervip
· 08-08 07:02
Who is paying for the training fee this time?
View OriginalReply0
SighingCashiervip
· 08-08 06:47
Who taught this underlying model? It's quite something.
View OriginalReply0
New_Ser_Ngmivip
· 08-08 06:40
gpt5 is your dad.
View OriginalReply0
JustAnotherWalletvip
· 08-08 06:34
Ah, don't make it too complicated.
View OriginalReply0
Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate app
Community
English
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)