TAMING OVERCONFIDENCE IN LLMS: REWARD CALIBRATION IN RLHF

  • Jixuan Leng
  • , Chengsong Huang
  • , Banghua Zhu
  • , Jiaxin Huang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Fingerprint

Dive into the research topics of 'TAMING OVERCONFIDENCE IN LLMS: REWARD CALIBRATION IN RLHF'. Together they form a unique fingerprint.
Sort by

Keyphrases

Psychology