<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>置信度校准 on Answer</title>
    <link>https://answer.freetools.me/tags/%E7%BD%AE%E4%BF%A1%E5%BA%A6%E6%A0%A1%E5%87%86/</link>
    <description>Recent content in 置信度校准 on Answer</description>
    <generator>Hugo -- 0.152.2</generator>
    <language>zh-cn</language>
    <lastBuildDate>Thu, 12 Mar 2026 15:13:23 +0800</lastBuildDate>
    <atom:link href="https://answer.freetools.me/tags/%E7%BD%AE%E4%BF%A1%E5%BA%A6%E6%A0%A1%E5%87%86/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>置信度校准：当大模型说&#34;我有80%把握&#34;时，它真的知道自己在说什么吗？</title>
      <link>https://answer.freetools.me/%E7%BD%AE%E4%BF%A1%E5%BA%A6%E6%A0%A1%E5%87%86%E5%BD%93%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%AF%B4%E6%88%91%E6%9C%8980%E6%8A%8A%E6%8F%A1%E6%97%B6%E5%AE%83%E7%9C%9F%E7%9A%84%E7%9F%A5%E9%81%93%E8%87%AA%E5%B7%B1%E5%9C%A8%E8%AF%B4%E4%BB%80%E4%B9%88%E5%90%97/</link>
      <pubDate>Thu, 12 Mar 2026 15:13:23 +0800</pubDate>
      <guid>https://answer.freetools.me/%E7%BD%AE%E4%BF%A1%E5%BA%A6%E6%A0%A1%E5%87%86%E5%BD%93%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%AF%B4%E6%88%91%E6%9C%8980%E6%8A%8A%E6%8F%A1%E6%97%B6%E5%AE%83%E7%9C%9F%E7%9A%84%E7%9F%A5%E9%81%93%E8%87%AA%E5%B7%B1%E5%9C%A8%E8%AF%B4%E4%BB%80%E4%B9%88%E5%90%97/</guid>
      <description>深入解析大语言模型置信度校准的核心问题：从2017年Guo等人的开创性论文出发，系统阐述ECE、可靠性图等评估方法，揭示LLM过度自信的深层原因，详解温度缩放、Platt Scaling等校准技术，并探讨医疗AI、幻觉检测等关键应用场景。涵盖RLHF对校准的损害、verbalized confidence的新进展，以及&amp;#34;knowing when not to know&amp;#34;这一AI安全的核心命题。</description>
    </item>
  </channel>
</rss>
