<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>幻觉检测 on Answer</title>
    <link>https://answer.freetools.me/tags/%E5%B9%BB%E8%A7%89%E6%A3%80%E6%B5%8B/</link>
    <description>Recent content in 幻觉检测 on Answer</description>
    <generator>Hugo -- 0.152.2</generator>
    <language>zh-cn</language>
    <lastBuildDate>Thu, 12 Mar 2026 15:13:23 +0800</lastBuildDate>
    <atom:link href="https://answer.freetools.me/tags/%E5%B9%BB%E8%A7%89%E6%A3%80%E6%B5%8B/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>置信度校准：当大模型说&#34;我有80%把握&#34;时，它真的知道自己在说什么吗？</title>
      <link>https://answer.freetools.me/%E7%BD%AE%E4%BF%A1%E5%BA%A6%E6%A0%A1%E5%87%86%E5%BD%93%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%AF%B4%E6%88%91%E6%9C%8980%E6%8A%8A%E6%8F%A1%E6%97%B6%E5%AE%83%E7%9C%9F%E7%9A%84%E7%9F%A5%E9%81%93%E8%87%AA%E5%B7%B1%E5%9C%A8%E8%AF%B4%E4%BB%80%E4%B9%88%E5%90%97/</link>
      <pubDate>Thu, 12 Mar 2026 15:13:23 +0800</pubDate>
      <guid>https://answer.freetools.me/%E7%BD%AE%E4%BF%A1%E5%BA%A6%E6%A0%A1%E5%87%86%E5%BD%93%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%AF%B4%E6%88%91%E6%9C%8980%E6%8A%8A%E6%8F%A1%E6%97%B6%E5%AE%83%E7%9C%9F%E7%9A%84%E7%9F%A5%E9%81%93%E8%87%AA%E5%B7%B1%E5%9C%A8%E8%AF%B4%E4%BB%80%E4%B9%88%E5%90%97/</guid>
      <description>深入解析大语言模型置信度校准的核心问题：从2017年Guo等人的开创性论文出发，系统阐述ECE、可靠性图等评估方法，揭示LLM过度自信的深层原因，详解温度缩放、Platt Scaling等校准技术，并探讨医疗AI、幻觉检测等关键应用场景。涵盖RLHF对校准的损害、verbalized confidence的新进展，以及&amp;#34;knowing when not to know&amp;#34;这一AI安全的核心命题。</description>
    </item>
    <item>
      <title>Logprobs深度解析：大模型输出的隐藏信息</title>
      <link>https://answer.freetools.me/logprobs%E6%B7%B1%E5%BA%A6%E8%A7%A3%E6%9E%90%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%BE%93%E5%87%BA%E7%9A%84%E9%9A%90%E8%97%8F%E4%BF%A1%E6%81%AF/</link>
      <pubDate>Thu, 12 Mar 2026 07:08:36 +0800</pubDate>
      <guid>https://answer.freetools.me/logprobs%E6%B7%B1%E5%BA%A6%E8%A7%A3%E6%9E%90%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%BE%93%E5%87%BA%E7%9A%84%E9%9A%90%E8%97%8F%E4%BF%A1%E6%81%AF/</guid>
      <description>从信息论基础到工程实践，深入解析logprobs的技术原理、数值稳定性、置信度评估与幻觉检测应用</description>
    </item>
  </channel>
</rss>
