<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Tokenization on Answer</title>
    <link>https://answer.freetools.me/tags/tokenization/</link>
    <description>Recent content in Tokenization on Answer</description>
    <generator>Hugo -- 0.152.2</generator>
    <language>zh-cn</language>
    <lastBuildDate>Thu, 12 Mar 2026 07:50:29 +0800</lastBuildDate>
    <atom:link href="https://answer.freetools.me/tags/tokenization/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Token ID：大模型如何用一个数字代表一个词</title>
      <link>https://answer.freetools.me/token-id%E5%A4%A7%E6%A8%A1%E5%9E%8B%E5%A6%82%E4%BD%95%E7%94%A8%E4%B8%80%E4%B8%AA%E6%95%B0%E5%AD%97%E4%BB%A3%E8%A1%A8%E4%B8%80%E4%B8%AA%E8%AF%8D/</link>
      <pubDate>Thu, 12 Mar 2026 07:50:29 +0800</pubDate>
      <guid>https://answer.freetools.me/token-id%E5%A4%A7%E6%A8%A1%E5%9E%8B%E5%A6%82%E4%BD%95%E7%94%A8%E4%B8%80%E4%B8%AA%E6%95%B0%E5%AD%97%E4%BB%A3%E8%A1%A8%E4%B8%80%E4%B8%AA%E8%AF%8D/</guid>
      <description>从分词器的词表构建到Embedding查找表，深入解析Token ID的数学本质、实现原理、以及在模型推理中的完整生命周期。涵盖BPE算法的Token ID分配机制、不同语言的Token效率差异、权重共享原理，以及Token ID如何影响模型的算术能力和多语言表现。</description>
    </item>
    <item>
      <title>为什么大模型连两位数加法都算不准：从tokenization到启发式神经元的完整技术解析</title>
      <link>https://answer.freetools.me/%E4%B8%BA%E4%BB%80%E4%B9%88%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%BF%9E%E4%B8%A4%E4%BD%8D%E6%95%B0%E5%8A%A0%E6%B3%95%E9%83%BD%E7%AE%97%E4%B8%8D%E5%87%86%E4%BB%8Etokenization%E5%88%B0%E5%90%AF%E5%8F%91%E5%BC%8F%E7%A5%9E%E7%BB%8F%E5%85%83%E7%9A%84%E5%AE%8C%E6%95%B4%E6%8A%80%E6%9C%AF%E8%A7%A3%E6%9E%90/</link>
      <pubDate>Thu, 12 Mar 2026 03:10:03 +0800</pubDate>
      <guid>https://answer.freetools.me/%E4%B8%BA%E4%BB%80%E4%B9%88%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%BF%9E%E4%B8%A4%E4%BD%8D%E6%95%B0%E5%8A%A0%E6%B3%95%E9%83%BD%E7%AE%97%E4%B8%8D%E5%87%86%E4%BB%8Etokenization%E5%88%B0%E5%90%AF%E5%8F%91%E5%BC%8F%E7%A5%9E%E7%BB%8F%E5%85%83%E7%9A%84%E5%AE%8C%E6%95%B4%E6%8A%80%E6%9C%AF%E8%A7%A3%E6%9E%90/</guid>
      <description>深入解析大语言模型算术能力受限的技术根源：从tokenization对数字的不一致切分、神经网络&amp;#34;启发式袋&amp;#34;机制替代真正算法、到位置编码导致数位信息丢失。基于ICLR 2025等前沿研究，揭示为什么能通过律师考试的AI却算不对两位数加法，以及这一发现对AI系统设计的深层启示。</description>
    </item>
    <item>
      <title>大模型的Padding陷阱：为什么Decoder推理必须左填充，而BERT却用右填充？</title>
      <link>https://answer.freetools.me/%E5%A4%A7%E6%A8%A1%E5%9E%8B%E7%9A%84padding%E9%99%B7%E9%98%B1%E4%B8%BA%E4%BB%80%E4%B9%88decoder%E6%8E%A8%E7%90%86%E5%BF%85%E9%A1%BB%E5%B7%A6%E5%A1%AB%E5%85%85%E8%80%8Cbert%E5%8D%B4%E7%94%A8%E5%8F%B3%E5%A1%AB%E5%85%85/</link>
      <pubDate>Thu, 12 Mar 2026 02:54:34 +0800</pubDate>
      <guid>https://answer.freetools.me/%E5%A4%A7%E6%A8%A1%E5%9E%8B%E7%9A%84padding%E9%99%B7%E9%98%B1%E4%B8%BA%E4%BB%80%E4%B9%88decoder%E6%8E%A8%E7%90%86%E5%BF%85%E9%A1%BB%E5%B7%A6%E5%A1%AB%E5%85%85%E8%80%8Cbert%E5%8D%B4%E7%94%A8%E5%8F%B3%E5%A1%AB%E5%85%85/</guid>
      <description>深入解析大模型中padding、truncation与attention mask的协同工作原理。从Decoder-only模型的生成机制出发，揭示为什么GPT推理必须使用左填充，而BERT使用右填充。涵盖位置编码交互、序列打包优化、Flash Attention处理、训练推理差异等核心技术细节。</description>
    </item>
    <item>
      <title>为什么千亿参数模型的词表只有32K？从压缩效率到计算最优的完整解析</title>
      <link>https://answer.freetools.me/%E4%B8%BA%E4%BB%80%E4%B9%88%E5%8D%83%E4%BA%BF%E5%8F%82%E6%95%B0%E6%A8%A1%E5%9E%8B%E7%9A%84%E8%AF%8D%E8%A1%A8%E5%8F%AA%E6%9C%8932k%E4%BB%8E%E5%8E%8B%E7%BC%A9%E6%95%88%E7%8E%87%E5%88%B0%E8%AE%A1%E7%AE%97%E6%9C%80%E4%BC%98%E7%9A%84%E5%AE%8C%E6%95%B4%E8%A7%A3%E6%9E%90/</link>
      <pubDate>Wed, 11 Mar 2026 19:30:52 +0800</pubDate>
      <guid>https://answer.freetools.me/%E4%B8%BA%E4%BB%80%E4%B9%88%E5%8D%83%E4%BA%BF%E5%8F%82%E6%95%B0%E6%A8%A1%E5%9E%8B%E7%9A%84%E8%AF%8D%E8%A1%A8%E5%8F%AA%E6%9C%8932k%E4%BB%8E%E5%8E%8B%E7%BC%A9%E6%95%88%E7%8E%87%E5%88%B0%E8%AE%A1%E7%AE%97%E6%9C%80%E4%BC%98%E7%9A%84%E5%AE%8C%E6%95%B4%E8%A7%A3%E6%9E%90/</guid>
      <description>从压缩效率到计算最优的完整解析：为什么千亿参数模型的词表只有32K？深入探讨词表大小对模型性能、多语言处理效率、内存占用的影响，以及NeurIPS 2024论文揭示的最优词表大小计算方法。</description>
    </item>
    <item>
      <title>为什么大模型连简单的加法都会算错？从分词陷阱到算术推理的技术困境</title>
      <link>https://answer.freetools.me/%E4%B8%BA%E4%BB%80%E4%B9%88%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%BF%9E%E7%AE%80%E5%8D%95%E7%9A%84%E5%8A%A0%E6%B3%95%E9%83%BD%E4%BC%9A%E7%AE%97%E9%94%99%E4%BB%8E%E5%88%86%E8%AF%8D%E9%99%B7%E9%98%B1%E5%88%B0%E7%AE%97%E6%9C%AF%E6%8E%A8%E7%90%86%E7%9A%84%E6%8A%80%E6%9C%AF%E5%9B%B0%E5%A2%83/</link>
      <pubDate>Mon, 09 Mar 2026 02:47:33 +0800</pubDate>
      <guid>https://answer.freetools.me/%E4%B8%BA%E4%BB%80%E4%B9%88%E5%A4%A7%E6%A8%A1%E5%9E%8B%E8%BF%9E%E7%AE%80%E5%8D%95%E7%9A%84%E5%8A%A0%E6%B3%95%E9%83%BD%E4%BC%9A%E7%AE%97%E9%94%99%E4%BB%8E%E5%88%86%E8%AF%8D%E9%99%B7%E9%98%B1%E5%88%B0%E7%AE%97%E6%9C%AF%E6%8E%A8%E7%90%86%E7%9A%84%E6%8A%80%E6%9C%AF%E5%9B%B0%E5%A2%83/</guid>
      <description>为什么大模型连简单的加法都会算错？从分词陷阱到算术推理的技术困境</description>
    </item>
    <item>
      <title>令牌化与加密的本质分歧：为何支付数据保护不能只靠密码学</title>
      <link>https://answer.freetools.me/%E4%BB%A4%E7%89%8C%E5%8C%96%E4%B8%8E%E5%8A%A0%E5%AF%86%E7%9A%84%E6%9C%AC%E8%B4%A8%E5%88%86%E6%AD%A7%E4%B8%BA%E4%BD%95%E6%94%AF%E4%BB%98%E6%95%B0%E6%8D%AE%E4%BF%9D%E6%8A%A4%E4%B8%8D%E8%83%BD%E5%8F%AA%E9%9D%A0%E5%AF%86%E7%A0%81%E5%AD%A6/</link>
      <pubDate>Fri, 06 Mar 2026 05:02:45 +0800</pubDate>
      <guid>https://answer.freetools.me/%E4%BB%A4%E7%89%8C%E5%8C%96%E4%B8%8E%E5%8A%A0%E5%AF%86%E7%9A%84%E6%9C%AC%E8%B4%A8%E5%88%86%E6%AD%A7%E4%B8%BA%E4%BD%95%E6%94%AF%E4%BB%98%E6%95%B0%E6%8D%AE%E4%BF%9D%E6%8A%A4%E4%B8%8D%E8%83%BD%E5%8F%AA%E9%9D%A0%E5%AF%86%E7%A0%81%E5%AD%A6/</guid>
      <description>深入解析令牌化与加密在支付安全领域的本质区别。从2013年Target数据泄露事件到EMVCo支付令牌化规范，系统梳理令牌化如何从根本上改变数据保护范式。基于PCI DSS合规指南、IEEE密码学研究论文和Visa全球支付数据，揭示为何令牌化能实现加密无法企及的安全效果——将敏感数据彻底移出攻击面，而非仅仅使其&amp;#34;不可读&amp;#34;。</description>
    </item>
  </channel>
</rss>
