<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>数据并行 on Answer</title>
    <link>https://answer.freetools.me/tags/%E6%95%B0%E6%8D%AE%E5%B9%B6%E8%A1%8C/</link>
    <description>Recent content in 数据并行 on Answer</description>
    <generator>Hugo -- 0.152.2</generator>
    <language>zh-cn</language>
    <lastBuildDate>Fri, 13 Mar 2026 07:09:28 +0800</lastBuildDate>
    <atom:link href="https://answer.freetools.me/tags/%E6%95%B0%E6%8D%AE%E5%B9%B6%E8%A1%8C/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>梯度同步：为什么分布式训练卡在通信瓶颈上二十年无法突破？</title>
      <link>https://answer.freetools.me/%E6%A2%AF%E5%BA%A6%E5%90%8C%E6%AD%A5%E4%B8%BA%E4%BB%80%E4%B9%88%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83%E5%8D%A1%E5%9C%A8%E9%80%9A%E4%BF%A1%E7%93%B6%E9%A2%88%E4%B8%8A%E4%BA%8C%E5%8D%81%E5%B9%B4%E6%97%A0%E6%B3%95%E7%AA%81%E7%A0%B4/</link>
      <pubDate>Fri, 13 Mar 2026 07:09:28 +0800</pubDate>
      <guid>https://answer.freetools.me/%E6%A2%AF%E5%BA%A6%E5%90%8C%E6%AD%A5%E4%B8%BA%E4%BB%80%E4%B9%88%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83%E5%8D%A1%E5%9C%A8%E9%80%9A%E4%BF%A1%E7%93%B6%E9%A2%88%E4%B8%8A%E4%BA%8C%E5%8D%81%E5%B9%B4%E6%97%A0%E6%B3%95%E7%AA%81%E7%A0%B4/</guid>
      <description>从AllReduce算法的演进到梯度压缩、通信重叠、ZeRO优化等技术突破，深入解析分布式训练中梯度同步通信瓶颈的技术本质与工程权衡，揭示为什么这个问题困扰AI工程界二十年。</description>
    </item>
    <item>
      <title>千亿参数模型如何塞进有限显卡ZeRO如何用分片消除数据并行的内存冗余</title>
      <link>https://answer.freetools.me/%E5%8D%83%E4%BA%BF%E5%8F%82%E6%95%B0%E6%A8%A1%E5%9E%8B%E5%A6%82%E4%BD%95%E5%A1%9E%E8%BF%9B%E6%9C%89%E9%99%90%E6%98%BE%E5%8D%A1zero%E5%A6%82%E4%BD%95%E7%94%A8%E5%88%86%E7%89%87%E6%B6%88%E9%99%A4%E6%95%B0%E6%8D%AE%E5%B9%B6%E8%A1%8C%E7%9A%84%E5%86%85%E5%AD%98%E5%86%97%E4%BD%99/</link>
      <pubDate>Mon, 09 Mar 2026 05:54:35 +0800</pubDate>
      <guid>https://answer.freetools.me/%E5%8D%83%E4%BA%BF%E5%8F%82%E6%95%B0%E6%A8%A1%E5%9E%8B%E5%A6%82%E4%BD%95%E5%A1%9E%E8%BF%9B%E6%9C%89%E9%99%90%E6%98%BE%E5%8D%A1zero%E5%A6%82%E4%BD%95%E7%94%A8%E5%88%86%E7%89%87%E6%B6%88%E9%99%A4%E6%95%B0%E6%8D%AE%E5%B9%B6%E8%A1%8C%E7%9A%84%E5%86%85%E5%AD%98%E5%86%97%E4%BD%99/</guid>
      <description>深入解析ZeRO（零冗余优化器）如何通过三阶段分片技术消除数据并行的内存冗余。从混合精度训练的内存消耗分析入手，详细阐述优化器状态、梯度、参数分片的数学原理，对比ZeRO与模型并行、流水线并行的通信开销，并介绍ZeRO-Offload和ZeRO-Infinity如何突破GPU内存墙。</description>
    </item>
  </channel>
</rss>
