<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>可靠性工程 on Answer</title>
    <link>https://answer.freetools.me/categories/%E5%8F%AF%E9%9D%A0%E6%80%A7%E5%B7%A5%E7%A8%8B/</link>
    <description>Recent content in 可靠性工程 on Answer</description>
    <generator>Hugo -- 0.152.2</generator>
    <language>zh-cn</language>
    <lastBuildDate>Sat, 07 Mar 2026 04:00:23 +0800</lastBuildDate>
    <atom:link href="https://answer.freetools.me/categories/%E5%8F%AF%E9%9D%A0%E6%80%A7%E5%B7%A5%E7%A8%8B/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>为何要在生产环境故意制造故障？从Netflix的猴子军团到混沌工程的十五年演进</title>
      <link>https://answer.freetools.me/%E4%B8%BA%E4%BD%95%E8%A6%81%E5%9C%A8%E7%94%9F%E4%BA%A7%E7%8E%AF%E5%A2%83%E6%95%85%E6%84%8F%E5%88%B6%E9%80%A0%E6%95%85%E9%9A%9C%E4%BB%8Enetflix%E7%9A%84%E7%8C%B4%E5%AD%90%E5%86%9B%E5%9B%A2%E5%88%B0%E6%B7%B7%E6%B2%8C%E5%B7%A5%E7%A8%8B%E7%9A%84%E5%8D%81%E4%BA%94%E5%B9%B4%E6%BC%94%E8%BF%9B/</link>
      <pubDate>Sat, 07 Mar 2026 04:00:23 +0800</pubDate>
      <guid>https://answer.freetools.me/%E4%B8%BA%E4%BD%95%E8%A6%81%E5%9C%A8%E7%94%9F%E4%BA%A7%E7%8E%AF%E5%A2%83%E6%95%85%E6%84%8F%E5%88%B6%E9%80%A0%E6%95%85%E9%9A%9C%E4%BB%8Enetflix%E7%9A%84%E7%8C%B4%E5%AD%90%E5%86%9B%E5%9B%A2%E5%88%B0%E6%B7%B7%E6%B2%8C%E5%B7%A5%E7%A8%8B%E7%9A%84%E5%8D%81%E4%BA%94%E5%B9%B4%E6%BC%94%E8%BF%9B/</guid>
      <description>深入解析混沌工程十五年演进历程，从Netflix 2008年数据库灾难到Chaos Monkey的诞生，系统阐述故障注入方法论、爆炸半径控制机制、主流工具对比与实施路径。涵盖稳态假设定义、tc/netem网络故障注入、Google DiRT演练实践、ROI量化分析，以及从传统测试到混沌工程的本质区别。</description>
    </item>
    <item>
      <title>健康检查为何成了分布式系统的隐形杀手——从TCP端口探测到语义健康检测的二十年陷阱</title>
      <link>https://answer.freetools.me/%E5%81%A5%E5%BA%B7%E6%A3%80%E6%9F%A5%E4%B8%BA%E4%BD%95%E6%88%90%E4%BA%86%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%9A%84%E9%9A%90%E5%BD%A2%E6%9D%80%E6%89%8B%E4%BB%8Etcp%E7%AB%AF%E5%8F%A3%E6%8E%A2%E6%B5%8B%E5%88%B0%E8%AF%AD%E4%B9%89%E5%81%A5%E5%BA%B7%E6%A3%80%E6%B5%8B%E7%9A%84%E4%BA%8C%E5%8D%81%E5%B9%B4%E9%99%B7%E9%98%B1/</link>
      <pubDate>Fri, 06 Mar 2026 11:08:39 +0800</pubDate>
      <guid>https://answer.freetools.me/%E5%81%A5%E5%BA%B7%E6%A3%80%E6%9F%A5%E4%B8%BA%E4%BD%95%E6%88%90%E4%BA%86%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%9A%84%E9%9A%90%E5%BD%A2%E6%9D%80%E6%89%8B%E4%BB%8Etcp%E7%AB%AF%E5%8F%A3%E6%8E%A2%E6%B5%8B%E5%88%B0%E8%AF%AD%E4%B9%89%E5%81%A5%E5%BA%B7%E6%A3%80%E6%B5%8B%E7%9A%84%E4%BA%8C%E5%8D%81%E5%B9%B4%E9%99%B7%E9%98%B1/</guid>
      <description>从AWS Builder&amp;#39;s Library的深度健康检查分层，到Colin Breck的Kubernetes探针踩坑实录，再到Netflix的应用层DDoS雪崩效应，系统梳理健康检查二十年来的设计演进与工程陷阱。深入剖析浅层检查与深层检查的本质权衡、级联故障的触发机制、健康检查风暴的成因，以及Fail-Open机制、反馈回路、并发限制等最佳实践。基于Google SRE、Lyft Envoy、gRPC健康协议等权威信源，揭示一个被严重误解的分布式系统核心组件。</description>
    </item>
  </channel>
</rss>
