Jack ShiraziFastly, Google and Amazon’s “Bug Already Present” Pattern Caused 3 Biggest Outages This YearIn all cases, a bug that wasn’t triggered until long after release caused a cascade of failures4 min read·Jun 30, 2021----
Jack ShiraziWhat Reliability Engineers can learn from Google’s December 2020 OAuth OutageFive Takeways to Apply to Your Reliability Strategy8 min read·Jun 30, 2021----
Jack ShiraziWhat Reliability Engineers can learn from Amazon’s November 2020 Kinesis OutageSix Takeways to Apply to Your Reliability Strategy5 min read·Jun 30, 2021----
Jack ShiraziinExpedia Group TechnologyHow To Export Medium Stats for Monthly AnalysisLearn how to generate and retain monthly stats on Medium3 min read·Jun 23, 2021--4--4
Jack ShiraziinExpedia Group TechnologyMicroservices Are Not a Technical Solution, They’re a Teamwork SolutionThe first answer to “make this more reliable” isn’t “make it a set of microservices”3 min read·Apr 29, 2021--4--4
Jack ShiraziinExpedia Group TechnologyTraffic Shedding, Rate Limiting, Backpressure, Oh My!How to stop your service from getting overloaded3 min read·Mar 25, 2021--1--1
Jack ShiraziinExpedia Group TechnologyThe Cost of 100% ReliabilityHow do reliability costs stack up? Where do they come from?6 min read·Mar 31, 2020----
Jack ShiraziinExpedia Group TechnologyPractical JVM GC tuning for everyoneThe modern garbage collection tuning procedure for JVMs3 min read·Feb 20, 2020--2--2
Jack ShiraziinExpedia Group TechnologyDevOps = Dev + ErrorBudget + OpsIn this decade we entered a new era of IT: rapid release as a standard. This practice has grown from niche to such a common practice that…5 min read·Jun 19, 2019--1--1
Jack ShiraziinThe Hotels.com Technology BlogOptimizing your server by limiting request overheadsSuccess is a double-edged sword — increased request volume and more edge case requests stress your server. Many a server has failed…3 min read·Jan 17, 2019----