mpm + safety   6

Atomic Durability - How do databases recover from a crash ?
In this post we're going to talk about the four different ways of having durable transactions.
database  consistency  safety  storage 
7 weeks ago by mpm
Probabilistically Bounded Staleness
Instead of relying on anecdotal evidence, we can quantitatively demonstrate why eventual consistency is "good enough" for many users. We can predict the expected consistency of an eventually consistent data store using models we've developed, called Probabilistically Bounded Staleness
distributed  consistency  base  safety 
april 2012 by mpm
Building on Quicksand
Reliable systems have always been built out of unreliable components. Early on, the reliable components were small such as mirrored disks or ECC (Error Correcting Codes) in core memory. These systems were designed such that failures of these small components were transparent to the application. Later, the size of the unreliable components grew larger and semantic challenges crept into the application when failures occurred.
reliability  safety  availability  integrity 
december 2008 by mpm
Perspectives - Degraded Operations Mode
all services should expect to be overloaded and all services should expect mass failures. Very few do and I see related down-time in the news every month or so.
reliability  availability  safety 
september 2008 by mpm
Article: Could we have saved the Death Star?
Had Darth Vader employed formal methods to the design of his Death Star, perhaps it would not have been vulnerable to the Starfighter attack that led to its destruction.
availability  safety 
july 2008 by mpm
Staggering Failure of the Software Sort
An alert that’s needed because the manual reboot that’s needed for the auto-reboot that’s needed for the system that doesn’t work, doesn’t work.
safety  maintainability  fault-removal 
may 2008 by mpm

Copy this bookmark: