...
Optimistic availability is a characteristic of distributed systems that are coordinated by a master node which allows the system to tolerate arbitrary node failure while preserving availability as long as the master node and at least one relevant node agree on the state of the system. In laymen's terms: given sufficient replication, if a node fails, we're optimistic that the system will remain available.
- The master node is a single point of failure. It is possible to have multiple masters and/or leader election in the event that the master goes down.
- The master node is only required to coordinate state changes and does not, itself, need to store much data beyond system-wide state (i.e. latest version, etc).
- If the master node is available, we can be optimistic that the system is available (the read or write will be serviced) as long as one relevant replica with the same state is available.
- We can guarantee CID because a single operation:
- will transition the database from one consistent state to another if the database is available or fail
be done in isolation because the master node will write lock its buffer of writes to coordinate concurrent operations (serializability) AND the WRITE PROTOCOL ensures that writes are only sent to nodes that were previously available (meaning they are the correct sequence of writes) and writes are only sent to previously unavailable nodes sequentially during the REPAIR PROCESS
will be held on the master node if there is at least one node that is not available to ensure that it is never lost
In order to get A (atomicity) and full transactions, the master node must store atomic groups of operations in a buffer and replay them against the current state of the system when its time to commit (TODO: think about this some more)