0029d62434d9045bc3e8b2eb48ae696e30336e92 - arm/linux

commit	0029d62434d9045bc3e8b2eb48ae696e30336e92	[log] [tgz]
author	Lars Ellenberg <lars.ellenberg@linbit.com>	Thu Jun 14 18:02:52 2012 +0200
committer	Philipp Reisner <philipp.reisner@linbit.com>	Tue Jul 24 14:09:53 2012 +0200
tree	d1e2ce7a3130e5b489320b12b3568c8a58828697
parent	88437879fbd0ac5cde3f683f8eee455bbf83aaac [diff]

drbd: do not reset rs_pending_cnt too early

Fix asserts like
  block drbd0: in got_BlockAck:4634: rs_pending_cnt = -35 < 0 !

We reset the resync lru cache and related information (rs_pending_cnt),
once we successfully finished a resync or online verify, or if the
replication connection is lost.

We also need to reset it if a resync or online verify is aborted
because a lower level disk failed.

In that case the replication link is still established,
and we may still have packets queued in the network buffers
which want to touch rs_pending_cnt.

We do not have any synchronization mechanism to know for sure when all
such pending resync related packets have been drained.

To avoid this counter to go negative (and violate the ASSERT that it
will always be >= 0), just do not reset it when we lose a disk.

It is good enough to make sure it is re-initialized before the next
resync can start: reset it when we re-attach a disk.

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

3 files changed

tree: d1e2ce7a3130e5b489320b12b3568c8a58828697