From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 3673 invoked by alias); 9 Jul 2010 15:08:17 -0000 Received: (qmail 3657 invoked by uid 9478); 9 Jul 2010 15:08:16 -0000 Date: Fri, 09 Jul 2010 15:08:00 -0000 Message-ID: <20100709150816.3655.qmail@sourceware.org> From: jbrassow@sourceware.org To: lvm-devel@redhat.com, lvm2-cvs@sourceware.org Subject: LVM2 ./WHATS_NEW lib/metadata/mirror.c Mailing-List: contact lvm2-cvs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: lvm2-cvs-owner@sourceware.org X-SW-Source: 2010-07/txt/msg00039.txt.bz2 CVSROOT: /cvs/lvm2 Module name: LVM2 Changes by: jbrassow@sourceware.org 2010-07-09 15:08:15 Modified files: . : WHATS_NEW lib/metadata : mirror.c Log message: Finish fix for bug 607347: failing both redundant mirror log legs... A previous check-in added logic to handle the case where both images of a mirrored log failed. It solved the problem by simply removing the log entirely - leaving the parent mirror with a 'core' log. This worked for most cases. However, if there was a small delay between the failures of the two mirrored log devices, the mirror would hang, LVM would hang, and no additional LVM commands could be issued. When the first leg of the log fails, it signals the need for repair. Before 'lvconvert --repair' is run by dmeventd, the second leg fails. 'lvconvert' would see both devices as failed and try to remove the log entirely. When it came time to suspend the parent mirror to update the configuration, the suspend would hang because it couldn't get any I/O through the mirrored log, which was plugged waiting for corrective action. The solution is to replace the log with an error target to clear any pending writes before removing it. This allows the parent mirror to suspend and make the proper changes. Patches: http://sourceware.org/cgi-bin/cvsweb.cgi/LVM2/WHATS_NEW.diff?cvsroot=lvm2&r1=1.1659&r2=1.1660 http://sourceware.org/cgi-bin/cvsweb.cgi/LVM2/lib/metadata/mirror.c.diff?cvsroot=lvm2&r1=1.122&r2=1.123 --- LVM2/WHATS_NEW 2010/07/08 12:24:04 1.1659 +++ LVM2/WHATS_NEW 2010/07/09 15:08:12 1.1660 @@ -1,5 +1,6 @@ Version 2.02.71 - =============================== + Fix possible hang when all mirror images of a mirrored log fail. Do not log backtrace in valid _lv_resume() code path. Cleanup help strings in configure.in. Prompt if metadataignore with vgextend or pvchange would adjust vg_mda_copies. --- LVM2/lib/metadata/mirror.c 2010/07/02 21:16:50 1.122 +++ LVM2/lib/metadata/mirror.c 2010/07/09 15:08:15 1.123 @@ -864,9 +864,72 @@ lv->status &= ~MIRROR_NOTSYNCED; if (!replace_lv_with_error_segment(lv)) return_0; - } else if (remove_log) + } else if (remove_log) { detached_log_lv = detach_mirror_log(mirrored_seg); + /* + * The log may be removed due to repair. If the log + * happens to be a mirrored log, then there is a special + * case we need to consider. One of the images of a + * mirrored log can fail followed shortly afterwards by + * a failure of the second. This means that the top-level + * mirror is waiting for writes to the log to finish, but + * they never will unless the mirrored log can be repaired + * or replaced with an error target. Since both the devices + * have failed, we must replace with error target - it is + * the only way to release the pending writes. + */ + if (lv_is_mirrored(detached_log_lv) && + (detached_log_lv->status & PARTIAL_LV)) { + struct lv_segment *seg = first_seg(detached_log_lv); + + log_very_verbose("%s being removed due to failures", + detached_log_lv->name); + + /* + * We are going to replace the mirror with an + * error segment, but before we do, we must remember + * all of the LVs that must be deleted later (i.e. + * the sub-lv's) + */ + for (m = 0; m < seg->area_count; m++) { + seg_lv(seg, m)->status &= ~MIRROR_IMAGE; + lv_set_visible(seg_lv(seg, m)); + if (!(lvl = dm_pool_alloc(lv->vg->cmd->mem, + sizeof(*lvl)))) + return 0; + lvl->lv = seg_lv(seg, m); + dm_list_add(&tmp_orphan_lvs, &lvl->list); + } + + if (!replace_lv_with_error_segment(detached_log_lv)) { + log_error("Failed error target substitution for %s", + detached_log_lv->name); + return_0; + } + + if (!vg_write(detached_log_lv->vg)) { + log_error("intermediate VG write fail."); + return 0; + } + + if (!suspend_lv(detached_log_lv->vg->cmd, + detached_log_lv)) { + log_error("Failed to suspend %s", + detached_log_lv->name); + vg_revert(detached_log_lv->vg); + return 0; + } + + if (!vg_commit(detached_log_lv->vg)) + return_0; + + if (!resume_lv(detached_log_lv->vg->cmd, + detached_log_lv)) + return 0; + } + } + /* * To successfully remove these unwanted LVs we need to * remove the LVs from the mirror set, commit that metadata