sourceware.org Git - lvm2.git/log

]> sourceware.org Git - lvm2.git/log

Peter Rajnoha [Fri, 4 Oct 2013 10:30:33 +0000 (12:30 +0200)]

thin: rename thin_pool_chunk_size_calculation -> ..size_policy and rename "default" policy to "generic"

Just to be consistent with existing naming we use.

commit | commitdiff | tree

Alasdair G Kergon [Tue, 1 Oct 2013 20:20:10 +0000 (21:20 +0100)]

cmdline: Add --ignoreskippedcluster.

Accept --ignoreskippedcluster with pvs, vgs, lvs, pvdisplay, vgdisplay,
lvdisplay, vgchange and lvchange to avoid the 'Skipping clustered
VG' errors when requesting information about a clustered VG
without using clustered locking and still exit with success.

The messages can still be seen with -v.

commit | commitdiff | tree

Peter Rajnoha [Mon, 30 Sep 2013 09:19:09 +0000 (11:19 +0200)]

libdm: export DM_UDEV_SUBSYSTEM_FLAG names for subystem udev flags

Just like we have symbolic names assigned to general DM udev flags
(DM_UDEV_* flags), we have the same for any subsystem flags now
(DM_SUBSYSTEM_UDEV_FLAG*), making it easier to use.

commit | commitdiff | tree

Peter Rajnoha [Mon, 30 Sep 2013 09:11:18 +0000 (11:11 +0200)]

udev: make subsystem rules responsible for importing subsystem flags

Each subsystem rule that needs to import any of DM_SUBSYSTEM_UDEV_FLAG*
flags is responsible for doing so. This simply moves control of these
flags from general 10-dm.rules to any subsystem rule using these flags
as each subsystem knows better how to handle these flags on its own.

commit | commitdiff | tree

Peter Rajnoha [Mon, 30 Sep 2013 06:58:57 +0000 (08:58 +0200)]

thin: better dbg msgs and avoid uninit. value on chunk size recalc

commit | commitdiff | tree

Peter Rajnoha [Mon, 30 Sep 2013 06:17:56 +0000 (08:17 +0200)]

fix: also make commit b4637 work without dmeventd

commit | commitdiff | tree

Peter Rajnoha [Fri, 27 Sep 2013 13:37:16 +0000 (15:37 +0200)]

udev: fix 3min udev timeout so that it is applied for all LVM volumes

The timeout should be set before any volume skipping.

commit | commitdiff | tree

Peter Rajnoha [Fri, 27 Sep 2013 11:58:55 +0000 (13:58 +0200)]

fix: make it possible to compile with --disable-devmapper again

Some code has been added recently which makes it impossible to compile
when "configure --disable-devmapper" is used. This patch just shuffles
the code around so it's under proper #ifdef DEVMAPPER_SUPPORT.

commit | commitdiff | tree

Jonathan Brassow [Thu, 26 Sep 2013 16:30:07 +0000 (11:30 -0500)]

RAID: Fix _sufficient_pes_free calculation for RAID

lib/metadata/lv_manip.c:_sufficient_pes_free() was calculating the
required space for RAID allocations incorrectly due to double
accounting.  This resulted in failure to allocate when available
space was tight.

When RAID data and metadata areas are allocated together, the total
amount is stored in ah->new_extents and ah->alloc_and_split_meta is
set.  '_sufficient_pes_free' was adding the necessary metadata extents
to ah->new_extents without ever checking ah->alloc_and_split_meta.
This often led to double accounting of the metadata extents.  This
patch checks 'ah->alloc_and_split_meta' to perform proper calculations
for RAID.

This error is only present in the function that checks for the needed
space, not in the functions that do the actual allocation.

commit | commitdiff | tree

Jonathan Brassow [Thu, 26 Sep 2013 03:35:52 +0000 (22:35 -0500)]

WHATS_NEW: description for previous commit

commit 098896fb2905cd02d02b7271be489e32d86c073d failed to include
description of what was fixed.

"Conversion from linear to mirror or RAID1 now honors
mirror_segtype_default."

commit | commitdiff | tree

Jonathan Brassow [Thu, 26 Sep 2013 03:25:43 +0000 (22:25 -0500)]

mirror/RAID: Honor mirror_segtype_default when converting from linear

1) When converting from an x-way mirror/raid1 to a y-way mirror/raid1,
the default behaviour should be to stay the same segment type.

2) When converting from linear to mirror or raid1, the default behaviour
should honor the mirror_segtype_default.

3) When converting and the '--type' argument is specified, the '--type'
argument should be honored.

catch such conditions, but errors in the tests caused the issue to go
unnoticed. The code has been fixed to perform #2 properly, the tests
have been corrected to properly test for #2, and a few other tests
were changed to explicitly specify the '--type mirror' when necessary.

commit | commitdiff | tree

Peter Rajnoha [Wed, 25 Sep 2013 14:02:38 +0000 (16:02 +0200)]

profile: add thin-performance.profile

Define a "performance" profile for thin pools which is exactly:
- allocation/thin_pool_zero = 0
- thin_pool_chunk_size_calculation = "performance"

commit | commitdiff | tree

Peter Rajnoha [Wed, 25 Sep 2013 14:01:47 +0000 (16:01 +0200)]

thin: calculate thin pool chunk size based on device IO hints

If "default" thin pool chunk size calculation method is selected,
use minimum_io_size, otherwise optimal_io_size for "performance"
device hint exposed in sysfs. If there appear to be PVs with
different hints presented, use their least common multiple.

If the hint is less than the default value defined for the
calculation method, use the default value instead.

commit | commitdiff | tree

Peter Rajnoha [Wed, 25 Sep 2013 14:00:52 +0000 (16:00 +0200)]

thin: use appropriate default value based on allocation/thin_pool_chunk_size_calculation setting

If thin_pool_chunk_size_calculation is set to "default", use 64KiB,
otheriwse 512KiB for "performance".

commit | commitdiff | tree

Peter Rajnoha [Wed, 25 Sep 2013 13:59:30 +0000 (15:59 +0200)]

conf: add allocation/thin_pool_chunk_size_calculation

Add allocation/thin_pool_chunk_size_calculation lvm.conf
option to select a method for calculating thin pool chunk
sizes and define two possible values - "default" and "performance".

commit | commitdiff | tree

Jonathan Brassow [Wed, 25 Sep 2013 02:32:53 +0000 (21:32 -0500)]

Test/clean-up: Indent clean-up and additional RAID resize test

Better indenting and a test for bug 1005434 (parity RAID should
extend in a contiguous fashion).

commit | commitdiff | tree

Jonathan Brassow [Wed, 25 Sep 2013 02:32:10 +0000 (21:32 -0500)]

RAID: Fix broken allocation policies for parity RAID types

A previous commit (b6bfddcd0a830d0c9312bc3ab906cb3d1b7a6dd9) which
was designed to prevent segfaults during lvextend when trying to
extend striped logical volumes forgot to include calculations for
RAID4/5/6 parity devices. This was causing the 'contiguous' and
'cling_by_tags' allocation policies to fail for RAID 4/5/6.

The solution is to remember that while we can compare
ah->area_count == prev_lvseg->area_count
for non-RAID, we should compare
(ah->area_count + ah->parity_count) == prev_lvseg->area_count
for a general solution.

commit | commitdiff | tree

Peter Rajnoha [Tue, 24 Sep 2013 12:03:42 +0000 (14:03 +0200)]

lvmconf: use_lvmetad=0 on --enable-cluster, reset to default on --disable-cluster

lvmetad is not yet supported in clustered environment so
disable it automatically if using lvmconf --enable-cluster
and reset it to default value if using lvmconf --disable-cluster.

Also, add a few comments in lvm.conf about locking_type vs. use_lvmetad
if setting it for clustered environment.

commit | commitdiff | tree

Peter Rajnoha [Tue, 24 Sep 2013 07:35:20 +0000 (09:35 +0200)]

tools: don't install separate command symlink for lvm devtypes

commit | commitdiff | tree

Alasdair G Kergon [Mon, 23 Sep 2013 20:50:14 +0000 (21:50 +0100)]

lvs: Add seg_size_pe field.

Requested
https://www.redhat.com/archives/linux-lvm/2013-July/msg00112.html

commit | commitdiff | tree

Alasdair G Kergon [Mon, 23 Sep 2013 18:59:47 +0000 (19:59 +0100)]

Merge branch 'master' of /data/agk/git/lvm2-upstream

commit | commitdiff | tree

Alasdair G Kergon [Mon, 23 Sep 2013 18:55:50 +0000 (19:55 +0100)]

cmdline: Accept PE ranges as start+length.

commit | commitdiff | tree

Alasdair G Kergon [Mon, 23 Sep 2013 18:53:18 +0000 (19:53 +0100)]

pvmove: clean exit on failed pvmove restart

At present, before the pvmove command can be used to restart pvmove
polling, the LVs concerned need to be activated e.g. with lvchange
-ay.

commit | commitdiff | tree

Alasdair G Kergon [Mon, 23 Sep 2013 18:50:34 +0000 (19:50 +0100)]

pvmove: Accept PE ranges as start+length.

commit | commitdiff | tree

Alasdair G Kergon [Mon, 23 Sep 2013 18:46:28 +0000 (19:46 +0100)]

pvmove: clean exit on failed pvmove restart

At present, before the pvmove command can be used to restart pvmove
polling, the LVs concerned need to be activated e.g. with lvchange -ay.

commit | commitdiff | tree

Alasdair G Kergon [Mon, 23 Sep 2013 14:55:11 +0000 (15:55 +0100)]

post-release

commit | commitdiff | tree

Alasdair G Kergon [Mon, 23 Sep 2013 14:43:37 +0000 (15:43 +0100)]

release 2.02.102

18 files changed, 137 insertions(+), 203 deletions(-)

commit | commitdiff | tree

Christine Caulfield [Mon, 23 Sep 2013 12:23:00 +0000 (13:23 +0100)]

clvmd: Fix node up/down handing in corosync module

The corosync cluster interface for clvmd did not correctly
deal with node up/down events so that when a node was removed
from the cluster clvmd would prevent remote operations
from happening, as it thought the node was up but not
running clvmd.

This patch fixes that code by simplifying the case to node
being up or down - which was the original intention
and is supported by pacemaker and CPG in the higher layers.

Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>

commit | commitdiff | tree

Zdenek Kabelac [Thu, 19 Sep 2013 12:04:18 +0000 (14:04 +0200)]

tests: drop extra loop

Loop was added as a result of unworking singlenode locking, since
this should be now fixed - remove this workaround.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 19 Sep 2013 12:03:39 +0000 (14:03 +0200)]

tests: drop pointeless test

Writing directly to thin-pool and reading it is not a good idea.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 19 Sep 2013 12:03:03 +0000 (14:03 +0200)]

tests: fix typo VEROSE->VERBOSE in Makefile.in

commit | commitdiff | tree

Zdenek Kabelac [Wed, 18 Sep 2013 08:48:51 +0000 (10:48 +0200)]

tests: hide expected error message

commit | commitdiff | tree

Zdenek Kabelac [Thu, 5 Sep 2013 09:32:42 +0000 (11:32 +0200)]

Makefile: add missing deps

Add missing deps for device-mapper build of scripts dir.
Cleanup multiple SUBDIR lines together.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 19 Sep 2013 20:18:16 +0000 (22:18 +0200)]

activation: use improved lv_info

Call lv_info() with info == NULL to query for local active presence.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 19 Sep 2013 12:05:55 +0000 (14:05 +0200)]

lvinfo: allow to use lv_info with NULL info

When NULL info struct is passed in - function is usable
as a quick query for lv_is_active_locally() - with a bonus
we may query for layered device.

So it could be seen as a more efficient lv_is_active_locally().

commit | commitdiff | tree

Zdenek Kabelac [Wed, 18 Sep 2013 15:22:14 +0000 (17:22 +0200)]

raid: add lv_is_raid()

More readable then status bit flag masking...

commit | commitdiff | tree

Zdenek Kabelac [Thu, 5 Sep 2013 11:14:19 +0000 (13:14 +0200)]

dmeventd: simplify fifos init

Simplier code.

commit | commitdiff | tree

Zdenek Kabelac [Mon, 23 Sep 2013 09:01:46 +0000 (11:01 +0200)]

cleanup: drop unused report parameter

Do not pass unused dm_report pointer.

commit | commitdiff | tree

Zdenek Kabelac [Mon, 23 Sep 2013 07:59:37 +0000 (09:59 +0200)]

cleanup: add log_error error path messages

commit | commitdiff | tree

Zdenek Kabelac [Mon, 23 Sep 2013 07:44:53 +0000 (09:44 +0200)]

cleanup: shorter code

Put assignment on declaration line.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 19 Sep 2013 20:23:43 +0000 (22:23 +0200)]

cleanup: release all memory in error path

Just ensure no memory will stay in pool even in error path.

commit | commitdiff | tree

Zdenek Kabelac [Fri, 20 Sep 2013 13:52:25 +0000 (15:52 +0200)]

cleanup: skip call of detect...

SInce we know the pool was locked and we want to reloc pool again,
just use '1' directly.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 5 Sep 2013 09:32:42 +0000 (11:32 +0200)]

cleanup: monitoring more readable

Put continue path into one code segment.

commit | commitdiff | tree

Jonathan Brassow [Fri, 20 Sep 2013 16:33:29 +0000 (11:33 -0500)]

test: Blacklist fedora-19 kernel for RAID4/5/6 dev replace tests

A know issue with kmem_cach is causing failures while testing
RAID 4/5/6 device replacement. Blacklist the offending kernel
so that these tests are not performed there.

commit | commitdiff | tree

Alasdair G Kergon [Fri, 20 Sep 2013 12:56:29 +0000 (13:56 +0100)]

release 2.02.101

112 files changed, 4131 insertions(+), 1312 deletions(-)

commit | commitdiff | tree

Peter Rajnoha [Fri, 20 Sep 2013 07:42:27 +0000 (09:42 +0200)]

udev: remove unused line in 69-dm-lvm-metad.rules

The explicit check for *_raid_member is not actually needed as
this gets filtered out by the ENV{ID_FS_TYPE}!="LVM2_member|LVM1_member" rule.

commit | commitdiff | tree

Jonathan Brassow [Fri, 20 Sep 2013 03:33:01 +0000 (22:33 -0500)]

RAID: Don't allow syncaction changes on non-RAID LVs

Don't allow syncaction or other RAID-type messages on non-RAID
logical volumes.

commit | commitdiff | tree

Petr Rockai [Sun, 8 Sep 2013 22:01:44 +0000 (00:01 +0200)]

clvmd: Avoid a 3-way deadlock in dead-client cleanup.

commit | commitdiff | tree

Petr Rockai [Wed, 18 Sep 2013 10:53:11 +0000 (12:53 +0200)]

vgconvert: Do not call lvmetad_vg_remove (path shared with vgcfgbackup).

commit | commitdiff | tree

Petr Rockai [Wed, 18 Sep 2013 09:22:49 +0000 (11:22 +0200)]

vgcfgrestore: Remove VG rom lvmetad later, to better deal with errors.

commit | commitdiff | tree

Petr Rockai [Wed, 18 Sep 2013 08:34:38 +0000 (10:34 +0200)]

vgcfgrestore: Remove the VG from lvmetad before overwriting it.

commit | commitdiff | tree

Jonathan Brassow [Wed, 18 Sep 2013 04:03:57 +0000 (23:03 -0500)]

man: lvs man page was not accurate on the volume health bit of lv_attr

The 'm'ismatches flag only shows after a "check" scrubbing operation -
not after a "check" or "repair" as indicated by the man page.

commit | commitdiff | tree

Alasdair G Kergon [Wed, 18 Sep 2013 00:24:19 +0000 (01:24 +0100)]

dmsetup: Detect invalid sector supplied to message.

atoll doesn't check for errors, so invalid sector numbers were silently
accepted in the "dmsetup message" command.

(Mikulas)

commit | commitdiff | tree

Alasdair G Kergon [Wed, 18 Sep 2013 00:16:48 +0000 (01:16 +0100)]

tools: Avoid overflow in _get_int_arg.

Use strtoull instead of strtol so that argument size is not cut
to 31 bytes on machines with 32-bit long.

(Mikulas)

commit | commitdiff | tree

Alasdair G Kergon [Wed, 18 Sep 2013 00:13:06 +0000 (01:13 +0100)]

libdm: avoid leak if dm_task_set_* fn called again

(Mikulas)

commit | commitdiff | tree

Alasdair G Kergon [Wed, 18 Sep 2013 00:09:15 +0000 (01:09 +0100)]

reporting: Add devtypes command.

Add internal devtypes reporting command to display built-in recognised
block device types.  (The output does not include any additional
types added by a configuration file.)

> lvm devtypes -o help
  Device Types Fields
  -------------------
    devtype_all            - All fields in this section.
    devtype_name           - Name of Device Type exactly as it appears in /proc/devices.
    devtype_max_partitions - Maximum number of partitions. (How many device minor numbers get reserved for each device.)
    devtype_description    - Description of Device Type.

> lvm devtypes
  DevType       MaxParts Description
  aoe                 16 ATA over Ethernet
  ataraid             16 ATA Raid
  bcache               1 bcache block device cache
  blkext               1 Extended device partitions
...

commit | commitdiff | tree

Jonathan Brassow [Mon, 16 Sep 2013 20:37:48 +0000 (15:37 -0500)]

WHATS_NEW: Better description for commit 82228ac

More correct description of changes made to disallow thin+mirror.

commit | commitdiff | tree

Alasdair G Kergon [Mon, 16 Sep 2013 19:20:26 +0000 (20:20 +0100)]

debug: Use // for commented out debug #defines

The traditional style used for optional editable definitions
/* #define X /* */
produces a bogus warning from gcc -Wall.

Rather than suppressing this with -Wno-comment, switch over to
the // comment style.

commit | commitdiff | tree

Alasdair G Kergon [Mon, 16 Sep 2013 18:57:14 +0000 (19:57 +0100)]

Makefiles: Fix CC variable override.

The CC override in commit f42b2d4bbf16345e5b5457f4298e751d5c134776
caused the built-in value to be used instead of the configured value
when it wasn't being overridden.

The behaviour is explained here:
http://stackoverflow.com/questions/18007326/how-to-change-default-values-of-variables-like-cc-in-makefile

commit | commitdiff | tree

Alasdair G Kergon [Mon, 16 Sep 2013 15:56:55 +0000 (16:56 +0100)]

filters: Add bcache.

N.B. Using bcache devices as PVs is still experimental.
Problems should be reported to the appropriate mailing lists.

commit | commitdiff | tree

Peter Rajnoha [Mon, 16 Sep 2013 14:28:28 +0000 (16:28 +0200)]

doc: add a note about device filtering while lvmetad is used

commit | commitdiff | tree

Zdenek Kabelac [Mon, 16 Sep 2013 13:38:42 +0000 (15:38 +0200)]

tests: no activate LV

Since our current vgcfgbackup/restore doesn't deal
with difference of active volumes between current and
restored set of volumes - run test with inactive LVs.

commit | commitdiff | tree

Zdenek Kabelac [Mon, 16 Sep 2013 11:46:17 +0000 (13:46 +0200)]

tests: update vgcfgrestore test

Test vgcfgrestore when seqno of restored mda is smaller then current
metadata on device.

commit | commitdiff | tree

Peter Rajnoha [Mon, 16 Sep 2013 09:47:09 +0000 (11:47 +0200)]

systemd: run lvm2-activation-net.service after lvm2-activation.service

The lvm2-activation-net.service was ordered only with respect to iscsi
and fcoe service before. In addition to that, we also need ordering
with respect to lvm2-activation.service to prevent parallel vgchange -aay
runs which may cause some problems during activation.
See also https://bugs.gentoo.org/show_bug.cgi?id=480066.

With this patch, the ordering is firmly set to:
lvm2-activation-early.service -> lvm2-activation.service -> lvm2-activation-net.service

Thanks to Alexander Tsoy for the original patch (modified a bit here):
https://www.redhat.com/archives/lvm-devel/2013-September/msg00049.html

commit | commitdiff | tree

Zdenek Kabelac [Mon, 16 Sep 2013 09:18:31 +0000 (11:18 +0200)]

tests: enhance pvmove testing

Rewrite check lv_on and add new lv_tree_on
Move more pvmove test unrelated code out to check & get sections
(so they do not obfuscate trace output unnecesserily)

Use new lv_tree_on()
NOTE: unsure how the snapshot origin should be accounted here.

Split pmove-all-segments into separate tests for raid and thins
(so the test output properly shows what has been skipped in test)

commit | commitdiff | tree

Zdenek Kabelac [Mon, 16 Sep 2013 09:15:38 +0000 (11:15 +0200)]

tests: use check for raid test

Use test-suite built-in check command for this test
(creates better logs, and reduces amount of valgrind-ed commands)

commit | commitdiff | tree

Zdenek Kabelac [Mon, 16 Sep 2013 09:12:37 +0000 (11:12 +0200)]

tests: update check and get

Update usage of "" around shell vars.
trim needs to trim both sides now.
trim also removes debug.log since it's only called when lvm command
has finished properly (so if something fails afterward, there
is no missleading debug trace in the log)

commit | commitdiff | tree

Zdenek Kabelac [Mon, 16 Sep 2013 09:05:03 +0000 (11:05 +0200)]

tests: update die

'die' evaluates given string - so \n could be used for
multiline error report

Also remove debug.log since the command finished properly when we
call 'die'

Note: we should not call 'die' after lvm command failure.

commit | commitdiff | tree

Zdenek Kabelac [Mon, 16 Sep 2013 09:02:58 +0000 (11:02 +0200)]

tests: add help function

Add mkdev_md5sum to create and checksum given LV.
Add dev_md5sum to verify device has matching md5 sum.

commit | commitdiff | tree

Jonathan Brassow [Thu, 12 Sep 2013 18:23:53 +0000 (13:23 -0500)]

TEST: Unaccounted possible output causing failure

lvchange-raid.sh checks to ensure that the 'p'artial flag takes
precedence over the 'w'ritemostly flag by disabling and reenabling
a device in the array.  Most of the time this works fine, but
sometimes the kernel can notice the device failure before it is
reenabled.  In that case, the attr flag will not return to 'w', but
to 'r'efresh.  This is because 'r'efresh also takes precedence over
the 'w'ritemostly flag.  So, we also do a quick check for 'r' and
not just 'w'.

commit | commitdiff | tree

Peter Rajnoha [Thu, 12 Sep 2013 12:16:54 +0000 (14:16 +0200)]

WHATS_NEW: some missing lines

commit | commitdiff | tree

Peter Rajnoha [Thu, 12 Sep 2013 11:46:20 +0000 (13:46 +0200)]

udev: keep DM_ACTIVATION and DM_UDEV_PRIMARY_SOURCE_FLAG meaning as before commit 8d1d835

The DM_ACTIVATION and DM_UDEV_PRIMARY_SOURCE_FLAG needs to be kept the
way it was for backward compatibility (e.g. the old rules are still
in initramfs). This way the check in whether the device should be
scanned in 69-dm-lvmetad.rules is even easier.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 12 Sep 2013 11:30:12 +0000 (13:30 +0200)]

tests: extend harness with output of /var/log/messages

Add a very simple hack for embeding /var/log/messages into
the tests output - it's not ideal since it sometimes breaks lines,
but still gives valuable info.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 12 Sep 2013 09:29:18 +0000 (11:29 +0200)]

tests: singlenode cleanup for prev commit

Add few more comments and cleanup some warnings.

commit | commitdiff | tree

Zdenek Kabelac [Wed, 11 Sep 2013 21:46:20 +0000 (23:46 +0200)]

tests: singlenode updates

Add more 'realistic' simulation of dlm locking.
Previous version was not capable to maintain multiple locks.
Current version doesn't handle multiqueues for locks,
so the ordering is different.

commit | commitdiff | tree

Zdenek Kabelac [Thu, 12 Sep 2013 08:33:51 +0000 (10:33 +0200)]

clvmd: add missing debug newline

Just missing new line.

commit | commitdiff | tree

Zdenek Kabelac [Mon, 9 Sep 2013 11:06:05 +0000 (13:06 +0200)]

cleanup: hide gcc warning

Older gcc is giving misleading warning:

metadata/lv_manip.c:4018: warning: ‘seg’ may be used uninitialized in
this function

But warning free compilation is better.

commit | commitdiff | tree

Jonathan Brassow [Wed, 11 Sep 2013 20:58:44 +0000 (15:58 -0500)]

Mirror/Thin: Disallow thinpools on mirror logical volumes

The same corner cases that exist for snapshots on mirrors exist for
any logical volume layered on top of mirror.  (One example is when
a mirror image fails and a non-repair LVM command is the first to
detect it via label reading.  In this case, the LVM command will hang
and prevent the necessary LVM repair command from running.)  When
a better alternative exists, it makes no sense to allow a new target
to stack on mirrors as a new feature.  Since, RAID is now capable of
running EX in a cluster and thin is not active-active aware, it makes
sense to pair these two rather than mirror+thinpool.

As further background, here are some additional comments that I made
when addressing a bug related to mirror+thinpool:
(https://bugzilla.redhat.com/show_bug.cgi?id=919604#c9)
I am going to disallow thin* on top of mirror logical volumes.
Users will have to use the "raid1" segment type if they want this.

This bug has come down to a choice between:
1) Disallowing thin-LVs from being used as PVs.
2) Disallowing thinpools on top of mirrors.

The problem is that the code in dev_manager.c:device_is_usable() is unable
to tell whether there is a mirror device lower in the stack from the device
being checked.  Pretty much anything layered on top of a mirror will suffer
from this problem.  (Snapshots are a good example of this; and option #1
above has been chosen to deal with them.  This can also be seen in
dev_manager.c:device_is_usable().)  When a mirror failure occurs, the
kernel blocks all I/O to it.  If there is an LVM command that comes along
to do the repair (or a different operation that requires label reading), it
would normally avoid the mirror when it sees that it is blocked.  However,
if there is a snapshot or a thin-LV that is on a mirror, the above code
will not detect the mirror underneath and will issue label reading I/O.
This causes the command to hang.

Choosing #1 would mean that thin-LVs could never be used as PVs - even if
they are stacked on something other than mirrors.

Choosing #2 means that thinpools can never be placed on mirrors.  This is
probably better than we think, since it is preferred that people use the
"raid1" segment type in the first place.  However, RAID* cannot currently
be used in a cluster volume group - even in EX-only mode.  Thus, a complete
solution for option #2 must include the ability to activate RAID logical
volumes (and perform RAID operations) in a cluster volume group.  I've
already begun working on this.

commit | commitdiff | tree

Peter Rajnoha [Wed, 11 Sep 2013 11:15:53 +0000 (13:15 +0200)]

WHATS_NEW: one more for commit 8d1d835

commit | commitdiff | tree

Peter Rajnoha [Wed, 11 Sep 2013 10:58:43 +0000 (12:58 +0200)]

WHATS_NEW: for commit 8d1d8350 and 72a9d4f

commit | commitdiff | tree

Peter Rajnoha [Wed, 11 Sep 2013 10:42:29 +0000 (12:42 +0200)]

udev: override new udev default timeout of 30s to original 3min

New versions of udev changed the default event timeout to 30s
from original 3min. This causes problems with LVM processes that
starve because of the IO load caused by some LVM actions (e.g.
mirror/raid synchronization).

Reinstate the 3min udev timeout for now until we optimize this
in a way that even the 30s timeout is sufficient.

commit | commitdiff | tree

Jonathan Brassow [Tue, 10 Sep 2013 21:33:22 +0000 (16:33 -0500)]

RAID: Make RAID single-machine-exclusive capable in a cluster

Creation, deletion, [de]activation, repair, conversion, scrubbing
and changing operations are all now available for RAID LVs in a
cluster - provided that they are activated exclusively.

The code has been changed to ensure that no LV or sub-LV activation
is attempted cluster-wide.  This includes the often overlooked
operations of activating metadata areas for the brief time it takes
to clear them.  Additionally, some 'resume_lv' operations were
replaced with 'activate_lv_excl_local' when sub-LVs were promoted
to top-level LVs for removal, clearing or extraction.  This was
necessary because it forces the appropriate renaming actions the
occur via resume in the single-machine case, but won't happen in
a cluster due to the necessity of acquiring a lock first.

The *raid* tests have been updated to allow testing in a cluster.
For the most part, this meant creating devices with '-aey' if they
were to be converted to RAID.  (RAID requires the converting LV to
be EX because it is a condition of activation for the RAID LV in
a cluster.)

commit | commitdiff | tree

Peter Rajnoha [Tue, 10 Sep 2013 13:49:05 +0000 (15:49 +0200)]

udev: fix pvscan --cache -aay to trigger on relevant events

This patch fixes the way the special devices are handled
(special in this context means that they're not usable
after the usual ADD event like other generic devices):

  - DM and MD devices are pvscanned only when they are just set up.
    This is the first CHANGE event that makes the device  usable
    (the DM_UDEV_PRIMARY_SOURCE_FLAG is set for DM and the
     md/array_state sysfs attribute is present for MD).
    Whether the device is activated is remembered via
    DM_ACTIVATED (for DM) and LVM_MD_PV_ACTIVATED (for MD)
    udev environment variable. This is then used to decide
    whether we should fire the pvscan on ADD event to
    support coldplugging. For any (artificial) ADD event
    generated during coldplug, the device must be already
    set up properly to fire the pvscan on it.

  - Similar for loop devices. For loop devices, only CHANGE
    events are relevant (so there's a CHANGE after the loop
    device is set up as well as detached). Whether the loop
    has just been activated is detected via loop/backing_file
    sysfs attribute presence. The activation state is remembered
    via LVM_LOOP_PV_ACTIVATED udev environment variable.

  - Do not pvscan multipath device components (underlying paths).

  - Do not pvscan RAID device components.

  - Also, set LVM_SCANNED="1" udev environment variable for
    debug purposes (it's visible in the lvmdump -u that takes
    the current udev database). This variable is set once
    the pvscan is triggered.

The table below summarises when the pvscan is triggered
(marked with X, X* means fire only if the special dev is properly set up):

      | real ADD | real CHANGE | artificial ADD | artificial CHANGE | remove
=============================================================================
DM    |          |      X      |       X*       |                   |   X
MD    |          |      X      |       X*       |                   |
loop  |          |      X      |       X*       |                   |
other |    X     |             |       X        |                   |   X

commit | commitdiff | tree

Peter Rajnoha [Tue, 10 Sep 2013 10:38:18 +0000 (12:38 +0200)]

udev: DM_ID_FS_TYPE should be ID_FS_TYPE when comparing with old value

commit | commitdiff | tree

Jonathan Brassow [Mon, 9 Sep 2013 20:07:28 +0000 (15:07 -0500)]

Misc/RAID: Enable resume_lv to handle some renaming conflicts.

When images and their associated metadata are removed from a RAID1 LV,
the remaining sub-LVs are "shifted" down to fill the gaps.  For
example, if there is a 3-way mirror:
[0][1][2]
and we remove device#0, the devices will be shifted down
[1][2]
and renamed.
[0][1]

This can create a problem for resume_lv (specifically,
dm_tree_activate_children) during the renaming process though.  This
is because it will attempt to rename the higher indexed sub-LVs first
and find that it cannot because there are currently other sub-LVs with
that name.  The solution is to check for a conflicting name before
attempting to rename.  If a conflict is found and that conflicting
sub-LV is also in the process of renaming, we can defer the current
rename until the conflicting sub-LV has renamed and cleared the
conflict.

Now that resume_lv can handle these types of rename conflicts, we can
remove the workaround in RAID that was attempting to resume a RAID1
LV from the bottom-up in order to force a proper rename in assending
order before attempting a resume on the top-level LV.  This "hack"
only worked for single machine use-cases of LVM.  Clearing this up
paves the way for exclusive activation of RAID LVs in a cluster.

commit | commitdiff | tree

Peter Rajnoha [Mon, 9 Sep 2013 11:47:25 +0000 (13:47 +0200)]

udev: also inform lvmetad about lost LVM1 PV label

Addendum to 4d3b5724e0b51782000a45027de00e0fed1c9833
which covered only LVM2 PV labels.

commit | commitdiff | tree

Zdenek Kabelac [Mon, 9 Sep 2013 10:15:48 +0000 (12:15 +0200)]

cleanup: drop within comment gcc warning

toollib.c:69:24: warning: "/*" within comment

commit | commitdiff | tree

Zdenek Kabelac [Mon, 9 Sep 2013 10:14:00 +0000 (12:14 +0200)]

test: timeout when no write happens since last written line

Change current test abort after 3 minutes, to abort after 3 minutes
without written output line.

commit | commitdiff | tree

Zdenek Kabelac [Sat, 7 Sep 2013 00:48:19 +0000 (02:48 +0200)]

deactivate: drop readahead calc in deactivation

Skip readahead when device will be deactivated.

commit | commitdiff | tree

Zdenek Kabelac [Fri, 6 Sep 2013 12:07:39 +0000 (14:07 +0200)]

thin: validation catch multiseg thin pool/volumes

Multisegment thin pools and volumes are not supported.
Catch such error code path early.

commit | commitdiff | tree

Zdenek Kabelac [Sat, 7 Sep 2013 00:46:48 +0000 (02:46 +0200)]

thin: fix monitoring of thin pool volume

Properly skip unmonitoring of thin pool volume in deactivation code
path. Code makes sure if there is just any thin pool user
it stays monitored with all its resources.

commit | commitdiff | tree

Zdenek Kabelac [Fri, 6 Sep 2013 08:54:50 +0000 (10:54 +0200)]

thin: fix resize of stacked thin pool volume

When the pool is created from non-linear target the more complex rules
have to be used and stacking needs to properly decode args for _tdata
LV. Also proper allocation policies are being used according to those
set in lvm2 metadata for data and metadata LVs.

Also properly check for active pool and extra code to active it
temporarily.

With this fix it's now possible to use:

lvcreate -L20 -m2 -n pool vg --alloc anywhere
lvcreate -L10 -m2 -n poolm vg --alloc anywhere
lvconvert --thinpool vg/pool --poolmetadata vg/poolm

lvresize -L+10 vg/pool

commit | commitdiff | tree

Petr Rockai [Fri, 6 Sep 2013 14:35:49 +0000 (16:35 +0200)]

test: Add the 64b fc17 kernel to the mirror recovery blacklist.

commit | commitdiff | tree

Alasdair G Kergon [Fri, 6 Sep 2013 00:49:43 +0000 (01:49 +0100)]

toollib: tweak background forking

Log what is forked and replace #if 1 with DEBUG_CHILD.

commit | commitdiff | tree

Alasdair G Kergon [Fri, 6 Sep 2013 00:47:41 +0000 (01:47 +0100)]

logging: Write Completed message before resetting.

commit | commitdiff | tree

Alasdair G Kergon [Fri, 6 Sep 2013 00:43:24 +0000 (01:43 +0100)]

tools: Use backgroundfork_ARG for pvscan -b

Change pvscan -b to use a new backgroundfork_ARG instead of
background_ARG so as not to affect pvmove -b and lvconvert -b.

commit | commitdiff | tree

Petr Rockai [Wed, 4 Sep 2013 14:20:35 +0000 (16:20 +0200)]

test: Include tests that timed out in the final summary.

commit | commitdiff | tree

Jonathan Brassow [Tue, 3 Sep 2013 18:17:01 +0000 (13:17 -0500)]

pvmove: Skip pvmove of RAID, thin, snapshot, origin, and mirror LVs in cluster

pvmove of the above types should only have been enabled in single machine
mode.

commit | commitdiff | tree

Petr Rockai [Tue, 3 Sep 2013 18:05:22 +0000 (20:05 +0200)]

test: Fix a spurious failure in skip_if_mirror_recovery_broken.

commit | commitdiff | tree

Jonathan Brassow [Tue, 3 Sep 2013 15:51:21 +0000 (10:51 -0500)]

TEST: pvmove-all-segtypes.sh should not be run in a cluster

LVM2 upstream repository (moved from fedorahosted in Feb 2017)

RSS Atom

This page took 0.076712 seconds and 5 git commands to generate.