scoutfs

mirror of https://github.com/versity/scoutfs.git synced 2026-01-05 11:45:09 +00:00

Author	SHA1	Message	Date
Zach Brown	2a6d827e7a	Add test for changing devices Add a relatively small initial test for swapping devices around. Signed-off-by: Zach Brown <zab@versity.com>	2023-04-17 12:47:50 -07:00
Zach Brown	6ded240089	Add t_rc test execution helper function Add a quick wrapper to run commands whose output is saved while only echoing their return code. Signed-off-by: Zach Brown <zab@versity.com>	2023-04-17 12:47:50 -07:00
Zach Brown	99a20bc383	Put scratch mount point in test tmp dirs Some tests had grown a bad pattern of making a mount point for the scratch mount in the root /mnt directory. Change them to use a mount point in their test's temp directory outside the testing fs. Signed-off-by: Zach Brown <zab@versity.com>	2023-04-17 12:47:50 -07:00
Zach Brown	f1264c7e47	Add test to rename into root directory The ancestor tests in rename were preventing renaming into the root directory. Signed-off-by: Zach Brown <zab@versity.com>	2023-03-08 11:00:59 -08:00
Zach Brown	9ba2ee5c88	Add testing of O_TMPFILE umask There were kernels that didn't apply the current umask to inode modes created with O_TMPFILE without acls. Let's have a test running to make sure that we're not surprised if we come across one. Signed-off-by: Zach Brown <zab@versity.com>	2023-01-09 14:49:23 -08:00
Zach Brown	fe33a492c2	Make o_tmpfile test more generic The o_tmpfile test only did one thing, clean it up a bit so we can add more tests to the file. Signed-off-by: Zach Brown <zab@versity.com>	2023-01-09 10:14:40 -08:00
Zach Brown	77c0ff89fb	Rename stage-tmpfile to o_tmpfile We had a one-off test that was overly specific to staging from tmpfile. This renames it to a more generic test where we can add more tests of o_tmpfile in general. Signed-off-by: Zach Brown <zab@versity.com>	2023-01-09 10:07:15 -08:00
Zach Brown	6d4916954b	Add basic-truncate test Signed-off-by: Zach Brown <zab@versity.com>	2022-12-06 10:31:31 -08:00
Zach Brown	41174867ed	Add t_get_sysfs_mount_option test func Add a quick little function to get the value of a mount option. Signed-off-by: Zach Brown <zab@versity.com>	2022-12-02 12:28:13 -08:00
Zach Brown	701f1a9538	Add test that checks duplicate meta_seq entries Add a quick test of the index items to make sure that rapid inode updates don't create duplicate meta_seq items. Signed-off-by: Zach Brown <zab@versity.com>	2022-11-15 13:26:32 -08:00
Zach Brown	d5ddf1ecac	Fix option save/restore test helpers The test shell helpers for saving and restoring mount options were trying to put each mount's option value in an array. It meant to build the array key by concatenating the option name and the mount number. But it didn't isolate the option "name" variable when evaluating it, instead always evaluating "name_" to nothing and building keys for all options that only contained the mount index. This then broke when tests attempted to save and restore multiple options. Signed-off-by: Zach Brown <zab@versity.com>	2022-10-17 09:12:21 -07:00
Zach Brown	e27ea22fe4	Add run-tests -T option to increase trace size Add an option to increase the trace buffer size during the run. Signed-off-by: Zach Brown <zab@versity.com>	2022-10-14 14:03:36 -07:00
Zach Brown	51fe5a4ceb	Add -o mount option argument to run-tests Add a run-tests option that lets us append an option string to all mounts performed during the tests. Signed-off-by: Zach Brown <zab@versity.com>	2022-10-14 14:03:36 -07:00
Zach Brown	3847c4fe63	Add data-prealloc test Signed-off-by: Zach Brown <zab@versity.com>	2022-10-14 14:03:35 -07:00
Zach Brown	78405bb5fd	Remove ACL tests from xfstests expunge list Signed-off-by: Zach Brown <zab@versity.com>	2022-09-28 10:36:10 -07:00
Zach Brown	98e514e5f4	Add failure message to xattr length test The simple-xattr-unit test had a helper that failed by exiting with non-zero instead of emitting a message. Let's make it a bit easier to see what's going on. Signed-off-by: Zach Brown <zab@versity.com>	2022-09-28 10:36:10 -07:00
Zach Brown	acb94dd9b7	Add test of large fragmented free lists Add a test which gives the server a transaction with a free list block that contains blknos that each dirty an individiaul btree blocks in the global data free extent btree. Signed-off-by: Zach Brown <zab@versity.com>	2022-07-29 11:25:01 -07:00
Zach Brown	dff366e1a4	Add lock invalidation and recovery test Add a test which tries to have lock recovery processed during lock invalidation on clients. Signed-off-by: Zach Brown <zab@versity.com>	2022-04-27 12:22:18 -07:00
Zach Brown	26ae9c6e04	Verify local unmount testing fence script The fence script we use for our single node multi-mount tests only knows how to fence by using forced unmount to destroy a mount. As of now, the tests only generate failing nodes that need to be fenced by using forced unmount as well. This results in the awkward situation where the testing fence script doesn't have anything to do because the mount is already gone. When the test fence script has nothing to do we might not notice if it isn't run. This adds explicit verification to the fencing tests that the script was really run. It adds per-invocation logging to the fence script and the test makes sure that it was run. While we're at it, we take the opportunity to tidy up some of the scripting around this. We use a sysfs file with the data device major:minor numbers so that the fencing script can find and unmount mounts without having to ask them for their rid. They may not be operational. Signed-off-by: Zach Brown <zab@versity.com>	2022-03-28 14:52:08 -07:00
Zach Brown	875583b7ef	Add t_fs_is_leader test helper The t_server_nr and t_first_client_nr helpers iterated over all the fs numbers examining their quorum/is_leader files, but clients don't have a quorum/ directory. This was causing spurious outputs in tests that were looking for servers but didn't find it in the first quorum fs number and made it down into the clients. Give them a helper that returns 0 for being a leader if the quorum/ dir doesn't exist. Signed-off-by: Zach Brown <zab@versity.com>	2022-03-15 16:09:55 -07:00
Zach Brown	e2c90339c5	Add orphan-inodes to race final deletion We're seeing some trouble with very specific race conditions. This updates the orphan-inodes test to try and force final inode deletion during eviction, the orphan scan worker, and opening inodes by handle to all race and hit an inode number at the same time. Signed-off-by: Zach Brown <zab@versity.com>	2022-03-11 14:30:17 -08:00
Zach Brown	4a0b14a4f2	Wait for stdin open in orphan-inodes test The orphan inode test often uses a trick where it runs sleep in the abckground with a file as stdin as a means of holding files open. This can very rarely fail if the background sleep happens to be first schedled after the unlink of the file it's reading as stdin. A small delay gives it a chance to run and open the file before its unlinked. It's still possible to lose the race, of course, but so far this has been good enough. Signed-off-by: Zach Brown <zab@versity.com>	2022-03-10 11:43:11 -08:00
Zach Brown	90518a0fbd	Add handle_fsetxattr test utility Add a quick little binary that spins opening an inode by a handle and calling fsetxattr. Signed-off-by: Zach Brown <zab@versity.com>	2022-03-10 11:43:11 -08:00
Zach Brown	cd23cc61ca	Add mount option test bash functions Add some test functions which work with mount options. Signed-off-by: Zach Brown <zab@versity.com>	2022-03-10 11:43:11 -08:00
Zach Brown	b2834d3c28	Add basic bad mount testing Add some tests which exercise the kinds of reasonable mistakes that people will make in the field. Signed-off-by: Zach Brown <zab@versity.com>	2022-02-21 10:44:38 -08:00
Zach Brown	48966b42bb	Add simple fallocate test Signed-off-by: Zach Brown <zab@versity.com>	2022-02-17 11:20:08 -08:00
Zach Brown	9fa2c6af89	Use get-allocated-inos in orphan-inodes test The orphan inodes test needs to test if inode items exist as it manipulates inodes. It used to open the inode by a handle but we're fixing that to not allow opening unlinked files. The get-allocated-inos ioctl tests for the presence of items owned by the inode regardless of any other vfs state so we can use it to verify what scoutfs is doing as we work with the vfs inodes. Signed-off-by: Zach Brown <zab@versity.com>	2022-01-24 09:40:08 -08:00
Zach Brown	813ce24d79	Move local-force-unmount test script into tests/ The local-force-unmount fenced fencing script only works when all the mounts are on the local host and it uses force unmount. It is only used in our specific local testing scripts. Packaging it as an example lead people to believe that it could be used to cobble together a multi-host testing network, however temporary. Move it from being in utils and packged to being private to our tests so that it doesn't present an attractive nuisance. Signed-off-by: Zach Brown <zab@versity.com>	2022-01-19 11:33:34 -08:00
Bryant G. Duffy-Ly	8a504cd5ae	Add client/server unmount race on lock_recov unit test This unit test reproduces the race we have between client and server diong lock recovery while farewell is processed. Signed-off-by: Bryant G. Duffy-Ly <bduffyly@versity.com>	2022-01-12 21:29:00 -06:00
Zach Brown	93bc52cc54	Merge pull request #60 from bgly/bduffyly/block_stale_reads Fix block-stale-read test case	2021-11-24 10:25:26 -08:00
Bryant G. Duffy-Ly	0abcd5a004	Take generic/025/078 off expunge list adding 23/24 We want to enable the test case for: generic/023 - tests that renameat2 syscall exists generic/024 - renameat2 with NOREPLACE flag Move both generic/025 and 078 to the no run list so that we can test the unsupported output if the flags were passed that were not supported. Example output: generic/025 [not run] fs doesn't support RENAME_EXCHANGE generic/078 [not run] fs doesn't support RENAME_WHITEOUT Signed-off-by: Bryant G. Duffy-Ly <bduffyly@versity.com>	2021-11-19 17:54:19 -06:00
Bryant G. Duffy-Ly	888ad8ec5c	Add renameat2 unit test case The goal of the test case is to have two mount points with two async calls made to do renameat2. This allows for two calls to race to call renameat2 RENAME_NOREPLACE. When this happens you expect one of them to fail with a -EEXIST. This would validate that the new flag works. Essentially one of the two calls to renameat should hit the new RENAME_NOREPLACE code and exit early. Signed-off-by: Bryant G. Duffy-Ly <bduffyly@versity.com>	2021-11-19 17:54:13 -06:00
Bryant G. Duffy-Ly	16ea0ef671	Add syscall wrapper for renameat2 Signed-off-by: Bryant G. Duffy-Ly <bduffyly@versity.com>	2021-11-19 17:54:08 -06:00
Bryant G. Duffy-Ly	3ae0ebd0d8	Fix block-stale-read test case The current test case attempts to create a state to read by calling setattr and getattr in attempt to force block cache reads. It so happens that this does not always force cache block reads, which in rare cases causes this test case to fail. The new test case removes all the extra bouncing around of mount points and we just directly call scoutfs df which will walk everyone's allocators to summarize the block counts, which is guaranteed to exist. Therefore, we do not have to create any sort of state prior to trying to force a read. Signed-off-by: Bryant G. Duffy-Ly <bduffyly@versity.com>	2021-11-19 15:41:54 -06:00
Bryant G. Duffy-Ly	38ee2defd5	Add a filter for forced unmount error output [85164.299902] scoutfs f.8c19e1.r.facf2e error: server error writing btree blocks: -5 [144308.589596] scoutfs f.c9397a.r.8ae97f error: server error -5 freeing merged btree blocks: looping commit del/upd freeing item [174646.005596] scoutfs f.15f0b3.r.1862df error: server error -5 freeing merged btree blocks: final commit del/upd freeing item [146653.893676] scoutfs f.c7f188.r.34e23c error: server error writing super block: -5 [273218.436675] scoutfs f.dd4157.r.f0da7e error: server failed to bind to 127.0.0.1:42002, err -98 [376832.542823] scoutfs f.049985.r.1a8987 error: error -5 reading quorum block 19 to update event 1 term 3 The above is an example output that will be filtered out Signed-off-by: Bryant G. Duffy-Ly <bduffyly@versity.com>	2021-11-08 07:36:02 -06:00
Zach Brown	f02d68f567	Merge pull request #55 from versity/zab/v1_format_version Zab/v1 format version	2021-11-03 10:18:50 -07:00
Zach Brown	1b4d291bf7	Fix xattr update out of bounds access As we update xattrs we need to update any existing old items with the contents of the new xattr that uses those items. The loop that updated existing items only took the old xattr size into account and assumed that the new xattr would use those items. If the new xattr size used fewer parts then the attempt to update all the old parts that weren't covered by the new size would go very wrong. The length of the region in the new xattr would be negative so it'd try to use the max part length. Worse, it'd copy these max part length regions outside the input new xattr buffer. Typically this would land in addressible memory and copy garbage into the unused old items before they were later deleted. However, it could access so far outside the input buffer that it could cross a page boudary into inaccessible memory and fault. We saw this in the field while trying to repeatedly incrementally shrink a large xattr. This fixes the loop that updates overlapping items between the new and old xattr to start with the smaller of their two item counts. Now it will only update items that are actually used by both xattrs and will only safely access the new xattr input buffer. Signed-off-by: Zach Brown <zab@versity.com>	2021-11-01 11:33:17 -07:00
Zach Brown	932a842ae3	Remove valid_bytes from stat _more ioctls The idea here was that we'd expand the size of the struct and valid_bytes would tell the kernel which fields were present in userspace's struct. That doesn't combine well with the ioctl convention of having the size of the type baked into the ioctl number. We'll remove this to make the world less surprising. If we expand the interface we'd add additional ioctls and types. Signed-off-by: Zach Brown <zab@versity.com>	2021-10-28 12:30:47 -07:00
Zach Brown	75f9aabe75	Allow compacting logs down to a single page The k-way merge function at the core of the srch file entry merging had some bookkeeping math (calculating number of parents) that couldn't handle merging a single incoming entry stream, so it threw a warning and returned an error. When refusing to handle that case, it was assuming that caller was trying to merge down a single log file which doesn't make any sense. But in the case of multiple small unsorted logs we can absolutely end up with their entries stored in one sorted page. We have one sorted input page that's merging multiple log files. The merge function is also the path that writes to the output file so we absolutely need to handle this case. We more carefully calculate the number of parents, clamping it to one parent when we'd otherwise get "(roundup(1) -> 1) - 1 == 0" when calculating the number of parents from the number of inputs. We can relax the warning and error to refuse to merge nothing. The test triggers this case by putting single search entries in the log files for mounts and unmounting them to force rotation of the mount log files into mergable rotated log files. Signed-off-by: Zach Brown <zab@versity.com>	2021-10-28 12:30:47 -07:00
Zach Brown	4e00f95014	run-tests builds our targets with -j The test harness might as well use all cpus when building. It's reasonably safe to assume both that the test systems are otherwise idle and that the build is likely to succeed. Signed-off-by: Zach Brown <zab@versity.com>	2021-10-28 12:30:46 -07:00
Bryant Duffy-Ly	501953d69e	Fix mkdir-rename-rmdir test script The current script gets stuck in an infinite loop when the test suite is started with 1 mount point. This is due to the advancement part of the script in which it advances the ops for each mount. The current while loop checks for when the op_mnt wraps by checking if it equals 0. But the problem is we set each of the op_mnts to 0 during the advancement, so when it wraps it still equates to 0, so it is an infinite loop. Therefore, the fix is to check at the end of the loop check if the last op's mount number wrapped. If so just break out. Signed-off-by: Bryant Duffy-Ly <bduffyly@versity.com>	2021-10-21 11:41:02 -05:00
Zach Brown	ab92d8d251	Add quick test for racing creates Add a quick test to make sure that create is validating stale dentries before deciding if it should create or return -eexist. Signed-off-by: Zach Brown <zab@versity.com>	2021-09-13 14:41:07 -07:00
Zach Brown	b9a0f1709f	Add xattr .totl. tag Add the .totl. xattr tag. When the tag is set the end of the name specifies a total name with 3 encoded u64s separated by dots. The value of the xattr is a u64 that is added to the named total. An ioctl is added to read the totals. Signed-off-by: Zach Brown <zab@versity.com>	2021-09-13 14:41:07 -07:00
Zach Brown	2a33b9faf0	Add some error testing to srch-basic-functionality When the xattr inode searchs fail the test will eventually fail when the output differs, but that could take a while. Have it fail much sooner so that we can have tighter debugging interations and trace ring buffer contents that are likely to be a lot closer to the first failure. Signed-off-by: Zach Brown <zab@versity.com>	2021-08-25 10:14:38 -07:00
Zach Brown	b0a08eb922	Remove lock grace period We had some logic to try and delay lock invalidation while the lock was still actively in use. This was trying to reduce the cost of pathological lock conflict cases but it had some severe fairness problems. It was first introduced to deal with bad patterns in userspace that no longer exist and it was built on top of the LSM transaction machinery that also no longer exists. It hasn't aged well. Instead of introducing invalidation latency in the hopes that it leads to more batched work, which it can't always, let's aim more towards reducing latency in all parts of the write-invalidate-read path and also aim towards reducing contention in the first place. Signed-off-by: Zach Brown <zab@versity.com>	2021-08-25 10:14:38 -07:00
Zach Brown	6d0694f1b0	Add resize_devices ioctl and scoutfs command Add a scoutfs command that uses an ioctl to send a request to the server to safely use a device that has grown. Signed-off-by: Zach Brown <zab@versity.com>	2021-07-30 13:26:32 -07:00
Zach Brown	a9baeab22e	stage_tmpfile test gets current data_version The stage_tmpfile test util was written when fallocate didn't update data_version for size extensions. It is more correct to get the data_version after fallocate changes data_versions for however many transactions, extent allocations, and i_size extensions it took to allocate space. Signed-off-by: Zach Brown <zab@versity.com>	2021-07-30 13:22:42 -07:00
Zach Brown	e4dca8ddcc	Don't shutdown quorum if server startup fails The quorum service shuts down if it sees errors that mean that it can't do its job. This is mostly fatal errors gathering resources at startup or runtime IO errors but it was also shutting down if server startup fails. That's not quite right. This should be treated like the server shutting down on errors. Quorum needs to stay around to participate in electing the next server. Fence timeouts could trigger this. A quorum mount could crash, the next server without a fence script could have a fence request timeout and shutdown, and now the third remaining server is left to indefinitely send vote requests into the void. With this fixed, continuing that example, the quorum service in the second mount remains to elect the third server with a working fence script after the second server shuts down after its fence request times out. Signed-off-by: Zach Brown <zab@versity.com>	2021-07-30 11:34:52 -07:00
Zach Brown	120c2d342a	Add create_xattr_loop test tool Add a quick tool that creates xattrs in a tight loop. Signed-off-by: Zach Brown <zab@versity.com>	2021-07-07 14:13:14 -07:00
Zach Brown	73bf916182	Return ENOSPC as space gets low Returning ENOSPC is challenging because we have clients working on allocators which are a fraction of the whole and we use COW transactions so we need to be able to allocate to free. This adds support for returning ENOSPC to client posix allocators as free space gets low. For metadata, we reserve a number of free blocks for making progress with client and server transactions which can free space. The server sets the low flag in a client's allocator if we start to dip into reserved blocks. In the client we add an argument to entering a transaction which indicates if we're allocating new space (as opposed to just modifying existing data or freeing). When an allocating transaction runs low and the server low flag is set then we return ENOSPC. Adding an argument to transaciton holders and having it return ENOSPC gave us the opportunity to clean it up and make it a little clearer. More work is done outside the wait_event function and it now specifically waits for a transaction to cycle when it forces a commit rather than spinning until the transaction worker acquires the lock and stops it. For data the same pattern applies except there are no reserved blocks and we don't COW data so it's a simple case of returning the hard ENOSPC when the data allocator flag is set. The server needs to consider the reserved count when refilling the client's meta_avail allocator and when swapping between the two meta_avail and meta_free allocators. We add the reserved metadata block count to statfs_more so that df can subtract it from the free meta blocks and make it clear when enospc is going to be returned for metadata allocations. We increase the minimum device size in mkfs so that small testing devices provide sufficient reserved blocks. And finally we add a little test that makes sure we can fill both metadata and data to ENOSPC and then recover by deleting what we filled. Signed-off-by: Zach Brown <zab@versity.com>	2021-07-07 14:13:14 -07:00

1 2 3

128 Commits