scoutfs

mirror of https://github.com/versity/scoutfs.git synced 2026-01-03 10:55:20 +00:00

Author	SHA1	Message	Date
Auke Kok	e3e2cfceec	Fix new_inode ctime assignment. Very old copy/paste bug here, we want to update new_inode's ctime instead. old_inode already is updated. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-02-18 13:15:49 -05:00
Zach Brown	5a10c79409	Merge pull request #201 from versity/auke/fixes_pre_parallel_restore Misc. fixes and changes to support parallel_restore and check.	2025-02-02 06:53:25 -08:00
Auke Kok	e9d147260c	Fix ctx->pos updating to properly handle dent gaps We need to assure we're emitting dents with the proper position and we already have them as part of our dent. The only caveat is to increment ctx->pos once beyond the list to make sure the caller doesn't call us once more. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	6c85879489	Assert unlock doesn't underflow lock user count. While debugging a double unlock error we hit this condition and debugging would have been a lot easier had we enforced this simple constraint that we can't decrement the lock users count if it's already 0. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	8b76a53cf3	Avoid cluster locking while put_user() in _allocated_inos. Similar to fiemap, readdir and walk_inodes, this method could have put_user during a page fault, causing potentially a deadlock. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	e76a171c40	Avoid faulting while cluster locked in _walk_inodes. Similar to readdir and fiemap vfs methods, we can't copy to user while holding cluster locks. The previous comment about it being safe no longer applies, and this could deadlock. Rewrite the loop to iterate and store entries in a page, then flush the page contents while not holding a clusterlock. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	8cb08507d6	Do not copy to user while holding locks in scoutfs_data_fiemap() Now that we support mmap writes, at any point in time we could pagefault and lock for writes. That means - just like readdir - we can no longer lock and copy_to_user, since it also may page fault and thus deadlock. We statically allocate 32 extent entries on the stack and use these to shuffle out fiemap entries at a time, locking and unlocking around collecting and fiemap_fill_extent_next. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	cad12d5ce8	Avoid deadlock in _readdir() due to copy_to_user(). dir_emit() will copy_to_user, which can pagefault. If this happens while cluster locked, we could deadlock. We use a single page to stage dir_emit data, and iterate between fetching dirents while locked, and emitting them while not locked. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	e59a5f8ebd	Readdir w/offset validation. Verify using xfs_io that readdir offsets match expected output. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	1bcd1d4d00	Drop readdir pre-.iterate() compat (el7.5ish). These 2 sections of compat for readdir are wholly obsolete and can be hard dropped, which restores the method to look like current upstream code. This was added in `ddd1a4e`. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Auke Kok	b944f609aa	remap_pages ops becomes obsolete.	2025-01-23 14:28:40 -05:00
Auke Kok	519b47a53c	mmap() trace events. We merely trace exit values and position, and ignore length. Because vm_fault_t is __bitwise, sparse will loudly complain about a plain cast to u32, so we must __force (on el8). ret will be 512 in normal cases. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Auke Kok	92f704d35a	Enable all xfstests mmap() tests. Now that all of these should be passing, we enable all mmap() tests in xfstests, and update the golden output with the new tests. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Auke Kok	311bf75902	Add mmap tests. Two test programs are added. The run time is about 1min on my el7 instance. The test script finishes up with a read/write mmap test on offline extents to verify the data wait paths in those functions. One program will perform vfs read/write and mmap read/write calls on the same file from across 5 threads (mounts) repeatedly. The goal is to assure there are no locking issues between read/write paths. The second test program performs consistency checking on a file that is repeatedly written/read using memory maps and normal reads and writes, and the content is verified after every operation. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Benjamin LaHaise	3788d67101	Add support for writable shared mmap()ings Add support for writable MAP_SHARED mmap()ings. Avoid issues with late writepage()s building transactions by doing the block_write_begin() work in scoutfs_data_page_mkwrite(). Ensure the page is marked dirty and prepared for write, then let the VM complete the write when the page is flushed or invalidated. Signed-off-by: Benjamin LaHaise <bcrl@kvack.org> Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Benjamin LaHaise	b7a3d03711	Add support for read only mmap() Adds the required memory mapped ops struct and page fault handler for reads. Signed-off-by: Benjamin LaHaise <bcrl@kvack.org> Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Zach Brown	295f751aed	Add test_bit to utils bitmap Add test_bit() to the trivial utils bitmap.c implementation. Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:58 -08:00
Zach Brown	7f6032d9b4	Add lk rbtree wrapper Import the kernel's rbtree implementation with a wrapper so we can use it from userspace. Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:49 -08:00
Zach Brown	7e3a6537ec	Add userspace version of our dirent name hash Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:41 -08:00
Zach Brown	49b7b70438	Add userspace version of our mode to type Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:31 -08:00
Zach Brown	de0fdd1f9f	Promote userspace btree block initialization Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:23 -08:00
Zach Brown	a6d7de3c00	Add fls64() alias for userspace flsll() Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:16 -08:00
Zach Brown	2c2c127c5e	Add put_unaligned_leXX() for userspace Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:10 -08:00
Zach Brown	9491c784e7	Add srch_encode_entry() for userspace utils Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:57:56 -08:00
Zach Brown	c3b30930fa	Add bloom filter index calc for userspace utils Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:57:46 -08:00
Zach Brown	e7e46a80e6	Add userspace NSEC_PER_SEC Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:57:39 -08:00
Zach Brown	1ddf752f42	Import a few more functions to our list.h Import a few more functions from the kernel's list.h into our imported copy. Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:57:29 -08:00
Zach Brown	14b65c6360	Fix printing alloc list block extents The list alloc blocks have an array of blknos that are offset by a start field in the block header. The print code wasn't using that and was always referencing the beginning of the array, which could miss blocks. Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:57:21 -08:00
Zach Brown	934f6c7648	Merge pull request #199 from versity/zab/v1.23 v1.23 Release	2024-12-11 17:02:52 -08:00
Zach Brown	a88972b50e	v1.23 Release Finish the release notes for the 1.23 release. Signed-off-by: Zach Brown <zab@versity.com> v1.23	2024-12-11 13:07:44 -08:00
Zach Brown	3e71f49260	Merge pull request #195 from versity/auke/el9_5 RHEL9.5 kernel support	2024-12-03 14:27:57 -08:00
Zach Brown	8a082e3f99	Merge pull request #197 from versity/greg/block-el9-minor-upgrades Block EL9 minor version upgrades	2024-12-03 14:09:17 -08:00
Greg Cymbalski	110d5ea0d5	Block EL9 minor version upgrades Since kABI migrations across minor versions is a thing of the past going forward, we now: - Detect if we're on EL9 - If so, add a requirement on the various flavors of release package to that specific major.minor version This appropriately does not allow upgrades across minor versions.	2024-12-02 16:04:24 -08:00
Auke Kok	669de459a7	bdev_open_by_path is now removed as well. Additional blkdev/bdev changes now cause this call to be removed as well resulting in us having to use yet another API to do the same for el9_5. The changes are a little more subtle as now the bdev_mount() call passes a custom bd_holder_ops that we must match or else throw a WARN_ON, so we switch to using sbi as our holder arg instead. Make sure to bdev_fput and not fput, since we don't want to have our private data cleanup deferred, failing xfstests generic/604. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-11-27 18:52:39 -08:00
Auke Kok	621271f8cf	backing_dev_info is entirely removed. The assignments to it is no longer needed at all. All references can be dropped since v6.4-rc4-163-g0d625446d0a4. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-11-08 13:32:21 -05:00
Auke Kok	d1092cdbe9	current_time() is no longer extern. Since v6.5-rc1-7-g9b6304c1d537, current_time() is no longer extern, so we need to update this grep regex to continue to match. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-11-08 13:21:03 -05:00
Zach Brown	aed7169fac	Merge pull request #194 from versity/zab/v1.22 v1.22 Release	2024-11-03 15:06:40 -08:00
Zach Brown	7f313f2818	v1.22 Release Finish the release notes for the 1.22 release. Signed-off-by: Zach Brown <zab@versity.com> v1.22	2024-11-01 13:05:52 -07:00
Zach Brown	6b4e666952	Merge pull request #193 from versity/zab/hung_lock_fixes Zab/hung lock fixes	2024-10-31 16:56:51 -07:00
Zach Brown	4a26059d00	Add lock-shrink-read-race test Add a quick test that races readers and shrinking to stress lock object refcount racing between concurrent lock request handling threads in the lock server. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-31 15:35:11 -07:00
Zach Brown	19e78c32fc	Allow null lock compatibility between nodes Right now a client requesting a null mode for a lock will cause invalidations of all existing granted modes of the lock across the cluster. This is unneccessarily broad. The absolute requirement is that a null request invalidates other existing granted modes on the client. That's how the client safely resolves shrinking's desire to free locks while the locks are in use. It relies on turning it into the race between use and remote invalidation. But that only requires invalidating existing grants from the requesting client, not all clients. It is always safe for null grants to coexist with all grants on other clients. Consider the existing mechanics involving null modes. First, null locks are instatiated on the client before sending any requests at all. At any given time newly allocated null locks are coexisting with all existing locks across the cluster. Second, the server frees the client entry tracking struct the moment it sends a null grant to the client. From that point on the client's null lock can not have any impact on the rest of the lock holders because the server has forgotten about it. So we add this case to the server's test that two client lock modes are compatible. We take the opportunity to comment the heck out of this function instead of making it a dense boolean composition. The only functional change is the addition of this case, the existing cases are refactored but unchanged. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-31 15:34:59 -07:00
Zach Brown	8c1a45c9f5	Use bools instead of weird addition as or in net When freeing acked reesponses in the net layer we sweep the send and resend queues looking for queued responses up to the sequence number we've had acked. The code that did this used a weird pattern of returning ints and adding them which gave me pause. Clean it up to use bools and or (not short-circuiting \|\|) to more obviously communicate what's going on. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-30 13:38:12 -07:00
Zach Brown	5a6eb569f3	Add some lock debugging trace fields Over time some fields have been added to the lock struct which haven't been added to the lock tracing output. Add some of the more relevant lock fields to tracing. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-30 13:16:04 -07:00
Zach Brown	69d9040e68	Close lock server use-after-free race Lock object lifetimes in the lock server are protected by reference counts. References are acquired while holding a lock on an rbtree. Unfortunately, the decision to free lock objects wasn't tested while also holding that lock on the rbtree. A caller putting their object would test the refcount, then wait to get the rbtree lock to remove it from the tree. There's a possible race where the decision is made to remove the object but another reference is added before the object is removed. This was seen in testing and manifest as an incoming request handling path adding a request message to the object before it is freed, losing the message. Clients would then hang on a lock that never saw a response because their request was freed with the lock object. The fix is to hold the rbtree lock when testing the refcount and deciding to free. It adds a bit more contention but not significantly so, given the wild existing contention on a per-fs spinlocked rbtree. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-30 13:04:13 -07:00
Zach Brown	d94ec29ffa	Merge pull request #192 from versity/greg/with-debug-kmod Generate debug packages	2024-10-24 15:35:03 -07:00
Greg Cymbalski	70c36ae394	Generate debug packages We had previously explicitly disabled this; let's start generating them.	2024-10-24 14:56:09 -07:00
Zach Brown	1d08a58add	Merge pull request #151 from versity/auke/el9 EL9 support.	2024-10-04 11:46:47 -07:00
Auke Kok	fc7876e844	Allow certain tests to skip, but not fail exit condition. Previously, any t_skip would cause the final test result to be a failure because up until now no test should have been skipped. However, with format-version-forward-back not being compatible with el9, we are going to rely on el7/8 testing for that test soleley, and therefore we have to allow skipping of this test on el9 and newer OS versions. We add `t_skip_permitted` to signal this from the test case to the run-tests.sh script. A new exit code is passed, and all accounting is updated to reflect that a test was skipped, but this was permitted. We modify format-version-forward-back to use this new exit path. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	5337b9e221	Ingore Process accounting resumed dmesg. I'm seeing more and more of these as audit is enabled in el8 and el9 images I am using for testing, and during ENOSPC tests this has a chance of triggering process accounting suspension, and subsequent resume. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	8a22bdd366	Ignore device mapper size change dmesg output. In v1.18-10-g5507ee5, we changed the test code away from loopback to device-mapper, which simplified our DUT setup code. However, this results in the occasional `device changed size` messages now being emitted by the `dm` driver instead of the `loop` kernel module. We have to additionally ignore these kernel messages from now as well. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00

1 2 3 4 5 ...

2118 Commits