scoutfs

mirror of https://github.com/versity/scoutfs.git synced 2026-01-03 10:55:20 +00:00

Author	SHA1	Message	Date
Auke Kok	519b47a53c	mmap() trace events. We merely trace exit values and position, and ignore length. Because vm_fault_t is __bitwise, sparse will loudly complain about a plain cast to u32, so we must __force (on el8). ret will be 512 in normal cases. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Auke Kok	92f704d35a	Enable all xfstests mmap() tests. Now that all of these should be passing, we enable all mmap() tests in xfstests, and update the golden output with the new tests. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Auke Kok	311bf75902	Add mmap tests. Two test programs are added. The run time is about 1min on my el7 instance. The test script finishes up with a read/write mmap test on offline extents to verify the data wait paths in those functions. One program will perform vfs read/write and mmap read/write calls on the same file from across 5 threads (mounts) repeatedly. The goal is to assure there are no locking issues between read/write paths. The second test program performs consistency checking on a file that is repeatedly written/read using memory maps and normal reads and writes, and the content is verified after every operation. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Benjamin LaHaise	3788d67101	Add support for writable shared mmap()ings Add support for writable MAP_SHARED mmap()ings. Avoid issues with late writepage()s building transactions by doing the block_write_begin() work in scoutfs_data_page_mkwrite(). Ensure the page is marked dirty and prepared for write, then let the VM complete the write when the page is flushed or invalidated. Signed-off-by: Benjamin LaHaise <bcrl@kvack.org> Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Benjamin LaHaise	b7a3d03711	Add support for read only mmap() Adds the required memory mapped ops struct and page fault handler for reads. Signed-off-by: Benjamin LaHaise <bcrl@kvack.org> Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Zach Brown	934f6c7648	Merge pull request #199 from versity/zab/v1.23 v1.23 Release	2024-12-11 17:02:52 -08:00
Zach Brown	a88972b50e	v1.23 Release Finish the release notes for the 1.23 release. Signed-off-by: Zach Brown <zab@versity.com> v1.23	2024-12-11 13:07:44 -08:00
Zach Brown	3e71f49260	Merge pull request #195 from versity/auke/el9_5 RHEL9.5 kernel support	2024-12-03 14:27:57 -08:00
Zach Brown	8a082e3f99	Merge pull request #197 from versity/greg/block-el9-minor-upgrades Block EL9 minor version upgrades	2024-12-03 14:09:17 -08:00
Greg Cymbalski	110d5ea0d5	Block EL9 minor version upgrades Since kABI migrations across minor versions is a thing of the past going forward, we now: - Detect if we're on EL9 - If so, add a requirement on the various flavors of release package to that specific major.minor version This appropriately does not allow upgrades across minor versions.	2024-12-02 16:04:24 -08:00
Auke Kok	669de459a7	bdev_open_by_path is now removed as well. Additional blkdev/bdev changes now cause this call to be removed as well resulting in us having to use yet another API to do the same for el9_5. The changes are a little more subtle as now the bdev_mount() call passes a custom bd_holder_ops that we must match or else throw a WARN_ON, so we switch to using sbi as our holder arg instead. Make sure to bdev_fput and not fput, since we don't want to have our private data cleanup deferred, failing xfstests generic/604. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-11-27 18:52:39 -08:00
Auke Kok	621271f8cf	backing_dev_info is entirely removed. The assignments to it is no longer needed at all. All references can be dropped since v6.4-rc4-163-g0d625446d0a4. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-11-08 13:32:21 -05:00
Auke Kok	d1092cdbe9	current_time() is no longer extern. Since v6.5-rc1-7-g9b6304c1d537, current_time() is no longer extern, so we need to update this grep regex to continue to match. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-11-08 13:21:03 -05:00
Zach Brown	aed7169fac	Merge pull request #194 from versity/zab/v1.22 v1.22 Release	2024-11-03 15:06:40 -08:00
Zach Brown	7f313f2818	v1.22 Release Finish the release notes for the 1.22 release. Signed-off-by: Zach Brown <zab@versity.com> v1.22	2024-11-01 13:05:52 -07:00
Zach Brown	6b4e666952	Merge pull request #193 from versity/zab/hung_lock_fixes Zab/hung lock fixes	2024-10-31 16:56:51 -07:00
Zach Brown	4a26059d00	Add lock-shrink-read-race test Add a quick test that races readers and shrinking to stress lock object refcount racing between concurrent lock request handling threads in the lock server. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-31 15:35:11 -07:00
Zach Brown	19e78c32fc	Allow null lock compatibility between nodes Right now a client requesting a null mode for a lock will cause invalidations of all existing granted modes of the lock across the cluster. This is unneccessarily broad. The absolute requirement is that a null request invalidates other existing granted modes on the client. That's how the client safely resolves shrinking's desire to free locks while the locks are in use. It relies on turning it into the race between use and remote invalidation. But that only requires invalidating existing grants from the requesting client, not all clients. It is always safe for null grants to coexist with all grants on other clients. Consider the existing mechanics involving null modes. First, null locks are instatiated on the client before sending any requests at all. At any given time newly allocated null locks are coexisting with all existing locks across the cluster. Second, the server frees the client entry tracking struct the moment it sends a null grant to the client. From that point on the client's null lock can not have any impact on the rest of the lock holders because the server has forgotten about it. So we add this case to the server's test that two client lock modes are compatible. We take the opportunity to comment the heck out of this function instead of making it a dense boolean composition. The only functional change is the addition of this case, the existing cases are refactored but unchanged. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-31 15:34:59 -07:00
Zach Brown	8c1a45c9f5	Use bools instead of weird addition as or in net When freeing acked reesponses in the net layer we sweep the send and resend queues looking for queued responses up to the sequence number we've had acked. The code that did this used a weird pattern of returning ints and adding them which gave me pause. Clean it up to use bools and or (not short-circuiting \|\|) to more obviously communicate what's going on. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-30 13:38:12 -07:00
Zach Brown	5a6eb569f3	Add some lock debugging trace fields Over time some fields have been added to the lock struct which haven't been added to the lock tracing output. Add some of the more relevant lock fields to tracing. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-30 13:16:04 -07:00
Zach Brown	69d9040e68	Close lock server use-after-free race Lock object lifetimes in the lock server are protected by reference counts. References are acquired while holding a lock on an rbtree. Unfortunately, the decision to free lock objects wasn't tested while also holding that lock on the rbtree. A caller putting their object would test the refcount, then wait to get the rbtree lock to remove it from the tree. There's a possible race where the decision is made to remove the object but another reference is added before the object is removed. This was seen in testing and manifest as an incoming request handling path adding a request message to the object before it is freed, losing the message. Clients would then hang on a lock that never saw a response because their request was freed with the lock object. The fix is to hold the rbtree lock when testing the refcount and deciding to free. It adds a bit more contention but not significantly so, given the wild existing contention on a per-fs spinlocked rbtree. Signed-off-by: Zach Brown <zab@versity.com>	2024-10-30 13:04:13 -07:00
Zach Brown	d94ec29ffa	Merge pull request #192 from versity/greg/with-debug-kmod Generate debug packages	2024-10-24 15:35:03 -07:00
Greg Cymbalski	70c36ae394	Generate debug packages We had previously explicitly disabled this; let's start generating them.	2024-10-24 14:56:09 -07:00
Zach Brown	1d08a58add	Merge pull request #151 from versity/auke/el9 EL9 support.	2024-10-04 11:46:47 -07:00
Auke Kok	fc7876e844	Allow certain tests to skip, but not fail exit condition. Previously, any t_skip would cause the final test result to be a failure because up until now no test should have been skipped. However, with format-version-forward-back not being compatible with el9, we are going to rely on el7/8 testing for that test soleley, and therefore we have to allow skipping of this test on el9 and newer OS versions. We add `t_skip_permitted` to signal this from the test case to the run-tests.sh script. A new exit code is passed, and all accounting is updated to reflect that a test was skipped, but this was permitted. We modify format-version-forward-back to use this new exit path. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	5337b9e221	Ingore Process accounting resumed dmesg. I'm seeing more and more of these as audit is enabled in el8 and el9 images I am using for testing, and during ENOSPC tests this has a chance of triggering process accounting suspension, and subsequent resume. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	8a22bdd366	Ignore device mapper size change dmesg output. In v1.18-10-g5507ee5, we changed the test code away from loopback to device-mapper, which simplified our DUT setup code. However, this results in the occasional `device changed size` messages now being emitted by the `dm` driver instead of the `loop` kernel module. We have to additionally ignore these kernel messages from now as well. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	235ab133a7	We must provide a_ops->dirty_folio and invalidate_folio. In v5.17-rc4-53-g3a3bae50af5d, we can no longer omit having this method unhooked as the mm caller blindly calls it now. In-kernel filesystems all were fixed in this change. aops->invalidatepage was the old aops method that would free pages with private attached data. This method is replaced with the new invalidate_folio method. If this method is NULL, the memory will become orphaned. (v5.17-rc4-29-gf50015a596fa) Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	9335d2eb86	Don't --track when checking out a tag. I've pushed a tag/release to scoutfs-xfstests-dev instead of a full blown branch. This seems simpler and cleaner than using branches, because we're going to end up rebasing these things a lot. However, we can't --track tags, so, if the branch name passed to -x is actually a tag instead of a branch, we have to omit the --track option here. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	97b081de3f	Switch xfstests tag over in CI jobs using this marker file. CI testing needs to know which xfstests branch to use on all OSs. We can't just use the el9 xfstests branch on el9 only, because we need to run the same el9 xfstests on el8 and el7 as well, otherwise testing will just fail. So, we put a marker file in our git repo that tells us that we're not going to use the default `scoutfs` branch from scoutfs-xfstests-dev but our own special tag or branch. The CI job then should pass the proper -x {branch} flag to the run-tests.sh script. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	21b5032365	Add new xfstests that we won't support or don't pass The new version of xfstests adds a _lot_ more tests to our mix. Many of the new ones will auto enable or auto skip as needed. There are tests we can't or won't support that will be in future xfstests. Disable them now so we can avoid dealing with them later. Quite a few fall into "we don't support these types of mounting yet", mostly bind-mount or dm-mapper things. We disable all the swapfile tests flatout. A few tests fail on el7 but not el8/9 but we don't have a way to run them without failing yet, so disable them as well. Update golden with the proper new array of tests. This all requires the `auke/scoutfs-el9` branch in `versity/scoutfs-xfstests-dev`. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	4723f4f9ab	Disable format-version-forward-back test on el9+. Using t_skip, we just skip this test on el9. If we ever want to add a formatversion 2->3 test, perhaps we should just add a separate test script, instead of going over a static array. But let's not worry about this too much right now. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	0a8b3f4e94	Fix basic-posix-acl test output on el9 It turns out that on el9, `bash -c` prints out `bash: line 1: cd..` instead of `line 0:` on el7 or el8. So discard all the stderr from these `cd` lines entirely and just rely on the expected echo output to stdout. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	8a4b0967cb	Add fiemap output through scoutfs util. There's filefrag already, and that works, but, it's output is very inconsistent between various OS release versions, and it has already meant that we'd needed to adjust tests to account for these little but insignificant changes. A lot more work than useful. It's even more changed in el9. This adds `scoutfs get-fiemap FILE` and prints out block extent info with flags that we care about as an abbreviated letter: U for Unwritten, L for Last, and O for Unknown (as in, "offline"). The -P/--physical and -L/--logical options turn off logical or physical offset display, in case you only want to see the offsets in either units. You can pass -b/--byte to display offsets and lengths in byte values. The block size will then be obtained from fstat() of the queried file (4096 for scoutfs). I've removed all uses of filefrag from our scoutfs tests. Xfstests still calls it but their internal diff takes care of that issue. Where needed and appropriate, the tests are adjusted so that the output of `scoutfs get-fiemap` is as close as it can to what it used to be, so that reading the test results allows the quick view of what might have been going wrong. There are some output strings I have not bothered to update because there's no real value to updating every output string to match, and we just adjust the golden file accordingly. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 15:38:34 -07:00
Auke Kok	606c519e96	Simple-staging doesn't actually test overflow. This isn't a simple case where we can use u64_region_wraps because length is s32. Let's actually test an overflow case instead of a case that doesn't overflow, though. We still should properly add an overflow test here as well. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	7d0e7e29f8	Avoid integer wrapping pitfalls for (off, len) pairs. We use check_add_overflow(a, b, d) here to validate that (off, len) pairs do not exceed the max value type. The kernel conveniently has several macros to sort out the problems with signed or unsigned types. However, we're not interested in purely seeing whether (a + b) overflows, because we're using this for (off, len) overflow checks, where the bytes we read are from 0 to len -1. We must therefore call this check with (b) being "len - 1". I've made sure that we don't accidentally fail when (len == 0) in all cases by making sure we've already checked this condition before, and moving code around as needed to ensure that (len > 0) in all cases where we check. The macro check_add_overflow requires a (d) argument in which temporarily the result of the addition is stored and then checked to see if an overflow occurred. We put a `tmp` variable on the stack of the correct type as needed to make the checks function. simple-release-extents test mistakenly relied on this buggy wrap code, so it needs fixing. The move-blocks test also got it wrong. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	69de6d7a74	Check for zero len in scoutfs_data_wait_check We consistently enter scoutfs_data_wait_check when len == 0 from scoutfs_aio_write() which directly passes the i_size_read() value, and for cases where we `echo >> $FILE` this is always reached. This can cause the wrapping check to fail since `0 + (0 - 1) < 0` which triggers the WARN_ON_ONCE wrap check that needs updating to allow certain operations on huge files. More importantly we can just omit all these checks if `len == 0` anyway, since they should always succeed and never should require taking all the locks. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	ac00f5cedb	Free after getline(), even if fail, and catch eof() on el9 getline() allocates the space for the return value even if there is an error, so when it returns an error, we still have to free() it. In el9, when reading stdin we will get errno=0 returned (no error) when we hit the end of stdin. This behavior is different from el7/8. We don't want to throw an error here to avoid failing the test, since it doesn't. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	6d42d260cf	xargs option conflict now a warning in el9 The warnings thrown by el9's version of xargs are unexpected output and cause this test to fail. When using the -I option (replace) the -n 1 arguments are always assumed. In el7/8 no warnings were printed. We can just remove `-n 1` since the argument is never needed. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	00ebe92186	Add stddef.h to util.h to avoid duplicate offsetof() def. In el9 releases, our includes declare offsetof() before our header chain includes stddef.h, which doesn't properly check if offsetof is already defined, leading to a redefinition. Just include stddef at all times here. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	570c05898c	Correct endian conversion length (blkno is le64) Trivial correction of wrong bitlength conversion. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	c298360a49	blkdev api changes - pass holder to replace FMODE_EXCL Passing a holder ptr to these functions now replaces the FMODE_EXCL flag. _put no longer needs flags for this reason, but the holder instead. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	95f4e56546	Introduce blk_mode_t instead of abuse of fmode_t v6.4-rc2-198-g05bdb9965305 adds a new type for passing flags instead of abusing fmode_t flags. They are essentially the same flags just in a new type. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	d5c2768f04	.tmpfile method now passed a struct file, which must be opened. v6.0-rc6-9-g863f144f12ad changes the VFS method to pass in a struct file and not a dentry in preperation for tmpfile support in fuse. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	676d429264	Assume el9 is the same as el8 for rpmbuild purposes. The current spec template can't handle future major el releases gracefully and fails to build entirely. We isolate all changes so that they are either "el7 specific" or generic. This rids us entirely of el8 specific conditionals. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	5b260e6b54	block_write_begin() no longer is being passed aop_flags. The flag is now obsolete, we don't need to set flags here or pass them. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	e2b06f2c92	mpage_readpage() is now replaced with mpage_read_folio. Folios are the new data types used for passing pages. For now, folios only appear to have a single page. Future kernels will change that. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	546b437df7	Shrinkers are now registered with a name. v5.19-rc4-52-ge33c267ab70d Adds shrinker names to the registration call to aid with shrinker debugging, which is highly opaque. To enable you'll have to recompile the kernel with CONFIG_SHRINKER_DEBUG=y though, since it's disabled by default in OSV kernels. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	381f4543b7	Use iter based read/write to support splice and thus sendfile(). The iter based read/write calls can support splice in el9 if we hook up these calls, otherwise splice will stop working. ->write() similar to: v3.15-rc4-330-g8d0207652cbe. ->read() to generic implementation. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00
Auke Kok	418a441604	kernel_setsockopt no longer available. We instead opt to use sock_setsockopt which is generally exactly the same and can be easily converted to map to kernel_setsockopt without impacting the code significantly. There are 3 methods we're calling with usec timeval's, and that is significantly different now that this requires a bit more compat code so we split these out to separate compat functions to handle them. Some of the TCP sock functions also have a slightly different signature that we want to split them out (struct socket vs. sock). Some further no longer return success, either. Signed-off-by: Auke Kok <auke.kok@versity.com>	2024-10-03 12:41:05 -07:00

1 2 3 4 5 ...

1995 Commits