scoutfs

mirror of https://github.com/versity/scoutfs.git synced 2026-02-07 11:10:44 +00:00

Author	SHA1	Message	Date
Auke Kok	f8d2cd759e	Shrinker API v4. Yet another major shrinker API evolution in v6.6-rc4-53-gc42d50aefd17. The struct shrinker now has to be dynamically allocated. This is purposely a backwards incompatible break. We add another KC_ wrapper around the new shrinker_alloc() and move some initialization around to make this as much as possible low impact, but compatible with the old APIs through substitution. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:07:09 -04:00
Auke Kok	d13bbb27eb	bio_add_page is now __must_check The return type always has been int, so, we just need to add return value checking and do something with it. We could return -ENOMEM here as well, either way it'll fall all the way through no matter what. This is since v6.4-rc2-100-g83f2caaaf9cb. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:07:09 -04:00
Auke Kok	c846b1ecf9	Adjust for __assign_str() losing second argument. In v6.8-9146-gc759e609030c, the second argument for __assign_str() was removed, as the second parameter is already derived from the __string() definition and no longer needed. We have to do a little digging in headers here to find the definition. Note the missing `;` at a few places... it has to be added now. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:07:09 -04:00
Auke Kok	5e85f11e82	RIP bd_inode. v6.9-rc4-29-g203c1ce0bb06 removes bd_inode. The canonical replacement is bd_mapping->host, were applicable. We have one use where we directly need the mapping instead of the inode, as well. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:07:09 -04:00
Auke Kok	c3bc36d4d0	Fix compiler warnings for flex array definitions. Instead of defining a struct that ends with a flex array member with `val[0]`, the compiler now balks at this since technically, the spec considers this unsanitary. As a result however, we can't memcpy to `struct->val` since that's a pointer and now we're writing something of a different length (u8's in our case) into something that's of pointer size. So there we have to do the opposite, and memcpy to &struct->val[0]. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:07:09 -04:00
Auke Kok	27d68e86c3	unaligned.h moved from asm/ to linux/ In v6.12-rc1-3-g5f60d5f6bbc1, asm/unaligned.h only included asm-generic/unaligned.h and that was cleaned up from architecture specific things. Everyone should now include linux/unaligned.h and the former include was removed. A quick peek at server.c shows that while included, it no longer uses any function from this header at all, so it can just be dropped. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:06:33 -04:00
Auke Kok	3eaa56a73c	Account for difference in `stat` output format for device nodes. The new format in el10 has non-hex output, separated by a comma. Add the additional filter string so this works as expected. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:06:33 -04:00
Auke Kok	dd533810c5	Fix el10 not skipping the format-version-forward-back test. The logic only accounted for single-digit versions. With el10, that breaks. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:06:33 -04:00
Auke Kok	fb17d59ef1	Use a/m/c_time accessor functions. In v6.6-rc5-1-g077c212f0344, one can no longer directly access the inode m_time and a_time etc. We have to go through these static inline functions to get to them. The compat is matched closely to mimic the new functions. Further back, ctime accessors were added in v6.5-rc1-7-g9b6304c1d537, and need to be applied as well. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:06:33 -04:00
Auke Kok	840f9af935	Stop using egrep. egrep Is no longer in el10, so replace it with `grep -E` everywhere. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:06:31 -04:00
Auke Kok	e9799636ed	prandom_bytes and family removed, switch to get_random_bytes variants. In v6.1-rc5-2-ge9a688bcb193, get_random_u32_below() becomes available and can start replacing prandom_bytes_max(). Switch to it where we can. get_random_bytes() has been available since el7, so also replace prandom_bytes() where we're using it. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-05-08 18:05:44 -04:00
Zach Brown	6f19d0bd36	Merge pull request #216 from versity/zab/stop_ending_dirty_data_freed Zab/stop ending dirty data freed	2025-05-08 11:18:23 -07:00
Zach Brown	8aa1a98901	Merge pull request #210 from versity/auke/perf-irq-took-too-long Filter out perf `interrupt took too long` dmesg.	2025-04-30 10:04:00 -07:00
Zach Brown	888b1394a6	Retry client commit and get log trees separately The client transaction commit worker has a series of functions that it calls to commit the current transaction and open the next one. If any of them fail, it retries all of them from the beginning each time until they all succeed. This pattern behaves badly since we added the strict get_trans_seq and commit_trans_seq latching in the log_trees. The server will only commit the items for a get or commit request once, and will fail a commit request if it isn't given the seq that matches the current item. If the server gets an error it can have persisted items while sending an error to the client. If this error was for a get request, then the client will retry all of its transaction write functions. This includes the commit request which is now using a stale seq and will fail indefinitely. This is visible in the server log as: error -5 committing client logs for rid e57e37132c919c4f: invalid log trees item get_trans_seq The solution is to retry the commit and get phases independently. This way a failed get will be retried on its own without running through the commit phase that had succeeded. The client will eventually get the next seq that it can then safely commit. Signed-off-by: Zach Brown <zab@versity.com>	2025-04-29 11:46:38 -07:00
Zach Brown	e457694f19	Don't send dirty data_freed blocks to client At the end of get_log_trees we can try and drain the data_freed extent tree, which can take multiple commits. If a commit fails then the blocks are still dirty in memory. We can't send references to those blocks to the client. We have to return an error and not send the log_trees, like the main get_log_trees does. The client will retry and eventually get a log_trees that references blocks that were successfully committed. Signed-off-by: Zach Brown <zab@versity.com>	2025-04-29 11:46:38 -07:00
Zach Brown	459de5b478	Merge pull request #211 from versity/auke/tapf-output TAP formatted output.	2025-04-15 14:25:06 -07:00
Auke Kok	24031cde1d	TAP formatted output. Stored as `results/scoutfs.tap`, this file contains TAP format 14 generated test results. Embedded in the output are some metadata so that these files can be aggregated and stored in an unique and deduplicating way, but using a generated UUID at the start of testing. The file itself also catches git ID, date, and kernel version, as well as the (possibly altered) test sequence used. Any test that has diff or dmesg output will be considered failed, and a copy of the relevant data is included as comments. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-04-15 12:02:41 -07:00
Zach Brown	04cc41719c	Merge pull request #209 from versity/auke/basic-truncate-yes-pipefail Ignore pipefail alternative error when not a tty.	2025-04-14 13:15:03 -07:00
Auke Kok	1b47e9429e	Filter out perf `interrupt took too long` dmesg. Example: ``` [ 2469.638414] perf: interrupt took too long (2507 > 2500), lowering kernel.perf_event_max_sample_rate to 79000 ``` Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-04-14 12:06:58 -07:00
Auke Kok	7ea084082d	Ignore pipefail alternative error when not a tty. This happens with the basic-truncate test, only. It's the only user of the `yes` program. The `yes` command normally fails gracefully under the usual runs that are attached to some terminal. But when the test script runs entirely under something else, it will throw a needless error message that pollutes the test output: `yes: standard output: Broken pipe` Adjust the redirect to omit all stderr for `yes` in this case. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-04-14 11:13:39 -07:00
Zach Brown	f565451f76	Merge pull request #208 from versity/zab/v1.24 v1.24 Release	2025-03-17 11:18:42 -07:00
Zach Brown	05f14640fb	v1.24 Release Finish the release notes for the 1.24 release. Signed-off-by: Zach Brown <zab@versity.com> v1.24	2025-03-14 12:19:30 -07:00
Zach Brown	609fc56cd6	Merge pull request #203 from versity/auke/new_inode_ctime Fix new_inode ctime assignment.	2025-02-25 15:23:16 -08:00
Zach Brown	a4b5a256eb	Merge pull request #175 from versity/auke/mmap Support for mmap() writable mappings.	2025-02-20 14:03:01 -08:00
Zach Brown	f701ce104c	Merge pull request #204 from versity/zab/remove_wordexp Remove wordexp expansion of utils path argument	2025-02-19 09:27:15 -08:00
Zach Brown	c6dab3c306	Remove wordexp expansion of utils path argument scoutfs cli commands were using a helper that tried to perform word expansion on the path argument. This was done with the intent of providing the convenience of shell expansion (env vars, ~) within the cli command argument. But it breaks paths that accidentally have their file names match the syntax that wordexp supports. "[ ]" tripped up files in the wild. We don't need to provide shell expansion functionality in our argument parsing. The shell can do that. The cli must pass the arguments straight through, no parsing at all. Signed-off-by: Zach Brown <zab@versity.com>	2025-02-18 11:55:37 -08:00
Auke Kok	e3e2cfceec	Fix new_inode ctime assignment. Very old copy/paste bug here, we want to update new_inode's ctime instead. old_inode already is updated. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-02-18 13:15:49 -05:00
Zach Brown	5a10c79409	Merge pull request #201 from versity/auke/fixes_pre_parallel_restore Misc. fixes and changes to support parallel_restore and check.	2025-02-02 06:53:25 -08:00
Auke Kok	e9d147260c	Fix ctx->pos updating to properly handle dent gaps We need to assure we're emitting dents with the proper position and we already have them as part of our dent. The only caveat is to increment ctx->pos once beyond the list to make sure the caller doesn't call us once more. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	6c85879489	Assert unlock doesn't underflow lock user count. While debugging a double unlock error we hit this condition and debugging would have been a lot easier had we enforced this simple constraint that we can't decrement the lock users count if it's already 0. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	8b76a53cf3	Avoid cluster locking while put_user() in _allocated_inos. Similar to fiemap, readdir and walk_inodes, this method could have put_user during a page fault, causing potentially a deadlock. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	e76a171c40	Avoid faulting while cluster locked in _walk_inodes. Similar to readdir and fiemap vfs methods, we can't copy to user while holding cluster locks. The previous comment about it being safe no longer applies, and this could deadlock. Rewrite the loop to iterate and store entries in a page, then flush the page contents while not holding a clusterlock. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	8cb08507d6	Do not copy to user while holding locks in scoutfs_data_fiemap() Now that we support mmap writes, at any point in time we could pagefault and lock for writes. That means - just like readdir - we can no longer lock and copy_to_user, since it also may page fault and thus deadlock. We statically allocate 32 extent entries on the stack and use these to shuffle out fiemap entries at a time, locking and unlocking around collecting and fiemap_fill_extent_next. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	cad12d5ce8	Avoid deadlock in _readdir() due to copy_to_user(). dir_emit() will copy_to_user, which can pagefault. If this happens while cluster locked, we could deadlock. We use a single page to stage dir_emit data, and iterate between fetching dirents while locked, and emitting them while not locked. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	e59a5f8ebd	Readdir w/offset validation. Verify using xfs_io that readdir offsets match expected output. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-27 14:49:04 -05:00
Auke Kok	1bcd1d4d00	Drop readdir pre-.iterate() compat (el7.5ish). These 2 sections of compat for readdir are wholly obsolete and can be hard dropped, which restores the method to look like current upstream code. This was added in `ddd1a4e`. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Auke Kok	b944f609aa	remap_pages ops becomes obsolete.	2025-01-23 14:28:40 -05:00
Auke Kok	519b47a53c	mmap() trace events. We merely trace exit values and position, and ignore length. Because vm_fault_t is __bitwise, sparse will loudly complain about a plain cast to u32, so we must __force (on el8). ret will be 512 in normal cases. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Auke Kok	92f704d35a	Enable all xfstests mmap() tests. Now that all of these should be passing, we enable all mmap() tests in xfstests, and update the golden output with the new tests. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Auke Kok	311bf75902	Add mmap tests. Two test programs are added. The run time is about 1min on my el7 instance. The test script finishes up with a read/write mmap test on offline extents to verify the data wait paths in those functions. One program will perform vfs read/write and mmap read/write calls on the same file from across 5 threads (mounts) repeatedly. The goal is to assure there are no locking issues between read/write paths. The second test program performs consistency checking on a file that is repeatedly written/read using memory maps and normal reads and writes, and the content is verified after every operation. Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Benjamin LaHaise	3788d67101	Add support for writable shared mmap()ings Add support for writable MAP_SHARED mmap()ings. Avoid issues with late writepage()s building transactions by doing the block_write_begin() work in scoutfs_data_page_mkwrite(). Ensure the page is marked dirty and prepared for write, then let the VM complete the write when the page is flushed or invalidated. Signed-off-by: Benjamin LaHaise <bcrl@kvack.org> Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Benjamin LaHaise	b7a3d03711	Add support for read only mmap() Adds the required memory mapped ops struct and page fault handler for reads. Signed-off-by: Benjamin LaHaise <bcrl@kvack.org> Signed-off-by: Auke Kok <auke.kok@versity.com>	2025-01-23 14:28:40 -05:00
Zach Brown	295f751aed	Add test_bit to utils bitmap Add test_bit() to the trivial utils bitmap.c implementation. Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:58 -08:00
Zach Brown	7f6032d9b4	Add lk rbtree wrapper Import the kernel's rbtree implementation with a wrapper so we can use it from userspace. Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:49 -08:00
Zach Brown	7e3a6537ec	Add userspace version of our dirent name hash Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:41 -08:00
Zach Brown	49b7b70438	Add userspace version of our mode to type Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:31 -08:00
Zach Brown	de0fdd1f9f	Promote userspace btree block initialization Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:23 -08:00
Zach Brown	a6d7de3c00	Add fls64() alias for userspace flsll() Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:16 -08:00
Zach Brown	2c2c127c5e	Add put_unaligned_leXX() for userspace Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:58:10 -08:00
Zach Brown	9491c784e7	Add srch_encode_entry() for userspace utils Signed-off-by: Zach Brown <zab@versity.com>	2025-01-22 09:57:56 -08:00

1 2 3 4 5 ...

2044 Commits