scoutfs

mirror of https://github.com/versity/scoutfs.git synced 2026-01-07 12:35:28 +00:00

Author	SHA1	Message	Date
Andy Grover	5241bba7f6	Update scoutfs.8 man page Update for cli args and options changes. Reorder subcommands to match scoutfs built-in help. Consistent ScoutFS capitalization. Tighten up some descriptions and verbiage for consistency and omit descriptions of internals in a few spots. Add SEE ALSO for blockdev(8) and wipefs(8). Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 16:29:42 -08:00
Andy Grover	e0a2175c2e	Use argp info instead of duplicating for cmd_register() Make it static and then use it both for argp_parse as well as cmd_register_argp. Split commands into five groups, to help understanding of their usefulness. Mention that each command has its own help text, and that we are being fancy to keep the user from having to give fs path. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 16:29:42 -08:00
Andy Grover	f2cd1003f6	Implement argp support for walk-inodes This has some fancy parsing going on, and I decided to just leave it in the main function instead of going to the effort to move it all to the parsing function. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 16:29:42 -08:00
Andy Grover	97c6cc559e	Implement argp support for data-waiting and data-wait-err These both have a lot of required options. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 16:29:42 -08:00
Andy Grover	7c54c86c38	Implement argp support for setattr Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 16:29:42 -08:00
Andy Grover	e1ba508301	Implement argp support for counters Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 16:29:42 -08:00
Andy Grover	f35154eb19	counters: Ensure name_wid[0] is initialized to zero I was seeing some segfaults and other weirdness without this. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 16:29:42 -08:00
Andy Grover	7befc61482	Implement argp support for mkfs and add --force Support max-meta-size and max-data-size using KMGTP units with rounding. Detect other fs signatures using blkid library. Detect ScoutFS super using magic value. Move read_block() from print.c into util.c since blkid also needs it. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 16:29:42 -08:00
Andy Grover	6b5ddf2b3a	Implement argp support for print Print warning if printing a data dev, you probably wanted the meta dev. Change read_block to return err value. Otherwise there are confusing ENOMEM messages when pread() fails. e.g. try to print /dev/null. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 10:47:47 -08:00
Andy Grover	d025122fdd	Implement argp support for listxaddr-hidden Rename to list-hidden-xaddrs. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 10:47:47 -08:00
Andy Grover	706fe9a30e	Implement argp support for search-xattrs Get fs path via normal methods, and make xattr an argument not an option. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 10:47:47 -08:00
Andy Grover	0f17ecb9e3	Implement argp support for stage/release Make offset and length optional. Allow size units (KMGTP) to be used for offset/length. release: Since off/len no longer given in 4k blocks, round offset and length to to 4KiB, down and up respectively. Emit a message if rounding occurs. Make version a required option. stage: change ordering to src (the archive file) then the dest (the staged file). Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-12 10:47:47 -08:00
Andy Grover	10df01eb7a	Implement argp support for ino-path Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-04 11:49:31 -08:00
Andy Grover	68b8e4098d	Implement argp support for stat and statfs Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-04 11:49:31 -08:00
Andy Grover	5701184324	Implement argp support for df Convert arg parsing to use argp. Use new get_path() helper fn. Add -h human-readable option. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-04 11:49:31 -08:00
Andy Grover	a3035582d3	Add strdup_or_error() Add a helper function to handle the impossible event that strdup fails. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-04 11:49:31 -08:00
Andy Grover	9e47a32257	Add get_path() Implement a fallback mechanism for opening paths to a filesystem. If explicitly given, use that. If env var is set, use that. Otherwise, use current working directory. Use wordexp to expand ~, $HOME, etc. Signed-off-by: Andy Grover <agrover@versity.com>	2021-01-04 11:49:31 -08:00
Andy Grover	b4592554af	Merge pull request #2 from versity/zab/stage_read_zero_block Zab/stage read zero block	2020-12-17 16:48:52 -08:00
Zach Brown	1e0f8ee27a	Finally change all 'ci' inode info ptrs to 'si' Finally get rid of the last silly vestige of the ancient 'ci' name and update the scoutfs_inode_info pointers to si. This is just a global search and replace, nothing functional changes. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-15 15:20:02 -08:00
Zach Brown	511cb04330	Add stage-mulit-part test Add a test which stages a file in multiple parts while a long-lived process is blocking on offline extents trying to compare the file to the known contents. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-15 15:13:42 -08:00
Zach Brown	807ae11ee9	Protect per-inode extent items with extent_sem Now that we have full precision extents a writer with i_mutex and a page lock can be modifying large extent items which cover much of the surrounding pages in the file. Readers can be in a different page with only the page lock and try to work with extent items as the writer is deleting and creating them. We add a per-inode rwsem which just protects file extent item manipulation. We try to acquire it as close to the item use as possible in data.c which is the only place we work with file extent items. This stops rare read corruption we were seeing where get_block in a reader was racing with extent item deletion in a stager at a further offset in the file. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-15 11:56:50 -08:00
Zach Brown	7ca3672a67	Update repo README.md, remove from kmod Move the main scoutfs README.md from the old kmod/ location into the top of the new single repository. We update the language and instructions just a bit to reflect that we can checkout and build the module and utilities from the single repo. Signed-off-by: Zach Brown <zab@versity.com> v0.0.3	2020-12-07 10:39:20 -08:00
Zach Brown	eb22425bad	Update tests/ README The README in tests/ had gone a bit stale. While it was originally written to be a README.md displayed in the github repo, we can still use it in place as a quick introduction to the tests. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 10:39:20 -08:00
Zach Brown	e386b900ee	Remove README.md from utils This was just boilerplate for the utils repo. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 10:39:20 -08:00
Zach Brown	6415814f92	Use kmod and utils subdirs instead of repos When we had three repos the run-tests harness helped by checking branches in kmod and utils repos to build and test. Now that we have one repo we can just use the sibling kmod/ and utils/ dirs in the repo. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 10:39:20 -08:00
Zach Brown	86cf3ec4ab	Remove format.h and ioctl.h from utils Now that we're in one repo utils can get its format and ioctl headers from the authoriative kmod files. When we're building a dist tarball we copy the files over so that the build from the dist tarball can use them. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 10:39:20 -08:00
Zach Brown	aa6e210ac7	Fix kmod spec path in dist tarball For some reason, the make dist rule in kmod/ put the spec file in a scoutfs-$ver/ directory, instead of scoutfs-kmod-$ver/ like the rest of the files and instead of scoutfs-utils-$ver/ that the spec file for utils is put in the utils dist tarball. This adds -kmod to the path for the spec file so that it matches the rest of the kmod dist tarball. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 10:39:20 -08:00
Zach Brown	e648063baa	Add simple top-level Makefile Add a trivial top-level Makefile that just runs Make in all the subdirs. This will probably expand over time. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 10:39:20 -08:00
Zach Brown	bc09012836	Merge scoutfs-tests repo filtered to tests/ Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 09:49:17 -08:00
Zach Brown	cf78e92eaf	Merge scoutfs-utils-dev repo filtered to utils/ Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 09:49:02 -08:00
Zach Brown	19f5c1d7bf	Merge scoutfs-kmod-dev repo filtered to kmod/ Signed-off-by: Zach Brown <zab@versity.com>	2020-12-07 09:48:00 -08:00
Zach Brown	bb0ed34786	Initial commit	2020-12-07 09:47:12 -08:00
Zach Brown	14530471c4	scoutfs-tests: add srch-basic-functionality Add basic functional testing of finding inodes by their xattrs. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 13:40:33 -08:00
Zach Brown	88aefc381a	scoutfs-tests: add find_xattrs Add a utility that mimics our search_xattrs ioctl with directory entry walking and fgetxattr as efficiently as it can so we can use it to test large file populations. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 13:40:33 -08:00
Zach Brown	8982750266	scoutfs-tests: bulk create more clearly sets xattr Just set the value using a single char, this messed up and set the size of the pointer. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 13:40:33 -08:00
Zach Brown	e2dfffcab9	scoutfs: search_xattrs name requires srch tag The search_xattrs ioctl is only going to find entries for xattrs with the .srch. tag which create srch entries as they're created and destroyed. Export the xattr tag parsing so that the ioctl can return -EINVAL for xattrs which don't have the scoutfs prefix and the .srch. tag. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Zach Brown	f0ddf5ff04	scoutfs: search_xattrs returns each ino once Hash collisions can lead to multiple xattr ids in an inode being found for a given name hash value. If this happens we only want to return the inode number once. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Zach Brown	18aee0ebbd	scoutfs: fix lost entries in resumed srch compact Compacting very large srch files can use all of a given operation's metadata allocator. When this happens we record the position in the srch files of the compcation in the pending item. We could lose entries when this happens because the kway_next callback would advance the srch file position as it read entries and put them in the tournament tree leaves, not as it put them in the output file. We'd continue from the entries that were next to go in the tournament leaves, not from what was in the leaves. This refactors the kway merge callbacks to differentiate between getting entries at the position and advancing the positions. We initialize the tournament leaves by getting entries at the positions and only advance the position as entries leave the tournament tree and are either stored in the output srch files or are dropped. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Zach Brown	c35f1ff324	scoutfs: inc end when search xattrs retries In the rare case that searching for xattrs only finds deletions within its window it retries the search past the window. The end entry is inclusive and is the last entry that can be returned. When retrying the search we need to start from the entry after that to ensure forward progress. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Zach Brown	6770a31683	scoutfs: consistently trim srch entry range We have to limit the number of srch entries that we'll track while performing a search for all the inodes that contain xattrs that match the search hash value. As we hit the limit on the number of entries to track we have to drop entries. As we drop entries we can't return any inodes for entries past the dropped entries. We were updating the end point of the search as we dropped entries past the tracked set, but we weren't updating the search end point if we dropped the last currently tracked entry. And we were setting the end point to the dropped entry, not to the entry before it. This could lead us to spuriously returning deleted entries if we drop the creation entry and then allow tracking its deletion later. This fixes both those problems. We now properly set the end point to just before the dropped entry for all entries that we drop. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Zach Brown	9395360324	scoutfs: add srch entry inc/dec We're going to need to increment and decrement srch entries in coming fixes. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Zach Brown	7c5823ad12	scoutfs: drop duplicate compacted srch entries The k-way merge used by srch file compaction only dropped the second entry in a pair of duplicate entries. Duplicate entries are both supposed to be removed so that entries for removed xattrs don't take up space in the files. This both drops the second entry and removes the first encoded entry. As we encode entries we rememeber their starting offset and the previous entry that they were encoded from. When we hit a duplicate entry we undo the encoding of the previous entry. This only works wihin srch file blocks. We can still have duplicate entries that span blocks but that's unlikely and relatively harmless. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Zach Brown	560c91a0e4	scoutfs: fix binary search for sorted srch block The search_xattrs ioctl looks for srch entries in srch files that map the caller's hashed xattr name to inodes. As it searches it maintains a range of entries that it is looking for. When it searches sorted srch files for entries it first performs a binary search for the start of the range and then iterates over the blocks until it reaches the end of its range. The binary search for the start of the range was a bit wrong. If the start of the range was less than all the blocks then the binary search could wrap the left index, try to get a file block at a negative index, and return an error for the search. This is relatively hard to hit in practice. You have to search for the xattr name with the smallest hashed value and have a sorted srch file that's just the right size so that blk offset 0 is the last block compared in the binary search, which sets the right index to -1. If there are lots of xattrs, or sorted files of the wrong length, it'll work. This fixes the binary search so that it specifically records the first block offset that intersects with the range and tests that the left and right offsets haven't been inverted. Now that we're not breaking out of the binary search loop we can more obviously put each block reference that we get. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Zach Brown	4647a6ccb2	scoutfs: fix srch btree iref puts The srch code was putting btree item refs outside of success. This is fine, but they only need to be put when btree ops return success and have set the reference. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-03 09:58:35 -08:00
Andy Grover	1bef610416	scoutfs: Don't destroy sroot unless srch_search_xattrs() was called Until then, sroot is uninitialized so it's not safe to call destroy_rb_root(). Signed-off-by: Andy Grover <agrover@versity.com>	2020-12-03 09:02:31 -08:00
Zach Brown	9375b9d3b7	scoutfs: commit while enough meta for dirty items Dirty items in a client transaction are stored in OS pages. When the transaction is committed each item is stored in its position in a dirty btree block in the client's existing log btree. Allocators are refilled between transaction commits so a given commit must have sufficient meta allocator space (avail blocks and unused freed entries) for all the btree blocks that are dirtied. The number of btree blocks that are written, thus the number of cow allocations and frees, depends on the number of blocks in the log btree and the distribution of dirty items amongst those blocks. In a typical load items will be near each other and many dirty items in smaller kernel pages will be stored in fewer larger btree blocks. But with the right circumstances, the ratio of dirty pages to dirty blocks can be much smaller. With a very large directory and random entry renames you can easily have 1 btree block dirtied for every page of dirty items. Our existing allocator meta allocator fill targets and the number of dirty item cache pages we allowed did not properly take this in to account. It was possible (and, it turned out, relatively easy to test for with a hgue directory and random renames) to run out of meta avail blocks while storing dirty items in dirtied btree blocks. This rebalances our targets and thresholds to make it more likely that we'll have enough allocator resources to commit dirty items. Instead of having an arbitrary limit on the number of dirty item cache pages, we require that a given number of dirty item cache pages have a given number of allocator blocks available. We require a decent number of avialable blocks for each dirty page, so we increase the server's target number of blocks to give the client so that it can still build large transactions. This code is conservative and should not be a problem in practice, but it's theoretically possible to build a log btree and set of dirty items that would dirty more blocks that this code assumes. We will probably revisit this as we add proper support for ENOSPC. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-02 09:25:13 -08:00
Zach Brown	ae286bf837	scoutfs: update srch _alloc_meta_low callers The srch system checks that is has allocator space while deleting srch files and while merging them and dirtying output blocks. Update the callers to check for the correct number of avail or freed blocks that it needs between each check. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-02 09:25:13 -08:00
Zach Brown	a5d9ac5514	scoutfs: rework scoutfs_alloc_meta_low, takes arg Previously, scoutfs_alloc_meta_lo_thresh() returned true when a small static number of metadata blocks were either available to allocate or had space for freeing. This didn't make a lot of sense as the correct number depends on how many allocations each caller will make during their atomic transaction. Rework the call to take an argument for the number of avail or freed blocks available to test. This first pass just uses the existing number, we'll get to the callers. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-02 09:25:13 -08:00
Zach Brown	7b2310442b	scoutfs-tests: add createmany-rename-large-dir Add a test that randomly renames entries in a single large directory. This has caught bugs in the reservation of allocator resources for client transactions. Signed-off-by: Zach Brown <zab@versity.com>	2020-12-02 09:23:15 -08:00
Andy Grover	cf278f5fa0	scoutfs: Tidy some enum usage Prefer named to anonymous enums. This helps readability a little. Use enum as param type if possible (a couple spots). Remove unused enum in lock_server.c. Define enum spbm_flags using shift notation for consistency. Rename get_file_block()'s "gfb" parameter to "flags" for consistency. Signed-off-by: Andy Grover <agrover@versity.com>	2020-11-30 13:35:44 -08:00

1 2 3 4 5 ...

1224 Commits