versitygw

mirror of https://github.com/versity/versitygw.git synced 2026-07-02 16:54:25 +00:00

Author	SHA1	Message	Date
Ben McClelland	9610ef8a4e	fix: prevent connection errors in space/quotas error paths When uploads hit ENOSPC or EDQUOT, the server was returning the correct S3 error but could close the connection while unread request bytes were still in flight, which caused TCP resets and surfaced as broken pipe/connection reset errors in SDKs instead of a clean Insufficient Storage response. This change drains the remaining upload body before returning the error so the response can be delivered and the connection can close gracefully, preserving correct client-visible behavior under disk-full and quota-exceeded conditions. Fixes #2209	2026-06-24 09:18:25 -07:00
Ben McClelland	27f04ad5ea	feat: add windows functional test coverage and fix some windows behavior This change adds Windows functional test execution in CI and updates backend handling so windows filesystem error/path semantics map correctly to expected S3 outcomes. The only meta supported on windows right now is sidecar, so the tests in windows mode also skip sidecar skips. Future work is to address the skips and/or more clearly document the unsupported/incompatible behavior on windows. The windows support will still remain best effort, but these tests should at least flag when future changes introduce incompatible behavior on windows.	2026-06-18 11:34:05 -07:00
niksis02	f08f76fea4	feat: support x-amz-website-redirect-location Integrate x-amz-website-redirect-location across object metadata flows so uploads, copies, multipart creation, HEAD, and GET preserve and return redirect locations, and website hosting applies object-level redirects from the stored value.	2026-06-10 12:41:55 +04:00
niksis02	1625c5963e	feat: improve static website hosting support Enhances the static website hosting implementation with more complete S3-compatible behavior across request handling, backend storage, validation, CORS, and errors. Adds dedicated website endpoint handling for GET, HEAD, and OPTIONS requests, including index document resolution, error document serving, redirect-all support, pre-fetch and post-error routing rules, query string preservation in redirects, public access checks before object reads, and method-not-allowed responses. Improves error handling for website responses by returning S3-compatible HTML error bodies with request IDs, host IDs, x-amz-error-code, x-amz-error-message, and specialized error fields. This also fixes website-related validation errors to return more accurate S3-style error codes and messages, including invalid redirect protocols, invalid HTTP redirect/error codes, conflicting routing rule replacements, routing rule limits, and oversized website configuration requests. Adds website CORS support for GET, HEAD, and OPTIONS preflight requests, including bucket CORS lookup through website host bucket resolution, allowed origin/method/header validation, exposed header handling, ETag exposure, Vary headers, max-age handling, and CORS access-denied responses. Adds debug logging around website configuration parsing, validation failures, CORS checks, backend lookup failures, and internal website error paths to make failures easier to diagnose. Adds compressed website configuration storage so larger configs fit backend metadata limits, including gzip storage for POSIX extended attributes and base64-encoded compressed metadata for Azure. Also adds Azure PutBucketWebsite, GetBucketWebsite, and DeleteBucketWebsite support. Adds and expands test coverage for website config validation, S3-compatible HTML error bodies, website routing behavior, public access enforcement, HEAD behavior, CORS handling, PutBucketWebsite limits, and end-to-end website hosting through a Docker-based dnsmasq test setup and CI workflow.	2026-06-10 12:41:55 +04:00
Marc Singer	ac5c3b4a86	Add WebsiteConfiguration types, validation, and S3 error codes Add S3 bucket website configuration types with XML serialization support in s3response/website.go. Includes IndexDocument, ErrorDocument, RedirectAllRequestsTo, and RoutingRules with full validation matching AWS S3 behavior. Add corresponding S3 error codes: ErrNoSuchWebsiteConfiguration, ErrInvalidWebsiteConfiguration, ErrInvalidWebsiteSuffix, and ErrInvalidWebsiteRedirectCode. Unit tests cover validation logic, XML round-trip, and parsing. Signed-off-by: Marc Singer <marc@singer.gg> Add website backend interface and implementations for posix and s3proxy Add PutBucketWebsite, GetBucketWebsite, and DeleteBucketWebsite methods to the Backend interface with BackendUnsupported stubs that return ErrNotImplemented. Posix backend stores website config as a metadata attribute (key: 'website') following the same pattern as CORS. ScoutFS inherits via embedding. S3Proxy backend stores website config in the metadata bucket with prefix 'vgw-meta-website-', consistent with existing ACL/policy/CORS metadata storage. Returns ErrNoSuchWebsiteConfiguration when not found. Signed-off-by: Marc Singer <marc@singer.gg> Add website API controllers and wire into router Add PutBucketWebsite, GetBucketWebsite, and DeleteBucketWebsite controller methods following the same pattern as CORS. Controllers parse and validate WebsiteConfiguration XML, check IAM authorization, and delegate to the backend. Replace the three HandleErrorRoute(ErrNotImplemented) stubs in the router with the new controller methods. Regenerate the backend mock to include the new interface methods. Signed-off-by: Marc Singer <marc@singer.gg> Add website index document middleware and wire into router Add ResolveWebsiteIndex middleware that rewrites directory-like object keys (empty or ending with /) to include the IndexDocument suffix when website hosting is enabled. Also handles RedirectAllRequestsTo by returning 301. Wire the middleware into both GetObject and HeadObject handler chains in the router, positioned after BucketObjectNameValidator and before auth. Signed-off-by: Marc Singer <marc@singer.gg>	2026-06-10 12:40:45 +04:00
niksis02	cd0b4e6d9d	fix: normalize object keys during bucket policy evaluation Object key validation allowed internal parent-directory segments such as `public/../private.txt`. Bucket policy and auth checks evaluated the raw key, so a policy allowing bucket/public/* could match the request while posix backend later resolved the key with `filepath.Join` and accessed `bucket/private.txt`. Add backend-specific object key normalization to close that mismatch. The Backend interface now exposes `NormalizeObjectKey` so authorization can evaluate resources using the same key shape a backend will use for storage access. Backends that do not collapse object paths, including Azure and the S3 proxy, inherit `BackendUnsupported.NormalizeObjectKey`. That implementation returns the input key unchanged, avoiding unnecessary normalization and keeping policy evaluation unpolluted for object stores where ../ is part of the key name. posix/scoutfs normalize keys with filepath.Join so policy resources and request keys are compared after internal dot segments are collapsed. Bucket policy evaluation now normalizes both the incoming object key and object resource patterns from the policy before matching. Object lock governance bypass policy checks use the same backend normalizer as well, so retention and legal hold authorization cannot diverge from backend path resolution.	2026-05-27 22:20:39 +04:00
niksis02	9f786b3c2c	feat: global error refactoring Fixes #2123 Fixes #2120 Fixes #2116 Fixes #2111 Fixes #2108 Fixes #2086 Fixes #2085 Fixes #2083 Fixes #2081 Fixes #2080 Fixes #2073 Fixes #2072 Fixes #2071 Fixes #2069 Fixes #2044 Fixes #2043 Fixes #2042 Fixes #2041 Fixes #2040 Fixes #2039 Fixes #2036 Fixes #2035 Fixes #2034 Fixes #2028 Fixes #2020 Fixes #1842 Fixes #1810 Fixes #1780 Fixes #1775 Fixes #1736 Fixes #1705 Fixes #1663 Fixes #1645 Fixes #1583 Fixes #1526 Fixes #1514 Fixes #1493 Fixes #1487 Fixes #959 Fixes #779 Closes #823 Closes #85 Refactor global S3 error handling around structured error types and centralized XML response generation. All S3 errors now share the common APIError base for the fields every error has: Code, HTTP status code, and Message. Non-traditional errors that need AWS-compatible XML fields now have dedicated typed errors in the s3err package. Each typed error implements the shared S3Error behavior so controllers and middleware can handle errors consistently while still emitting error-specific XML fields. Add a dedicated InvalidArgumentError type because InvalidArgument is used widely across request validation, auth, copy source handling, object lock validation, multipart validation, and header parsing. The new InvalidArgument path uses explicit InvalidArgErrorCode constants with predefined descriptions and ArgumentName values, keeping call sites readable while preserving the correct InvalidArgument XML shape and optional ArgumentValue. New structured errors added in s3err: - `AccessForbiddenError`: Method, ResourceType - `BadDigestError`: CalculatedDigest, ExpectedDigest - `BucketError`: BucketName - `ContentSHA256MismatchError`: ClientComputedContentSHA256, S3ComputedContentSHA256 - `EntityTooLargeError`: ProposedSize, MaxSizeAllowed - `EntityTooSmallError`: ProposedSize, MinSizeAllowed - `ExpiredPresignedURLError`: ServerTime, XAmzExpires, Expires - `InvalidAccessKeyIdError`: AWSAccessKeyId - `InvalidArgumentError`: Description, ArgumentName, ArgumentValue - `InvalidChunkSizeError`: Chunk, BadChunkSize - `InvalidDigestError`: ContentMD5 - `InvalidLocationConstraintError`: LocationConstraint - `InvalidPartError`: UploadId, PartNumber, ETag - `InvalidRangeError`: RangeRequested, ActualObjectSize - `InvalidTagError`: TagKey, TagValue - `KeyTooLongError`: Size, MaxSizeAllowed - `MetadataTooLargeError`: Size, MaxSizeAllowed - `MethodNotAllowedError`: Method, ResourceType, AllowedMethods - `NoSuchUploadError`: UploadId - `NoSuchVersionError`: Key, VersionId - `NotImplementedError`: Header, AdditionalMessage - `PreconditionFailedError`: Condition - `RequestTimeTooSkewedError`: RequestTime, ServerTime, MaxAllowedSkewMilliseconds - `SignatureDoesNotMatchError`: AWSAccessKeyId, StringToSign, SignatureProvided, StringToSignBytes, CanonicalRequest, CanonicalRequestBytes Fix CompleteMultipartUpload validation in the Azure backend so missing or empty `ETag` values return the appropriate S3 error instead of allowing a gateway panic. Fix presigned authentication expiration validation to compare server time in `UTC`, matching the `UTC` timestamp used by presigned URL signing. Add request ID and host ID support across S3 requests. Each request now receives AWS S3-like identifiers, returned in response headers as `x-amz-request-id` and `x-amz-id-2` and included in all XML error responses as RequestId and HostId. The generated ID structure is designed to resemble AWS S3 request IDs and host IDs. The request signature calculation/validation for streaming uploads was previously delayed until the request body was fully read, both for Authorization header authentication and presigned URLs. Now, the signature is validated immediately in the authorization middlewares without reading the request body, since the signature calculation itself does not depend on the request body. Instead, only the `x-amz-content-sha256` SHA-256 hash calculation is delayed.	2026-05-21 23:49:34 +04:00
Ben McClelland	6fe0841cc3	Merge pull request #2125 from versity/sis/compress-mp-metadata fix: store object multipart upload metadata compressed	2026-05-13 11:54:17 -07:00
Ben McClelland	2c0844ad88	fix: ignore implicit directories for Get/HeadObject Directories that exist on the filesystem but were not explicitly created via S3 (put-object with a key ending in '/') do not have an etag value. ListObjectsV2 already uses the presence of this attribute to decide whether to include a directory as an object. GetObject and HeadObject were not performing this check, so they would successfully return directories that ListObjectsV2 would not list. Add the etag attribute check in GetObject and HeadObject: if a directory path is requested but has no etag xattr, return 404. This makes all three operations agree on which directories are S3 objects. Fixes #2130	2026-05-12 18:56:00 -07:00
niksis02	5cb5541006	fix: store object multipart upload metadata compressed Store multipart upload metadata through shared backend helpers so POSIX and Azure use the same encode/decode path. POSIX stores raw gzipped JSON in metadata stores, while Azure stores base64-encoded gzip for string metadata. Retrieval falls back to the legacy raw JSON format for existing objects. Storing the mp metadata compressed in posix will guarantee that for any allowed number of parts, the metadata won't exceed the xattr threshold(64KB).	2026-05-08 22:59:34 +04:00
Ben McClelland	21da1d7e70	Merge pull request #2097 from versity/dependabot/go_modules/dev-dependencies-3789651bc5 chore(deps): bump the dev-dependencies group with 7 updates	2026-05-04 16:24:32 -07:00
niksis02	d2fa265fb8	feat: support sha512, md5, xxhash3, xxhash64, xxhash128 data integrity checksums Integrate the new S3 checksum types in the gateway, including `SHA512`, `MD5`, `XXHASH64`, `XXHASH3`, and `XXHASH128`. This adds checksum calculation, validation, schema handling, and test coverage for the expanded checksum support. These external packages have been used: - `github.com/zeebo/xxh3` for `XXHASH3` and `XXHASH128` - `github.com/cespare/xxhash/v2` for `XXHASH64` Adjust integration tests because `aws-sdk-go-v2/service/s3` does not support automatic checksum calculation for the new checksum algorithms and returns an SDK-level error when only the checksum algorithm is provided. Only precalculated checksum values are acceptable for these checksum types. References: - `https://github.com/aws/aws-sdk-go-v2/issues/3404` - `https://github.com/aws/aws-sdk-go-v2/issues/3403`	2026-05-04 08:50:39 -07:00
Ben McClelland	8ae566d44e	feat: add new ErrNoSpaceLeftOnDevice API error for ENOSPC errors Add a new non-AWS error ErrNoSpaceLeftOnDevice (HTTP 507 Insufficient Storage) to s3err. Update all call sites in the posix backend that could return ENOSPC and return the new error when the underlying filesystem has no space remaining. Fixes #2093	2026-04-28 10:34:48 -07:00
Ben McClelland	8c49a3335e	fix: add explicit sidecar metadata cleanup on object/bucket deletion Unlike xattr metadata which is tied to the filesystem object and removed automatically, sidecar metadata lives in a parallel directory tree and must be deleted explicitly. Add DeleteAttributes calls after removing bucket directories, null-version files, and versioned object files. Also add an os.Stat existence check in GetObjectTagging and PutObjectTagging when no versionId is given, since sidecar's StoreAttribute/RetrieveAttribute do not naturally return ErrNotExist for missing objects the way xattr operations do.	2026-04-22 11:55:11 -07:00
niksis02	d6fb9547b8	fix: correct 206 Partial Content response status for ranged GetObject and HeadObject Fixes #2052 Fixes #2056 Fixes #2057 Previously, GetObject and HeadObject used the request's `Range` header to determine the response status code, which caused incorrect 206 responses for invalid Range header values. The status is now driven by whether res.ContentRange is set in the response, rather than by the presence of a range in the request. Backends (posix and azure) now set Content-Range for PartNumber=1 on non-multipart objects, skipping zero-size objects where no range applies. HeadObject was also fixed to return 206 when Content-Range is present, and to only return checksums when the full object is requested.	2026-04-21 02:13:04 +04:00
niksis02	62e8cddbc7	fix: make CompleteMultipartUpload idempotent and add part-number support to GetObject/HeadObject Closes #1064 Use the multipart ETag as the in-progress directory suffix instead of the static `.inprogress` marker so that concurrent CompleteMultipartUpload calls for the same upload ID are all treated as successful (idempotent) rather than racing, where only one succeeded and the rest returned NoSuchUpload. After finalizing the multipart upload, store an `mp-metadata` xattr on the assembled object that records the upload ID and cumulative byte offsets for each part. GetObject and HeadObject now use this metadata to serve individual part ranges via the `partNumber` query parameter, returning a successful response instead of returning NotImplemented. Add two new S3 error codes: - `ErrInvalidPartNumberRange` (416 RequestedRangeNotSatisfiable) — returned when the requested part number exceeds the number of parts in the upload. - `ErrRangeAndPartNumber` (400 BadRequest) — returned when both a Range header and a partNumber query parameter are specified on the same request.	2026-04-20 20:45:58 +04:00
Ben McClelland	2b918d585e	chore: fix spellings and unused function args No logic changes, just janitorial cleanup	2026-04-15 10:32:14 -07:00
Ben McClelland	efd1885d21	Merge pull request #2023 from versity/sis/move-versionid-validation-backend fix: move versionId validation to backend	2026-04-10 11:23:20 -07:00
Anael ORLINSKI	a673900b51	fix for multipart upload when using sidecar meta	2026-04-07 23:37:47 +02:00
niksis02	b473aa0545	fix: move versionId validation to backend Closes #1813 We use a specific `versionId` format(`ulid` package) to generate versionIds in posix, which is not compatible to S3. The versionId validation was performed in frontend which is a potential source of failure for s3 proxy configured on an s3 service which doesn't use ulid for versionId generation(e.g. aws S3). These changes move the specific `ulid` versionId validation to posix to not force any specific versionId format in the gateway.	2026-04-07 01:56:51 +04:00
Ben McClelland	1fca33e738	Merge pull request #2006 from versity/ben/racing-put-delete fix: retry link on ENOENT caused by racing DeleteObject	2026-04-02 09:57:48 -07:00
Ben McClelland	c26012905c	fix: retry link on ENOENT caused by racing DeleteObject A concurrent PutObject and DeleteObject on the same prefix directory can race: PutObject opens an O_TMPFILE in MetaTmpDir (not yet visible in the fs) DeleteObject removes the last visible object in the prefix directory and calls removeParents(), which rmdir's the now-empty prefix directory PutObject's link() tries to link the fd into a parent directory that no longer exists Fix by detecting ENOENT in the final link step (Linkat, Rename, and MoveFile) and retrying after recreating the parent directory. Also extract linkatOTmpfile() to consolidate the Linkat+EEXIST→Renameat logic that was previously inline in link(). Fixes #1988	2026-04-02 08:14:44 -07:00
niksis02	052f2364cc	feat: implement x-amz-source-expected-bucket-owner for CopyObject and UploadPartCopy Closes #1897 Extract the `X-Amz-Source-Expected-Bucket-Owner` header for CopyObject and UploadPartCopy. Verify the source bucket owner in the backend and if the provided access key id doesn't match, return an `AccessDenied` error.	2026-04-01 21:44:33 +04:00
Ben McClelland	e0209ebab4	Merge pull request #1997 from versity/sis/copyobject-threshold fix: enforce 5gb copy source object size threshold.	2026-03-31 12:27:11 -07:00
niksis02	59002b2650	feat: implement integration tests for browser-based POST object	2026-03-31 22:47:04 +04:00
niksis02	bbe246e8ec	fix: enforce 5gb copy source object size threshold. Fixes #1896 Enforces the S3 `5 GiB` copy source size limit across the posix and azure backends for `CopyObject` and `UploadPartCopy`, returning `InvalidRequest` when the source object exceeds the threshold. The limit is now configurable via `--copy-object-threshold` (`VGW_COPY_OBJECT_THRESHOLD`, default 5 GiB). A new `--mp-max-parts flag` (`VGW_MP_MAX_PARTS`, default `10000`) has been added to make multipart upload parts number limit configurable. No integration test has been added, as GitHub Actions cannot reliably handle large objects.	2026-03-31 22:44:03 +04:00
Ben McClelland	13b3dc5267	fix: serialize concurrent CompleteMultipartUpload calls via rename When two requests raced to complete the same multipart upload, the first caller to finish would remove the part files and upload directory. The second caller, already past the initial existence check, would then fail mid-flight with confusing errors such as ErrInvalidPart or an I/O error when trying to open a part that no longer exists. Fix this by atomically renaming the upload directory from <uploadID> to <uploadID>.inprogress at the very start of CompleteMultipartUploadWithCopy, before any part data is read. A concurrent caller will now find the original directory absent and receive a clean NoSuchUpload error. A deferred rename restores the original name if the complete does not succeed, allowing the client to retry. ListMultipartUploads is updated to skip any directories whose name ends in .inprogress so in-flight completes do not appear as pending uploads.	2026-03-27 16:14:20 -07:00
Ben McClelland	0234f3ecc7	Merge pull request #1992 from versity/ben/cleanup-fd fix: cleanup file descriptor leaks with chown fails	2026-03-27 11:43:31 -07:00
Ben McClelland	495b38a899	fix: abort scoutfs multipart uploads on error after successful moveblocks The scoutfs backend uses the move blocks ioctl when combining parts into the final multipart upload object. Once a move blocks from any part is successful, the original data is no longer in the part file. If the multipart upload fails and retries, future complete multipart upload calls will not have the correct data within the part files anymore. To prevent this case, once a move blocks call is successful for an upload, any future failure for the complete upload is set to auto-abort the upload to force clients to re-upload the part data again.	2026-03-26 16:28:42 -07:00
Ben McClelland	927d1d668a	fix: cleanup file descriptor leaks with chown fails We were missing a few cases of cleaning up temp files and file descriptors in the openTmpFile Chown() error cases.	2026-03-26 15:45:20 -07:00
Jakob van Santen	22f04312a7	fix: cleanup tempfiles when error prevents calling link When a client provides an invalid or incomplete body for a single-part upload, the handler returns before the link stage. Factor tempfile removal into cleanup() to catch all cases.	2026-03-20 08:25:44 -07:00
Jakob van Santen	56cb36d45a	feat: add option to disable copy-file-range for multipart uploads The standard io.Copy() will attempt to use copy_file_range when available. However, this can cause problems with certain filesystems. Add an option that will prevent io.Copy() from being able to use copy_file_range to force a standard data copy between file descriptors when needed for completing multipart uploads.	2026-03-20 08:23:01 -07:00
Jakob van Santen	d8f927519a	Do not expose directories that can't be buckets	2026-03-18 15:33:10 +01:00
Ben McClelland	97cc6bf23b	chore: run go modernize tool This is a fixup of the codebase using: go run golang.org/x/tools/go/analysis/passes/modernize/cmd/modernize@latest -fix ./... This has no bahvior changes, and only updates safe changes for modern go features.	2026-03-10 09:47:37 -07:00
niksis02	97bb70509f	fix: change the way object metadata is stored in posix Fixes #1909 Previously, the mapping between object metadata and posix object was as follows: for each metadata key, we stored a separate xattr with the `user.X-Amz-Meta.<key>` prefix. This resulted in syscall overhead when storing and deleting large numbers of metadata keys. In addition, very long metadata keys caused failures because most posix filesystems limit xattr key lengths to 127–255 bytes, while S3 does not enforce such a per-key limit. The logic has now been changed so that all object metadata is stored in a single xattr, `user.metadata`, as a JSON key/value object. For backward compatibility, metadata GET operations still fall back to the old mechanism (`metadata key -> xattr key`) when `user.metadata` is not present. A new CLI utility has been added to convert all legacy object metadata to the new metadata format within the provided directory. Example usage: ``` versitygw utils convert-xattr-metadata path/to/bucket ``` or ``` versitygw utils cxm path/to/bucket ``` It is recommended to run this command on bucket directories to convert all legacy metadata for every object in the bucket.	2026-03-06 01:44:14 +04:00
niksis02	8550dba36f	feat: add tagging support for directory objects in posix Closes #1857 Adds object Tagging support for directory objects in `PutObject` posix. Updates the integration tests to test object metadata and tagging both for file and directory objects.	2026-03-02 18:56:50 +04:00
Ben McClelland	62da7be9cd	Merge pull request #1907 from versity/ben/posix-cleanup chore: fix typos and error return wrapping types in posix	2026-03-02 05:44:07 -08:00
Ben McClelland	fc5c0a36a6	chore: fix typos and error return wrapping types in posix This is just a cleanup of typos, error messages and error types to correctly wrap the returned errors in the posix backend. No major logic changes.	2026-02-28 10:38:29 -08:00
Ben McClelland	7695be56b0	fix: store part checksums at destination path in UploadPartCopy In sidecar mode, the three StoreAttribute/storeChecksums calls after the copy loop were using objPath (the source object path) instead of the destination part's bucket and partPath. This caused checksums and the internal part-crc64nvme to be written under the source object's sidecar directory, making them unresolvable when CompleteMultipartUploadWithCopy tried to retrieve them. All three stores now use *upi.Bucket and partPath, consistent with the etag store directly below them.	2026-02-28 10:24:15 -08:00
Ben McClelland	809ba9e580	Merge pull request #1899 from versity/sis/upload-part-crc64nvme feat: optimize multipart upload checksum calculation.	2026-02-28 09:55:34 -08:00
Ben McClelland	98acad9c99	Merge pull request #1887 from versity/sis/complete-mp-checksum fix: store final checksum on CompleteMultipartUpload	2026-02-28 09:54:55 -08:00
Ben McClelland	0ad928a4d8	Merge pull request #1894 from versity/sis/getobject-directory-object-checksum feat: adds checksums for directory objects in posix	2026-02-28 09:39:51 -08:00
niksis02	d03a33110d	feat: optimize multipart upload checksum calculation. This PR optimizes multipart upload checksum handling. When a checksum algorithm/type is specified at multipart-upload initiation, each `UploadPart` request computes, validates, and stores the corresponding part checksum. During `CompleteMultipartUpload`, the final checksum is derived either via composite checksum calculation or by composing the CRC-family checksums. When no checksum algorithm is specified during multipart-upload initiation, each `UploadPart` may supply a different checksum algorithm for data-integrity verification. To support this scenario, a new mechanism has been implemented: for every `UploadPart`, a crc64nvme checksum is always computed. * If the client uses crc64nvme for the part upload, a single hash reader is used. * Otherwise, two hash readers are used—one for crc64nvme and one for the user-provided checksum. The crc64nvme value is stored in part xattrs under `user.part-crc64nvme` and later used during `CompleteMultipartUpload` as a composable checksum source. In `CompleteMultipartUpload`, the hash reader is entirely removed; the gateway no longer re-reads part data to compute the final checksum. The logic now follows two distinct paths: 1. Checksum algorithm/type specified at MP initiation * All required per-part checksums have already been stored. * If the checksum type is `FULL_OBJECT`, the gateway uses the composable path. * If the type is `COMPOSITE`, the gateway follows the checksum-combining path. 2. No checksum algorithm specified at MP initiation * The gateway loads the stored per-part `crc64nvme` values and composes them to compute the final checksum. The previous `composableCRC` check has been removed because all `FULL_OBJECT` algorithms are inherently composable (`crc32`, `crc32c`, `crc64nvme`). Validation now relies solely on `checksum.Type`.	2026-02-27 15:13:21 +04:00
niksis02	4ebe40829e	fix: store final checksum on CompleteMultipartUpload Previously, if no object checksum type/algorithm was specified when initiating a multipart upload, the CompleteMultipartUpload request would compute the final object’s CRC64NVME checksum but not persist it. This logic has now been fixed, and in the scenario described above the checksum is stored on the final object. There should no longer be any case where a CompleteMultipartUpload request finishes without persisting the final object checksum.	2026-02-27 15:12:57 +04:00
Ben McClelland	b3eac9781f	feat: add concurrency limiter to scoutfs This brings scoutfs in-line with the posix concurrency limiter. This fixes a hang with scoutfs due to not correctly initializing the concurrency in posix leading to a concurrency of 0 allowed. This also adds a sane default to the posix concurrency when not initialized.	2026-02-26 17:34:29 -08:00
niksis02	24364754fd	feat: adds checksums for directory objects in posix Add data-integrity checksum support in `PutObject` in the POSIX backend for directory objects. Since the only way to upload a directory object is via `PutObject`, this logic validates and stores the checksum of the empty payload. Support for `GetObject` has also been added to retrieve and return directory-object checksums.	2026-02-26 22:36:56 +04:00
niksis02	4b11f540cb	feat: add posix concurrency-limiter Closes #1815 Implements posix actions concurrency limiter. Since posix actions perform filesystem-heavy syscalls, a semaphore-based limiter is introduced to cap the maximum number of concurrent posix actions. When the limit is reached, additional action calls block until a slot becomes available. For internal posix calls, the `no_acquire_slot` context key is used to prevent acquiring the limiter multiple times within a single action (e.g., PutObject internally calling PutObjectLegalHold). The posix concurrency limit can be configured via the gateway posix subcommand flag (--concurrency) or the environment variable `VGW_POSIX_CONCURRENCY`. The default value is `5000`.	2026-02-17 14:59:32 +04:00
niksis02	2365f9f1ae	fix: fixes list-limiters parsing and validation Fixes #1809 Fixes #1806 Fixes #1804 Fixes #1794 This PR focuses on correcting so-called "list-limiter" parsing and validation. The affected limiters include: `max-keys`, `max-uploads`, `max-parts`, `max-buckets`, `max-uploads` and `part-number-marker`. When a limiter value is outside the integer range, a specific `InvalidArgument` error is now returned. If the value is a valid integer but negative, a different `InvalidArgument` error is produced. `max-buckets` has its own validation rules: completely invalid values and values outside the allowed range (`1 <= input <= 10000`) return distinct errors. For `ListObjectVersions`, negative `max-keys` values follow S3’s special-case behavior and return a different `InvalidArgument` error message. Additionally, `GetObjectAttributes` now follows S3 semantics for `x-amz-max-parts`: S3 ignores invalid values, so the gateway now matches that behavior.	2026-02-06 14:21:56 +04:00
niksis02	2e6794007c	feat: adds delimiter support in ListMultipartUploads Fixes #1792 Fixes #1747 Fixes #1797 Fixes #1799 This PR primarily introduces delimiter support and several bug fixes for the `ListMultipartUploads` action in the POSIX and Azure backends. Delimiter handling is now implemented — when a delimiter is present in multipart-upload object key names, the backend collects and returns the appropriate common prefixes. This functionality is achieved by introducing a common multipart-upload lister in the backend package. All backends (Azure, POSIX) now use this lister. The lister accepts a list that is already sorted and filtered by `KeyMarker` and `Prefix`. Previously, the `KeyMarker` was required to exactly match an existing multipart-upload object key. This restriction is removed. The listing now relies on a lexicographical comparison between the provided `KeyMarker` and existing multipart-upload object keys. Validation for `UploadIdMarker` is also added to correctly return an `InvalidArgument` error for invalid upload IDs. If `KeyMarker` is missing, the `UploadIdMarker` is ignored entirely. If `KeyMarker` is provided, a valid upload ID is one that matches an upload belonging to the first object key after the KeyMarker. For example, if the `KeyMarker` is `foo`, but the provided `UploadIdMarker` corresponds to an upload under `quxx`, it is invalid. It must match one of the uploads for the next object key equal to `foo`. Finally, this PR fixes multipart-upload sorting. Multipart uploads must be sorted primarily lexicographically by their object key, and secondarily—when multiple uploads share the same object key—by their initiation time in ascending order.	2026-02-06 14:16:16 +04:00
niksis02	7017ffa2a3	fix: removes object metadata loading from posix ListParts In the POSIX `ListParts` implementation, there was a code snippet that loaded the object metadata, even though it wasn’t needed and never used in the response. This redundant code has now been removed.	2026-01-28 21:12:56 +04:00

1 2 3 4 5 ...

334 Commits