versitygw

mirror of https://github.com/versity/versitygw.git synced 2026-07-31 20:36:18 +00:00

Author	SHA1	Message	Date
Ben McClelland	e2821fc855	feat: add option to disable s3proxy client data integrity checks AWS introduced a relatively newer option for data integrity checks that not all non-AWS server support yet. See this for mmore info: https://docs.aws.amazon.com/AmazonS3/latest/userguide/checking-object-integrity.html This change adds a new option: disable-data-integrity-check to disable the data integrity checks in the client sdk for the servers that may not yet support this. Use this only when the s3 service for the proxy does not support the data integrity features. Fixes #1867	2026-02-21 11:49:20 -08:00
niksis02	46bcc8af35	fix: fixes object default Content-Type Fixes #1849 If no `Content-Type` is provided during object upload, S3 defaults it to `application/octet-stream`. This behavior was missing in the gateway, causing backends to persist an empty `Content-Type`, which Fiber then overrides with its default `text/plain`. The behavior has now been corrected for the object upload operations: `PutObject`, `CreateMultipartUpload`, and `CopyObject`.	2026-02-18 01:44:52 +04:00
niksis02	4b11f540cb	feat: add posix concurrency-limiter Closes #1815 Implements posix actions concurrency limiter. Since posix actions perform filesystem-heavy syscalls, a semaphore-based limiter is introduced to cap the maximum number of concurrent posix actions. When the limit is reached, additional action calls block until a slot becomes available. For internal posix calls, the `no_acquire_slot` context key is used to prevent acquiring the limiter multiple times within a single action (e.g., PutObject internally calling PutObjectLegalHold). The posix concurrency limit can be configured via the gateway posix subcommand flag (--concurrency) or the environment variable `VGW_POSIX_CONCURRENCY`. The default value is `5000`.	2026-02-17 14:59:32 +04:00
Andrii BrataninandGitHub	9c212997dc	feat: allow anonymous access for s3proxy backend * Update client.go to support anonymous S3 access * Update s3.go to make access and secret parameters optional * Update example.conf for more clear S3 access and secret usage Fixes #1836	2026-02-11 11:03:02 -08:00
Ben McClellandandGitHub	3856f99904	Merge pull request #1839 from versity/sis/deleteobject-if-match-etag-quotes fix: fixes DeleteObject if-match quoted comparison	2026-02-11 08:18:35 -08:00
niksis02	89aa822a40	fix: fixes DeleteObject if-match quoted comparison Fixes #1835 If-Match in DeleteObject is a precondition header that compares the client-provided ETag with the server-side ETag before deleting the object. Previously, the comparison failed when the client sent an unquoted ETag, because server ETags are stored with quotes. The implementation now trims quotes from both the input ETag and the server ETag before comparison to avoid mismatches. Both quoted and unquoted ETags are valid according to S3.	2026-02-11 16:45:36 +04:00
Ben McClelland	e702a4860a	fix: CopyObject with URL-encoded special chars CopyObject was failing with NoSuchKey when source keys contained special characters like {} or spaces. The X-Amz-Copy-Source header is URL-encoded by clients, but ParseCopySource wasn't decoding before filesystem access. Added url.QueryUnescape() to properly decode bucket and object names, fixing copy operations for keys with special characters. Fixing this also uncovered an errors with azure blob url encoding with similar special character handling. Added this fix in for the integration tests to pass. Fixes #1832 Fixes #1637	2026-02-10 14:55:18 -08:00
Ben McClellandandGitHub	e805003872	Merge pull request #1816 from versity/sis/max-limiter-errors fix: fixes list-limiters parsing and validation	2026-02-06 09:28:18 -08:00
Ben McClellandandGitHub	b2a9b383ae	Merge pull request #1803 from versity/sis/list-mp-delimiter feat: adds delimiter support in ListMultipartUploads	2026-02-06 09:27:32 -08:00
niksis02	2365f9f1ae	fix: fixes list-limiters parsing and validation Fixes #1809 Fixes #1806 Fixes #1804 Fixes #1794 This PR focuses on correcting so-called "list-limiter" parsing and validation. The affected limiters include: `max-keys`, `max-uploads`, `max-parts`, `max-buckets`, `max-uploads` and `part-number-marker`. When a limiter value is outside the integer range, a specific `InvalidArgument` error is now returned. If the value is a valid integer but negative, a different `InvalidArgument` error is produced. `max-buckets` has its own validation rules: completely invalid values and values outside the allowed range (`1 <= input <= 10000`) return distinct errors. For `ListObjectVersions`, negative `max-keys` values follow S3’s special-case behavior and return a different `InvalidArgument` error message. Additionally, `GetObjectAttributes` now follows S3 semantics for `x-amz-max-parts`: S3 ignores invalid values, so the gateway now matches that behavior.	2026-02-06 14:21:56 +04:00
niksis02	2e6794007c	feat: adds delimiter support in ListMultipartUploads Fixes #1792 Fixes #1747 Fixes #1797 Fixes #1799 This PR primarily introduces delimiter support and several bug fixes for the `ListMultipartUploads` action in the POSIX and Azure backends. Delimiter handling is now implemented — when a delimiter is present in multipart-upload object key names, the backend collects and returns the appropriate common prefixes. This functionality is achieved by introducing a common multipart-upload lister in the backend package. All backends (Azure, POSIX) now use this lister. The lister accepts a list that is already sorted and filtered by `KeyMarker` and `Prefix`. Previously, the `KeyMarker` was required to exactly match an existing multipart-upload object key. This restriction is removed. The listing now relies on a lexicographical comparison between the provided `KeyMarker` and existing multipart-upload object keys. Validation for `UploadIdMarker` is also added to correctly return an `InvalidArgument` error for invalid upload IDs. If `KeyMarker` is missing, the `UploadIdMarker` is ignored entirely. If `KeyMarker` is provided, a valid upload ID is one that matches an upload belonging to the first object key after the KeyMarker. For example, if the `KeyMarker` is `foo`, but the provided `UploadIdMarker` corresponds to an upload under `quxx`, it is invalid. It must match one of the uploads for the next object key equal to `foo`. Finally, this PR fixes multipart-upload sorting. Multipart uploads must be sorted primarily lexicographically by their object key, and secondarily—when multiple uploads share the same object key—by their initiation time in ascending order.	2026-02-06 14:16:16 +04:00
John W Higgins	bcf341bdaa	Add "lexical" sort to walker.go	2026-02-01 16:10:23 -08:00
John W Higgins	73e2df4105	Base import of cutdown version of io/fs/walk.go from golang as walker.go	2026-01-29 13:10:01 -08:00
John W Higgins	bef8f38f3e	Revert of `34da1833` and `8e18b431` for backend/walk.go Revert the changes with regards to lexical sorting but only for the walk.go code itself. Leave the tests alone.	2026-01-29 13:10:01 -08:00
niksis02	7017ffa2a3	fix: removes object metadata loading from posix ListParts In the POSIX `ListParts` implementation, there was a code snippet that loaded the object metadata, even though it wasn’t needed and never used in the response. This redundant code has now been removed.	2026-01-28 21:12:56 +04:00
khambrechtandGitHub	f29206afb6	fix: put object on windows when parent directories dont already exists The previous logic was not allowing put-object on windows when the parent directory did not already exist, and would not always return the correct error if an ancestor in the path already existed as a file. The problem is the different behavior of the os.Stat command in Windows compared to nix in backend/posix/posix.go in function PutObjectWithPostFunc. The os.Stat returns ENOTDIR on nix if the parent object is a file instead of a directory. On Windows, if the parent object does not exist at all, the return code of such os.Stat is ERROR_PATH_NOT_FOUND which is mapped to ENOTDIR. However this is inappropriate in this case. As a result, the return code of the os.Stat is incorrectly interpreted as if the parent object is a file instead of the parent object does not exist. Which then leads to a failed upload. This fix validates the existing parent structure on put to make sure the correct error is returned or the put is successful. Fixes #1702	2026-01-27 08:51:06 +09:00
Ben McClellandandGitHub	01fc142c1e	fix: correct spelling for debuglogger.InternalError() (#1784 )	2026-01-24 06:44:54 -08:00
niksis02	86e2b02e55	fix: fixes delete markers access for some actions Fixes #1766 Fixes #1750 This PR focuses on two bug fixes: First, it blocks access to delete `DeleteMarkers` for the following operations by returning a `MethodNotAllowed` error: `PutObjectTagging`, `GetObjectTagging`, `DeleteObjectTagging`, `PutObjectLegalHold`, `GetObjectLegalHold`, `PutObjectRetention`, and `GetObjectRetention`. Second, it removes the access check that previously prevented deleting a delete marker locked by a bucket default retention rule. A delete marker should always be allowed to be deleted.	2026-01-20 16:24:46 +04:00
niksis02	43559e646e	fix: fixes non-existing object deletion with versionId Fixes #1757 Fixes #1758 When attempting to delete a non-existing object in a versioning-enabled bucket while specifying a `versionId`, VersityGW previously returned an internal error if the object had a parent file object, and an `InvalidArgument` error if the object did not exist. This PR fixes both behaviors and now returns a successful response that includes the `versionId`.	2026-01-16 15:00:47 +04:00
niksis02	2a7e76a44f	fix: fixes missing bucket object lock config error Fixes #1751 When an object lock–related operation is performed on an object in a bucket where Object Lock is not enabled, an `InvalidRequest` error is returned; however, the error message differs for some actions. This PR introduces a new error, `ErrMissingObjectLockConfigurationNoSpaces`, for `PutObject`, `CopyObject`, and `CreateMultipartUpload` to maintain compatibility with S3 in terms of the error message. It also adds the missing integration tests for these actions.	2026-01-14 13:41:50 +04:00
niksis02	06f4f0ac15	fix: skips object lock check in DeleteObject without versionId. Fixes #1741 An object delete request without a `versionId` results in the creation of a new delete marker in versioning-enabled buckets. Even if the latest object version is locked, a new delete marker must still be created. This implementation skips the object lock check for delete requests in versioning-enabled buckets when the `versionId` is missing, allowing the delete marker to be created as expected. Additionally, it introduces a flag in the `createObjVersion` method in POSIX to remove unnecessary xattr attributes from an object after creating a new object version. A delete marker must not carry object-specific attributes such as tagging, legal hold, or retention. Currently, the cleanup is limited to legal hold and retention attributes, but this list will be expanded after fixing issue #1751.	2026-01-13 16:50:54 +04:00
Ben McClellandandGitHub	d05d29010d	Merge pull request #1739 from versity/sis/create-bucket-and-owner feat: implements admin CreateBucket endpoint/cli command	2026-01-12 10:09:58 -08:00
Ben McClellandandGitHub	b1e9dead5d	Merge pull request #1748 from loktionovam/fix-meta-sidecar-cleanup-performance fix: optimize sidecar empty-dir checks	2026-01-12 08:56:33 -08:00
niksis02	2561ef9708	feat: implements admin CreateBucket endpoint/cli command Closes #1731 Implements the admin `CreateBucket` (`PATCH /:bucket/create`) endpoint and CLI command, which create a new bucket with the provided owner access key ID. The endpoint internally calls the S3 `CreateBucket` API, storing the new owner information in the request context under the `bucket-owner` key. This value is then retrieved by the S3 API layer and the backends. The endpoint uses the custom `x-vgw-owner` HTTP header to pass the bucket owner access key ID. The admin CLI command mirrors `aws s3api create-bucket` and supports all flags implemented by the gateway (for example, `--create-bucket-configuration`, `--acl`, `--object-ownership`, etc.).	2026-01-12 14:32:52 +04:00
Aleksandr Loktionov	b78d21c3db	fix: optimize sidecar empty-dir checks	2026-01-12 06:51:18 -03:00
Dave Cottlehuber	0cab42d9fe	xattr: use different namespace prefixes for FreeBSD vs other platforms Go's stdlib seems to handle the FreeBSD user. namespace directly, or FreeBSD itself doesn't require it. Make this a platform-specific feature. Fixes: #1745	2026-01-10 16:43:33 +00:00
Ben McClellandandGitHub	841a012ce0	Merge pull request #1728 from versity/sis/get-object-empty-tagging fix: removes the NoSuchTagSet error in GetObjectTagging	2026-01-03 20:51:08 -08:00
niksis02	06a45124b1	fix: removes the NoSuchTagSet error in GetObjecTagging Fixes #1686 GetObjectTagging previously returned a `NoSuchTagSet` error when no object tags were set. This has been fixed, and an empty tag set is now returned instead.	2026-01-02 23:31:35 +04:00
niksis02	a75aa9bad5	fix: fixes if-none-match precondition header logic in object write operations Fixes #1708 This PR focuses on evaluating the `x-amz-if-none-match` precondition header for object PUT operations. If any value other than `` is provided, a `NotImplemented` error is returned. If `If-Match` is used together with `If-None-Match`, regardless of the value combination, a `NotImplemented` error is returned. When only `If-None-Match: ` is specified, a `PreconditionFailed` error is returned if the object already exists in `PutObject` or `CompleteMultipartUpload`; if the object does not exist, object creation is allowed.	2026-01-02 22:59:13 +04:00
Ben McClellandandGitHub	4cbd58cc66	Merge pull request #1717 from loktionovam/fix-meta-sidecar-cleanup fix: cleanup sidecar metadata empty dirs	2025-12-31 00:44:19 -08:00
niksis02	61308d2fbf	fix: return NoSuchKey if a precondition header is present and object doesn't exist in PutObject, CompleteMultipartUpload Fixes #1709 If any precondition header is present(`If-Match`, `If-None-Match`) in `PutObject` and `CompleteMultipartUpload` and there's no object in the bucket with the given key, a `NoSuchKey` error is now returned. Previously the headers were simply ignored and new object creation was allowed.	2025-12-30 12:02:49 +04:00
Aleksandr LoktionovandAleksandr Loktionov	edac345c23	fix: cleanup sidecar metadata empty dirs	2025-12-29 08:24:05 -03:00
niksis02	5aa2a822e8	fix: Makes precondition headers insensitive to whether the value is quoted Fixes #1710 The `If-Match` and `If-None-Match` precondition header values represent object ETags. ETags are generally quoted; however, S3 evaluates precondition headers equivalently whether the ETag is quoted or not, comparing only the underlying value and ignoring the quotes if present. The new implementation trims quotes from the ETag in both the input precondition header and the object metadata, ensuring that comparisons are performed purely on the ETag value and are insensitive to quoting.	2025-12-28 13:51:33 +04:00
niksis02	d507f206f3	fix: fixes the GetObjectAttributes panic in s3 proxy The error check for the SDK call in `GetObjectAttributes` in the S3 proxy backend was missing, which caused the gateway to panic in all cases where the SDK method returned an error. The error check has now been added so that the method returns an error when the SDK call fails.	2025-12-15 17:24:45 +04:00
niksis02	c58f9b20e0	feat: adds integration tests for unsigned streaming payload trailer uploads	2025-12-03 01:32:18 +04:00
niksis02	d861dc8e30	fix: fixes unsigned streaming upload parsing and checksum calculation Fixes #1600 Fixes #1603 Fixes #1607 Fixes #1626 Fixes #1632 Fixes #1652 Fixes #1653 Fixes #1656 Fixes #1657 Fixes #1659 This PR focuses mainly on unsigned streaming payload trailer request payload parsing and checksum calculation. For streaming uploads, there are essentially two ways to specify checksums: 1. via `x-amz-checksum-` headers, 2. via `x-amz-trailer`, or none — in which case the checksum should default to crc64nvme. Previously, the implementation calculated the checksum only from `x-amz-checksum-` headers. Now, `x-amz-trailer` is also treated as a checksum-related header and indicates the checksum algorithm for streaming requests. If `x-amz-trailer` is present, the payload must include a trailing checksum; otherwise, an error is returned. `x-amz-trailer` and any `x-amz-checksum-` header cannot* be used together — doing so results in an error. If `x-amz-sdk-checksum-algorithm` is specified, then either `x-amz-trailer` or one of the `x-amz-checksum-*` headers must also be present, and the algorithms must match. If they don’t, an error is returned. The old implementation used to return an internal error when no `x-amz-trailer` was received in streaming requests or when the payload didn’t contain a trailer. This is now fixed. Checksum calculation used to happen twice in the gateway (once in the chunk reader and once in the backend). A new `ChecksumReader` is introduced to prevent double computation, and the trailing checksum is now read by the backend from the chunk reader. The logic for stacking `io.Reader`s in the Fiber context is preserved, but extended: once a `ChecksumReader` is stacked, all following `io.Reader`s are wrapped with `MockChecksumReader`, which simply delegates to the underlying checksum reader. In the backend, a simple type assertion on `io.Reader` provides the necessary checksum metadata (algorithm, value, etc.).	2025-12-03 01:32:18 +04:00
niksis02	1d0a1d8261	fix: fixes the panic in GetBucketVersioning in s3 proxy Fixes #1649 `GetBucketVersioning` used to be a cause of a panic in s3 proxy backend, because of an inproper error handling. Now the error returned from the sdk method is explitily checked, before returning the response.	2025-11-17 20:13:34 +04:00
Ben McClelland	3c3e9dd8b1	feat: add project id support for scoutfs backend The scoutfs filesystem allows setting project IDs on files and directories for project level accounting tracking. This adds the option to set the project id for the following: create bucket put object put part complete multipart upload The project id will only be set if all of the following is true: - set project id option enabled - filesystem format version supports projects (version >1) - account project id > 0	2025-11-14 15:36:10 -08:00
niksis02	8bb4bcba63	fix: fixes NoSuchVersion errors for some actions in posix Fixes #1616 Some object-level actions in the gateway that work with object versions used to return `InvalidVersionId` when the specified object version did not exist. The logic has now been fixed, and they correctly return `NoSuchVersion`. These actions include: `HeadObject`, `GetObject`, `PutObjectLegalHold`, `GetObjectLegalHold`, `PutObjectRetention`, and `GetObjectRetention`.	2025-11-10 19:44:20 +04:00
niksis02	77459720ba	feat: adds x-amz-tagging-count support for HeadObject Closes #1346 `GetObject` and `HeadObject` return the `x-amz-tagging-count` header in the response, which specifies the number of tags associated with the object. This was already supported for `GetObject`, but missing for `HeadObject`. This implementation adds support for `HeadObject` in `azure` and `posix` and updates the integration tests to cover this functionality for `GetObject`.	2025-11-05 20:30:50 +04:00
niksis02	8d2eeebce3	feat: adds tagging support for object versions in posix Closes #1343 Object version tagging support was previously missing in the gateway. The support is added with this PR. If versioning is not enabled at the gateway level and a user attempts to put, get, or delete object version tags, the gateway returns an `InvalidArgument`(Invalid versionId)	2025-11-04 23:51:22 +04:00
niksis02	9bde1ddb3a	feat: implements tagging support for CreateBucket Closes #1595 This implementation diverges from AWS S3 behavior. The `CreateBucket` request body is no longer ignored. Based on the S3 request body schema, the gateway parses only the `LocationConstraint` and `Tags` fields. If the `LocationConstraint` does not match the gateway’s region, it returns an `InvalidLocationConstraint` error. In AWS S3, tagging during bucket creation is supported only for directory buckets. The gateway extends this support to general-purpose buckets. If the request body is malformed, the gateway returns a `MalformedXML` error.	2025-10-31 00:59:56 +04:00
Ben McClellandandGitHub	69a3483269	Merge pull request #1592 from versity/sis/bucket-object-tag-validation fix: fixes the bucket/object tagging key/value name validation	2025-10-20 12:21:01 -07:00
Ben McClellandandGitHub	d256ea5929	Merge pull request #1589 from versity/sis/complete-mp-composite-checksum fix: fixes the composite checksums in CompleteMultipartUpload	2025-10-20 09:25:17 -07:00
niksis02	ebf7a030cc	fix: fixes the bucket/object tagging key/value name validation Fixes #1579 S3 enforces a specific rule for validating bucket and object tag key/value names. This PR integrates the regexp pattern used by S3 for tag validation. Official S3 documentation for tag validation rules: [AWS S3 Tag](https://docs.aws.amazon.com/AmazonS3/latest/API/API_control_Tag.html) There are two types of tagging inputs for buckets and objects: 1. On existing buckets/objects — used in the `PutObjectTagging` and `PutBucketTagging` actions, where tags are provided in the request body. 2. On object creation — used in the `PutObject`, `CreateMultipartUpload`, and `CopyObject` actions, where tags are provided in the request headers and must be URL-encoded. This implementation ensures correct validation for both types of tag inputs.	2025-10-20 15:19:38 +04:00
niksis02	932f1c9da7	fix: sets crc64nvme as defualt checksum for complete mp action Fixes #1547 When no checksum is specified during multipart upload initialization, the complete multipart upload request should default to CRC64NVME FULL_OBJECT. The checksum will not be stored in the final object metadata, as it is used solely for data integrity verification. Note that although CRC64NVME is composable, it is calculated using the standard hash reader, since the part checksums are missing and the final checksum calculation is instead based directly on the parts data.	2025-10-17 17:18:29 +04:00
niksis02	24679a82ac	fix: fixes the composite checksums in CompleteMultipartUpload Fixes #1359 The composite checksums in CompleteMultipartUpload generally follow the format `checksum-<number_of_parts>`. Previously, the gateway treated composite checksums as regular checksums without distinguishing between the two formats. In S3, the `x-amz-checksum-` headers accept both plain checksum values and the `checksum-<number_of_parts>` format. However, after a successful `CompleteMultipartUpload` request, the final checksum is always stored with the part number included. This implementation adds support for parsing both formats—checksums with and without the part number. From now on, composite checksums are consistently stored with the part number included. Additionally, two integration tests are added: One verifies the final composite checksum with part numbers. * Another ensures invalid composite checksums are correctly rejected.	2025-10-17 16:45:07 +04:00
Ben McClelland	40da4a31d3	chore: cleanup unused constants We have some leftover constants from some previous changes. This just cleans up all that are no longer needed.	2025-10-09 12:19:00 -07:00
Ben McClelland	4c3965d87e	feat: add option to disable strict bucket name checks Some systems may choose to allow non-aws compliant bucket names and/or handle the bucket naem validation in the backend instead. This adds the option to turn off the strict bucket name validation checks in the frontend API handlers. When frontend bucket name validation is disabled, we need to do sanity checks for posix compliant names in the posix/scoutfs backends. This is automatically enabled when strict bucket name validation is disabled. Fixes #1564	2025-10-08 14:34:52 -07:00
niksis02	a606e57bbd	fix: correct a few object lock behaviors Fixes #1565 Fixes #1561 Fixes #1300 This PR focuses on three main changes: 1. Prioritizing object-level lock configuration over bucket-level default retention When an object is uploaded with a specific retention configuration, it takes precedence over the bucket’s default retention set via `PutObjectLockConfiguration`. If the object’s retention expires, the object must become available for write operations, even if the bucket-level default retention is still active. 2. Preventing object lock configuration from being disabled once enabled To align with AWS S3 behavior, once object lock is enabled for a bucket, it can no longer be disabled. Previously, sending an empty `Enabled` field in the payload would disable object lock. Now, this behavior is removed—an empty `Enabled` field will result in a `MalformedXML` error. This creates a challenge for integration tests that need to clean up locked objects in order to delete the bucket. To handle this, a method has been implemented that: * Removes any legal hold if present. * Applies a temporary retention with a "retain until" date set 3 seconds ahead. * Waits for 3 seconds before deleting the object and bucket. 3. Allowing object lock to be enabled on existing buckets via `PutObjectLockConfiguration` Object lock can now be enabled on an existing bucket if it wasn’t enabled at creation time. * If versioning is enabled at the gateway level, the behavior matches AWS S3: object lock can only be enabled when bucket versioning status is `Enabled`. * If versioning is not enabled at the gateway level, object lock can always be enabled on existing buckets via `PutObjectLockConfiguration`. * In Azure (which does not support bucket versioning), enabling object lock is always allowed. This change also fixes the error message returned in this scenario for better clarity.	2025-10-03 00:18:46 +04:00

1 2 3 4 5 ...