versitygw

mirror of https://github.com/versity/versitygw.git synced 2026-01-09 13:03:09 +00:00

Author	SHA1	Message	Date
niksis02	394675a5a8	feat: implements unit tests for controller utilities	2025-07-22 20:55:23 +04:00
niksis02	ba76aea17a	feat: adds unit tests for the object HEAD and GET controllers.	2025-07-22 20:55:22 +04:00
niksis02	d2038ca973	feat: implements advanced routing for HeadObject and bucket PUT operations.	2025-07-22 20:55:22 +04:00
niksis02	b8456bc5ab	feat: implements advanced routing system for the bucket get operations. Closes #908 This PR introduces a new routing system integrated with Fiber. It matches each S3 action to a route using middleware utility functions (e.g., URL query match, request header match). Each S3 action is mapped to a dedicated route in the Fiber router. This functionality cannot be achieved using standard Fiber methods, as Fiber lacks the necessary tooling for such dynamic routing. Additionally, this PR implements a generic response handler to manage responses from the backend. This abstraction helps isolate the controller from the data layer and Fiber-specific response logic. With this approach, controller unit testing becomes simpler and more effective.	2025-07-22 20:55:22 +04:00
Ben McClelland	003bf5db0b	fix: convert deprecated fasthttp VisitAll() to All() An update to fasthttp has deprecated the VisitAll() method for an iterator function All() that can be used to range over all headers. This should fix the staticcheck warnings for calling the deprecated function.	2025-07-07 22:34:01 -07:00
Ben McClelland	7b8b483dfc	feat: calculate full object crc for multi-part uploads for compatible checksums The CRC32, CRC32c, and CRC64NVME data integrity checksums support calculating the composite full object values for multi-part uploads using the checksum and length of the individual parts. Previously, we were reading all of the part data to recalculate the full object checksum values during the complete multipart upload call. This disabled the optimized copy_file_range() for certain filesystems such as XFS because the part data was being read. If the data is not read, and the file handle is passed directly to io.Copy(), then the filesystem is allowed to optimize the copying of the data from the source to destination files. This now allows both the optimized copy_file_range() optimizations as well as the data integrity features enabled for support composite checksum types.	2025-07-03 19:58:53 -07:00
niksis02	458db64e2d	feat: implements public bucket access. This implementation introduces public buckets, which are accessible without signature-based authentication. There are two ways to grant public access to a bucket: * Bucket ACLs * Bucket Policies Only `Get` and `List` operations are permitted on public buckets. All write operations require authentication, regardless of whether public access is granted through an ACL or a policy. The implementation includes an `AuthorizePublicBucketAccess` middleware, which checks if public access has been granted to the bucket. If so, authentication middlewares are skipped. For unauthenticated requests, appropriate errors are returned based on the specific S3 action. --- 1. Bucket-Level Operations: ```json { "Version": "2012-10-17", "Statement": [ { "Effect": "Allow", "Principal": "", "Action": "s3:", "Resource": "arn:aws:s3:::test" } ] } ``` 2. Object-Level Operations: ```json { "Version": "2012-10-17", "Statement": [ { "Effect": "Allow", "Principal": "", "Action": "s3:", "Resource": "arn:aws:s3:::test/" } ] } ``` 3. Both Bucket and Object-Level Operations:* ```json { "Version": "2012-10-17", "Statement": [ { "Effect": "Allow", "Principal": "", "Action": "s3:", "Resource": "arn:aws:s3:::test" }, { "Effect": "Allow", "Principal": "", "Action": "s3:", "Resource": "arn:aws:s3:::test/*" } ] } ``` --- ```sh aws s3api create-bucket --bucket test --object-ownership BucketOwnerPreferred aws s3api put-bucket-acl --bucket test --acl public-read ```	2025-07-02 00:11:10 +04:00
Ben McClelland	7260854cd0	fix: add object path validation util This adds an object name validation util to check if the object path would resolve to a path outside of the bucket directory. S3 returns Bad Request for these type of paths: % aws s3api put-object --bucket mybucket --key test/../../hello An error occurred (400) when calling the PutObject operation: Bad Request	2025-07-01 09:24:29 -07:00
niksis02	dbc710da2d	feat: implements host-style bucket addressing in the gateway. Closes #803 Implements host-style bucket addressing in the gateway. This feature can be enabled by running the gateway with the `--virtual-domain` flag and specifying a virtual domain name. Example: ```bash ./versitygw -a user -s secret --virtual-domain localhost:7070 posix /tmp/vgw ``` The implementation follows this approach: it introduces a middleware (`HostStyleParser`) that parses the bucket name from the `Host` header and appends it to the URL path. This effectively transforms the request into a path-style bucket addressing format, which the gateway already supports. With this design, the gateway can handle both path-style and host-style requests when running in host-style mode. For local testing, one can either set up a local DNS server to wildcard-match all subdomains of a specified domain and resolve them to the local IP address, or manually add entries to `/etc/hosts` to resolve bucket-prefixed hosts to the server IP (e.g., `127.0.0.1`).	2025-05-22 00:36:45 +04:00
niksis02	23b5e60854	feat: adds debug logging for chunk readers. Closes #1221 Adds debug logging for `signed`/`unsigned` chunk readers. Adds the `debuglogger.Infof` log method, which prints out green info logs with `[INFO]:` prefix. The debug logging inclues some chunk details: size, signature, trailers. It also prints out stash/release stash operations. The error cases are logged with standart yellow `[DEBUG]:` prefix. The `String to sign` block in signed chunk reader is logged in purple horizontal borders with title.	2025-05-08 00:22:01 +04:00
Ben McClelland	a9fcf63063	feat: cleanup calling of debuglogger with managed debug setting	2025-05-02 17:05:59 -07:00
niksis02	2b1e1af89b	fix: Adds validation for Content-Length in upload operations. Fixes #961 Fixes #1248 The gateway should return a `MissingContentLength` error if the `Content-Length` HTTP header is missing for upload operations (`PutObject`, `UploadPart`). The second fix involves enforcing a maximum object size limit of `5 * 1024 * 1024 * 1024` bytes (5 GB) by validating the value of the `Content-Length` header. If the value exceeds this limit, the gateway should return an `EntityTooLarge` error.	2025-04-30 14:20:28 +04:00
niksis02	6fa58db6ab	fix: fixes the signed chunk encoding reader stashing. Fixes #1238 The signed chunk reader stashes the header bytes if it can't fully parse the chunk header. On the next `io.Reader` call, the stash is combined with the new buffer data to attempt parsing the header again. The stashing logic was broken due to the premature removal of the first two header bytes (`\r\n`). As a result, the stash was incomplete, leading to parsing issues on subsequent calls. These changes fix the stashing logic and correct the buffer offset calculation in `parseChunkHeaderBytes`.	2025-04-25 19:06:57 +04:00
niksis02	f831578d51	fix: handles tag parsing error cases for PutBucketTagging and PutObjectTagging Fixes #1214 Fixes #1231 Fixes #1232 Implements `utils.ParseTagging` which is a generic implementation of parsing tags for both `PutObjectTagging` and `PutBucketTagging`. - The actions now return `MalformedXML` if the provided request body is invalid. - Adds validation to return `InvalidTag` if duplicate keys are present in tagging. - For invalid tag keys, it creates a new error: `ErrInvalidTagKey`.	2025-04-23 20:35:19 +04:00
niksis02	bbb5a22c89	feat: makes debug loggin prettier. Adds missing logs in FE and utility functions Added missing debug logs in the `front-end` and `utility` functions. Enhanced debug logging with the following improvements: - Each debug message is now prefixed with [DEBUG] and appears in color. - The full request URL is printed at the beginning of each debug log block. - Request/response details are wrapped in framed sections for better readability. - Headers are displayed in a colored box. - XML request/response bodies are pretty-printed with indentation and color.	2025-04-17 22:46:05 +04:00
Ben McClelland	7866c326e3	Merge pull request #1190 from versity/sis/empty-checksum-headers fix: Handles the error cases for empty checksum headers for PutObject and UploadPart	2025-04-08 10:19:34 -07:00
niksis02	5560e5ebc4	fix: Returns ErrSignatureVersionNotSupported for sigV2 signed requests. Fixes #1171 As signature v2 is depracated the gateway doesn't support it. AWS S3 supports signature version 2 in some regions. For some regions the request fails with error: ``` InvalidRequest: The authorization mechanism you have provided is not supported. Please use AWS4-HMAC-SHA256. ``` The PR makes this change to return unsupported authorization mechanism for `sigV2` requests.	2025-04-08 01:05:59 +04:00
niksis02	ed44fe1969	fix: Handles the error cases for empty checksum headers for PutObject and UploadPart Fixes #1186 Fixes #1188 Fixes #1189 If multiple checksum headers are provided, no matter if they are empty or not, the gateway should return `(InvalidRequest): Expecting a single x-amz-checksum- header. Multiple checksum Types are not allowed.` An empty checksum header is considered as invalid, because it's not valid crc32, crc32c ...	2025-04-04 23:17:22 +04:00
niksis02	7d4076b944	fix: Fixes the signed chunk trailer encoding to return proper api errors for invalid and incorrect checksums. Fixes #1165 The signed chunk encoding with trailers should return api error for: 1. Invalid checksum - `(InvalidRequest) Value for x-amz-checksum-x trailing header is invalid.` 2. Incorrect checksum - `(BadDigest) The x you specified did not match the calculated checksum.` Where `x` could be crc32, crc32c, sha1 ...	2025-03-29 01:46:45 +04:00
niksis02	832371afb1	fix: Fixes the case for GetObjectAttributes to return InvalidArgument if a single invalid object attribute is provided. Fixes #1000 `GetObjectAttributes` returned `InvalidRequest` instead of `InvalidArgument` with description `Invalid attribute name specified.`. Fixes the logic in `ParseObjectAttributes` to ignore empty values for `X-Amz-Object-Attributes` headers to return `InvalidArgument` if all the specified object attributes are invalid.	2025-03-28 07:27:35 +04:00
niksis02	a4b7feaa70	feat: Implements signed chunk encoding with trailer in the gateway. Closes #1159 Fixes #1161 Implements signed chunk encoding with trailers in the gateway. The signed encoding (both with and without trailers) is now handled by the `ChunkReader`. Fixes the `ChunkReader` implementation to validate encoding headers byte by byte. The chunk encoding with trailers follows the general signed chunk encoding pattern, but the final chunk includes the trailing signature (`x-amz-trailing-signature`) and the checksum header (`x-amz-checksum-x`, where `x` can be `crc32`, `crc32c`, `sha1`, `sha256`, or `crc64nvme`). Adds validation for the `X-Amz-Trailer` header.	2025-03-27 20:56:49 +04:00
niksis02	4496711695	fix: Adds check for the final chunk signature in signed chunk encoding without trailing headers. Fixes #1147 The final chunk header with 0 length, contains the last signature in signed chunk encoding implementation. Added this last signature verification in the signed chunk encoding without trailers.	2025-03-20 18:19:14 +04:00
niksis02	9e0f56f807	fix: Fixes the returned error type for object legal hold status and object lock mode in PutObject, CopyObject and CreateMultipartUpload. Fixes #1141 Fixes #1142 Changes the error type to `InvalidArgument` for `x-amz-object-lock-legal-hold` and `x-amz-object-lock-mode` headers invalid values.	2025-03-18 13:58:49 +04:00
niksis02	7d6505ec06	fix: Adds validation for x-amz-checksum- headers. Makes x-amz-sdk-checksum-algorithm header case insensitive	2025-03-05 22:06:20 +04:00
Ben McClelland	85ba390ebd	fix: utils StreamResponseBody() memory use for large get requests The StreamResponseBody() called ctx.Write() in a loop with a small buffer in an attempt to stream data back to client. But the ctx.Write() was just calling append buffer to the response instead of streaming the data back to the client. The correct way to stream the response back is to use (ctx *fasthttp.RequestCtx).SetBodyStream() to set the body stream reader, and the response will automatically get streamed back using the reader. This will also call Close() on our body since we are providing an io.ReadCloser. Testing this should be done with single large get requests such as aws s3api get-object --bucket bucket --key file /tmp/data for very large objects. The testing shows significantly reduced memory usage for large objects once the streaming is enabled. Fixes #1082	2025-02-26 11:20:41 -08:00
niksis02	e7b9db1a1f	fix: Reconstructes the authentication handlers: initializes the chunk reader based on x-amz-content-sha256 header value. Adds the MissingContentHeader error if x-amz-decoded-content-length header is missing for chunk uploads	2025-02-25 21:17:04 +04:00
niksis02	e5811e4ce7	fix: Fixes the entity limiter validation for ListObjects(V2), ListParts, ListMultipartUploads, ListBuckets actions	2025-02-20 15:45:42 +04:00
niksis02	132d0ae631	feat: Adds the CRC64NVME checksum support in the gateway. Adds checksum-type support for the checksum implementation	2025-02-16 17:10:06 +04:00
niksis02	6956757557	feat: Integrates object integrity checksums(CRC32, CRC32C, SHA1, SHA256) into the gateway	2025-02-14 14:14:00 +04:00
Ben McClelland	3b1fcf2f08	fix: chunk encoding with incorrect chunk signature We were getting errors such as: 2025/02/07 19:24:28 Internal Error, write object data: write exceeds content length 87 whenever the chunk encoding did not have the correct chunk signatures. The issue was that the chunk encoding reader was reading from the underlying reader and then passing the full buffer read back to the caller if the underlying reader returned an error. This meant that we were not processing the chunk headers within the buffer due to the higher level error, and would possibly hand back the longer unprocessed chunk encoded stream to the caller that was in turn trying to write to the object file exceeding the content length limit. Fixes #1056	2025-02-07 19:33:10 -08:00
Ben McClelland	748912fb3d	fix: prevent panic with malformed chunk encoding An invalid chunk encoding, or parse errors leading to parsing invalid data can lead to a server panic if the chunk header remaining is determined to be larger than the max buffer size. This was previously seen when the chunk trailer checksums were used by the client without the support from the server side for this encoding. Example panic: panic: runtime error: slice bounds out of range [4088:1024] goroutine 5 [running]: github.com/versity/versitygw/s3api/utils.(ChunkReader).parseChunkHeaderBytes(0xc0003c4280, {0xc0000e6000?, 0x3000?, 0x423525?}) /home/tester/s3api/utils/chunk-reader.go:242 +0x492 github.com/versity/versitygw/s3api/utils.(ChunkReader).parseAndRemoveChunkInfo(0xc0003c4280, {0xc0000e6000, 0x3000, 0x8000}) /home/tester/s3api/utils/chunk-reader.go:170 +0x20b github.com/versity/versitygw/s3api/utils.(*ChunkReader).Read(0xc0003c4280, {0xc0000e6000, 0xc0000b41e0?, 0x8000}) /home/tester/s3api/utils/chunk-reader.go:91 +0x11e This fix will validate the data length before copying into the temporary buffer to prevent a panic and instead just return an error.	2025-01-31 16:22:25 -08:00
niksis02	da3c6211bd	feat: Implements streaming unsigned payload reader with trailers	2025-01-31 13:29:34 -08:00
niksis02	c094086d83	fix: Fixes the response body streaming for GetObject, implementing a chunk streamer	2025-01-15 23:11:04 +04:00
jonaustin09	66c13ef982	fix: Adds a check to ensure the x-amz-object-attributes header is set and non-empty.	2024-10-31 17:05:54 -04:00
jonaustin09	06e2f2183d	fix: Changes GetObjectAttributes action xml encoding root element to GetObjectAttributesResponse. Adds input validation for x-amz-object-attributes header. Adds x-amz-delete-marker and x-maz-version-id headers for GetObjectAttributes action. Adds VersionId in HeadObject response, if it's not specified in the request	2024-10-30 15:42:15 -04:00
jonaustin09	3b903f6044	fix: Fixes max-parts, max-keys, max-uploads validation defaulting to 1000	2024-10-22 14:28:50 -04:00
jonaustin09	600aca8bdc	fix: Fixed the request uri path escape to support object key special characters	2024-09-17 13:28:30 -04:00
Jon Austin	d79f978df9	feat: Added the standard storage class to all the available get/list actions responses in posix. (#765 )	2024-08-27 15:28:40 -07:00
jonaustin09	7545e6236c	feat: Implement bucket ownership controls Bucket ACLs are now disabled by default the same as AWS. By default the object ownership is BucketOwnerEnforced which means that bucket ACLs are disabled. If one attempts to set bucket ACL the following error is returned both in the gateway and on AWS: ErrAclNotSupported: { Code: "AccessControlListNotSupported", Description: "The bucket does not allow ACLs", HTTPStatusCode: http.StatusBadRequest, }, ACls can be enabled with PutBucketOwnershipControls Changed bucket canned ACL translation New backend interface methods: PutBucketOwnershipControls GetBucketOwnershipControls DeleteBucketOwnershipControls Added these to metrics	2024-06-28 21:03:09 -07:00
jonaustin09	43f509d971	fix: Added missing properties support for CreateMultipartUpload action: ContentType, ObjectLock, Tagging, Metadata	2024-05-22 12:16:55 -07:00
Ben McClelland	f722f515ae	chore: add missing copyright headers to files	2024-05-06 16:16:31 -07:00
Ben McClelland	aba8d03ddf	Merge pull request #544 from versity/ben/request_time_skewed Ben/request time skewed	2024-05-02 10:21:17 -07:00
Ben McClelland	2c165a632c	fix: int overflow check in chunk reader Make the code scanners happy with a bounds check before we do the integer conversion from int64 to int, since this can overflow on 32 bit platforms. Best error to return here is a signature error since this is a client problem and the chunk headers are considered part of the request signature.	2024-05-01 21:27:17 -07:00
Ben McClelland	3fc8956baf	fix: increase valid timestampe window from 1 to 15 minutes According to: https://docs.aws.amazon.com/AmazonS3/latest/userguide/RESTAuthentication.html#RESTAuthenticationTimeStamp The valid time wondow for authenticated requests is 15 minutes, and when outside of that window should return RequestTimeTooSkewed.	2024-05-01 13:56:34 -07:00
jonaustin09	0c3771ae2d	feat: Added GetObjectAttributes actions implementation in posix, azure and s3 backends. Added integration tests for GetObjectAttributes action	2024-04-29 15:31:53 -04:00
jonaustin09	fbaba0b944	feat: Added object WORM protection by object-lock feature from AWS with the following actions support: PutObjectLockConfiguration, GetObjectLockConfiguration, PutObjectRetention, GetObjectRetention, PutObjectLegalHold, GetObjectLegalHold	2024-04-22 13:13:40 -07:00
Ben McClelland	07b01a738a	fix: chunkreader invalid signature when header crossed read buffers Fixes #512. For chunked uploads, we parse the chunk headers in place and then move the data payload up on the buffer to overwrite the chunk headers for the real data stream. For the special case where the chunk header was truncated in the current read buffer, the partial header is stashed in a temporary byte slice. The following read will contain the remainder of the header that we can put together and parse. We were correctly parsing this, but we forgot that the data offset is calculated based on the start of the header. But the special case where part of the header was stashed means we were incorrectly calculating the data offset into the read buffer. Easy fix to just remove the stash size from the data offset return value.	2024-04-16 23:08:25 -07:00
Ben McClelland	dac69caac3	fix: escape path and query for presign signature validation fixes #462	2024-03-18 15:16:17 -07:00
Ben McClelland	b555c92940	fix: include all request signed headers in signature canonical string Fixes #457. There are some buggy clients that include headers not actually set on the request in the signed headers list. For these we need to include them in the signature canoncal string with empty values.	2024-03-14 09:56:36 -07:00
Ben McClelland	d422aced17	fix: 0 len content-len header missing in signed headers This fixes the case where clients can include the content-length header in the signed headers for a 0 length file (like s3cmd). Since we had to hoist the aws code into versitygw, we can also remove the hack for the "User-Agent" header in the hard coded excludes list and just remove it from the excludes list.	2024-03-02 21:52:22 -08:00

1 2

81 Commits