scylladb

Author	SHA1	Message	Date
Avi Kivity	7cb1c10fed	treewide: replace seastar::future::get0() with seastar::future::get() get0() dates back from the days where Seastar futures carried tuples, and get0() was a way to get the first (and usually only) element. Now it's a distraction, and Seastar is likely to deprecate and remove it. Replace with seastar::future::get(), which does the same thing.	2024-02-02 22:12:57 +08:00
Kefu Chai	207fe93b90	utils: add formatter for rjson::value before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for rjson::value, and drop its operator<<(). Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16956	2024-01-24 10:30:52 +02:00
Michael Huang	75109e9519	cql3: Fix invalid JSON parsing for JSON objects with ASCII keys For JSON objects represented as map<ascii, int>, don't treat ASCII keys as a nested JSON string. We were doing that prior to the patch, which led to parsing errors. Included the error offset where JSON parsing failed for rjson::parse related functions to help identify parsing errors better. Fixes: #7949 Signed-off-by: Michael Huang <michaelhly@gmail.com> Closes scylladb/scylladb#15499	2023-10-05 22:26:08 +03:00
Marcin Maliszkiewicz	f96ed4dba5	utils: yield when streaming json in print() - removed buffer reuse to simplify the code - added co_await suspention point on each send() making it yield	2023-01-23 13:46:06 +01:00
Piotr Grabowski	0544973b15	utils/rjson.cc: ignore buggy GCC warning When compiling utils/rjson.cc on GCC, the compilation triggers the following warning (which becomes a compilation error): utils/rjson.cc: In function ‘seastar::future<> rjson::print(const value&, seastar::output_stream<char>&, size_t)’: utils/rjson.cc:239:15: error: typedef ‘using Ch = char’ locally defined but not used [-Werror=unused-local-typedefs] 239 \| using Ch = char; \| ^~ This warning is a false positive. 'using Ch' is actually used internally by rapidjson::Writer. This is a known GCC bug (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=61596), which has not been fixed since 2014. I disabled this warning only locally as other code is not affected by this warning and no other code already disables this warning. Note that there are some GCC compilation problems still left apart from this one. Closes #10158	2022-03-02 19:10:58 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Calle Wilund	e2d7225df8	rjson: Add print to stream of rjson::value Allows direct stream of object to seastar::stream. While not 100% efficient, it has the advantage of avoiding large allocations (long string) for huge result messages.	2022-01-12 13:34:49 +00:00
Nadav Har'El	5e52858295	rjson, alternator: rename set() functions add() The rjson::set() sounds like it can set any member of a JSON object (i.e., map), but that's not true :-( It calls the RapidJson function AddMember() so it can only add a member to an object which doesn't have a member with the same name (i.e., key). If it is called with a key that already has a value, the result may have two values for the same key, which is ill-formed and can cause bugs like issue #9542. So in this patch we begin by renaming rjson::set() and its variant to rjson::add() - to suggest to its user that this function only adds members, without checking if they already exist. After this rename, I was left with dozens of calls to the set() functions that need to changed to either add() - if we're sure that the object cannot already have a member with the same name - or to replace() if it might. The vast majority of the set() calls were starting with an empty item and adding members with fixed (string constant) names, so these can be trivially changed to add(). It turns out that all other set() calls - except the one fixed in issue #9542 - can also use add() because there are various "excuses" why we know the member names will be unique. A typical example is a map with column-name keys, where we know that the column names are unique. I added comments in front of such non-obvious uses of add() which are safe. Almost all uses of rjson except a handful are in Alternator, so I verified that all Alternator test cases continue to pass after this patch. Fixes #9583 Refs #9542 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211104152540.48900-1-nyh@scylladb.com>	2021-11-04 16:35:38 +01:00
Nadav Har'El	b95e431228	alternator: fix bug in ReturnValues=ALL_NEW This patch fixes a bug in UpdateItem's ReturnValues=ALL_NEW, which in some cases returned the OLD (pre-modification) value of some of the attributes, instead of its NEW value. The bug was caused by a confusion in our JSON utility function, rjson::set(), which sounds like it can set any member of a map, but in fact may only be used to add a new member - if a member with the same name (key) already existed, the result is undefined (two values for the same key). In ReturnValues=ALL_NEW we did exactly this: we started with a copy of the original item, and then used set() to override some of the members. This is not allowed. So in this patch, we introduce a new function, rjson::replace(), which does what we previously thought that rjson::set() does - i.e., replace a member if it exists, or if not, add it. We call this function in the ReturnValues=ALL_NEW code. This patch also adds a test case that reproduces the incorrect ALL_NEW results - and gets fixed by this patch. In an upcoming patch, we should rename the confusingly-named set() functions and audit all their uses. But we don't do this in this patch yet. We just add some comments to clarify what set() does - but don't change it, and just add one new function for replace(). Fixes #9542 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211104134937.40797-1-nyh@scylladb.com>	2021-11-04 16:34:58 +01:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Piotr Sarna	223a59c09c	test: make rjson allocator test working in sanitize mode Following Nadav's advice, instead of ignoring the test in sanitize/debug modes, the allocator simply has a special path of failing sufficiently large allocation requests. With that, a problem with the address sanitizer is bypassed and other debug mode sanitizers can inspect and check if there are no more problems related to wrapping the original rapidjson allocator. Closes #8539	2021-05-20 00:42:47 +03:00
Piotr Sarna	45d7144529	rjson: add a throwing allocator The default rapidjson allocator returns nullptr from a failed allocation or reallocation. It's not a bug by itself, but rapidjson internals usually don't check for these return values and happily use nullptr as a valid pointer, which leads to segmentation faults and memory corruptions. In order to prevent these bugs, the default allocator is wrapped with a class which simply throws once it fails to allocate or reallocate memory, thus preventing the use of nullptr in the code. One exception is Malloc/Realloc with size 0, which is expected to return nullptr by rapidjson code.	2021-04-21 14:26:38 +02:00
Piotr Sarna	ec750e5f49	rjson: make the max nested level configurable Back when rjson was only part of alternator, there was a hardcoded limit of nested levels - 78. The number was calculated as: - taking the DynamoDB limit (32) - adding 7 to it to make alternator support more cases - doubling it because rjson internals bump the level twice for each alternator object (because the alternator object is represented as a 2-level JSON object). Since rjson is no longer specific to alternator, this limit is now configurable, and the original default value is explained in a comment. Message-Id: <51952951a7cd17f2f06ab36211f74086e1b60d2d.1618916299.git.sarna@scylladb.com>	2021-04-20 14:05:03 +03:00
Nadav Har'El	f41dac2a3a	alternator: avoid large contiguous allocation for request body Alternator request sizes can be up to 16 MB, but the current implementation had the Seastar HTTP server read the entire request as a contiguous string, and then processed it. We can't avoid reading the entire request up-front - we want to verify its integrity before doing any additional processing on it. But there is no reason why the entire request needs to be stored in one big contiguous allocation. This always a bad idea. We should use a non- contiguous buffer, and that's the goal of this patch. We use a new Seastar HTTPD feature where we can ask for an input stream, instead of a string, for the request's body. We then begin the request handling by reading lthe content of this stream into a vector<temporary_buffer<char>> (which we alias "chunked_content"). We then use this non-contiguous buffer to verify the request's signature and if successful - parse the request JSON and finally execute it. Beyond avoiding contiguous allocations, another benefit of this patch is that while parsing a long request composed of chunks, we free each chunk as soon as its parsing completed. This reduces the peak amount of memory used by the query - we no longer need to store both unparsed and parsed versions of the request at the same time. Although we already had tests with requests of different lengths, most of them were short enough to only have one chunk, and only a few had 2 or 3 chunks. So we also add a test which makes a much longer request (a BatchWriteItem with large items), which in my experiment had 17 chunks. The goal of this test is to verify that the new signature and JSON parsing code which needs to cross chunk boundaries work as expected. Fixes #7213. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210309222525.1628234-1-nyh@scylladb.com>	2021-03-10 09:22:34 +01:00
Calle Wilund	699c4d2c7e	rjson: Add templated get/set overloads and optional get<T> To allow immediate json value conversion for types we have TypeHelper<...>:s for. Typed opt-get to get both automatic type conversion, _and_ find functionality in one call.	2020-07-15 08:10:23 +00:00
Calle Wilund	72ec525045	rjson: Add exception overloads To avoid copying error message composing, as well as forcing said code info rjson.cc. Also helps caller to determine fault by catch type.	2020-07-15 08:10:23 +00:00
Piotr Sarna	1b37517aab	rjson: move quote_json_string to rjson This utility function is used for type serialization, but it also has a dedicated unit test, so it needs to be globally reachable.	2020-07-03 10:27:23 +02:00
Piotr Sarna	f568fe869f	rjson: add non-throwing parsing Returning a disengaged optional instead of throwing an error can be useful when the input string is expected not to be a valid JSON in certain cases.	2020-07-03 10:27:23 +02:00
Piotr Sarna	3fda9908f2	rjson: add from_string_map function This legacy function is needed because the existing implementation relies on being able to parse flat JSON documents to and from maps of strings.	2020-07-03 10:27:23 +02:00
Piotr Sarna	4de23d256e	alternator,utils: move rjson.hh to utils/ rjson is going to replace libjsoncpp, so it's moved from alternator to the common utils/ directory.	2020-07-03 08:30:01 +02:00

20 Commits