scylladb

Author	SHA1	Message	Date
Lakshmi Narayanan Sreethar	279253ffd0	utils/big_decimal: fix scale overflow when parsing values with large exponents The exponent of a big decimal string is parsed as an int32, adjusted for the removed fractional part, and stored as an int32. When parsing values like `1.23E-2147483647`, the unscaled value becomes `123`, and the scale is adjusted to `2147483647 + 2 = 2147483649`. This exceeds the int32 limit, and since the scale is stored as an int32, it overflows and wraps around, losing the value. This patch fixes that the by parsing the exponent as an int64 value and then adjusting it for the fractional part. The adjusted scale is then checked to see if it is still within int32 limits before storing. An exception is thrown if it is not within the int32 limits. Note that strings with exponents that exceed the int32 range, like `0.01E2147483650`, were previously not parseable as a big decimal. They are now accepted if the final adjusted scale fits within int32 limits. For the above value, unscaled_value = 1 and scale = -2147483648, so it is now accepted. This is in line with how Java's `BigDecimal` parses strings. Fixes: #24581 Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com> Closes scylladb/scylladb#24640	2025-06-26 15:29:28 +03:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Botond Dénes	b6a9c79af3	utils/big_decimal: add fast paths to operator <=> Currently, the tri-compare operator for big_decimal (operator <=>), uses a precise but potentially very expensive algorithm for comparing the numbers: it first brings them to the same scale, then compares the normalized unscaled values. big_decimal has abritrary precisions, therefore the stored numbers can be arbitrarily large. In extreme cases, comparing two numbers can result in huge amount of memory allocated and stalls. If this type is used int he primary key of a table, these comparisons can make the node completely unresponsive. This patch adds the following fast-paths to operator <=>: * An early return for the case of equal scales. * An early return for different signs. * An early return for the case where one or both of the numbers are 0. * A fast algorithm for detecting the case where the there is a big difference between the two numbers. This algorithm works only with the scales and is able to compare the two numbers by using only one division and some additions and substractions. This algorithm is imprecise and when the numbers are closer than its confidence window, it will fall-back to the current slow but precise tri-compare. All but the last case should have been fast before as well, but the scale-compare algorithm makes a huge difference. Numbers, which would previously make the node unresponsive, now compare in constant-time. Fixes: scylladb/scylladb#21716 Closes scylladb/scylladb#21715	2024-12-03 14:56:51 +02:00
Nadav Har'El	e639434a89	change remaining sstring_view to std::string_view Our "sstring_view" is an historic alias for the standard std::string_view. The patch changes the last remaining random uses of this old alias across our source directory to the standard type name. After this patch, there are no more uses of the "sstring_view" alias. It will be removed in the following patch. Refs #4062. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-11-18 16:48:57 +02:00
Kefu Chai	00810e6a01	treewide: include seastar/core/format.hh instead of seastar/core/print.hh The later includes the former and in addition to `seastar::format()`, `print.hh` also provides helpers like `seastar::fprint()` and `seastar::print()`, which are deprecated and not used by scylladb. Previously, we include `seastar/core/print.hh` for using `seastar::format()`. and in seastar 5b04939e, we extracted `seastar::format()` into `seastar/core/format.hh`. this allows us to include a much smaller header. In this change, we just include `seastar/core/format.hh` in place of `seastar/core/print.hh`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21574	2024-11-14 17:45:07 +02:00
Kefu Chai	3e84d43f93	treewide: use seastar::format() or fmt::format() explicitly before this change, we rely on `using namespace seastar` to use `seastar::format()` without qualifying the `format()` with its namespace. this works fine until we changed the parameter type of format string `seastar::format()` from `const char*` to `fmt::format_string<...>`. this change practically invited `seastar::format()` to the club of `std::format()` and `fmt::format()`, where all members accept a templated parameter as its `fmt` parameter. and `seastar::format()` is not the best candidate anymore. despite that argument-dependent lookup (ADT for short) favors the function which is in the same namespace as its parameter, but `using namespace` makes `seastar::format()` more competitive, so both `std::format()` and `seastar::format()` are considered as the condidates. that is what is happening scylladb in quite a few caller sites of `format()`, hence ADT is not able to tell which function the winner in the name lookup: ``` /__w/scylladb/scylladb/mutation/mutation_fragment_stream_validator.cc:265:12: error: call to 'format' is ambiguous 265 \| return format("{} ({}.{} {})", _name_view, s.ks_name(), s.cf_name(), s.id()); \| ^~~~~~ /usr/bin/../lib/gcc/x86_64-redhat-linux/14/../../../../include/c++/14/format:4290:5: note: candidate function [with _Args = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 4290 \| format(format_string<_Args...> __fmt, _Args&&... __args) \| ^ /__w/scylladb/scylladb/seastar/include/seastar/core/print.hh:143:1: note: candidate function [with A = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 143 \| format(fmt::format_string<A...> fmt, A&&... a) { \| ^ ``` in this change, we change all `format()` to either `fmt::format()` or `seastar::format()` with following rules: - if the caller expects an `sstring` or `std::string_view`, change to `seastar::format()` - if the caller expects an `std::string`, change to `fmt::format()`. because, `sstring::operator std::basic_string` would incur a deep copy. we will need another change to enable scylladb to compile with the latest seastar. namely, to pass the format string as a templated parameter down to helper functions which format their parameters. to miminize the scope of this change, let's include that change when bumping up the seastar submodule. as that change will depend on the seastar change. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-09-11 23:21:40 +03:00
Avi Kivity	aa1270a00c	treewide: change assert() to SCYLLA_ASSERT() assert() is traditionally disabled in release builds, but not in scylladb. This hasn't caused problems so far, but the latest abseil release includes a commit [1] that causes a 1000 insn/op regression when NDEBUG is not defined. Clearly, we must move towards a build system where NDEBUG is defined in release builds. But we can't just define it blindly without vetting all the assert() calls, as some were written with the expectation that they are enabled in release mode. To solve the conundrum, change all assert() calls to a new SCYLLA_ASSERT() macro in utils/assert.hh. This macro is always defined and is not conditional on NDEBUG, so we can later (after vetting Seastar) enable NDEBUG in release mode. [1] `66ef711d68` Closes scylladb/scylladb#20006	2024-08-05 08:23:35 +03:00
Kefu Chai	5fa459bd1a	treewide: do not include unused header since #13452, we switched most of the caller sites from std::regex to boost::regex. in this change, all occurences of `#include <regex>` are dropped unless std::regex is used in the same source file. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13765	2023-05-07 19:01:29 +03:00
Kefu Chai	6bb32efac0	utils: big_decimal: replace compare() with <=> operator now that we are using C++20, it'd be more convenient if we can use the <=> operator for comparing. the compiler creates the 6 other operators for us if the <=> operator is defined. so the code is more compacted. in this change, `big_decimal::compare()` is replaced with `operator<=>`, and its caller is updated accordingly. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-15 12:52:30 +08:00
Kefu Chai	e991e6087e	utils: big_decimal: optimize big_decimal::compare() before this change in the worst case, the underlying `number::compare()` gets called twice. as it is used by Boost::multiprecision to implement the comparing operators of `number`. but since we can have the result in one go, there is no need to to perform the comparison multiple times. so, in this change, we just call `number::compare()` explicitly, and use it to implement `compare()`. this should save a call of `number::compare()`. also, the chained ternary expression is replaced using if-else statement for better readability. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-15 12:52:30 +08:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	89bd7737f3	utils: big_decimal: change to std::strong_ordering Ref #1449.	2021-07-28 13:28:21 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Avi Kivity	dfffa4dc71	utils: big_decimal: work around clang difficulty with boost::cpp_int(string_view) constructor Clang has some difficulty with the boost::cpp_int constructor from string_view. In fact it is a mess of enable_if<>s so a human would have trouble too. Work around it by converting to std::string. This is bad for performance, but this constructor is not going to be fast in any case. Hopefully a fix will arrive in clang or boost. Closes #7389	2020-10-11 22:09:19 +03:00
Rafael Ávila de Espíndola	684f32c862	big_decimal: Correctly handle negative scales A negative scale was being passed an a positive value to boost::multiprecision::pow, which would never finish. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-25 15:34:10 -07:00
Rafael Ávila de Espíndola	bac0f3a9ee	big_decimal: Add a as_rational member function This just refactors some duplicated code so that it can be fixed in one place. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-25 15:33:31 -07:00
Rafael Ávila de Espíndola	77725ce1a4	big_decimal: Move constructors out of line Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-25 15:33:01 -07:00
Piotr Sarna	7b5db478ed	big_decimal: migrate to string views Big decimals are, among other use cases, used as a main number type for alternator, and as such can appear on the fast path. Parsing big decimals was performed via std::regex, which is not precisely famous for its speeds, and also enforces unnecessary string copying. Therefore, the implementation is replaced with an open-coded version based on string_views. One previous iteration of this series also included a hand-coded state machine implementation, but it proved to be slower than the slightly naive string_view one. Overall, execution time is reduced by 61.6% according to microbenchmarks, which sounds like a promising improvement. Perf results: test iterations median mad min max Regex (original): big_decimal_test.from_string 88895 11.228us 25.891ns 11.202us 11.510us String view (new): big_decimal_test.from_string 232334 4.303us 21.660ns 4.282us 4.736us State machine (experimental, ditched): big_decimal_test.from_string 148318 6.723us 51.896ns 6.672us 6.877us Tests: unit(dev + release(big_decimal_test))	2020-06-01 16:11:49 +02:00
Avi Kivity	3c772757c0	treewide: use utils::multiprecision_int for varint implementation The goal is to forward-declare utils::multiprecision_int, something beyond my capabilities for boost::multiprecision::cpp_int, to reduce compile time bloat. The patch is mostly search-and-replace, with a few casts added to disambiguate conversions the compiler had trouble with.	2020-03-04 13:28:16 +02:00
Rafael Ávila de Espíndola	3d641d4062	lua: Use existing cpp_int cast logic Different versions of boost have different rules for what conversions from cpp_int to smaller intergers are allowed. We already had a function that worked with all supported versions, but it was not being use by lua. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200104041028.215153-1-espindola@scylladb.com>	2020-01-05 12:10:54 +02:00
Piotr Sarna	a5e41408ec	utils: add operators to big_decimal For convenience, operators -=, + and - are implemented on top of +=.	2019-07-04 11:32:53 +02:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	be99101f36	utils: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Vladimir Krivopalov	61b1988aa1	Use meaningful error messages when throwing a marshal_exception Fixes #2977 Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <20171121005108.23074-1-vladimir@scylladb.com>	2017-11-21 16:05:43 +02:00
Daniel Fiala	21ea05ada1	utils/big_decimal: Fix compilation issue with converion of cpp_int to uint64_t. Signed-off-by: Daniel Fiala <daniel@scylladb.com> Message-Id: <20171121134854.16278-1-daniel@scylladb.com>	2017-11-21 15:51:29 +02:00
Daniel Fiala	ce2f010859	utils/big_decimal: Added necessary operators and methods for aggregate functions. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-11-12 15:51:29 +01:00
Avi Kivity	7234f0f0a0	utils: remove dependency on types.hh Replace with dependency on much smaller marshal_exception.hh.	2017-08-27 15:16:21 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Paweł Dziepak	026ecdb50f	utils: add big_decimal class Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-09-08 16:04:30 +02:00

31 Commits