scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 04:06:59 +00:00

Author	SHA1	Message	Date
Takuya ASADA	ef05ea8e91	node_exporter_install: stop service before force installing Stop node-exporter.service before re-install it, to avoid 'Text file busy' error. Fixes #6782	2020-07-07 18:27:16 +03:00
Takuya ASADA	f34001ff14	debian: use symlink copying files to build/debian/debian/ Instead of running shutil.copy() for each *.{service,default}, create symlink for these files. Python will copy original file when copying debian directory.	2020-07-07 18:27:16 +03:00
Asias He	0929a5e82b	repair: Fix inaccurate exception message in check_failed_ranges The reason for the failure can be other reasons than failure of checksum. Fixes #6785	2020-07-07 18:27:16 +03:00
Asias He	6e6e554944	repair: Use warn level for logs with recoverable failures Those logs are not fatal and recoverable. We should make them warn level instead of info level. Fixes #5612	2020-07-07 18:27:16 +03:00
Botond Dénes	5ebe2c28d1	db/view: view_update_generator: re-balance wait/signal on the register semaphore The view update generator has a semaphore to limit concurrency. This semaphore is waited on in `register_staging_sstable()` and later the unit is returned after the sstable is processed in the loop inside `start()`. This was broken by `4e64002`, which changed the loop inside `start()` to process sstables in per table batches, however didn't change the `signal()` call to return the amount of units according to the number of sstables processed. This can cause the semaphore units to dry up, as the loop can process multiple sstables per table but return just a single unit. This can also block callers of `register_staging_sstable()` indefinitely as some waiters will never be released as under the right circumstances the units on the semaphore can permanently go below 0. In addition to this, `4e64002` introduced another bug: table entries from the `_sstables_with_tables` are never removed, so they are processed every turn. If the sstable list is empty, there won't be any update generated but due to the unconditional `signal()` described above, this can cause the units on the semaphore to grow to infinity, allowing future staging sstables producers to register a huge amount of sstables, causing memory problems due to the amount of sstable readers that have to be opened (#6603, #6707). Both outcomes are equally bad. This patch fixes both issues and modifies the `test_view_update_generator` unit test to reproduce them and hence to verify that this doesn't happen in the future. Fixes: #6774 Refs: #6707 Refs: #6603 Tests: unit(dev) Signed-off-by: Botond DÃ©nes <bdenes@scylladb.com> Message-Id: <20200706135108.116134-1-bdenes@scylladb.com>	2020-07-07 08:53:00 +02:00
Wojciech Mitros	76038b8d8e	view: differentiate identical error messages and change them to warnings Modified log message in view_builder::calculate_shard_build_step to make it distinct from the one in view_builder::execute, changed their logging level to warning, since we're continuing even if we handle an exception. Fixes #4600	2020-07-06 20:50:34 +03:00
Dejan Mircevski	921dbd0978	cql/restrictions: Handle `WHERE a>0 AND a<0` WHERE clauses with start point above the end point were handled incorrectly. When the slice bounds are transformed to interval bounds, the resulting interval is interpreted as wrap-around (because start > end), so it contains all values above 0 and all values below 0. This is clearly incorrect, as the user's intent was to filter out all possible values of a. Fix it by explicitly short-circuiting to false when start > end. Add a test case. Fixes #5799. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-07-06 19:11:20 +03:00
Piotr Sarna	e4b74356bb	Merge 'view_update_generator: use partitioned sstable set' from Botond. Recently it was observed (#6603) that since 4e6400293ea, the staging reader is reading from a lot of sstables (200+). This consumes a lot of memory, and after this reaches a certain threshold -- the entire memory amount of the streaming reader concurrency semaphore -- it can cause a deadlock within the view update generation. To reduce this memory usage, we exploit the fact that the staging sstables are usually disjoint, and use the partitioned sstable set to create the staging reader. This should ensure that only the minimum number of sstable readers will be opened at any time. Refs: #6603 Fixes: #6707 Tests: unit(dev) * 'view-update-generator-use-partitioned-set/v1' of https://github.com/denesb/scylla: db/view: view_update_generator: use partitioned sstable set sstables: make_partitioned_sstable_set(): return an sstable_set	2020-07-06 14:36:08 +02:00
Botond Dénes	62c6859b69	db/view: view_update_generator: use partitioned sstable set And pass it to `make_range_sstable_reader()` when creating the reader, thus allowing the incremental selector created therein to exploit the fact that staging sstables are disjoint (in the case of repair and streaming at least). This should reduce the memory consumption of the staging reader considerably when reading from a lot of sstables.	2020-07-06 13:38:23 +03:00
Botond Dénes	84b5d6d6d0	sstables: make_partitioned_sstable_set(): return an sstable_set Instead of an `std::unique_ptr<sstable_set_impl>`. The latter doesn't have a publicly available destructor, so it can only be called from withing `sstables/compaction_strategy.cc` where its definition resides. Thus it is not really usable as a public function in its current form, which shows as it has no users either. This patch makes it usable by returning an `sstable_set`. That is what potential callers would want anyway. In fact this patch prepares the ground for the next one, which wishes to use this function for just that but can't in its current form.	2020-07-06 13:38:23 +03:00
Takuya ASADA	2d63acdd6a	scylla_util.py: use correct ID value for distro.id() It seems distro.id() is NOT always same output as ID in /etc/os-release. We need to replace "ol" to "oracle", "amzn" to "amazon". Fixes #6761	2020-07-06 11:40:00 +03:00
Asias He	a19917eb91	gossiper: Drop replacement_quarantine It is not used any more after "gossiper: Drop unused replaced_endpoint". Refs #5482	2020-07-06 11:27:55 +03:00
Asias He	2bc73ad290	gossiper: Drop unused replaced_endpoint It is not used any more after `75cf1d18b5` (storage_service: Unify handling of replaced node removal from gossip) in the "Make replacing node take writes" series. Refs #5482	2020-07-06 11:27:55 +03:00
Piotr Sarna	446b89f408	test: move json tests from manual/ to boost/ Manual tests are, as the name suggests, not run automatically, which makes them more prone to regressions. JSON tests are fast and correct, so there's no reason for them to be marked as manual. Message-Id: <dea75b0a0d1c238d12382a28840978884ac6ec2c.1594023481.git.sarna@scylladb.com>	2020-07-06 11:24:12 +03:00
Avi Kivity	058b30b891	Merge "scylla-gdb.py: scylla_fiber: protect against reference loops" from Botond " This mini-series adds protection against reference loops between tasks, preventing infinite recursion in this case. It also contains some other improvements, like updating the task whitelist as well as the task identification mechanism w.r.t. recent changes in seastar. It also improves verbose logging, which was found to not work well while investigating the other issues fixed herein. " * 'scylla-gdb.py-scylla-fiber-update/v1' of https://github.com/denesb/scylla: scylla-gdb.py: scylla_fiber: add protection against reference loops scylla-gdb.py: scylla_fiber: relax requirement w.r.t. what object qualifies as task scylla-gdb.py: scylla_fiber: update whitelist scylla-gdb.py: scylla_fiber: improve verbose log output	2020-07-06 10:34:13 +03:00
Piotr Sarna	83ab41c76d	test: add json test for parsing from map Our JSON legacy helper functions for parsing documents to/from string maps are indirectly tested by several unit tests, e.g. caching_options_test.cc. They however lacked one corner case detected only by dtest - parsing an empty map from a null JSON document. This case is hereby added in order to prevent future regressions. Message-Id: <df8243bd083b2ba198df665aeb944c8710834736.1594020411.git.sarna@scylladb.com>	2020-07-06 10:28:55 +03:00
Avi Kivity	cc7a906149	Merge "random_access_reader: futurize seek" from Benny " Rather than relying on a gate to serialize seek's background work with close(), change seek() to return a future<> and wait on it. Also, now random_access_reader read_exactly(), seek(), and close() are made noexcept. This will be followed up by making sstable parse methods noexcept. Test: unit(dev) " * tag 'random_access_reader-v4' of github.com:bhalevy/scylla: sstables: random_access_reader: make methods noexcept sstables: random_access_reader: futurize seek sstables: random_access_reader: unify input stream close code sstables: random_access_reader: let file_random_access_reader set the input stream sstables: random_access_reader: move functions out of line	2020-07-06 10:16:18 +03:00
Botond Dénes	54bb9ddaae	docs/debugging.md: drop --privileged from dbuild start instructions Instead, label the mapped volume by passing `:z` options to `-v` argument, like we do for other mapped volumes in the `dbuild` script. Passing the `--privileged` flag doesn't work after the most recent Fedora update and anyway, using `:z` is the proper way to make sure the mounted volume is accessible. Historically it was needed to be able to open cores as well, but since `5b08e91bd` this is not necessary as the container is created with SYS_PTRACE capability. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200703072703.10355-1-bdenes@scylladb.com>	2020-07-06 08:09:58 +02:00
Benny Halevy	fc89018146	sstables: random_access_reader: make methods noexcept handle all exceptions in read_exactly, seek, and close and specify them as noexcept. Also, specify eof() as noexcept as it trivially is. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-07-05 19:40:48 +03:00
Benny Halevy	94460f3199	sstables: random_access_reader: futurize seek And adjust its callers to wait on the returned future. With this, there is no need for a gate to serialize close() with the background work seek() used to leave behind. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-07-05 19:40:26 +03:00
Benny Halevy	765c5752c2	sstables: random_access_reader: unify input stream close code Define a close_if_needed() helper function, to be called from seek() and close(). A future patch will call it with a possibly disengaged `_in` so it will close it only if it was engaged. close_if_needed() captures the input stream unique ptr so it will remain valid throughout close. This was missing from close(). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-07-05 19:37:39 +03:00
Benny Halevy	e7fdadd748	sstables: random_access_reader: let file_random_access_reader set the input stream Allow file_random_access_reader constructor to set the input stream to prepare for futurizing seek() by adding a protected set() method. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-07-05 19:37:36 +03:00
Benny Halevy	0bb1c0f37d	sstables: random_access_reader: move functions out of line These are not good candidates for inlining. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-07-05 18:47:04 +03:00
Avi Kivity	36b6ee7b11	Merge 'python3: simplified .rpm/.deb build process' from Takuya " Follow scylla-server package changes, simplified .rpm/.deb build process which merge build scripts into single script. " * syuu1228-python3_simplified_pkg_scripts: python3: simplified .deb build process python3: simplified .rpm build process	2020-07-05 18:09:17 +03:00
Avi Kivity	cc891a5de8	Merge "Convert a few uses of sstring to std::string_view" from Rafael " This series converts an API to use std::string_view and then converts a few sstring variables to be constexpr std::string_view. This has the advantage that a constexpr variables cannot be part of any initialization order problem. " * 'espindola/convert-to-constexpr' of https://github.com/espindola/scylla: auth: Convert sstring variables in common.hh to constexpr std::string_view auth: Convert sstring variables in default_authorizer to constexpr std::string_view cql_test_env: Make ks_name a constexpr std::string_view class_registry: Use std::string_view in (un)?qualified_name	2020-07-05 17:08:54 +03:00
Dmitry Kropachev	de82b3efae	dist/common/scripts/scylla-housekeeping: wrap urllib.request with try ... except We could hit "cannot serialize '_io.BufferedReader' object" when request get 404 error from the server Now you will get legit error message in the case. Fixes #6690	2020-07-05 16:33:11 +03:00
Takuya ASADA	d94fe346ee	scylla_coredump_setup: detect missing coredump file Print error message and exit with non-zero status by following condition: - coredumpctl says the coredump file is inaccessible - failed to detect coredump file path from 'coredumpctl info <pid>' - deleting coredump file failed because the file is missing Fixes #6654	2020-07-05 14:24:51 +03:00
Takuya ASADA	d65b15f3b2	dist/debian/python3: apply version number fixup on scylla-python3 Sync version number fixup from main package, contains #6546 and #6752 fixes. Note that scylla-python3 likely does not affect this versioning issue, since it uses python3 version, which normally does not contain 'rcX'.	2020-07-05 14:21:18 +03:00
Takuya ASADA	8750c5ccf3	python3: simplified .deb build process We don't really need to have two build_deb.sh, merge it to reloc.	2020-07-04 23:41:33 +09:00
Takuya ASADA	fc320ac49d	python3: simplified .rpm build process We don't really need to have two build_rpm.sh, merge it to reloc.	2020-07-04 23:41:22 +09:00
Rafael Ávila de Espíndola	400212e81f	auth: Convert sstring variables in common.hh to constexpr std::string_view This converts the following variables: DEFAULT_SUPERUSER_NAME AUTH_KS USERS_CF AUTH_PACKAGE_NAME Since they are now constexpr they will not be part of any initialization order problems. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-03 12:35:58 -07:00
Rafael Ávila de Espíndola	53ed39e64a	auth: Convert sstring variables in default_authorizer to constexpr std::string_view This converts the following variables: ROLE_NAME RESOURCE_NAME PERMISSIONS_NAME PERMISSIONS_CF Since they are now constexpr they will not be part of any initialization order problems. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-03 12:33:33 -07:00
Rafael Ávila de Espíndola	33af0c293f	cql_test_env: Make ks_name a constexpr std::string_view Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-03 12:28:20 -07:00
Rafael Ávila de Espíndola	a2110e413f	class_registry: Use std::string_view in (un)?qualified_name This gives more flexibility for constructing a qualified_name or unqualified_name. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-03 12:28:14 -07:00
Nadav Har'El	8e3ecc30a9	merge: Migrate from libjsoncpp to rjson Merged patch series by Piotr Sarna: The alternator project was in need of a more optimized JSON library, which resulted in creating "rjson" helper functions. Scylla generally used libjsoncpp for its JSON handling, but in order to reduce the dependency hell, the usage is now migrated to rjson, which is faster and offers the same functionality. The original plan was to be able to drop the dependency on libjsoncpp-lib altogether and remove it from install-dependencies.sh, but one last usage of it remains in our test suite, namely cql_repl. The tool compares its output JSON textually, so it depends on how a library presents JSON - what are the delimeters, indentation, etc. It's possible to provide a layer of translation to force rjson to print in an identical format, but the other issue is that libjsoncpp keeps subobjects sorted by their name, while rjson uses an unordered structure. There are two possible solutions for the last remaining usage of libjsoncpp: 1. change our test suite to compare JSON documents with a JSON parser, so that we don't rely on internal library details 2. provide a layer of translation which forces rjson to print its objects in a format idential to libjsoncpp. (1.) would be preferred, since now we're also vulnerable for changes inside libjsoncpp itself - if they change anything in their output format, tests would start failing. The issue is not critical however, so it's left for later. Tests: unit(dev), manual(json_test), dtest(partitioner_tests.TestPartitioner.murmur3_partitioner_test) Piotr Sarna (8): alternator,utils: move rjson.hh to utils/ alternator: remove ambiguous string overloads in rjson rjson: add parse_to_map helper function rjson: add from_string_map function rjson: add non-throwing parsing rjson: move quote_json_string to rjson treewide: replace libjsoncpp usage with rjson configure: drop json.cc and json.hh helpers alternator/base64.hh \| 2 +- alternator/conditions.cc \| 2 +- alternator/executor.hh \| 2 +- alternator/expressions.hh \| 2 +- alternator/expressions_types.hh \| 2 +- alternator/rmw_operation.hh \| 2 +- alternator/serialization.cc \| 2 +- alternator/serialization.hh \| 2 +- alternator/server.cc \| 2 +- caching_options.hh \| 9 +- cdc/log.cc \| 4 +- column_computation.hh \| 5 +- configure.py \| 3 +- cql3/functions/functions.cc \| 4 +- cql3/statements/update_statement.cc \| 24 ++-- cql3/type_json.cc \| 212 ++++++++++++++++++---------- cql3/type_json.hh \| 7 +- db/legacy_schema_migrator.cc \| 12 +- db/schema_tables.cc \| 1 - flat_mutation_reader.cc \| 1 + index/secondary_index.cc \| 80 +++++------ json.cc \| 80 ----------- json.hh \| 113 --------------- schema.cc \| 25 ++-- test/boost/cql_query_test.cc \| 9 +- test/manual/json_test.cc \| 4 +- test/tools/cql_repl.cc \| 1 + {alternator => utils}/rjson.cc \| 75 +++++++++- {alternator => utils}/rjson.hh \| 40 +++++- 29 files changed, 344 insertions(+), 383 deletions(-) delete mode 100644 json.cc delete mode 100644 json.hh rename {alternator => utils}/rjson.cc (86%) rename {alternator => utils}/rjson.hh (81%)	2020-07-03 18:23:56 +02:00
Piotr Sarna	449e72826f	configure: drop json.cc and json.hh helpers Now that only rjson is used in the code, the old helper is not used anywhere in the code, so it can be dropped.	2020-07-03 10:27:23 +02:00
Piotr Sarna	4cb79f04b0	treewide: replace libjsoncpp usage with rjson In order to eventually switch to a single JSON library, most of the libjsoncpp usage is dropped in favor of rjson. Unfortunately, one usage still remains: test/utils/test_repl utility heavily depends on the exact textual format of its output JSON files, so replacing a library results in all tests failing because of differences in formatting. It is possible to force rjson to print its documents in the exact matching format, but that's left for later, since the issue is not critical. It would be nice though if our test suite compared JSON documents with a real JSON parser, since there are more differences - e.g. libjsoncpp keeps children of the object sorted, while rapidjson uses an unordered data structure. This change should cause no change in semantics, it strives just to replace all usage of libjsoncpp with rjson.	2020-07-03 10:27:23 +02:00
Piotr Sarna	1b37517aab	rjson: move quote_json_string to rjson This utility function is used for type serialization, but it also has a dedicated unit test, so it needs to be globally reachable.	2020-07-03 10:27:23 +02:00
Piotr Sarna	f568fe869f	rjson: add non-throwing parsing Returning a disengaged optional instead of throwing an error can be useful when the input string is expected not to be a valid JSON in certain cases.	2020-07-03 10:27:23 +02:00
Piotr Sarna	3fda9908f2	rjson: add from_string_map function This legacy function is needed because the existing implementation relies on being able to parse flat JSON documents to and from maps of strings.	2020-07-03 10:27:23 +02:00
Piotr Sarna	39b5408a84	rjson: add parse_to_map helper function Existing infrastructure relies on being able to parse a JSON string straight into a map of strings. In order to make rjson a drop-in replacement(tm) for libjsoncpp, a similar helper function is provided.	2020-07-03 10:27:23 +02:00
Piotr Sarna	1df6d98b1a	alternator: remove ambiguous string overloads in rjson It's redundant to provide function overloads for both string_view and const string&, since both of them can be implicitly created from const char*. Thus, only string_view overloads are kept. Example code which was ambiguous before the patch, but compiles fine after it: rjson::from_string("hello"); Without the patch, one had to explicitly state the type, e.g.: rjson::from_string(std::string_view("hello")); which is excessive.	2020-07-03 08:30:01 +02:00
Piotr Sarna	4de23d256e	alternator,utils: move rjson.hh to utils/ rjson is going to replace libjsoncpp, so it's moved from alternator to the common utils/ directory.	2020-07-03 08:30:01 +02:00
Takuya ASADA	a107f086bc	dist/debian: apply generated package version for .orig.tar.gz file We currently does not able to apply version number fixup for .orig.tar.gz file, even we applied correct fixup on debian/changelog, becuase it just reading SCYLLA-VERSION-FILE. We should parse debian/{changelog,control} instead. Fixes #6736	2020-07-03 08:24:41 +02:00
Takuya ASADA	4769f30a11	python3: fix incorrect variable name builddir should be BUILDDIR.	2020-07-03 08:24:41 +02:00
Avi Kivity	a3dd1ba76f	build: thrift: avoid rebuild if cassandra.thrift is touched but not modified Thrift 0.12 includes a change [1] that avoids writing the generated output if it has not changed. As a result, if you touch cassandra.thrift (but not change it), the generated files will not update, and as a result ninja will try to rebuild them every time. The compilation of thrift files will be fast due to ccache, but still we will re-link everything. This touching of cassandra.thrift can happen naturally when switching to a different git branch and then switching back. The net result is that cassandra.thrift's contents has not changed, but its timestamp has. Fix by adding the "restat" option to the thrift rule. This instructs ninja to check of the output has changed as expected or not, and to avoid unneeded rebuilds if it has not. [1] https://issues.apache.org/jira/browse/THRIFT-4532	2020-07-03 08:24:41 +02:00
Rafael Ávila de Espíndola	6fe7706fce	mutation_reader_test: Wait for a future Nothing was waiting for this future. Found while testing another patch. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200630183929.1704908-1-espindola@scylladb.com>	2020-07-03 08:24:41 +02:00
Rafael Ávila de Espíndola	b7f5e2e0dd	big_decimal: Add more tests It looks like an order version of my patch series was merged. The only difference is that the new one had more tests. This patch adds the missing ones. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200630141150.1286893-1-espindola@scylladb.com>	2020-07-03 08:24:41 +02:00
Botond Dénes	b91cb8cc60	scylla-gdb.py: scylla_fiber: add protection against reference loops Remember all previously visited tasks and stop if one of them is seen again. The walk algorithm is converted from recursive to iterative to facilitate this.	2020-07-01 16:37:47 +03:00
Botond Dénes	427dae61f8	scylla-gdb.py: scylla_fiber: relax requirement w.r.t. what object qualifies as task Don't require that the object is located at the start of the allocation block. Some tasks, like `seastar::internal::when_all_state_component` might not.	2020-07-01 16:34:36 +03:00

1 2 3 4 5 ...

22673 Commits