scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Gleb Natapov	306f5b99b5	cache_hitrate_calculator: fix use after free in non_system_filter lambda non_system_filter lambda is defined static which means it is initialized only once, so the 'this' that is will capture will belong to a shard where the function runs first. During service destruction the function may run on different shard and access already other's shard service that may be already freed. Fixed #4425 Message-Id: <20190421152139.GN21208@scylladb.com>	2019-04-21 18:22:31 +03:00
Amnon Heiman	9ad63efcfe	Adding node_exporter to docker This patch add the node_exporter to the docker image. It install it create and run a service with it. After this patch node_exporter will run and will be part of scylla Docker image. Fixes #4300 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20190421130643.6837-1-amnon@scylladb.com>	2019-04-21 18:12:58 +03:00
Benny Halevy	0c9aaef673	sstables: make lamdas that std:move mutable As noticed by Rafael Ávila de Espíndola <espindola@scylladb.com> regarding commit `5a99023d4a`: Without the lambda being mutable, the second std::move actually doesn't move anything. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190421150422.19304-1-bhalevy@scylladb.com>	2019-04-21 18:11:42 +03:00
Benny Halevy	5a99023d4a	treewide: use lambda for io_check of *touch_directory To prepare for a seastar change that adds an optional file_permissions parameter to touch_directory and recursive_touch_directory. This change messes up the call to io_check since the compiler can't derive the Func&& argument. Therefore, use a lambda function instead to wrap the call to {recursive_,}touch_directory. Ref #4395 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190421085502.24729-1-bhalevy@scylladb.com>	2019-04-21 12:04:39 +03:00
Tomasz Grabiec	f092decd90	lsa: Fix potential bad_alloc even though evictable memory exists When we start the LSA reclamation it can be that segment_pool::_free_segments is 0 under some conditions and segment_pool::_current_emergency_reserve_goal is set to 1. The reclamation step is 1 segment, and compact_and_evict_locked() frees 1 segment back into the segment_pool. However, segment_pool::reclaim_segments() doesn't free anything to the standard allocator because the condition _free_segments > _current_emergency_reserve_goal is false. As a result, tracker::impl::reclaim() returns 0 as the amount of released memory, tracker::reclaim() returns memory::reclaiming_result::reclaimed_nothing and the seastar allocator thinks it's a real OOM and throws std::bad_alloc. The fix is to change compact_and_evict() to make sure that reserves are met, by releasing more if they're not met at entry. This change also allows us to drop the variant of allocate_segment() which accepts the reclamation step as a means to refill reserves faster. This is now not needed, because compact_and_evict() will look at the reserve deficit to increase the amount of memory to reclaim. Fixes #4445 Message-Id: <1555671713-16530-1-git-send-email-tgrabiec@scylladb.com>	2019-04-20 09:17:49 +03:00
Avi Kivity	704600f829	Update seastar submodule * seastar eb03ba5cd...e84d2647c (14): > Fix hardcoded python paths in shebang line > Disable -Wmaybe-uninitialized everywhere > app_template: allow opting out of automatic SIGINT/SIGTERM handling > build: Restore DPDK machine inference from cflags > http: capture request content for POST requests > Merge "Simplify future_state and promise" from Rafael > temporary_buffer: fix memleak on fast path > perftune.py: allow explicitly giving a CPU mask to be used for binding IRQs > perftune.py: fix the sanity check for args.tune > perftune.py: identify fast-path hardware queues IRQs of Mellanox NICs > memory: malloc_allocator should be always available > Merge "Using custom allocator in the posix network stack" from Elazar > memory: Tell reclaimers how much should be reclaimed > net/ipv4_addr: add std::hash & operator== overloads	2019-04-20 09:16:53 +03:00
Avi Kivity	d485facea2	Revert "tools: toolchain: improve dbuild signal handing" This reverts commit `6c672e674b`. It loses build logs, and the patch that restores logs causes build failures, so the whole thing needs to be revisited.	2019-04-19 15:16:42 +03:00
Takuya ASADA	0a874f1897	dist/docker/redhat: prioritize /opt/scylladb/python3/bin on $PATH To prevent running entrypoint script in another python3 package like python36 in EPEL, move /opt/scylladb/python3/bin to top of $PATH. It won't happen on this container image, but may occurs when user tries to extend the image. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190417165806.12212-1-syuu@scylladb.com>	2019-04-19 11:47:40 +03:00
Takuya ASADA	c3dae6673f	dist/common/scripts: use out() to run perftune.py perftune.py executes hwloc-calc, the command is now provided as relocatable binary, placed under /opt/scylladb/bin. So we need to add the directory to PATH when calling subprocess.check_output(), but our utility function already do that, switch to it. Fixes #4443 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190418124345.24973-1-syuu@scylladb.com>	2019-04-19 11:47:40 +03:00
Benny Halevy	9785754e0d	distributed_loader: do not follow symlinks when verifying mode and owner We allow only regular files and directotries so to detect symlinks we must not follow them. Fixes #4375 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190418051627.9298-1-bhalevy@scylladb.com>	2019-04-19 11:47:40 +03:00
Takuya ASADA	214c74a71d	dist: merge product name parameter on single place When we add product name customization, we mistakenly defined the parameter on each package build script. Number of script is increasing since we recently added relocatable python3 package, we should merge it in single place. Also we should save the parameter on relocatable package, just like version-release parameters. So move the definition to SCYLLA-VERSION-GEN, save it to build/SCYLLA-PRODUCT-FILE then archive it to relocatable package. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190417163335.10191-1-syuu@scylladb.com>	2019-04-19 11:47:40 +03:00
Paweł Dziepak	d47ea66ec6	messaging_service: add lz4_fragmented RPC compressor Seastar now supports two RPC compression algorithm: the original LZ4 one and LZ4_FRAGMENTED. The latter uses lz4 stream interface which allows it to process large messages without fully linearising them. Since, RPC requests used by Scylla often contain user-provided data that potentially could be very large, LZ4_FRAGMENTED is a better choice for the default compression algorithm. Message-Id: <20190417144318.27701-1-pdziepak@scylladb.com>	2019-04-18 19:07:14 +03:00
Takuya ASADA	592fec32a0	dist/common/scripts: use /etc/os-release to detect distributions Since we moved relocatable .rpm now Scylla able to run on Amazon Linux 2. However, is_redhat_variant() on scylla_util.py does not works on Amazon Linux 2, since it does not have /etc/redhat-release. So we need to switch to /etc/os-release, use ID_LIKE to detect Redhat variants/Debian variants. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190417115634.9635-1-syuu@scylladb.com>	2019-04-18 19:07:14 +03:00
Takuya ASADA	3cf7cf015a	dist/docker/redhat: use relocatable python3 on docker-entrypoint.py Switch to relocatable python3 instead of EPEL's python3 on docker-entrypoint.py. Also drop uneeded dependencies, since we switched to relocatable scylla image. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190417111024.6604-1-syuu@scylladb.com>	2019-04-18 19:07:14 +03:00
Paweł Dziepak	85409c1a16	Merge "Validate elements of collections" from Piotr " Previously we weren't validating elements of collections so it was possible to add non-UTF-8 string to a column with type list<text>. Tests: unit(release) Fixes #4009 " * 'haaawk/4009/v5' of github.com:scylladb/seastar-dev: types: Test correct map validation types: Test correct in clause validation types: Test correct tuple validation types: Test correct set validation types: Test correct list validation types: Add test_tuple_elements_validation types: Add test_in_clause_validation types: Add test_map_elements_validation types: Add test_set_elements_validation types: Add test_list_elements_validation types: Validate input when tuples types: Validate input when parsing a set types: Validate input when parsing a map types: Validate input when parsing a list types: Implement validation for tuple types: Implement validation for set types: Implement validation for map types: Implement validation for list types: Add cql_serialization_format parameter to validate	2019-04-18 19:07:14 +03:00
Botond Dénes	6e85d1e8c1	date_type_impl: add notice explaining why its not used And why is it still in the code. The note has been copied from Origin. Refs: #4419 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <c7790a898c331a7f58014d82a10cbc9ee7ad3265.1555483620.git.bdenes@scylladb.com>	2019-04-18 19:07:14 +03:00
Piotr Jastrzebski	134b59a425	table_helper: take insert function arguments by value Previous version wasn't working correctly with r-values. Fixes #4438 Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <5017b04901c47bd826b2e411e603ce01e42a83a5.1555424512.git.piotr@scylladb.com>	2019-04-16 17:34:35 +03:00
Tomasz Grabiec	5dc3f5ea33	Merge "Properly enable MC format on the cluster" from Piotr 1. All nodes in the cluster have to support MC_SSTABLE_FEATURE 2. When a node observes that whole cluster supports MC_SSTABLE_FEATURE then it should start using MC format. 3. Once all shards start to use MC then a node should broadcast that unbounded range tombstones are now supported by the cluster. 4. Once whole cluster supports unbounded range tombstones we can start accepting them on CQL level. tests: unit(release) Fixes #4205 Fixes #4113 * seastar-dev.git dev/haaawk/enable_mc/v11: system_keyspace: Add scylla_local system_keyspace: add accessors for SCYLLA_LOCAL storage_service: add _sstables_format field feature: add when_enabled callbacks system_keyspace: add storage_service param to setup Add sstable format helper methods Register feature listeners in storage_service Add service::read_sstables_format Use read_sstables_format in main.cc Use _sstables_format to determine current format Add _unbounded_range_tombstones_feature Update supported features on format change	2019-04-16 14:07:05 +02:00
Avi Kivity	6c672e674b	tools: toolchain: improve dbuild signal handing Currently, we use --sig-proxy to forward signals to the container. However, this requires the container's co-operation, which usually doesn't exist. For example, docker run --sig-proxy fedora:29 bash -c "sleep 5" Does not respond to ctrl-C. This is a problem for continuous integration. If a build is aborted, Jenkins will first attempt to gracefully terminate the processes (SIGINT/SIGTERM) and then give up and use SIGKILL. If the graceful termination doesn't work, we end up with an orphan container running on the node, which can then consume enough memory and CPU to harm the following jobs. To fix this, trap signals and handle them by killing the container. Also trap shell exit, and even kill the container unconditionally, since if Jenkins happens to kill the "docker wait" process the regular paths will not be taken. Message-Id: <20190415084040.12352-1-avi@scylladb.com>	2019-04-16 14:07:05 +02:00
Tomasz Grabiec	ac0d435c3e	Merge "hinted handoff: don't reuse_segments and discard corrupted segments" from Vlad This series addresses two issues in the hinted handoff that should complete fixing the infamous #4231. In particular the second patch removes the requirement to manually delete hints files after upgrading to 3.0.4. Tested with manual unit testing. * https://github.com/vladzcloudius/scylla.git hinted_handoff_drop_broken_segments-v3: hinted handoff: disable "reuse_segments" commitlog: introduce a segment_error hinted handoff: discard corrupted segments	2019-04-16 14:07:05 +02:00
Avi Kivity	643bddbecc	Update seastar submodule * seastar 6f73675...eb03ba5 (11): > tests: tests C++14 dialect in continuous integration > rpc/compressor/lz4: fix std:variant related compiler errors > tests: futures_test: allow project to compile with C++14 > Merge "io_queue: make io_priority_class namespace global" from Benny > future::then_wrapped: use std::terminate instead of abort > reactor: make metric about task quota violations less sensitive > Merge "Add LZ4_FRAGMENTED compressor for RPC" from Paweł > Fix build issues with Clang 7 > Merge "file_stat follow_symlink option and related fixes" from Benny > doc/tutorial.md: reword mention of seastar::thread premption on get() > tests: semaphore_test: relax timeouts Fixes #4272.	2019-04-16 14:34:32 +03:00
Raphael S. Carvalho	52e1125b52	sstables: do not destroy sstable runs after resharding Resharding wasn't preserving the sstable run structure, which depends on all fragments sharing the same run identifier. So let's make resharding run aware, meaning that a run will be created for each shard involved. tests: release mode. Fixes #4428. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20190415193556.16435-1-raphaelsc@scylladb.com>	2019-04-16 10:34:49 +03:00
Tomasz Grabiec	ff66b27754	gdb: heapprof: Coalesce parents in the flamegraph mode This change drops the hit count from the name of the node, because it prevents coalescing of nodes which are shared parents for paths with different counts. This lack of coalescing makes the flamegraph a lot less useful. Message-Id: <1555348576-26382-1-git-send-email-tgrabiec@scylladb.com>	2019-04-15 21:05:08 +03:00
Tomasz Grabiec	3fd82021b1	schema_tables: Serialize schema merges fairly All schema changes made to the node locally are serialized on a semaphore which lives on shard 0. For historical reasons, they don't queue but rather try to take the lock without blocking and retry on failure with a random delay from the range [0, 100 us]. Contenders which do not originate on shard 0 will have an extra disadvantage as each lock attempt will be longer by the across-shard round trip latency. If there is constant contention on shard 0, contenders originating from other shards may keep loosing to take the lock. Schema merge executed on behalf of a DDL statement may originate on any shard. Same for the schema merge which is coming from a push notification. Schema merge executed as part of the background schema pull will originate on shard 0 only, where the application state change listeners run. So if there are constant schema pulls, DDL statements may take a long time to get through. The fix is to serialize merge requests fairly, by using the blocking semaphore::wait(), which is fair. We don't have to back-off any more, since submit_to() no longer has a global concurrency limit. Fixes #4436. Message-Id: <1555349915-27703-1-git-send-email-tgrabiec@scylladb.com>	2019-04-15 20:40:38 +03:00
Botond Dénes	c6314e422f	tests/mutation_source_test: use a single random seed Currently, each instanciation of `random_mutation_generator::impl` will generate a new random seed for itself. Altough these are printed, mapping back all the printed seeds to the exact source location where it has to be substituted in is non-trivial. This makes reproducing random test failures very hard. To solve this problem, use `tests::random::get_int()` to produce the random seed of the `random_mutation_generator::impl` instances. This way the seed of all the mutation generator will be derived from a single "master" seed that is easily replaced after a test failure, hopefully also leading to easily reproducible random test failures. I checked that after substituting in a previously generated master random seed, all derived seeds were exactly the same. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <0471415938fc27485975ef9213d37d94bff20fd5.1555329062.git.bdenes@scylladb.com>	2019-04-15 17:37:31 +03:00
Avi Kivity	3afbe219cd	Merge "UDF/UDA related cleanups and refactoring" from Rafael " These are patches I wrote while working on UDF/UDA, but IMHO they are independent improvements and are ready for review. Tests: unit (debug) dtest (release) I checked that all tests in nosetests -v user_types_test.py sstabledump_test.py cqlsh_tests/cqlsh_tests.py now pass. " * 'espindola/udf-uda-refactoring-v3' of https://github.com/espindola/scylla: Refactor user type merging cql_type_parser::raw_builder: Allow building types incrementally cql3: delete dead code Include missing header return a const reference from return_type delete unused var Add a test on nested user types.	2019-04-15 16:52:13 +03:00
Glauber Costa	c01ed239a3	fix typo in create table statement error message specifed -> specified Fixes #4434 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190415125206.2993-1-glauber@scylladb.com>	2019-04-15 16:51:13 +03:00
Benny Halevy	b543ab4c76	sstables: remove_temp_dir: do not return then_wrapped future f.get_exception makes the future invalid so it must not be returned. Instead, make_exception_future<> with the exception ptr. Fixes #4435. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190415111909.30499-1-bhalevy@scylladb.com>	2019-04-15 16:42:49 +03:00
Glauber Costa	b9327f81cf	conf: stop telling people to run auto_bootstrap: false auto_bootstrap: false provide negligible gains for new clusters and it is extremely dangerous everywhere else. We have seen a couple of times in which users, confused by this, added this flag by mistake and added nodes with it. While they were pleased by the extremely fast times to add nodes, they were later displeased to find their data missing. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190414012028.20767-1-glauber@scylladb.com>	2019-04-14 10:42:25 +03:00
Piotr Jastrzebski	2c599122e1	Update supported features on format change Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 10:38:31 +02:00
Piotr Jastrzebski	9c7e3dd470	Add _unbounded_range_tombstones_feature This requires introduction of storage_service::get_known_features and using it with check_knows_remote_features. Otherwise a node joining the existing cluster won't be able to join because it does not support unbounded range tombstones yet. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 10:37:12 +02:00
Piotr Jastrzebski	96ad8f7df9	Use _sstables_format to determine current format Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 10:37:12 +02:00
Piotr Jastrzebski	da1eba5bdb	Use read_sstables_format in main.cc Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 10:37:12 +02:00
Piotr Jastrzebski	7339e9de30	Add service::read_sstables_format Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 10:37:12 +02:00
Piotr Jastrzebski	9934740c39	Register feature listeners in storage_service Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 10:36:58 +02:00
Piotr Jastrzebski	7a62235259	Add sstable format helper methods Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 09:33:40 +02:00
Piotr Jastrzebski	caa6798f2c	system_keyspace: add storage_service param to setup Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 09:33:40 +02:00
Piotr Jastrzebski	460fb260cb	feature: add when_enabled callbacks Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 09:33:40 +02:00
Piotr Jastrzebski	081542cf00	storage_service: add _sstables_format field Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 09:33:40 +02:00
Piotr Jastrzebski	0211541d84	system_keyspace: add accessors for SCYLLA_LOCAL Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 09:33:40 +02:00
Piotr Jastrzebski	4c205b733a	system_keyspace: Add scylla_local Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-12 09:33:40 +02:00
Benny Halevy	adf539fb2c	tests: sstable_test_env::do_with_async: wait_for_background_jobs To solve memory leak seen in sstable_datafile_test -t test_old_format_non_compound_range_tombstone_is_read Refs #4376 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190411154621.9716-1-bhalevy@scylladb.com>	2019-04-11 18:50:42 +03:00
Takuya ASADA	4636284856	dist/ami: drop EPEL, convert scylla_install_ami script to python2 We have to run this script in python2, since we dropped EPEL from dependencies, and the script is installer for rpms so we cannot use relocatable python3 for it. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190411151858.2292-1-syuu@scylladb.com>	2019-04-11 18:21:48 +03:00
Glauber Costa	f3a24b6c22	dist: remove curl dependency to simplify dependency list further Although curl is widely available, there is no reason to depend on it. There are mainly two users, as indicated by grep: 1) scylla-housekeeping 2) scripts within the AMI 3) docker image The AMI has its own RPM and it already depends on curl. While we could get rid of the curl dependency there too, we can do that later. Docker is its own thing and it only needs it at build time anyway. For the main scylla repo, this patch changes scylla-housekeeping so as not to depend on the curl binary and use urllib directly instead. We can then remove curl from our dependency list. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190411125642.9754-1-glauber@scylladb.com>	2019-04-11 16:12:36 +03:00
Benny Halevy	8181acd83b	test.py: fail if given test name not found Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190411092041.24712-1-bhalevy@scylladb.com>	2019-04-11 12:31:23 +03:00
Tzach Livyatan	f444c949bd	Fix the Dockerhub documentation for listen-address Fix listen-address documention: it is used for internal communication, not for external clients Signed-off-by: Tzach Livyatan <tzach@scylladb.com> Message-Id: <20190410181409.16078-1-tzach@scylladb.com>	2019-04-11 11:53:40 +03:00
Botond Dénes	f201f8abab	types: fix date_type_impl::less() (timestamp cql type) date_type_impl::less() invokes `compare_unsigned()` to compare the underlying raw byte values. `compared_unsigned()` is a tri comparator, however `date_type_impl::less()` implicitely converted the returned value to bool. In effect, `date_type_impl::less()` would always return `true` when the two compared values were not equal. Found while working on a unit test which empoly a randomly generated schema to test a component. Fixes #4419. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <8a17c81bad586b3772bf3d1d1dae0e3dc3524e2d.1554907100.git.bdenes@scylladb.com>	2019-04-10 21:01:25 +03:00
Botond Dénes	90721468f0	tests/mutation_diff: remove false-positive diff of the partition header Currently the partition header will always be reported as different when comparing two mutations. This is because they are prepended with the "expected: " and "... but got: " texts. This generates unnecessary noise. Inject a new line between the prefix and the partition-header proper. This way the partition header will only show up in the diff when there is an actual difference. The "expected: " and "... but got: " phrases are still shown as different on the top of the diff but this is fine as one can immediately see that they are not part of the data and additionaly they help the reader in determining which part of the diff is the expected one and which is the actual one. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <29e0f413d248048d7db032224a3fd4180bf1b319.1554909144.git.bdenes@scylladb.com>	2019-04-10 18:05:36 +02:00
Raphael S. Carvalho	8a117c338a	compaction: fix use-after-free when calculating backlog after schema change The problem happens after a schema change because we fail to properly remove ongoing compaction, which stopped being tracked, from list that is used to calculate backlog, so it may happen that a compaction read monitor (ceases to exist after compaction ends) is used after freed. Fixes #4410. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20190409024936.23775-1-raphaelsc@scylladb.com>	2019-04-10 15:54:39 +03:00
Vlad Zolotarov	db2ba0df61	hinted handoff: discard corrupted segments If we discover that a current segment is corrupted there is nothing we can do about it. This patch does the following: 1) Drops the corrupted segment and moves to the next one. 2) Logs such events as ERRORs. 3) Introduces a new metrics that accounts such event. Fixes #4364 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2019-04-09 15:54:20 -04:00

1 2 3 4 5 ...

18519 Commits