scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-01 21:55:50 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	a8a45a243b	tests/cql_test_env: don't override tmpdir::path The interface tmpdir::path isn't properly encapsulated and its users can modify the path even though they really shouldn't. This can happen accidentally, in cql_test_env a reference to tmpdir::path was created and later assigned to in one of the code paths. This caused tmpdir destructor to remove wrong directory at program exit. This patch solves the problem by avoiding referencing tmpdir::path, a copy is perfectly acceptable considering that this is tests-only code. Message-Id: <20190206173046.26801-1-pdziepak@scylladb.com>	2019-02-06 20:55:40 +02:00
Takuya ASADA	96b1cb97ba	dist/ami: don't cleanup build dir rm -rf build/* was to start rpm building on clean state, but it also delete scylla built binaries so it was not good idea. Instead of rm -rf build/*, we can check file existance on cloned directory, if it seems good we can reuse it. Also we need to run git pull on each package repo since it may not included latest commit. Fixes #4189 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190206101755.2056-1-syuu@scylladb.com>	2019-02-06 15:33:09 +02:00
Nadav Har'El	3e7dc7230d	build_deb.sh: fix error message The error message was apparently copied from the RPM script. Fix it. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190205162148.20698-1-nyh@scylladb.com>	2019-02-05 18:22:36 +02:00
Avi Kivity	54748ad15b	Merge "Allow non-key IN restrictions" from Piotr " Fixes #4193 Fixes #3795 This series enables handling IN restrictions for regular columns, which is needed by both filtering and indexing mechanisms. Tests: unit (release) " * 'allow_non_key_in_restrictions' of https://github.com/psarna/scylla: tests: add filtering with IN restriction test cql3: remove unused can_have_only_one_value function cql3: allow non-key IN restrictions	2019-02-05 17:30:35 +02:00
Piotr Sarna	45db5da51b	tests: add filtering with IN restriction test Test case for filtering regular columns with IN restriction is added.	2019-02-05 16:04:17 +01:00
Piotr Sarna	36609d1376	cql3: remove unused can_have_only_one_value function	2019-02-05 16:04:17 +01:00
Piotr Sarna	c178ed8b16	cql3: allow non-key IN restrictions Restricting a regular column with IN restriction is a perfectly valid case for filtering and indexing, so it should be allowed. Fixes #4193 Fixes #3795	2019-02-05 15:50:17 +01:00
Rafael Ávila de Espíndola	84542dadfa	sstables: delete_atomically: don't drop futures We still allow the delete of rows from system.large_partition to run in parallel with the sstable deletion, but now we return a future that waits for both. Tests: unit (release) Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190205001526.68774-1-espindola@scylladb.com>	2019-02-05 16:47:58 +02:00
Calle Wilund	ba6a8ef35b	tls: Use a default prio string disabling TLS1.0 forcing min 128bits Fixes #4010 Unless user sets this explicitly, we should try explicitly avoid deprecated protocol versions. While gnutls should do this for connections initiated thusly, clients such as drivers etc might use obsolete versions. Message-Id: <20190107131513.30197-1-calle@scylladb.com>	2019-02-05 15:34:18 +02:00
Avi Kivity	6c71eae63f	Merge "API: Stream compaction history records" from Amnon " get_compaction_history can return a lot of records which will add up to a big http reply. This series makes sure it will not create large allocations when returning the results. It adds an api to the query_processor to use paged queries with a consumer function that returns a future, this way we can use the http stream after each record. This implementation will prevent large allocations and stalls. Fixes #4152 " * 'amnon/compaction_history_stream_v7' of github.com:scylladb/seastar-dev: tests/query_processor_test: add query_with_consumer_test system_keyspace, api: stream get_compaction_history query_processor: query and for_each_cql_result with future	2019-02-05 14:16:36 +02:00
Avi Kivity	ebf179318c	Merge "SI: Add virtual columns to underlying MV" from Duarte " Virtual columns are MV-specific columns that contribute to the liveness of view rows. However, we were not adding those columns when creating an index's underlying MV, causing indexes to miss base rows. Fixes #4144 Branches: master, branch-3.0 " Reviewed-by: Nadav Har'El <nyh@scylladb.com> * 'sec-index/virtual-columns/v1' of https://github.com/duarten/scylla: tests/secondary_index_test: Add reproducer for #4144 index/secondary_index_manager: Add virtual columns to MV	2019-02-05 13:26:45 +02:00
Avi Kivity	367ef8d318	Merge "provide our own, relocatable, python3 interpreter" from Glauber " We would like to deploy Scylla in constrained environments where internet access is not permitted. In those environments it is not possible to acquire the dependencies of Scylla from external repos and the packages have to be sent alongside with its dependencies. In older distributions, like CentOS7 there isn't a python3 interpreter available. And while we can package one from EPEL this tends to break in practice when installing the software in older patchlevels (for instance, installing into RHEL7.3 when the latest is RHEL7.5). The reason for that, as we saw in practice, is that EPEL may not respect RHEL patchlevels and have the python interpreter depending on newer versions of some system libraries. virtualenv can be used to create isolated python enviornments, but it is not designed for full isolation and I hit at least two roadblocks in practice: 1) It doesn't copy the files, linking some instead. There is an --always-copy option but it is broken (for years) in some distributions. 2) Even when the above works, it still doesn't copy some files, relying on the system files instead (one sad example was the subprocess module that was just kept in the system and not moved to the virtualenv) This patch solves that problem by creating a python3 environment in a directory with the modules that Scylla uses, and no other else. It is essentially doing what vitualenv should do but doesn't. Once this environment is assembled the binaries are then made relocatable the same way the Scylla binary is. One difference (for now) between the Scylla binary relocation process and ours is that we steer away from LD_LIBRARY_PATH: the environment variable is inherited by any child process steming from the caller, which means that we are unable to use the subprocess module to call system binaries like mkfs (which our scripts do a lot). Instead, we rely on RUNPATH to tell the binary where to search for its libraries. Once we generate an archive with the python3 interpreter, we then package it as an rpm with bare any dependencies. The dependencies listed are: $ rpm -qpR scylla-relocatable-python3-3.6.7-1.el7.x86_64.rpm rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PartialHardlinkSets) <= 4.0.4-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 rpmlib(PayloadIsXz) <= 5.2-1 And the total size of that rpm, with all modules scylla needs is 20MB. The Scylla rpm now have a way more modest dependency list: $ rpm -qpR scylla-server-666.development-0.20190121.80b7c7953.el7.x86_64.rpm \| sort \| uniq /bin/sh curl file hwloc kernel >= 3.10.0-514 mdadm pciutils rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 rpmlib(PayloadIsXz) <= 5.2-1 scylla-conf scylla-relocatable-python3 <== our python3 package. systemd-libs util-linux xfsprogs I have tested this end to end by generating RPMs from our master branch, then installing them in a clean CentOS7.3 installation without even using yum, just rpm -Uhv <package_list> Then I called scylla_setup to make sure all python scripts were working and started Scylla successfully. " * 'scylla-python3-v5' of github.com:glommer/scylla: Create a relocatable python3 interpreter spec file: fix python3 dependency list. fixup scripts before installing them to their final location automatically relocate python scripts make scyllatop relocatable use relative paths for installing scylla and iotune binaries	2019-02-05 12:53:34 +02:00
Amnon Heiman	c96c3ce9e8	tests/query_processor_test: add query_with_consumer_test This patch adds a unit test for querying with a consumer function. query with consumer uses paging, the tests covers the scenarios where the number of rows bellow and above the page size, it also test the option to stop in the middle of reading. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-02-05 12:35:53 +02:00
Amnon Heiman	6c7742d616	system_keyspace, api: stream get_compaction_history get_compaciton_history can return big chunk of data. To prevent large memory allocation, the get_compaction_history now read each compaction_history record and use the http stream to send it. Fixes #4152 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-02-05 11:14:53 +02:00
Amnon Heiman	c0e3b7673d	query_processor: query and for_each_cql_result with future query and for_each_cql_result accept a function that reads a row and return a stop_iterator. This implementation of those functions gets a function that returns a future stop_iterator allowing preemption between calls. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-02-05 11:14:53 +02:00
Glauber Costa	afed2cddae	Create a relocatable python3 interpreter We would like to deploy Scylla in constrained environments where internet access is not permitted. In those environments it is not possible to acquire the dependencies of Scylla from external repos and the packages have to be sent alongside with its dependencies. In older distributions, like CentOS7 there isn't a python3 interpreter available. And while we can package one from EPEL this tends to break in practice when installing the software in older patchlevels (for instance, installing into RHEL7.3 when the latest is RHEL7.5). The reason for that, as we saw in practice, is that EPEL may not respect RHEL patchlevels and have the python interpreter depending on newer versions of some system libraries. virtualenv can be used to create isolated python enviornments, but it is not designed for full isolation and I hit at least two roadblocks in practice: 1) It doesn't copy the files, linking some instead. There is an --always-copy option but it is broken (for years) in some distributions. 2) Even when the above works, it still doesn't copy some files, relying on the system files instead (one sad example was the subprocess module that was just kept in the system and not moved to the virtualenv) This patch solves that problem by creating a python3 environment in a directory with the modules that Scylla uses, and no other else. It is essentially doing what vitualenv should do but doesn't. Once this environment is assembled the binaries are then made relocatable the same way the Scylla binary is. One difference (for now) between the Scylla binary relocation process and ours is that we steer away from LD_LIBRARY_PATH: the environment variable is inherited by any child process steming from the caller, which means that we are unable to use the subprocess module to call system binaries like mkfs (which our scripts do a lot). Instead, we rely on RUNPATH to tell the binary where to search for its libraries. In terms of the python interpreter, PYTHONPATH does not need to be set for this to work as the python interpreter will include the lib directory in its PYTHONPATH. To confirm this, we executed the following code: bin/python3 -c "import sys; print('\n'.join(sys.path))" with the interpreter unpacked to both /home/centos/glaubertmp/test/ and /tmp. It yields respectively: /home/centos/glaubertmp/test/lib64/python36.zip /home/centos/glaubertmp/test/lib64/python3.6 /home/centos/glaubertmp/test/lib64/python3.6/lib-dynload /home/centos/glaubertmp/test/lib64/python3.6/site-packages and /tmp/python/lib64/python36.zip /tmp/python/lib64/python3.6 /tmp/python/lib64/python3.6/lib-dynload /tmp/python/lib64/python3.6/site-packages This was tested by moving the .tar.gz generated on my Fedora28 laptop to a CentOS machine without python3 installed. I could then invoke ./scylla_python_env/python3 and use the interpreter to call 'ls' through the subprocess module. I have also tested that we can successfully import all the modules we listed for installation and that we can read a sample yaml file (since PyYAML depends on the system's libyaml, we know that this works) Time to build: real 0m15.935s user 0m15.198s sys 0m0.382s Final archive size (uncompressed): 81MB Final archive sie (compressed) : 25MB Signed-off-by: Glauber Costa <glauber@scylladb.com> -- v3: - rewrite in python3 - do not use temporary directories, add directly to the archive. Only the python binary have to be materialized - Use --cacheonly for repoquery, and also repoquery --list in a second step to grab the file list v2: - do not use yum, resolve dependencies from installed packages instead - move to scripts as Avi wants this not only for old offline CentOS	2019-02-04 18:02:40 -05:00
Glauber Costa	f757b42ba7	spec file: fix python3 dependency list. The dependency list as it was did not reflect the fact that scyllatop is now written in python3. Some packages, like urwid, should use the python3 version. CentOS doesn't really have an urwid package for python3, not even in EPEL. So this officially marks the point in which we can't build packages that will install in CentOS7 anyway. Luckily, we will soon be providing our own python3 interpreter. But for now, as a first step, simplify the dependency list by removing the CentOS/Fedora conditional and listing the full python3 list Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-02-04 18:02:40 -05:00
Glauber Costa	7052028752	fixup scripts before installing them to their final location Before installing python files to their final location in install.sh, replace them with a thunk so that they can work with our python3 interpreter. The way the thunk works, they will also work without our python3 interpreter so unconditionally fixing them up is always safe. I opt in this patch for fixing up just at install time to simplify developer's life, who won't have to worry about this at all. Note about the rpm .spec file: since we are relying on specific format for the shebangs, we shouldn't let rpmbuild mess with them. Therefore, we need to disable a global variable that controls that behavior (by definition, Fedora rpmbuild will rewrite all shebangs to /usr/bin/python3) Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-02-04 18:02:40 -05:00
Glauber Costa	3869628429	automatically relocate python scripts Given a python script at $DIR/script.py, this copies the script to $DIR/libexec/script.py.bin, fixes its shebang to use /usr/bin/env instead of an absolute path for the interpreter and replaces the original script with a thunk that calls into that script. PYTHONPATH is adjusted so that the original directory containing the script can also serve as a source of modules, as would be originally intended. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-02-04 18:02:39 -05:00
Glauber Costa	1bb65a0888	make scyllatop relocatable Right now the binary we distribute with scyllatop calls into /usr/lib/scylla/scyllatop/scyllatop.py unconditionally. Calling that is all that this binary does. This poses a problem to our relocatable process, since we don't want to be referring to absolute paths (And moreover, that is calling python whereas it should be calling python3) The scyllatop.py files includes a python3 shebang and is executable. Therefore, it is best to just create a link to that file and execute it directly Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-02-04 16:12:46 -05:00
Glauber Costa	e890b8af09	use relative paths for installing scylla and iotune binaries The answer is yes: if we install them in $root/opt, we should link to $root/opt Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-02-04 14:33:51 -05:00
Piotr Jastrzebski	834bec5cc9	Read shard awareness columns as dropped Without this new version of Scylla won't be able to start with system tables inherited after older version that had shard awareness columns. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <cb62f20fc0c98f532c6f4ad5e08b3794951e85bd.1549289050.git.piotr@scylladb.com>	2019-02-04 18:43:11 +02:00
Rafael Ávila de Espíndola	bbd9dfcba7	Add a --split-dwarf option to configure.py It is off by default as it conflicts with distcc. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190204002706.15540-1-espindola@scylladb.com>	2019-02-04 18:42:16 +02:00
Benny Halevy	a9e1e0233a	Add a dev build mode to test.py Message-Id: <20190204162112.7471-2-espindola@scylladb.com>	2019-02-04 18:38:23 +02:00
Rafael Ávila de Espíndola	6243443591	Add a dev build mode The build times I got with a clean ccache were: ninja dev 10806.89s user 678.29s system 2805% cpu 6:49.33 total ninja release 28906.37s user 1094.53s system 2378% cpu 21:01.27 total ninja debug 18611.17s user 1405.66s system 2310% cpu 14:26.52 total With this version -gz is not passed to seastar's configure. It should probably be seastar's configure responsibility to do that and I will send a separate patch to do it. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190204162112.7471-1-espindola@scylladb.com>	2019-02-04 18:38:22 +02:00
Calle Wilund	9cadbaa96f	commitlog_replayer: Bugfix: finding truncation positions uses local var ref "uuid" was ref:ed in a continuation. Works 99.9% of the time because the continuation is not actually delayed (and assuming we begin the checks with non-truncated (system) cf:s it works). But if we do delay continuation, the resulting cf map will be borked. Fixes #4187. Message-Id: <20190204141831.3387-1-calle@scylladb.com>	2019-02-04 16:51:13 +02:00
Rafael Ávila de Espíndola	15a515a39b	build: Don't link utils/gz/gen_crc_combine_table with seastar It doesn't use seastar, so there is no point in linking with it. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190203214145.43009-1-espindola@scylladb.com>	2019-02-04 15:43:16 +02:00
Botond Dénes	2a67355ded	multishard_combining_reader: better shard selection algorithm The multishard reader has to combine the output of all shards into a single fragment stream. To do that, each time a `partition_start` is read it has to check if there is another partition, from another shard, that has to be emitted before this partition. Currently for this it uses the partitioner. At every partition start fragment it checks if the token falls into the current shard sub-range. The shard sub-range is the continuous range of tokens, where each token belongs to the same shard. If the partition doesn't belong to the current shard sub-range the multishard reader assumes the following shard sub-range of the next shard will have data and move over to it. This assumption will however only stand on very dense tables, and will fail miserably on less dense tables, resulting in the multishard reader effectively iterating over the shard sub-ranges (4096 in the worst case), only to find data in just a few of them. This resulted in high user-perceived latency when scanning a sparse table. This patch replaces this algorithm with one based on a shard heap. The shards are now organized into a min-heap, by the next token they have data for. When a partition start fragment is read from the current shard, its token is compared to the smallest token in the shard heap. If smaller, we continue to read from the current shard. Otherwise we move to the shard with the smallest token. When constructing the reader, or after fast-forwarding we don't know what first token each reader will produce. To avoid reading in a partition from each reader, we assume each reader will produce the first token from the first shard sub-range that overlaps with the query range. This algorithm performs much better on sparse tables, while also being slightly better on dense tables. I did only a very rough measurement using CQL tracing. I populated a table with four rows on a 64 shards machine, then scanned the entire table. Time to scan the table (microseconds): before 27'846 after 5'248 Fixes: #4125 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <d559f887b650ab8caa79ad4d45fa2b7adc39462d.1548846019.git.bdenes@scylladb.com>	2019-02-04 14:10:23 +02:00
Piotr Sarna	11e6d88ca7	tests: supplement filtering collections with more cases Filtering test cases for collections are supplemented with checking whether CONTAINS works correctly for sets and maps. Message-Id: <4a684152cdcdb65e1415ba5859699cb324312c2b.1548837150.git.sarna@scylladb.com>	2019-02-03 17:19:30 +02:00
Avi Kivity	468f8c7ee7	Merge "Print a warning if a row is too large" from Rafael " This is a first step in fixing #3988. " * 'espindola/large-row-warn-only-v4' of https://github.com/espindola/scylla: Rename large_partition_handler Print a warning if a row is too large Remove defaut parameter value Rename _threshold_bytes to _partition_threshold_bytes keys: add schema-aware printing for clustering_key_prefix	2019-02-03 13:57:42 +02:00
Nadav Har'El	5a695b8029	Materialized views: fix three error messages Three error messages were supposed to include a column name, but a "{}" was missing in the format so the given column name didn't actually appear in the error message. So this patch adds the missing {}'s. Fixes #4183. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190203112100.13031-1-nyh@scylladb.com>	2019-02-03 12:23:29 +01:00
Tomasz Grabiec	72dd6f54e3	gdb: Print total amount of memory used by small and large allocations Message-Id: <1548956406-7601-2-git-send-email-tgrabiec@scylladb.com>	2019-02-01 13:18:16 +00:00
Tomasz Grabiec	f48fa542fc	gdb: Extend 'scylla memory' to show memory used by large allocations Adds new columns to the "Page spans" table named "large [B]" and "[spans]", which shows how much memory is allocated in spans of given size. Excludes spans used by small pools. Useful in determining what is the size of large allocations which consume the memory. Example output: Page spans: index size [B] free [B] large [B] [spans] 0 4096 4096 4096 1 1 8192 32768 0 0 2 16384 16384 0 0 3 32768 98304 2785280 85 4 65536 65536 1900544 29 5 131072 524288 471597056 3598 ... 31 8796093022208 0 0 0 Large allocations: 484675584 [B] Message-Id: <1548956406-7601-1-git-send-email-tgrabiec@scylladb.com>	2019-02-01 13:18:01 +00:00
Asias He	28d6d117d2	migration_manager: Fix nullptr dereference in maybe_schedule_schema_pull Commit `976324bbb8` changed to use get_application_state_ptr to get a pointer of the application_state. It may return nullptr that is dereferenced unconditionally. In resharding_test.py:ReshardingTest_nodes4_with_SizeTieredCompactionStrategy.resharding_by_smp_increase_test, we saw: 4 nodes in the tests n1, n2, n3, n4 are started n1 is stopped n1 is changed to use different shard config n1 is restarted ( 2019-01-27 04:56:00,377 ) The backtrace happened on n2 right fater n1 restarts: 0 INFO 2019-01-27 04:56:05,175 [shard 0] gossip - Feature STREAM_WITH_RPC_STREAM is enabled 1 INFO 2019-01-27 04:56:05,175 [shard 0] gossip - Feature WRITE_FAILURE_REPLY is enabled 2 INFO 2019-01-27 04:56:05,175 [shard 0] gossip - Feature XXHASH is enabled 3 WARN 2019-01-27 04:56:05,177 [shard 0] gossip - Fail to send EchoMessage to 127.0.58.1: seastar::rpc::closed_error (connection is closed) 4 INFO 2019-01-27 04:56:05,205 [shard 0] gossip - InetAddress 127.0.58.1 is now UP, status = 5 Segmentation fault on shard 0. 6 Backtrace: 7 0x00000000041c0782 8 0x00000000040d9a8c 9 0x00000000040d9d35 10 0x00000000040d9d83 11 /lib64/libpthread.so.0+0x00000000000121af 12 0x0000000001a8ac0e 13 0x00000000040ba39e 14 0x00000000040ba561 15 0x000000000418c247 16 0x0000000004265437 17 0x000000000054766e 18 /lib64/libc.so.6+0x0000000000020f29 19 0x00000000005b17d9 We do not know when this backtrace happened, but according to log from n3 an n4: INFO 2019-01-27 04:56:22,154 [shard 0] gossip - InetAddress 127.0.58.2 is now DOWN, status = NORMAL INFO 2019-01-27 04:56:21,594 [shard 0] gossip - InetAddress 127.0.58.2 is now DOWN, status = NORMAL We can be sure the backtrace on n2 happened before 04:56:21 - 19 seconds (the delay the gossip notice a peer is down), so the abort time is around 04:56:0X. The migration_manager::maybe_schedule_schema_pull that triggers the backtrace must be scheduled before n1 is restarted, because it dereference application_state pointer after it sleeps 60 seconds, so the time maybe_schedule_schema_pull is called is around 04:55:0X which is before n1 is restarted. So my theory is: migration_manager::maybe_schedule_schema_pull is scheduled, at this time n1 has SCHEMA application_state, when n1 restarts, n2 gets new application state from n1 which does not have SCHEMA yet, when migration_manager::maybe_schedule wakes up from the 60 sleep, n1 has non-empty endpoint_state but empty application_state for SCHEMA. We dereference the nullptr application_state and abort. Fixes: #4148 Tests: resharding_test.py:ReshardingTest_nodes4_with_SizeTieredCompactionStrategy.resharding_by_smp_increase_test Message-Id: <9ef33277483ae193a49c5f441486ee6e045d766b.1548896554.git.asias@scylladb.com>	2019-02-01 09:01:08 +02:00
Piotr Jastrzebski	ad217bbdc7	Revert "system_keyspace: add sharding information to local table" This reverts commit `bdce561ada`. Those columns are not used and cause problems with tools. Refs #4112 Message-Id: <c772ebc0ebc001e5bdf229424c6d51dc58cd5d2e.1548945023.git.piotr@scylladb.com>	2019-01-31 19:06:55 +01:00
Avi Kivity	9adf46b50e	Update seastar submodule * seastar 2f35731...c3be06d (1): > rpc: support closing streaming when only sink or source was created Ref #4124.	2019-01-31 12:39:02 +02:00
Nadav Har'El	7b9b7f8ebc	docs/metrics.md: document syntax for choosing specific instance/shard As another useful example of Prometheus syntax, show the syntax of plotting a graph for one particular node or shard. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Reviewed-by: Botond Denes <bdenes@scylladb.com> Message-Id: <20190129221607.11813-1-nyh@scylladb.com>	2019-01-31 12:37:30 +02:00
Asias He	9d9ecda619	repair: Log keyspace and table name in repair_cf_range When a repair failed, we saw logs like: repair - Checksum of range (8235770168569320790, 8235957818553794560] on 127.0.0.1 failed: std::bad_alloc (std::bad_alloc) It is hard to tell which keyspace and table has failed. To fix, log the keyspace and table name. It is useful to know when debugging. Fixes #4166 Message-Id: <8424d314125b88bf5378ea02a703b0f82c2daeda.1548818669.git.asias@scylladb.com>	2019-01-31 12:36:46 +02:00
Gleb Natapov	a70374d982	messaging_service: do not forget to close stream when sending it to another side failed Fixes #4124 Message-Id: <20190131091857.GC3172@scylladb.com>	2019-01-31 12:01:56 +02:00
Piotr Jastrzebski	4b47094f30	Prevent undefined behaviour while writing range tombstones in LA/KA Stop calling .remove_suffix on empty string_view. ck_bview can be empty because this function can be called for a half open range tombstone. It is impossible to write such range tombstones to LA/KA SSTables so we should throw a proper exception instead of allowing an undefined behaviour. Refs #4113 Tests: unit(release) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <c3738916953e4b10812aed95e645c739b4c29462.1548777086.git.piotr@scylladb.com>	2019-01-31 10:58:19 +01:00
Glauber Costa	94ead559f7	move scylla-housekeeping to dist/common/scripts All of our python scripts are there and they are all installed automatically into /usr/lib/scylla. By keeping scylla-housekeeping separately we are just complicating our build process. This would be just a minor annoyance but this broke the new relocatable process for python3 that I am trying to put together because I forgot to add the new location as a source for the scripts. Therefore, I propose we start being more diligent with this and keeping all scripts together for the future. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190123191732.32126-2-glauber@scylladb.com>	2019-01-31 11:44:34 +02:00
Jesse Haber-Kucharsky	c37aa258c5	build: Fix incremental builds when Seastar changes When a file in the `seastar` directory changes, we want to minimize the amount of Scylla artifacts that are re-built while ensuring that all changes in Seastar are reflected in Scylla correctly. For compiling object files, we change Seastar to be an "order only" dependency so that changes to Seastar don't trigger unnecessary builds. For linking, we add an "implicit" dependency on Seastar so that Scylla is re-linked when Seastar changes. With these changes, modifying a Seastar header file will trigger the recompilation of the affected Scylla object files, and modifying a Seastar source file will trigger linking only. Fixes #4171 Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <0ab43d79ce0d41348238465d1819d4c937ac6414.1548906335.git.jhaberku@scylladb.com>	2019-01-31 11:00:40 +02:00
Raphael S. Carvalho	930f8caff9	sstables/compaction: Fix segfault when replacing expired sstable in incremental compaction Fully expired sstable is not added to compacting set, meaning it's not actually compacted, but it's kept in a list of sstables which incremental compaction uses to check if any sstable can be replaced. Incremental compaction was unconditionally removing expired sstable from compacting set, which led to segfault because end iterator was given. The fix is about changing sstable_set::erase() behavior to follow standard one for erase functions which will works if the target element is not present. Fixes #4085. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20190130163100.5824-1-raphaelsc@scylladb.com>	2019-01-30 16:32:45 +00:00
Avi Kivity	056b6a4439	Update seastar submodule * seastar 07e1ed3...2f35731 (1): > Merge " Initial seastar ipv6 support" from Calle	2019-01-30 17:41:39 +02:00
Avi Kivity	1224cde871	Merge "Make perf_simple_query produce JSON results" from Paweł " This series enhances perf_simple_query error reporting by adding an option of producing a json file containing the results. The format of that file is very similar to the results produces by perf_fast_forward in order to ease integration with any tools that may want to interpret them. In addition to that perf_simple_query now prints to the standard output median, median absolute deviation, minimum and maximum of the partial results, so that there is no need for external scripts to compute those values. " * tag 'perf_simple_query-json/v1' of https://github.com/pdziepak/scylla: perf_simple_query: produce json results perf_simple_query: calculate and print statistics perf: time_parallel: return results of each iteration perf_simple_query: take advantage of threads in main()	2019-01-30 17:39:19 +02:00
Paweł Dziepak	6a0ee5dbbf	Merge "Simpler fix for the memtable reader's fragment monotonicity violation" from Botond " Recently it was discovered that the memtable reader (partition_snapshot_reader to be more precise) can violate mutation fragment monotonicity, by remitting range tombstones when those overlap with more than one ck range of the partition slice. This was fixed by `7049cd9`, however after that fix was merged a much simpler fix was proposed by Tomek, one that doesn't involve nearly as much changes to the partition snapshot reader and hences poses less risk of breaking it. This mini-series reverts the previous fix, then applies the new, simpler one. Refs: #4104 " * 'partition-snapshot-reader-simpler-fix/v2' of https://github.com/denesb/scylla: partition_snapshot_reader: don't re-emit range tombstones overlapping multiple ck ranges Revert "partition_snapshot_reader: don't re-emit range tombstones overlapping multiple ck ranges"	2019-01-30 15:24:31 +00:00
Jesse Haber-Kucharsky	b39eac653d	Switch to the the CMake-ified Seastar Committer: Avi Kivity <avi@scylladb.com> Branch: next Switch to the the CMake-ified Seastar This change allows Scylla to be compiled against the `master` branch of Seastar. The necessary changes: - Add `-Wno-error` to prevent a Seastar warning from terminating the build - The new Seastar build system generates the pkg-config files (for example, `seastar.pc`) at configure time, so we don't need to invoke Ninja to generate them - The `-march` argument is no longer inherited from Seastar (correctly), so it needs to be provided independently - Define `SEASTAR_TESTING_MAIN` so that the definition of an entry point is included for all unit test compilation units - Independently link Scylla against Seastar's compiled copy of fmt in its build directory - All test files use the (now public) Seastar testing headers - Add some missing Seastar headers to source files [avi: regenerate frozen toolchain, adjust seastar submoule] Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <02141f2e1ecff5cbcd56b32768356c3bf62750c4.1548820547.git.jhaberku@scylladb.com>	2019-01-30 11:17:38 +02:00
Botond Dénes	8d59c36165	partition_snapshot_reader: don't re-emit range tombstones overlapping multiple ck ranges When entering a new ck range (of the partition-slice), the partition snapshot reader will apply to its range tombstones stream all the tombstones that are relevant to the new ck range. When the partition has range tombstones that overlap with multiple ck ranges, these will be applied to the range tombstone stream when entering any of the ck ranges they overlap with. This will result in the violation of the monotonicity of the mutation fragments emitted by the reader, as these range tombstones will be re-emitted on each ck range, if the ck range has at least one clustering row they apply to. For example, given the following partition: rt{[1,10]}, cr{1}, cr{2}, cr{3}... And a partition-slice with the following ck ranges: [1,2], [3, 4] The reader will emit the following fragment stream: rt{[1,10]}, cr{1}, cr{2}, rt{[1,10]}, cr{3}, ... Note how the range tombstone is emitted twice. In addition to violating the monotonicity guarantee, this can also result in an explosion of the number of emitted range tombstones. Fix by trimming range tombstones to the start of the current ck range, thus ensuring that they will not violate mutation fragment monotonicity guarantees. Refs: #4104 This is a much simpler fix for the above issue, than the already committed one (7049cd937A). The latter is reverted by the previous patch and this patch applies the simpler fix.	2019-01-30 10:01:13 +02:00
Nadav Har'El	9dd3c59c77	docs/metrics.md: explain Prometheus and Grafana docs/metrics.md so far explained just the REST API for retrieving current metrics from a single Scylla node. In this patch, I add basic explanations on how to use the Prometheus and Grafana tools included in the "scylla-grafana-monitoring" project. It is true that technically, what is being explained here doesn't come with the Scylla project and requires the separate scylla-grafana-monitoring to be installed as well. Nevertheless, most Scylla developers will need this knowledge eventually and suprisingly it appears it was never documented anywhere accessible to newbie developers, and I think metrics.md is the right place to introduce it. In fact, I myself wasn't aware until today that Prometheus actually had its own Web UI on port 9090, and that it is probably more useful for developers than Grafana is. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Reviewed-by: Botond Denes <bdenes@scylladb.com> Message-Id: <20190129114214.17786-1-nyh@scylladb.com>	2019-01-29 15:46:06 +02:00
Duarte Nunes	35c03f41a4	Merge 'Fix multiple contains for one column' from Piotr " An error in validating CONTAINS restrictions against collections caused only the first restriction to be taken into account due to returning prematurely. This miniseries provides a fix for that as well as a matching test case. Tests: unit (release) Fixes #4161 " * 'fix_multiple_contains_for_one_column' of https://github.com/psarna/scylla: tests: enable CONTAINS tests for filtering cql3: remove premature return from is_satisfied_by cql3: restore indentation	2019-01-29 11:10:13 +00:00

1 2 3 4 5 ...

17900 Commits