scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 18:40:38 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	c422bfc2c5	tests: perf_fast_forward: Store results for each dataset in separate sub-directory Otherwise read test results for subsequent datasets will override each other. Also, rename population test case to not include dataset name, which is now redundant. Message-Id: <1547822942-9690-1-git-send-email-tgrabiec@scylladb.com>	2019-01-20 15:38:46 +02:00
Botond Dénes	7049cd9374	partition_snapshot_reader: don't re-emit range tombstones overlapping multiple ck ranges When entering a new ck range (of the partition-slice), the partition snapshot reader will apply to its range tombstones stream all the tombstones that are relevant to the new ck range. When the partition has range tombstones that overlap with multiple ck ranges, these will be applied to the range tombstone stream when entering any of the ck ranges they overlap with. This will result in the violation of the monotonicity of the mutation fragments emitted by the reader, as these range tombstones will be re-emitted on each ck range, if the ck range has at least one clustering row they apply to. For example, given the following partition: rt{[1,10]}, cr{1}, cr{2}, cr{3}... And a partition-slice with the following ck ranges: [1,2], [3, 4] The reader will emit the following fragment stream: rt{[1,10]}, cr{1}, cr{2}, rt{[1,10]}, cr{3}, ... Note how the range tombstone is emitted twice. In addition to violating the monotonicity guarantee, this can also result in an explosion of the number of emitted range tombstones. Fix by applying only those range tombstones to the range tombstone stream, that have a position strictly greater than that of the last emitted clustering row (or range tombstone), when entering a new ck range. Fixes: #4104 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <e047af76df75972acb3c32c7ef9bb5d65d804c82.1547916701.git.bdenes@scylladb.com>	2019-01-20 15:38:04 +02:00
Paweł Dziepak	14757d8a83	types: collection_type: drop tombstone if covered by higher-level one At the moment are inefficiencies in how collection_type_impl::mutation::compact_and_expire( handles tombstones. If there is a higher-level tombstone that covers the collection one (including cases where there is no collection tombstone) it will be applied to the collection tombstone and present in the compaction output. This also means that the collection tombstone is never dropped if fully covered by a higher-level one. This patch fixes both those problems. After the compaction the collection tombstone is either unchanged or removed if covered by a higher-level one. Fixes #4092. Message-Id: <20190118174244.15880-1-pdziepak@scylladb.com>	2019-01-20 15:32:34 +02:00
Avi Kivity	e51ef95868	Update seastar submodule * seastar af6b797...7d620e1 (1): > perftune.py: don't let any exception out when connecting to AWS meta server Fixes #4102.	2019-01-20 13:59:09 +02:00
Avi Kivity	6e6372e8d2	Revert "Merge "Type-eaese gratuitous templates with functions" from Avi" This reverts commit `31c6a794e9`, reversing changes made to `4537ec7426`. It causes bad_function_calls in some situations: INFO 2019-01-20 01:41:12,164 [shard 0] database - Keyspace system: Reading CF sstable_activity id=5a1ff267-ace0-3f12-8563-cfae6103c65e version=d69820df-9d03-3cd0-91b0-c078c030b708 INFO 2019-01-20 01:41:13,952 [shard 0] legacy_schema_migrator - Moving 0 keyspaces from legacy schema tables to the new schema keyspace (system_schema) INFO 2019-01-20 01:41:13,958 [shard 0] legacy_schema_migrator - Dropping legacy schema tables INFO 2019-01-20 01:41:14,702 [shard 0] legacy_schema_migrator - Completed migration of legacy schema tables ERROR 2019-01-20 01:41:14,999 [shard 0] seastar - Exiting on unhandled exception: std::bad_function_call (bad_function_call)	2019-01-20 11:32:14 +02:00
Paweł Dziepak	e212d37a8a	utils/small_vector: fix leak in copy assignment slow path Fixes #4105. Message-Id: <20190118153936.5039-1-pdziepak@scylladb.com>	2019-01-18 17:49:46 +02:00
Paweł Dziepak	23cfb29fea	Merge "compaction: mc: re-calculate encoding_stats" from Benny " Use input sstables stats metadata to re-calculate encoding_stats. Fixes #3971. " * 'projects/compaction-encoding-stats/v3' of https://github.com/bhalevy/scylla: compaction: mc: re-calculate encoding_stats based on column stats memtable: extract encoding_stats_collector base class to encoding_stats header file	2019-01-18 14:36:17 +00:00
Tomasz Grabiec	7308effb45	tests: flat_mutation_reader_test: Drop unneeded includes Message-Id: <1547819118-4645-1-git-send-email-tgrabiec@scylladb.com>	2019-01-18 13:58:05 +00:00
Tomasz Grabiec	6461e085fe	managed_bytes: Fix compilation on gcc 8.2 The compilation fails on -Warray-bounds, even though the branch is never taken: inlined from ‘managed_bytes::managed_bytes(bytes_view)’ at ./utils/managed_bytes.hh:195:22, inlined from ‘managed_bytes::managed_bytes(const bytes&)’ at ./utils/managed_bytes.hh:162:77, inlined from ‘dht::token dht::bytes_to_token(bytes)’ at dht/random_partitioner.cc:68:57, inlined from ‘dht::token dht::random_partitioner::get_token(bytes)’ at dht/random_partitioner.cc:85:39: /usr/include/c++/8/bits/stl_algobase.h:368:23: error: ‘void* __builtin_memmove(void, const void, long unsigned int)’ offset 16 from the object at ‘<anonymous>’ is out of the bounds of referenced subobject ‘managed_bytes::small_blob::data’ with type ‘signed char [15]’ at offset 0 [-Werror=array-bounds] __builtin_memmove(__result, __first, sizeof(_Tp) * _Num); ~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Work around by disabling the diagnostic locally. Message-Id: <1547205350-30225-1-git-send-email-tgrabiec@scylladb.com>	2019-01-18 13:48:05 +00:00
Tomasz Grabiec	31c6a794e9	Merge "Type-eaese gratuitous templates with functions" from Avi Many area of the code are splattered with unneeded templates. This patchset replaces some of them, where the template parameter is a function object, with an std::function or noncopyable_function (with a preference towards the latter; but it is not always possible). As the template is compiled for each instantiation (if the function object is a lambda) while a function is compiled only once, there are significant savings in compile time and bloat. text data bss dec hex filename 85160690 42120 284910 85487720 5187068 scylla.before 84824762 42120 284910 85151792 5135030 scylla.after * https://github.com/avikivity/scylla detemplate/v1: api/commitlog: de-template acquire_cl_metric() database: de-template do_parse_schema_tables database: merge for_all_partitions and for_all_partitions_slow hints: de-template scan_for_hints_dirs() schema_tables: partially de-template make_map_mutation() distributed_loader: de-template tests: commitlog_test: de-template tests: cql_auth_query_test: de-template test: de-template eventually() and eventually_true() tests: flush_queue_test: de-template hint_test: de-template tests: mutation_fragment_test: de-template test: mutation_test: de-template	2019-01-18 11:42:01 +01:00
Avi Kivity	089931fb56	test: mutation_test: de-template Replace the with_column_family helper template with an ordinary funciton, to reduce code bloat.	2019-01-17 19:06:42 +02:00
Avi Kivity	53a3db9446	tests: mutation_fragment_test: de-template The for_each_target() template is called four times, so making it a normal function reduces a lot of code generation.	2019-01-17 19:05:48 +02:00
Avi Kivity	4a21de4592	hint_test: de-template While cl_test is duplicated with commitlog_test, at least deduplicate it internally by converting it to an ordinary function.	2019-01-17 19:03:31 +02:00
Avi Kivity	1f02fd3ff6	tests: flush_queue_test: de-template The internal test_propagation template is instantiated many times. Replace with an oridinary function to reduce bloat. Call sites adjusted to have a uniform signature.	2019-01-17 19:02:26 +02:00
Avi Kivity	63077501ed	test: de-template eventually() and eventually_true() These templates are not trivial and called many times. De-template them to reduce code bloat.	2019-01-17 19:00:55 +02:00
Avi Kivity	a5d3254ed3	tests: cql_auth_query_test: de-template Replace the with_user() and verify_unauthorized_then_ok() templates with functions. Some adjustments made to the call site to unify the signatures.	2019-01-17 18:59:30 +02:00
Avi Kivity	8c05debecb	tests: commitlog_test: de-template The cl_test function is called many times, so its contents are bloat. De-template it so it is compiled only once.	2019-01-17 18:57:35 +02:00
Avi Kivity	b6239134c2	distributed_loader: de-template distributed_loader has several large templates that can be converted to normal function with the help of noncopyable_function<>, reducing code bloat.	2019-01-17 18:56:22 +02:00
Avi Kivity	2407c35cc1	schema_tables: partially de-template make_map_mutation() make_map_mutation() is called several times, hopfully with the same Map type parameter. Replace the Func parameter with a noncopyable_function<>.	2019-01-17 18:54:43 +02:00
Avi Kivity	81d004b2c0	hints: de-template scan_for_hints_dirs() This function is called twice, and is not doing anything performance critical, so replace the template parameter Func with std::function<>.x	2019-01-17 18:51:46 +02:00
Avi Kivity	f61dbc9855	database: merge for_all_partitions and for_all_partitions_slow for_all_partitions is only used in the implementation of for_all_partitions_slow, so merge them and get rid of a template.	2019-01-17 18:50:36 +02:00
Avi Kivity	4568a4e4b0	database: de-template do_parse_schema_tables This long slow-path function is called four times, so de-templating it is an easy win.	2019-01-17 18:48:57 +02:00
Avi Kivity	08bd28942b	api/commitlog: de-template acquire_cl_metric() Use noncopyable_function instead of a template parameter. Likely doesn't gain anyting, because the template was always instantiated with the same type (the result of std::bind() with the same signatures), but still good practice.	2019-01-17 18:45:14 +02:00
Botond Dénes	4537ec7426	mutlishard_mutation_query(): use correct reader concurrency semaphore The multishard mutation query used the semaphore obtained from `database::user_read_concurrency_sem()` to pause-resume shard readers. This presented a problem when `multishard_mutation_query()` was reading from system tables. In this case the readers themselves would obtain their permits from the system read concurrency semaphore. Since the pausing of shard readers used the user read semaphore, pausing failed to fulfill its objective of alleviating pressure on the semaphore the reads obtained their permits from. In some cases this lead to a deadlock during system reads. To ensure the correct semaphore is used for pausing-resuming readers, obtain the semaphore from the `table` object. To avoid looking up the table on every pause or resume call, cache the semaphores when readers are created. Fixes: #4096 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <c784a3cd525ce29642d7216fbe92638fa7884e88.1547729119.git.bdenes@scylladb.com>	2019-01-17 15:19:59 +02:00
Avi Kivity	8e9989685d	scyllatop: complete conversion to python3 `d2dbbba139` converted scyllatop's interperter to Python 3, but neglected to do the actual conversion. This patch does so, by running 2to3 over allfiles and adding an additional bytes->string decode step in prometheus.py. Superfluous 2to3 changes to print() calls were removed. Message-Id: <20190117124121.7409-1-avi@scylladb.com>	2019-01-17 12:50:25 +00:00
Duarte Nunes	7505815013	Merge 'Fix filtering with LIMIT and paging' from Piotr " Before this series the limit was applied per page instead of globally, which might have resulted in returning too many rows. To fix that: 1. restrictions filter now has a 'remaining' parameter in order to stop accepting rows after enough of them have already been accepted 2. pager passes its row limit to restrictions filter, so no more rows than necessary will be served to the client 3. results no longer need to be trimmed on select_statement level Tests: unit (release) " * 'fix_filtering_limit_with_paging_3' of https://github.com/psarna/scylla: tests: add filtering+limit+paging test case tests: allow null paging state in filtering tests cql3: fix filtering with LIMIT with regard to paging	2019-01-17 12:50:00 +00:00
Piotr Sarna	ed7328613f	tests: add filtering+limit+paging test case A test case that checks whether a combination of paging and LIMIT clause for filtering queries doesn't return with too many rows. Refs #4100	2019-01-17 13:25:10 +01:00
Piotr Sarna	7d4f994e98	tests: allow null paging state in filtering tests Previously the utility to extract paging state asserted that the state exists, but in future tests it would be useful to be able to call this function even if it would return null.	2019-01-17 13:25:10 +01:00
Piotr Sarna	87c23372fb	cql3: fix filtering with LIMIT with regard to paging Previously the limit was erroneously applied per page instead of being accumulated, which might have caused returning too many rows. As of now, LIMIT is handled properly inside restrictions filter. Fixes #4100	2019-01-17 13:25:09 +01:00
Piotr Sarna	02d88de082	db,view: add consuming units in staging table registration View update generator service can accept sstables even before it starts, but it should still acknowledge the number of waiters in the semaphore. Reported-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <fcaa0f2884ebb4d34d1716e9e1cfed0642b4b85d.1547661048.git.sarna@scylladb.com>	2019-01-16 18:05:17 +00:00
Benny Halevy	1d483bc424	compaction: mc: re-calculate encoding_stats based on column stats When compacting several sstables, get and merge their encoding_stats for encoding the result. Introduce sstable::get_encoding_stats_for_compaction to return encoding_stats based on the sstable's column stats. Use encoding_stats_collector to keep track of the minimum encoding_stats values of all input sstables. Fixes #3971 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-01-16 17:59:59 +02:00
Benny Halevy	e2c4d2d60a	memtable: extract encoding_stats_collector base class to encoding_stats header file To be used also by compaction. Refs #3971 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-01-16 17:59:58 +02:00
Asias He	4b9e1a9f1d	repair: Add row level metrics Number of rows sent and received - tx_row_nr - rx_row_nr Bytes of rows sent and received - tx_row_bytes - rx_row_bytes Number of row hashes sent and received - tx_hashes_nr - rx_hashes_nr Number of rows read from disk - row_from_disk_nr Bytes of rows read from disk - row_from_disk_bytes Message-Id: <d1ee6b8ae8370857fe45f88b6c13087ea217d381.1547603905.git.asias@scylladb.com>	2019-01-16 14:04:57 +02:00
Duarte Nunes	04a14b27e4	Merge 'Add handling staging sstables to /upload dir' from Piotr " This series adds generating view updates from sstables added through /upload directory if their tables have accompanying materialized views. Said sstables are left in /upload directory until updates are generated from them and are treated just like staging sstables from /staging dir. If there are no views for a given tables, sstables are simply moved from /upload dir to datadir without any changes. Tests: unit (release) " * 'add_handling_staging_sstables_to_upload_dir_5' of https://github.com/psarna/scylla: all: rename view_update_from_staging_generator distributed_loader: fix indentation service: add generating view updates from uploaded sstables init: pass view update generator to storage service sstables: treat sstables in upload dir as needing view build sstables,table: rename is_staging to requires_view_building distributed_loader: use proper directory for opening SSTable db,view: make throttling optional for view_update_generator	2019-01-15 18:19:27 +00:00
Duarte Nunes	9b79f0f58b	Merge 'Add stream phasing' from Piotr " This series addresses the problem mentioned in issue 4032, which is a race between creating a view and streaming sstables to a node. Before this patch the following scenario is possible: - sstable X arrives from a streaming session - we decide that view updates won't be generated from an sstable X by the view builder - new view is created for the table that owns sstable X - view builder doesn't generate updates from sstable X, even though the table has accompanying views - which is an inconsistency This race is fixed by making the view builder wait for all ongoing streams, just like it does for reads and writes. It's implemented with a phaser. Tests: unit (release) dtest(not merged yet: materialized_views_test.TestMaterializedViews.stream_from_repair_during_build_process_test) " * 'add_stream_phasing_2' of https://github.com/psarna/scylla: repair: add stream phasing to row level repair streaming: add phasing incoming streams multishard_writer: add phaser operation parameter view: wait for stream sessions to finish before view building table: wait for pending streams on table::stop database: add pending streams phaser	2019-01-15 18:18:40 +00:00
Piotr Sarna	0eb703dc80	all: rename view_update_from_staging_generator The new name, view_update_generator, is both more concise and correct, since we now generate from directories other than "/staging".	2019-01-15 17:31:47 +01:00
Piotr Sarna	a5d24e40e0	distributed_loader: fix indentation Bad indentation was introduced in the previous commit.	2019-01-15 17:31:37 +01:00
Piotr Sarna	13c8c84045	service: add generating view updates from uploaded sstables SSTables loaded to the system via /upload dir may sometimes be needed to generate view updates from them (if their table has accompanying views). Fixes #4047	2019-01-15 17:31:37 +01:00
Piotr Sarna	46305861c3	init: pass view update generator to storage service Storage service needs to access view update generator in order to register staging sstables from /upload directory.	2019-01-15 17:31:36 +01:00
Piotr Sarna	13f6453350	sstables: treat sstables in upload dir as needing view build In some cases, sstables put in the upload dir should have view updates generated from them. In order to avoid moving them across directories (which then involves handling failure paths), upload dir will also be treated as a valid directory where staging sstables reside. Regular sstables that are not needed for view updates will be immediately moved from upload/ dir as before.	2019-01-15 16:47:01 +01:00
Piotr Sarna	09401e0e71	sstables,table: rename is_staging to requires_view_building A generalized name will be more fitting once we treat uploaded sstables as requiring view building too.	2019-01-15 16:47:01 +01:00
Piotr Sarna	76616f6803	distributed_loader: use proper directory for opening SSTable Previous implementation assumes that each SSTable resides directly in table::datadir directory, while what should actually be used is directory path from SSTable descriptor. This patch prevents a regression when adding staging sstables support for upload/ dir.	2019-01-15 16:47:01 +01:00
Piotr Sarna	beb4836726	db,view: make throttling optional for view_update_generator Currently registering new view updates is throttled by a semaphore, which makes sense during stream sessions in order to avoid overloading the queue. Still, registration also occurs during initialization, where it makes little sense to wait on a semaphore, since view update generator might not have started at all yet.	2019-01-15 16:47:01 +01:00
Paweł Dziepak	635873639b	Merge "Encoding stats enhancements" from Benny " Cleanup various cases related to updating of metatdata stats and encoding stats updating in preparation for 64-bit gc_clock (#3353). Fixes #4026 Fixes #4033 Fixes #4035 Fixes #4041 Refs #3353 " * 'projects/encoding-stats-fixes/v6' of https://github.com/bhalevy/scylla: sstables: remove duplicated code in data_consume_rows_context CELL_VALUE_BYTES sstables: mc: use api::timestamp_type in write_liveness_info sstables: mc: sstable_write encoding_stats are const mp_row_consumer_k_l::consume_deleted_cell rename ttl param to local_deletion_time memtable: don't use encoding_stats epochs as default memtable: mc: udpate min_ttl encoding stats for dead row marker memtable: mc: add comment regarding updating encoding stats of collection tombstones sstables: metadata_collector: add update tombstone stats sstables: assert that delete_time is not live when updating stats sstables: move update_deletion_time_stats to metadata collector sstables: metadata_collector: introduce update_local_deletion_time_and_tombstone_histogram sstables: mc: write_liveness_info and write_collection should update tombstone_histogram sstables: update_local_deletion_time for row marker deletion_time and expiration	2019-01-15 16:53:36 +02:00
Tomasz Grabiec	32f711ce56	row_cache: Fix crash on memtable flush with LCS Presence checker is constructed and destroyed in the standard allocator context, but the presence check was invoked in the LSA context. If the presence checker allocates and caches some managed objects, there will be alloc-dealloc mismatch. That is the case with LeveledCompactionStrategy, which uses incremental_selector. Fix by invoking the presence check in the standard allocator context. Fixes #4063. Message-Id: <1547547700-16599-1-git-send-email-tgrabiec@scylladb.com>	2019-01-15 16:53:36 +02:00
Piotr Sarna	08a42d47a5	repair: add stream phasing to row level repair In order to allow other services to wait for incoming streams to finish, row level repair uses stream phasing when creating new sstables from incoming data. Fixes scylladb#4032	2019-01-15 10:28:21 +01:00
Piotr Sarna	7e61f02365	streaming: add phasing incoming streams Incoming streams are now phased, which can be leveraged later to wait for all ongoing streams to finish. Refs #4032	2019-01-15 10:28:15 +01:00
Asias He	1cc7e45f44	database: Make log max_vector_size and internal_count debug level It is useful for developers but not useful for users. Make it debug level. Message-Id: <775ce22d6f8088a44d35601509622a7e73ddeb9b.1547524976.git.asias@scylladb.com>	2019-01-15 11:02:30 +02:00
Piotr Sarna	238003b773	multishard_writer: add phaser operation parameter Multishard writer can now accept a phaser operation parameter in order to sustain a phased operation (e.g. a streaming session).	2019-01-15 10:02:22 +01:00
Piotr Sarna	b9203ec4f8	view: wait for stream sessions to finish before view building During streaming, there's a race between streamed sstables and view creation, which might result in some tables not being used to generate view updates, even though they should. That happens when the decision about view update path for a table is done before view creation, but after already receiving some sstables via streaming. These will not be used in view building even though they should. Hence, a phaser is used to make the view builder wait for all ongoing stream sessions for a table to finish before proceeding with build steps. Refs #4032	2019-01-15 09:36:55 +01:00

1 2 3 4 5 ...

17729 Commits