scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-09 00:13:31 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	22019577cc	cache_streamed_mutation: Call fast_forward_to() outside allocating section On bad_alloc the section is retried. If the exception happened inside fast_forward_to() on the underlying reader, that call will be retried. However, the reader should not be used after exception is thrown, since it is in unspecified state. Also, calling fast_forward_to() with cache region locked increases the chances of it failing to allocate. We shouldn't call fast_forward_to() with the cache region locked. Fixes #2791.	2017-09-15 15:41:55 +02:00
Tomasz Grabiec	3b790a1e80	cache_streamed_mutation: Switch from flags to explicit state machine We're in one state at a time, so it's better to express it as a single variable rather than N independent flags. In preparation before adding more states.	2017-09-15 15:41:55 +02:00
Glauber Costa	eb93d5f8ad	database: pass a monitor as a parameter to memtable writer Right now we pass a permit to the memtable writer and that permit is used insite write_memtable_to_sstable to compose a write_monitor. We would like to extend the write_monitor to include other things, that right now are not available as parameters to write_memtable_to_sstable - and which are possibly too specialized to be. The solution for that is to pass the write_monitor instead of the permit to the writer. Conceptually, that also makes sense because the write_monitor is something the sstable writer is aware of. Permits, on the other hand, are a database concept that is alien to the sstable writer. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20170915032836.21154-1-glauber@scylladb.com>	2017-09-15 12:26:56 +02:00
Duarte Nunes	8378fe190a	Merge 'Fix schema version mismatch during rolling upgrade from 1.7' from Tomasz "When there are at least 2 nodes upgraded to 2.0, and the two exchanged schema for some reason, reads or writes which involve both 1.7 and 2.0 nodes may start to fail with the following error logged: storage_proxy - Exception when communicating with 127.0.0.3: Failed to load schema version 58fc9b89-74ab-37ca-8640-8b38a1204f8d The situation should heal after whole cluster is upgraded. Table schema versions are calculated by 2.0 nodes differently than 1.7 nodes due to change in the schema tables format. Mismatch is meant to be avoided by having 2.0 nodes calculate the old digest on schema migration during upgrade, and use that version until next time the table is altered. It is thus not allowed to alter tables during the rolling upgrade. Two 2.0 nodes may exchange schema, if they detect through gossip that their schema versions don't match. They may not match temporarily during boot, until the upgraded node completes the bootstrap and propagates its new schema through gossip. One source of such temporary mismatch is construction of new tracing tables, which didn't exist on 1.7. Such schema pull will result in a schema merge, which cause all tables to be altered and their schema version to be recalculated. The new schema will not match the one used by 1.7 nodes, causing reads and writes to fail, because schema requesting won't work during rolling upgrade from 1.7 to 2.0. The main fix employed here is to hold schema pulls, even among 2.0 nodes, until rolling upgrade is complete." * 'tgrabiec/fix-schema-mismatch' of github.com:scylladb/seastar-dev: tests: schema_change_test: Add test_merging_does_not_alter_tables_which_didnt_change test case tests: cql_test_env: Enable all features in tests schema_tables: Make make_scylla_tables_mutation() visible migration_manager: Disable pulls during rolling upgrade from 1.7 storage_service: Introduce SCHEMA_TABLES_V3 feature schema_tables: Don't alter tables which differ only in version schema_mutations: Use mutation_opt instead of stdx::optional<mutation>	2017-09-15 10:27:47 +02:00
Tomasz Grabiec	c657eec4cf	tests: schema_change_test: Add test_merging_does_not_alter_tables_which_didnt_change test case	2017-09-14 20:26:31 +02:00
Tomasz Grabiec	f0fdf75e7c	tests: cql_test_env: Enable all features in tests	2017-09-14 20:26:31 +02:00
Tomasz Grabiec	571cac95ed	schema_tables: Make make_scylla_tables_mutation() visible For tests.	2017-09-14 20:26:31 +02:00
Tomasz Grabiec	5a92c18e63	migration_manager: Disable pulls during rolling upgrade from 1.7 If there is a schema pull during rolling upgrade among a two 2.0 nodes, then schema merge will delete the persisted schema version. When the node loads that table again, e.g. on restart, it will generate a version which is different than the one which 1.7 nodes use. This will cause reads and writes to fail. To avoid this, disable pulls until all nodes are upgraded. Fixes #2802.	2017-09-14 20:26:31 +02:00
Tomasz Grabiec	713d75fd51	storage_service: Introduce SCHEMA_TABLES_V3 feature	2017-09-14 20:26:31 +02:00
Tomasz Grabiec	f943d2efbf	schema_tables: Don't alter tables which differ only in version We apply deletion of scylla_tables.version to the incoming schema mutations so that table schema version is recalculated after merge. The mutations which we read from local schema tables may not have it deleted in which case all tables would be considered as differing on the presence of the version field. Avoid this by deleting the field from old mutations as well.	2017-09-14 20:26:31 +02:00
Tomasz Grabiec	99272087e6	schema_mutations: Use mutation_opt instead of stdx::optional<mutation>	2017-09-14 20:26:31 +02:00
Takuya ASADA	7662271fc9	dist/ami: show correct message when scylla-ami-setup.service failed After the service started, a state of the service may become "failed", "active" or "activating". But our script does not accept scylla-ami-setup.service become "failed" state, in result the script shows up wrong message. So we handle these three types of state correctly. Fixes #2759 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1504589079-1986-1-git-send-email-syuu@scylladb.com>	2017-09-14 12:40:05 +03:00
Tomasz Grabiec	0911fbbdef	row_cache: Fix row_cache::update_invalidating() evict() doesn't guarantee that the whole partition is discontinuous. In particular, partition tombstone cannot be marked as discontinuous. The parts which are still continuous must be updated. Broken after `c78047fa5b`. Message-Id: <1505375684-28574-1-git-send-email-tgrabiec@scylladb.com>	2017-09-14 10:58:25 +03:00
Asias He	0ec574610d	locator: Get rid of assert in token_metadata In commit `69c81bcc87` (repair: Do not allow repair until node is in NORMAL status), we saw a coredump due to an assert in token_metadata::first_token_index. Throw an exception instead of abort the whole scylla process. Message-Id: <c110645cee1ee3897e30a3ae1b7ab3f49c97412c.1504752890.git.asias@scylladb.com>	2017-09-14 10:33:02 +03:00
Gleb Natapov	31e803a36c	storage_proxy: wire up percentile speculative read properly Collect coordinator side read statistic per CF and use them in percentile speculative read executor. Getting percentile from estimated_histogram object is rather expensive, so cache it and recalculate only once per second (or if requested percentile changes). Fixes #2757 Message-Id: <20170911131752.27369-3-gleb@scylladb.com>	2017-09-14 10:31:26 +03:00
Gleb Natapov	0842faecef	estimated_histogram: fix overflow error handling Currently overflow values are stored in incorrect bucket (last one instead of special "overflow" one) and percentile() function throws if there is overflow value. The patch fixes the code to store overflow value in corespondent bucket and makes percentile() to take it into account instead of throwing. Message-Id: <20170911131752.27369-2-gleb@scylladb.com>	2017-09-14 10:31:21 +03:00
Asias He	5ff0b113c9	gossip: Fix indentation in apply_state_locally Message-Id: <2bdefa8d982ad8da7452b41e894f41d865b83b0b.1505356245.git.asias@scylladb.com>	2017-09-14 10:09:50 +03:00
Tomasz Grabiec	65ca8eebb8	mutation_partition: Print rows_entry's position instead of key For dummy rows, _key doesn't reflect the right position. Message-Id: <1505317040-6783-1-git-send-email-tgrabiec@scylladb.com>	2017-09-13 20:49:28 +03:00
Avi Kivity	ca8e3c4a78	Merge "Evict from partition snapshots in cache" from Tomasz "This series fixes the problem of active reads causing OOM due to the fact that partition snapshots they hold are not evictable. In particular, a single scan of a partition larger than memory will bad_alloc due to itself. After this, when partition entry is evicted from cache, data in all the snapshots is also evicted. We still don't have row-level eviction, but this series lays some grounds for it by making cache readers prepared for the possibility of rows being evicted. Fixes #2775. Fixes #2730." * tag 'tgrabiec/snapshot-evicition-in-cache-v1' of github.com:scylladb/seastar-dev: tests: Add test for partition_entry::evict() mutation_partition: Introduce range continuity checking methods mutation_partition: Enable rows_entry::compare() on position_in_partition_views tests: Extract mvcc tests to separate file tests: row_cache: Add evicition tests tests: simple_schema: Add new_tombstone() helper tests: streamed_mutation_assertions: Introduce produces(mutation&) streamed_mutation: Allow setting buffer capacity row_cache: Evict partition snapshots mvcc: Introduce partition_entry::evict() row_cache: Handle eviction in partition reader tests: row_cache_test: Don't assume mvcc snapshots are not evictable row_cache: Reuse allocation_strategy::invalidate_references() row_cache: Don't invalidate references on insertion lsa: Move reclaim counter concept to allocation_strategy level mvcc: Ensure partition_snapshot always destroys versions using proper allocator mvcc: Encapsulate reference stability check in partition_snapshot mvcc: Store LSA region reference in partition_snapshot	2017-09-13 20:48:33 +03:00
Tomasz Grabiec	a45b1ef4bc	sstables: Make atomic_deletion_manager logger static So that it's visible to the framework at boot and --logger-log-level can be used on it. Message-Id: <1505286578-21904-1-git-send-email-tgrabiec@scylladb.com>	2017-09-13 20:35:41 +03:00
Tomasz Grabiec	b8f62e86de	tests: Add test for partition_entry::evict()	2017-09-13 17:47:04 +02:00
Tomasz Grabiec	455a1b0d24	mutation_partition: Introduce range continuity checking methods	2017-09-13 17:47:04 +02:00
Tomasz Grabiec	abc489e99d	mutation_partition: Enable rows_entry::compare() on position_in_partition_views For full symmetry with existing overloads.	2017-09-13 17:47:04 +02:00
Tomasz Grabiec	d76b141b34	tests: Extract mvcc tests to separate file	2017-09-13 17:47:04 +02:00
Tomasz Grabiec	2dfb3b95a5	tests: row_cache: Add evicition tests	2017-09-13 17:47:03 +02:00
Tomasz Grabiec	204ec9c673	tests: simple_schema: Add new_tombstone() helper	2017-09-13 17:47:03 +02:00
Tomasz Grabiec	5b1adfa542	tests: streamed_mutation_assertions: Introduce produces(mutation&)	2017-09-13 17:47:03 +02:00
Tomasz Grabiec	cb16b038ef	streamed_mutation: Allow setting buffer capacity Needed in tests to limit amount of prefetching done by readers, so that it's easier to test interleaving of various events.	2017-09-13 17:47:03 +02:00
Tomasz Grabiec	c78047fa5b	row_cache: Evict partition snapshots If snapshots are not evicted, they may pin unbouned amount of memory for a long time in cache, which may lead to OOM. Evict snapshots together with the entry. Fixes #2775. Fixes #2730.	2017-09-13 17:47:03 +02:00
Tomasz Grabiec	b6ae5783cd	mvcc: Introduce partition_entry::evict() The operation frees as much memory as possible, marking affected mutation elements as discontinuous.	2017-09-13 17:47:03 +02:00
Tomasz Grabiec	fa2c26342c	row_cache: Handle eviction in partition reader	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	99aa3d1964	tests: row_cache_test: Don't assume mvcc snapshots are not evictable The test was not updating the underlying mutation source but still expecting to get the right data after calling invalidate(). If snapshots are evictable, that's not guaranteed. Apply to underlying as well, so data is read from underlying if necessary.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	adb159d51b	row_cache: Reuse allocation_strategy::invalidate_references() Modification count in the tracker is redundant, we can rely on allocator's invalidation counter.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	27a3b4bca9	row_cache: Don't invalidate references on insertion modification_count is currently only used to detect invalidation of references, intended to be incremented on erasure. Insertion into intrusive set doesn't invalidate references, so no need to increment the counter.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	87be474c19	lsa: Move reclaim counter concept to allocation_strategy level So that generic code can detect invalidation of references. Also, to allow reusing the same mechanism for signalling external reference invalidation.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	4053c801e2	mvcc: Ensure partition_snapshot always destroys versions using proper allocator partition_snapshot is managed by lw_shared_ptr. Currently it is assumed that before it dies, maybe_merge_versions() is called on it, which destroyes it in the right allocator context. It's not very safe. This patch improves safety by using the right allocator in snapshot's destructor.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	cda86abdbc	mvcc: Encapsulate reference stability check in partition_snapshot	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	2df6f356b1	mvcc: Store LSA region reference in partition_snapshot Will be useful for improving encapsulation.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	4c920c9891	tests: cql_test_env: Use cancel_prior_atomic_deletions() This fixes a failure in view_schema_test, which starts many instances of single_node_cql_env. cancel_atomic_deletions() causes later deletions to fail, which causes some of the test cases to fail. Message-Id: <1505311250-3118-2-git-send-email-tgrabiec@scylladb.com>	2017-09-13 17:11:34 +03:00
Tomasz Grabiec	dc0860ac70	sstables: Introduce cancel_prior_atomic_deletions() Like cancel_atomic_deletions() but doesn't fail later deletions. Message-Id: <1505311250-3118-1-git-send-email-tgrabiec@scylladb.com>	2017-09-13 17:11:33 +03:00
Tomasz Grabiec	8a425cedc6	tests: cql_test_env: Cancel pending sstable deletions on shutdown Fixes a hang on shutdown with --smp 2 in perf_fast_forward. The hang is in sstables::await_background_jobs_on_all_shards(), which is waiting on sstable deletions. Not all shards agree to delete certain sstables, because e.g. not all shards decide to compact them yet. Cancel those deletes after database is stopped on all shards, like we do in main.cc Fixes #2796. Message-Id: <1505292239-26032-1-git-send-email-tgrabiec@scylladb.com>	2017-09-13 11:56:48 +03:00
Asias He	c84dcabb8f	gossip: Use boost::copy_range in apply_state_locally boost::copy_range is better because the vector is allocated with the correct size instead of growing when the inserter is called. [avi: also crashes less] Message-Id: <b19ca92d56ad070fca1e848daa67c00c024e3a4d.1505291199.git.asias@scylladb.com>	2017-09-13 11:33:15 +03:00
Tomasz Grabiec	b3a8ba5af6	gdb: Introduce "scylla find" command Finds live objects on seastar heap of current shard which contain given value. Prints results in 'scylla ptr' format. Example: (gdb) scylla find 0x600005321900 thread 1, small (size <= 512), live (0x6000000f3800 +48) thread 1, small (size <= 56), live (0x6000008a1230 +32) Message-Id: <1505284614-19577-1-git-send-email-tgrabiec@scylladb.com>	2017-09-13 11:22:23 +03:00
Asias He	fa9d47c7f3	gossip: Fix a log message typo in compare_endpoint_startup Message-Id: <c4958950e1108082b63e08ab81ee2177edc9b232.1505286843.git.asias@scylladb.com>	2017-09-13 09:54:56 +02:00
Glauber Costa	ecad1be161	compaction_strategy: add missing header compaction_strategy.hh throws an exception, but it doesn't add the exception header. It is working in-tree because of inclusion order, but it broke one of my yet-out-of-tree changes. In any case, it is best to add the headers we will need to the files, and that is what this patch does. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20170912233326.26114-1-glauber@scylladb.com>	2017-09-13 08:40:15 +02:00
Raphael S. Carvalho	ef18b1162b	sstables/compaction_manager: rename and better explain reshard function submit doesn't properly describe the function and also improve explanation of the relationship between function itself and its job parameter. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20170912032034.23043-1-raphaelsc@scylladb.com>	2017-09-12 12:25:17 +03:00
Avi Kivity	1bd207a306	sstables: merge filter.cc into sstables.cc filter.cc has just two smallish functions, which are part of the sstable class. Move them to sstables.cc where the rest of the class members are defined. Message-Id: <20170912080541.7836-1-avi@scylladb.com>	2017-09-12 10:06:52 +02:00
Tomasz Grabiec	423142ec81	tests: row_cache_test: Fix abort in debug mode The test used apply() variant which assumed that it was invoked in a seastar thread, which is no longer the case after commit `d22fdf4`. Fix by copying outisde cache update, and use non-deferring apply() variant for cache update. Message-Id: <1505200142-3650-1-git-send-email-tgrabiec@scylladb.com>	2017-09-12 10:57:36 +03:00
Tomasz Grabiec	3f527e028d	Merge "Reduce dependencies on sstables.hh" from Avi This patchset reduces includes of sstables.hh, reducing compile time by both reducing the amount of code compiled, and the amount of needless recompiles caused by false dependencies. It does so by replacing lw_shared_ptr<sstable>, which requires a complete class, with a new custom type shared_sstable, which allows an incomplete sstable class definition. * https://github.com/avikivity/scylla deps2/v2.1 database: change truncate() to flush while compaction is disabled database: make run_with_compaction_disabled() a non-template database: add indirection to compaction_manager instance database: remove dependency on compaction.hh and compaction_manager.hh size_estimates_virtual_reader.hh: add missing include system_keyspace: add missing include main: add missing include storage_service: add missing include repair: add missing include compaction.hh: add missig include and forward declaration compaction_manager: add missing include shared_index_lists.hh: add missing include perf_fast_forward: add missing include sstable_mutation_test: add missing include sstables: extract version and format enum into a separate header file database.hh: add missing forward declaration for foreign_sstable_open_info cql_test_env: add forward declaration database: make column_family::disable_sstable_write() out-of-line sstables: introduce make_sstable() as a shortcut for make_lw_shared<sstable> treewide: use shared_sstable, make_sstable in place of lw_shared_ptr<sstable> sstables: use support for lw_shared_ptr with incomplete type for shared_sstable sstables: reduce dependencies streaming: remove unneeded includes	2017-09-12 09:56:46 +02:00
Tomasz Grabiec	ee1e7732a6	database: Create tables with continuous cache When table is created, it doesn't contain any data, so we can mark the whole data range as continuous in cache. This way reads will immediately hit, and flushes will populate. If sstables are later attached, the attaching process is supposed to invalidate affected ranges (and it does). Fixes #2536. Message-Id: <1505200269-4031-1-git-send-email-tgrabiec@scylladb.com>	2017-09-12 10:53:07 +03:00

1 2 3 4 5 ...

13142 Commits