scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-02 21:17:01 +00:00

Author	SHA1	Message	Date
Glauber Costa	8a50b027aa	summary: generate one if it is not present There are cases in which a Summary file will not be present, and imported SSTables will have just the Index and Data files. In earlier versions of Cassandra, a Summary didn't exist, so one may not be generated when migrating. In Issue #1170, we can see an example of tables generated by CQLSSTableWriter, and they lack a Summary. Cassandra is robust against this and can cope perfectly with the Summary not existing. I will argue that we should do the same. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-04-08 17:14:29 -04:00
Glauber Costa	4de26fdec8	sstables: allow read_toc to be called more than once We do that by bailing immediately if we detect that the components map is already populated. This allow us to call read_toc() earlier if we need to - for instance, to inquire about the existence of the Summary - without the need to re-read the components again later. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-04-08 17:14:29 -04:00
Glauber Costa	736e21222e	sstables: avoid passing schema unnecessarily for prepare_summary we can just pass the min interval as a parameter and avoid having the schema do yet another hop. For sealing the summary, it is completely unused and we can do away with it. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-04-08 17:14:29 -04:00
Glauber Costa	0de3a32147	index reader: make index_consumer a template parameter This is done so we can use other consumers. An example of that, is regeneration of the Summary from an existing Index. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-04-08 17:14:29 -04:00
Glauber Costa	8453ff7788	make get_sstable_key_range an instance method Because just creating an SSTable object does not generate any I/O, get_sstable_key_range should be an instance method. The main advantage of doing that is that we won't have to read the summary twice. The way we're doing it currently, if happens to be a shard-relevant table we'll call load() - which reads the summary again. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-04-08 17:14:29 -04:00
Glauber Costa	6ae601a025	do not re-read the summary There are times in which we read the Summary file twice. That actually happens every time during normal boot (it doesn't during refresh). First during get_sstable_key_range and then again during load(). Every summary will have at least one entry, so we can easily test for whether or not this is properly initialized. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-04-08 17:14:29 -04:00
Avi Kivity	db03295c8a	Merge "Fix query digest mismatch" from Tomasz "Currently data query digest includes cells and tombstones which may have expired or be covered by higher-level tombstones. This causes digest mismatch between replicas if some elements are compacted on one of the nodes and not on others. This mismatch triggers read-repair which doesn't resolve because mutations received by mutation queries are not differing, they are compacted already. The fix adds compacting step before writing and digesting query results by reusing the algorithm used by mutation query. This is not the most optimal way to fix this. The compaction step could be folded with the query writing, there is redundancy in both steps. However such change carries more risk, and thus was postponed. perf_simple_query test (cassandra-stress-like partitions) shows regression from 83k to 77k (7%) ops/s. Fixes #1165."	2016-04-08 12:13:29 +03:00
Pekka Enberg	47a904c0f6	Merge "gossip: Introduce SUPPORTED_FEATURES" from Asias "There is a need to have an ability to detect whether a feature is supported by entire cluster. The way to do it is to advertise feature availability over gossip and then each node will be able to check if all other nodes have a feature in question. The idea is to have new application state SUPPORTED_FEATURES that will contain set of strings, each string holding feature name. This series adds API to do so. The following patch on top of this series demostreates how to wait for features during boot up. FEATURE1 and FEATURE2 are introduced. We use wait_for_feature_on_all_node to wait for FEATURE1 and FEATURE2 successfully. Since FEATURE3 is not supported, the wait will not succeed, the wait will timeout. --- a/service/storage_service.cc +++ b/service/storage_service.cc @@ -95,7 +95,7 @@ sstring storage_service::get_config_supported_features() { // Add features supported by this local node. When a new feature is // introduced in scylla, update it here, e.g., // return sstring("FEATURE1,FEATURE2") - return sstring(""); + return sstring("FEATURE1,FEATURE2"); } std::set<inet_address> get_seeds() { @@ -212,6 +212,11 @@ void storage_service::prepare_to_join() { // gossip snitch infos (local DC and rack) gossip_snitch_info().get(); + gossiper.wait_for_feature_on_all_node(std::set<sstring>{sstring("FEATURE1"), sstring("FEATURE2")}, std::chrono::seconds(30)).get(); + logger.info("Wait for FEATURE1 and FEATURE2 done"); + gossiper.wait_for_feature_on_all_node(std::set<sstring>{sstring("FEATURE3")}).get(); + logger.info("Wait for FEATURE3 done"); + We can query the supported_features: cqlsh> SELECT supported_features from system.peers; supported_features -------------------- FEATURE1,FEATURE2 FEATURE1,FEATURE2 (2 rows) cqlsh> SELECT supported_features from system.local; supported_features -------------------- FEATURE1,FEATURE2 (1 rows)"	2016-04-08 09:22:50 +03:00
Benoît Canet	7c99ecf16f	scylla_setup: Check if scylla-jmx is installed Signed-of-by: Benoît Canet <benoit@scylladb.com> Fixes #1107 Message-Id: <1460045692-815-1-git-send-email-benoit@scylladb.com>	2016-04-08 09:03:38 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Tomasz Grabiec	474a35ba6b	tests: Add test for query digest calculation	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	4418da77e6	tests: mutation_source: Include random mutations in generate_mutation_sets() result Probably increases coverage.	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	5d768d0681	tests: mutation_test: Move mutation generator to mutation_source_test.hh So that it can be reused.	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	30d25bc47a	tests: mutation_test: Add test case for querying of expired cells	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	58bbd4203f	partition_slice_builder: Add new setters	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	7cd8e61429	tests: result_set_assertions: Add and_only_that()	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	f15c380a4f	database: Compact mutations when executing data queries Currently data query digest includes cells and tombstones which may have expired or be covered by higher-level tombstones. This causes digest mismatch between replicas if some elements are compacted on one of the nodes and not on others. This mismatch triggers read-repair which doesn't resolve because mutations received by mutation queries are not differing, they are compacted already. The fix adds compacting step before writing and digesting query results by reusing the algorithm used by mutation query. This is not the most optimal way to fix this. The compaction step could be folded with the query writing, there is redundancy in both steps. However such change carries more risk, and thus was postponed. perf_simple_query test (cassandra-stress-like partitions) shows regression from 83k to 77k (7%) ops/s. Fixes #1165.	2016-04-07 19:56:58 +02:00
Tomasz Grabiec	e4e8acc946	mutation_query: Extract main part of mutation_query() into more generic querying_reader So that it can be reused in query()	2016-04-07 19:03:04 +02:00
Takuya ASADA	ed7a3beed2	dist/ubuntu: drop unused scripts This was used when we didn't shared scripts between CentOS/Fedora and Ubuntu, but used anymore so drop them. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1459408583-13497-1-git-send-email-syuu@scylladb.com>	2016-04-06 08:21:09 +03:00
Asias He	d5dce8016b	storage_service: Advertise supported_features into cluster Advertise features supported by this node, so that other nodes can know this info. For example, on a 3 node cluster with supported_features == FEATURE1 and FEATURE2, it looks like: cqlsh> SELECT supported_features from system.peers; supported_features -------------------- FEATURE1,FEATURE2 FEATURE1,FEATURE2 (2 rows) cqlsh> SELECT supported_features from system.local; supported_features -------------------- FEATURE1,FEATURE2 (1 rows)	2016-04-06 07:12:34 +08:00
Asias He	0e1738943d	storage_service: Add supported_features into system.peers table	2016-04-06 07:12:34 +08:00
Asias He	50bcfe569a	system_keyspace: Add supported_features into system.local table	2016-04-06 07:12:34 +08:00
Asias He	b710a5f9ee	storage_service: Introduce get_config_supported_features It tells features supported by this local node. When new feature is introduced in scylla, update features returned by get_config_supported_features, e.g., return sstring("FEATURE1,FEATURE2")	2016-04-06 07:12:34 +08:00
Asias He	e0a82a1107	gossip: Add supported_features helper in versioned_value Give a supported features sstring, return a versioned_value for it.	2016-04-06 07:12:34 +08:00
Asias He	214c0f72b2	db: Add supported_features column in system.local and system.peers table	2016-04-06 07:12:34 +08:00
Asias He	04e8727793	gossip: Introduce wait_for_feature_on_{all}_node API to wait for features are available on a node or all the nodes in the cluster. $timeout specifies how long we want to wait. If the features are not availabe yet, sleep 2 seconds and retry.	2016-04-06 07:12:34 +08:00
Asias He	1e437e925c	gossip: Introduce get_supported_features - Get features supported by this particular node std::set<sstring> get_supported_features(inet_address endpoint) const; - Get features supported by all the nodes this node knows about std::set<sstring> get_supported_features() const;	2016-04-06 07:12:34 +08:00
Asias He	a6080773b3	gossip: Add SUPPORTED_FEATURES application_state It is used to negotiate cluster wide features.	2016-04-06 07:12:34 +08:00
Piotr Jastrzebski	d3f91eec61	Implement tuple_type_impl::from_string This is a fix for: https://github.com/scylladb/scylla/issues/574 It mirrors the behavior of: org.apache.cassandra.db.marshal.TupleType.java#fromString Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <24a7d6253727d0faebb1df117c2f52410523d42f.1459843091.git.piotr@scylladb.com>	2016-04-05 16:00:18 +03:00
Vlad Zolotarov	2daaa00c4f	conf: resurrect the important text related to endpoint_snitch configuration commit `d1b44cef1b` removed an important part of a comment related to an 'endpoint_snitch' configuration. This patch puts it back. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1459858934-12005-1-git-send-email-vladz@cloudius-systems.com>	2016-04-05 15:23:13 +03:00
Raphael S. Carvalho	e15ce5eb4d	api: Add support to get column family compression ratio After this change, user can query compression ratio on a per column family basis with 'nodetool cfstats'. look at 'nodetool cfstats' output: ./bin/nodetool cfstats ks.test5 Keyspace: ks Read Count: 0 Read Latency: NaN ms. Write Count: 0 Write Latency: NaN ms. Pending Flushes: 0 Table: test5 SSTable count: 1 Space used (live): 4774 Space used (total): 4774 Space used by snapshots (total): 0 Off heap memory used (total): 131384 SSTable Compression Ratio: 0.833333 ... Fixes #636. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <a1bee5a23fe63787df3e387a88f2d216ba4a4134.1459802771.git.raphaelsc@scylladb.com>	2016-04-05 12:46:40 +03:00
Asias He	d1b44cef1b	conf: Drop duplicated section for endpoint_snitch endpoint_snitch is supported and it is the "Supported Parameters". Remove the duplicated section in "Unsupported parameters". Message-Id: <f8260b72558305f9186c011b8f8f452b3b91339b.1459325982.git.asias@scylladb.com>	2016-04-05 08:48:48 +03:00
Pekka Enberg	32471fcb96	Merge "Do batch log replay in decommission" from Asias "batchlog_manager is modified to allow the storage_service to initate a bachlog replay operation. Refs #1085. Tested with tests/batchlog_manager_test and batch_test.py"	2016-04-05 08:42:47 +03:00
Gleb Natapov	70575699e4	commitlog, sstables: enlarge XFS extent allocation for large files With big rows I see contention in XFS allocations which cause reactor thread to sleep. Commitlog is a main offender, so enlarge extent to commitlog segment size for big files (commitlog and sstable Data files). Message-Id: <20160404110952.GP20957@scylladb.com>	2016-04-04 14:15:00 +03:00
Amnon Heiman	725231a7a0	api: set the api_doc before registering any api This is a left over from the re ordering of the API init. The api_doc should be set first, so later API registration will enable their relevent swagger doc. Currently, the swagger documentation of the system API is not available. Fixes #1160 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1459750490-15996-1-git-send-email-amnon@scylladb.com>	2016-04-04 11:37:59 +03:00
Avi Kivity	6a3cf4ac41	cql: unlock ALTER TABLE syntax It was marked experimental for 1.0, but will be fully supported in the next release. Message-Id: <1459707946-5860-1-git-send-email-avi@scylladb.com>	2016-04-04 11:36:11 +03:00
Piotr Jastrzebski	613e7d8618	Add more info to wrong RPC address error If listening on RPC address failed then report IP address and port in the error message. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <c4db3527df2ce6dccb3b619584ee3fcb1e70ffd1.1459512258.git.piotr@scylladb.com>	2016-04-03 12:57:19 +03:00
Takuya ASADA	cad5edc53b	dist: fix build error at copy symlinks Both build_rpm.sh and build_deb.sh will fail with "cannot stat 'xxx': No such file or directory" when scylla-server package is not installed, need to prevent it by --no-dereference option of cp. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1459523585-9108-1-git-send-email-syuu@scylladb.com>	2016-04-03 12:49:55 +03:00
Tomasz Grabiec	0fc4c36952	tests: sstable_mutation_test: Compare keys not representations Representation is opaque at this level of abstraction. Reviewed-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1459508193-7086-1-git-send-email-tgrabiec@scylladb.com>	2016-04-03 11:39:03 +03:00
Nadav Har'El	6c4ee49bd3	sstables: another test for range tombstone merging This is another unit test for range tombstone merging, introduced in commit `0fc9a5ee4d` and rewritten in commit `99ecda3c96`. In this test, a single large deletion was broken up into several smaller ranges, all with the same time stamps, so we should recombine them into one row tombstone, instead of failing the read. The sstable in this test case was artificially created using json2sstable. We don't know how yet to produce such a case using Cassandra 2, but we have seen a similar occurance in the wild, in a real SSTable. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1459429243-15821-1-git-send-email-nyh@scylladb.com>	2016-04-01 11:55:14 +02:00
Takuya ASADA	d59c1c7648	dist/redhat: drop very old %pre script These lines are needed for very old version of scylla, not for 1.0. Can be removed now. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1459177601-20269-1-git-send-email-syuu@scylladb.com>	2016-04-01 09:41:18 +03:00
Pekka Enberg	b9a1aef670	Merge "Random exception safety fixes" from Paweł "These patches fix some of the problems found by randomly injecting memory allocation failures."	2016-04-01 08:58:00 +03:00
Paweł Dziepak	8f78b8e190	log: ignore logging exceptions Logging is used in many places including those that shouldn't really throw any exceptions (destructors, noexcept functions). Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-31 16:43:32 +01:00
Paweł Dziepak	c8159eca52	commitlog: make sure that segment destructor doesn't throw Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-31 16:42:56 +01:00
Paweł Dziepak	3e0555809e	storage_proxy: catch all exceptions in read executor abstract_read_executor::reconcile() is supposed to make sure that _result_promise is eventually set to either a result or an exception. That may not happen however if reconciliation throws any exception since only read timeouts are being caught. When that happends the continuation chain becomes stuck. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-31 16:38:41 +01:00
Paweł Dziepak	3c107c4b05	sstables: remove HyperLogLog throw() specifier HyperLogLog constructor promises that it only throws instances of std::invalid_argument. That's a lie since it also adds elements to a vector (and doesn't catch potential bad_allocs). Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-31 16:36:53 +01:00
Avi Kivity	417bcb122d	commitlog: ignore commitlog segments generated by Cassandra-derived tools Cassandra-derived tools (such as sstable2json) may write commitlog segments, that Scylla cannot recognize. Since we now write them with a distinct name, we can recognize the name and ignore these segments, as we know the data they contain is not interesting. Fixes #1112. Message-Id: <1459356904-20699-1-git-send-email-avi@scylladb.com>	2016-03-31 16:01:08 +03:00
Nadav Har'El	78c9f49585	sstables: Move check_marker() to source file The check_marker() function is use as a sanity-check of data we read from sstable, so instead of the header file key.hh, let's move it to the sstable-parsing source file partition.cc. In addition to having less code in header files, another benefit is that the function can now throw a more specific exception (malformed sstable exception). Also fixed the exception's message (which had a second "%d" but only one parameter). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1459420430-5968-1-git-send-email-nyh@scylladb.com>	2016-03-31 14:22:51 +03:00
Nadav Har'El	99ecda3c96	sstables: overhaul range tombstone reading Until recently, we believed that range tombstones we read from sstables will always be for entire rows (or more generalized clustering-key prefixes), not for arbitrary ranges. But as we found out, because Cassandra insists that range tombstones do not overlap, it may take two overlapping row tombstones and convert them into three range tombstones which look like general ranges (see the patch for a more detailed example). Not only do we need to accept such "split" range tombstones, we also need to convert them back to our internal representation which, in the above example, involves two overlapping tombstones. This is what this patch does. This patch also contains a test for this case: We created in Cassandra an sstable with two overlapping deletions, and verify that when we read it to Scylla, we get these two overlapping deletions - despite the sstable file actually having contained three non-overlapping tombstones. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <b7c07466074bf0db6457323af8622bb5210bb86a.1459399004.git.glauber@scylladb.com>	2016-03-31 12:49:50 +03:00
Pekka Enberg	2629389d5d	dist/docker/ubuntu: Use bash in start-scylla script The default shell in Ubuntu is "dash" which causes the following error when "scylla-start" script is executed: /start-scylla: 8: /start-scylla: source: not found Message-Id: <1459406561-20141-1-git-send-email-penberg@scylladb.com>	2016-03-31 11:21:36 +03:00

1 2 3 4 5 ...

9082 Commits