scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-04 22:13:19 +00:00

Author	SHA1	Message	Date
Piotr Jastrzebski	ec51c8e1b8	Fix after free access bug in storage proxy Due to speculative reads we can't guarantee that all fibers started by storage_proxy::query will be finished by the time the method returns a result. We need to make sure that no parameter passed to this method ever changes. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <31952e323e599905814b7f378aafdf779f7072b8.1471005642.git.piotr@scylladb.com> (cherry picked from commit `f212a6cfcb`) [tgrabiec: resolved trivial conflict]	2016-08-12 16:38:21 +02:00
Duarte Nunes	e9b7352adb	storage_service: Fix get_range_to_address_map_in_local_dc This patch fixes a couple of bugs in get_range_to_address_map_in_local_dc. Fixes #1517 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1469782666-21320-1-git-send-email-duarte@scylladb.com> (cherry picked from commit `7d1b7e8da3`)	2016-07-29 11:24:50 +02:00
Pekka Enberg	e5d24d5940	service/storage_service: Make do_isolate_on_error() more robust Currently, we only stop the CQL transport server. Extract a stop_transport() function from drain_on_shutdown() and call it from do_isolate_on_error() to also shut down the inter-node RPC transport, Thrift, and other communications services. Fixes #1353 (cherry picked from commit `d72c608868`) Conflicts: service/storage_service.cc (cherry picked from commit `7e052a4e91`)	2016-06-16 14:01:33 +03:00
Gleb Natapov	15ad4c9033	storage_proxy: drop debug output Message-Id: <20160601132641.GK2381@scylladb.com> (cherry picked from commit `26b50eb8f4`)	2016-06-01 17:14:32 +03:00
Avi Kivity	3f6ecb9f28	Merge "cancel cross DC read repair if non matching data was recently modified" from Gleb	2016-05-29 15:58:55 +03:00
Gleb Natapov	2efbccc901	storage_proxy: do only local read repair if non matching data was recently modified When read/write to a partition happens in parallel reader may detect digest mismatch that may potentially cause cross DC read repair attempt, but the repair is not really needed, so added latency is not justified. This patch tries to prevent such parallel access from causing heavy cross DC repair operation buy checking a timestamp of most resent modification. If the modification happens less then "write timeout" seconds ago the patch assumes that the read operation raced with write one and cancel cross DC repair, but only if CL is LOCAL_*.	2016-05-29 15:26:51 +03:00
Asias He	f1b3cb4a08	storage_service: Catch and fail an invalid configuration with --replace-address Vlad reported a strange user configuration: SCYLLA_ARGS="--log-to-syslog 1 --log-to-stdout 0 --default-log-level info --collectd-address=127.0.0.1:25826 --collectd=1 --collectd-poll-period 60000 --network-stack posix --num-io-queues 32 --max-io-requests 128 --replace-address 10.0.4.131" seed_provider: - class_name: org.apache.cassandra.locator.SimpleSeedProvider parameters: - seeds: "10.0.4.131" In the mean while, 10.0.4.131 is the IP address of the node itself. When the node was started, the following message were reported. Apr 13 06:31:12 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (20 seconds passed) Apr 13 06:31:13 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (21 seconds passed) Apr 13 06:31:14 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (22 seconds passed) Apr 13 06:31:15 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (23 seconds passed) The configruation is invalid, becasue for --replace-address to work, at least one working seed node should be alive. Catch the configuration error and fail it with an appropriate error message. Fixes #1183 Message-Id: <a94a082d896313e7a668915ae21fe2c03719da3a.1464164058.git.asias@scylladb.com>	2016-05-25 14:42:19 +03:00
Pekka Enberg	ceb29f9d32	Merge "Introduce upload dir for sstable migration" from Raphael "This change is intended to make migration process safer and easier. All column families will now have a directory called upload. With this feature, users may choose to copy migrated sstables to upload directory of respective column families, and run 'nodetool refresh'. That's supposed to be the preferred option from now on."	2016-05-24 16:36:47 +03:00
Gleb Natapov	12cf60c302	messaging_service: add timestemp of last modification to READ_DIGEST verb return value	2016-05-24 13:27:34 +03:00
Avi Kivity	9637c2232c	Merge "Move the JMX timer polling logic to Scylla" from Amnon	2016-05-24 13:07:52 +03:00
Raphael S. Carvalho	e5f0314afd	db: introduce upload directory for sstable migration This change is intended to make migration process safer and easier. All column families will now have a directory called upload. With this feature, users may choose to copy migrated sstables to upload directory of respective column families, and call 'nodetool refresh'. That's supposed to be the preferred option from now on. For each sstable in upload directory, refresh will do the following: 1) Mutate sstable level to 0. 2) Create hard links to its components in column family dir, using a new generation. We make it safe by creating a hard link to temporary TOC first. 3) Remove all of its components in upload directory. This new code runs after refresh checked for new sstables in the column family directory. Otherwise, we could have a generation conflict. Unlike the first step, this new step runs with sstable write enabled. It's easier here because we know exactly which sstables are new. After that, refresh will load new sstables found in column family and upload directories. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-20 17:26:21 -03:00
Amnon Heiman	64e0c8cd1b	storage_proxy: Change histogram to timed_rate_moving_average_and_histogram As part of moving the derived statistic in to scylla, this replaces the histogram object in the storage_proxy to timed_rate_moving_average_and_histogram. and the read, write and range counters where replaced by rate_moving_average. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:52:16 +03:00
Piotr Jastrzebski	dcba6f5c45	Pass clustering_row_ranges to mutation readers. This will allow readers to reduce the amount of data read. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 14:36:57 +02:00
Tomasz Grabiec	1eabe9b840	storage_proxy: Add trace-level logging for mutating Message-Id: <1462978554-31217-1-git-send-email-tgrabiec@scylladb.com>	2016-05-12 13:52:56 +03:00
Tomasz Grabiec	7207cc8b1a	storage_proxy: Improve error reporting Knowing the source node can help in debugging the issue. Message-Id: <1462978535-31164-1-git-send-email-tgrabiec@scylladb.com>	2016-05-12 13:52:39 +03:00
Pekka Enberg	b5d9aa866d	Merge "Fixes for schema synchronization" from Tomek "Writes may start to be rejected by replicas after issuing alter table which doesn't affect columns. This affects all versions with alter table support. Fixes #1258"	2016-05-12 09:43:25 +03:00
Duarte Nunes	7dbeef3c39	storage_service: Fix ignored future in on_alive This patch ensures the future created by invoke_on_all is not ignored by waiting on it, which is safe to do since we are within a seastar::async context. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1462989837-7326-1-git-send-email-duarte@scylladb.com>	2016-05-12 09:03:46 +03:00
Tomasz Grabiec	8703136a4f	migration_manager: Fix schema syncing with older version The problem was that "s" would not be marked as synced-with if it came from shard != 0. As a result, mutation using that schema would fail to apply with an exception: "attempted to mutate using not synced schema of ..." The problem could surface when altering schema without changing columns and restarting one of the nodes so that it forgets past versions. Fixes #1258. Will be covered by dtest: SchemaManagementTest.test_prepared_statements_work_after_node_restart_after_altering_schema_without_changing_columns	2016-05-11 17:29:14 +02:00
Calle Wilund	63b6c6bb5a	migration_manager: Implement announce_keyspace_update More or less the same as create keyspace...	2016-05-10 14:34:51 +00:00
Calle Wilund	437ebe7128	cql_server: Use credentials_builder to init tls Slightly cleaner, and shard-safe tls init. Message-Id: <1462283265-27051-3-git-send-email-calle@scylladb.com>	2016-05-09 14:12:59 +03:00
Calle Wilund	709dd82d59	storage_service: Add logging to match origin Pointing out if CQL server is listing in SSL mode. Message-Id: <1462368016-32394-2-git-send-email-calle@scylladb.com>	2016-05-06 13:27:55 +03:00
Gleb Natapov	3039e4c7de	storage_proxy: stop range query with limit after the limit is reached	2016-05-02 15:10:15 +03:00
Gleb Natapov	41c586313a	storage_proxy: fix calculation of concurrency queried ranges	2016-05-02 15:10:15 +03:00
Gleb Natapov	c364ab9121	storage_proxy: add logging for range query row count estimation	2016-05-02 15:10:15 +03:00
Calle Wilund	cdd0f00de5	client_state: Remove unwarranted keyspace check "has_keyspace_access" is not supposed to (according to origin) verify that a keyspace exists. Remove. It (and all others) are however supposed to check "ks" (name) not empty. Add this. Message-Id: <1461578072-24113-1-git-send-email-calle@scylladb.com>	2016-04-25 13:16:36 +03:00
Pekka Enberg	f6da9bc92b	Merge "Additional mutations/queries related collectd metrics" from Vlad "This series introduces some additional metrics (mostly) in a storage_proxy and a database level that are meant to create a better picture of how data flows in the cluster. First of all where possible counters of each category (e.g. total writes in the storage proxy level) are split into the following categories: - operations performed on a local Node - operations performed on remote Nodes aggregated per DC In a storage_proxy level there are the following metrics that have this "split" nature (all on a sending side): - total writes (attempts/errors) - writes performed as a result of a Read Repair logic - total data reads (attempts/completed/errors) - total digest reads (attempts/completed/errors) - total mutations data reads (attempts/completed/errors) In a batchlog_manager: - writes performed as a result of a batchlog replay logic Thereby if for instance somebody wants to get an idea of how many writes the current Node performs due to user requested mutations only he/she has to take a counter of total writes and subtract the writes resulted by Read Repairs and batchlog replays. On a receiving side of a storage_proxy we add the two following counters: - total number of received mutations - total number of forwarded mutations (attempts/errors) In order to get a better picture of what is going on on a local Node we are adding two counters on a database level: - total number of writes - total number of reads Comparing these to total writes/reads in a storage_proxy may give a good idea if there is an excessive access to a local DB for example."	2016-04-21 15:58:45 +03:00
Pekka Enberg	3f1fcca3bc	cql3: Fix DROP KEYSPACE error message when keyspace does not exist Commit `d3fe0c5` ("Refactor db/keyspace/column_family toplogy") changed database::find_keyspace() to throw a std::nested_exception so the catch block in migration_manager::announce_keyspace_drop() no longer catches the exception. Fix the issue by explicitly checking if the keyspace exists and throwing the correct exception type if it doesn't. Fixes TestCQL.keyspace_test. Message-Id: <1461218910-26691-1-git-send-email-penberg@scylladb.com>	2016-04-21 12:42:45 +02:00
Vlad Zolotarov	9bf8253412	storage_proxy: add read requests split counters Add split (local Nodes, external Nodes aggregated per Nodes' DCs) counters for the following read categories: - data reads - digest reads - mutation data reads Each category is added attempts, completions and errors metrics. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-04-21 11:28:19 +03:00
Vlad Zolotarov	cbcbdc3b4a	storage_proxy: add split counters for writes Added split metrics for operations on a local Node and on external Nodes aggregated per Nodes' DCs. Added separate split counters for: - total writes attempts/errors - read repair write attempts (there is no easy way to separate errors at the moment) Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-04-21 11:28:15 +03:00
Vlad Zolotarov	c92654b281	storage_proxy: add counters for received and forwarded mutations Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-04-21 11:27:29 +03:00
Duarte Nunes	08a7bba4ed	udt: Announce UDT migrations This patch defines the member functions responsible for announce create, update and drop user defined types migration. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-04-20 09:54:06 +02:00
Duarte Nunes	37a1547971	udt: Add migration notifications This patch adds migration notifications for user defined types. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-04-20 09:54:06 +02:00
Calle Wilund	dac6cf69eb	service::client_state: Add authorization checkers	2016-04-19 11:49:05 +00:00
Gleb Natapov	9801d69d53	storage_proxy: add query result row count to brief format Report number of rows in brief reporting format, but only if we can count them without linearizing result's buffer.	2016-04-14 19:26:00 +03:00
Gleb Natapov	53993527ed	storage_proxy: move verbose query result printing into separate logger If query result is large tracing cannot be done since printing the result takes too much time and space.	2016-04-14 19:26:00 +03:00
Gleb Natapov	46e5d05220	storage_proxy: cleanup query logging. Since commit `c1cffd06` logger catch errors internally, so no need to catch most of them at the top level. Only those that can happen during parameter evaluation can reach here. Change parameters to not throw too.	2016-04-14 19:26:00 +03:00
Pekka Enberg	a1a9294d8c	Merge "Support nodetool removenode force and status" from Asias "With this series, we support all the 3 nodetool removenode commands, e.g., $ nodetool removenode 778948bf-6709-4eb5-80fe-bee911e9c3bf $ nodetool removenode status RemovalStatus: Removing token (-8969872965815280276). Waiting for replication confirmation from [127.0.0.3,127.0.0.1]. $ nodetool removenode force RemovalStatus: No token removals in process. Tested with: 1) - start 3 nodes - inject data with cassandra-stress write no-warmup cl=TWO n=2000000 -schema 'replication(factor=2)' - kill -9 node2 - wait for node2 to be in DOWN state - run nodetool removenode host2_host_id on node1 2) - start 3 nodes - inject data with cassandra-stress write no-warmup cl=TWO n=2000000 -schema 'replication(factor=2)' - kill -9 node2 - wait for node2 to be in DOWN state - run nodetool removenode host2_host_id on node1 - kill -9 node3 - nodetool removenode will wait forever since node3 is gonne, node3 will never send the replication confirmation to node1 - run nodetool removenode force on node1 nodetool removenode completes with the following error: $ nodetool removenode 31690b82-ebb0-4594-8bcf-1ce82b6e0f6e nodetool: Scylla API server HTTP POST to URL '/storage_service/remove_node' failed: nodetool removenode force is called by user nodetool removenode force completes sucessfully $ nodetool removenode force RemovalStatus: Removing token (-9171569494049085776). Waiting for replication confirmation from [127.0.0.3,127.0.0.1]. Fixes #1135."	2016-04-14 15:44:33 +03:00
Gleb Natapov	6f13715f8c	storage_proxy: add logging to read executor creation path Message-Id: <1460549369-29523-4-git-send-email-gleb@scylladb.com>	2016-04-14 14:58:02 +03:00
Gleb Natapov	14ecadb247	storage_proxy: add logging for mutation write path Message-Id: <1460549369-29523-3-git-send-email-gleb@scylladb.com>	2016-04-14 14:57:29 +03:00
Gleb Natapov	dfdbb1e703	storage_proxy: move hack to make coordinator most preferable node for read into sorting function This is kind of sorting, so it belongs there, but it also fixes a bug in storage_proxy::get_read_executor() that assumes filter_for_query() do not change order of nodes in all_nodes when extra replica is chosen. Otherwise if coordinator ip happens to be last in all_nodes then it will be chosen as extra replica and will be quired twice. Message-Id: <1460549369-29523-1-git-send-email-gleb@scylladb.com>	2016-04-14 14:56:21 +03:00
Asias He	891e947314	storage_service: Rename remove_node to removenode nodetool uses removenode command to remove a node. Rename the implementation in storage_service to match the command.	2016-04-13 14:53:28 +08:00
Asias He	9ffb95216d	storage_service: Add force_remove_completion It is needed by the $ nodetool removenode force command.	2016-04-13 14:53:28 +08:00
Asias He	7c7e5967f6	storage_service: Add get_removal_status It is needed by the $ nodetool removenode status command.	2016-04-13 14:53:28 +08:00
Asias He	8d7cd07d6c	storage_service: Add print info in confirm_replication The message is rare but it is very useful to debug removenode operation.	2016-04-13 14:53:28 +08:00
Pekka Enberg	64c9ebb962	Merge "More exception safety fixes" from Paweł "This is the second part of exception safety fixes for issues discovered using memory allocation failure injector."	2016-04-12 08:08:00 +03:00
Paweł Dziepak	d53354947c	storage_proxy: mark hint_to_dead_endpoints() noexcept Hints are currently unimplemented but there is code depending on the fact that hint_to_dead_endpoints() doesn't throw. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-04-12 00:06:10 +01:00
Paweł Dziepak	b75c4098f2	storage_proxy: catch all errors in abstract_read_executor::execute() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-04-11 23:52:13 +01:00
Gleb Natapov	3734dcbace	storage_proxy: cleanup data_read_resolver::resolve() live_row_count is summed several times in the same function. Do it only once. -- v1->v2: - call get() on std::reference_wrapper<std::vector<partition>> to get to reference for moving out of it. Message-Id: <20160411123829.GE21479@scylladb.com>	2016-04-11 17:13:48 +02:00
Calle Wilund	7ebac35779	client_state: break up setting login/validation transport::server uses client_state in a move-temporary-around fashion. Having a setter that does continuation-bound validation makes this messier. Break them up to separate "this" placement from the actual validation continuation logic	2016-04-11 09:10:41 +00:00
Calle Wilund	83e2604bc6	client_state: Propagate login user in merge	2016-04-11 09:10:41 +00:00

1 2 3 4 5 ...

804 Commits