scylladb

Author	SHA1	Message	Date
Gleb Natapov	c9bd069815	messaging_service: log rpc errors Message-Id: <20160125155005.GC23862@scylladb.com>	2016-01-25 17:59:26 +02:00
Asias He	ad80916905	messaging_service: Add streaming implementation for idl - stream_request - stream_summary - prepare_message Please enter the commit message for your changes. Lines starting	2016-01-25 22:36:58 +08:00
Avi Kivity	5c5207f122	Merge "Another round of streaming cleanup" from Asias "- Merge stream_init_message and stream_parepare_message - Drop session_index / keep_ss_table_level / file_message_header"	2016-01-25 12:54:30 +02:00
Asias He	77684a5d4c	messaging_service: Drop STREAM_INIT_MESSAGE The verb is not used anymore. Message-Id: <1453719054-29584-1-git-send-email-asias@scylladb.com>	2016-01-25 12:53:08 +02:00
Asias He	53c6cd7808	gossip: Rename echo verb to gossip_echo It is used by gossip only. I really could not allow myself to get along this inconsistence. Change before we still can. Message-Id: <1453719054-29584-2-git-send-email-asias@scylladb.com>	2016-01-25 12:53:07 +02:00
Asias He	ad4a096b80	streaming: Get rid of stream_init_message Unlike streaming in c*, scylla does not need to open tcp connections in streaming service for both incoming and outgoing messages, seastar::rpc does the work. There is no need for a standalone stream_init_message message in the streaming negotiation stage, we can merge the stream_init_message into stream_prepare_message.	2016-01-25 16:24:16 +08:00
Asias He	4ce08ff251	messaging_service: Add heart_beat_state implementation	2016-01-25 11:28:29 +08:00
Asias He	ecca969adf	messaging_service: Add gossip::endpoint_state implementation	2016-01-25 11:28:29 +08:00
Asias He	2a0b6589dd	messaging_service: Add versioned_value implementation	2016-01-25 11:28:29 +08:00
Asias He	15f2b353b9	messaging_service: Add gossip_digest implementation	2016-01-25 11:28:29 +08:00
Asias He	d81fc12af3	messaging_service: Add gossip_digest_ack2 implementation	2016-01-25 11:28:29 +08:00
Asias He	e67cecaee1	messaging_service: Add gossip_digest_syn implementation	2016-01-25 11:28:29 +08:00
Avi Kivity	65a140481c	Merge " streaming COMPLETE_MESSAGE failure and message retry logic fix" from Asias "This series: - Add more debug info to stream session - Fail session if we fail to send COMPLETE_MESSAGE - Handle message retry logic for verbs used by streaming See commit log for details."	2016-01-24 16:41:06 +02:00
Gleb Natapov	067bdb23cd	Move reconcilable_result and frozen_mutation to idl	2016-01-24 12:45:41 +02:00
Gleb Natapov	18dff5ebc8	Move smart pointer serialization helpers to .cc file. They are not used outside of the .cc file, so should not be in the header.	2016-01-24 12:45:41 +02:00
Gleb Natapov	93da9b2725	Remove redundant vector serialization code. IDL serializer has the code to serialize vectors, so use it instead.	2016-01-24 12:45:41 +02:00
Gleb Natapov	afc407c6e5	Move query::result to use idl.	2016-01-24 12:45:41 +02:00
Gleb Natapov	4ae906b204	Add serializer overload for query::partition_range. From now on query::partition_range will use generated code.	2016-01-24 12:45:41 +02:00
Gleb Natapov	2d1b2765e6	Add serializer overload for query::read_command. From now on query::read_command will use generated code.	2016-01-24 12:45:41 +02:00
Amnon Heiman	577ce0d231	Adding a sepcific template initialization in messaging_service to use the serializer This patch adds a specific template initialization so that the rpc would use the serializer and deserializer that are auto-generated. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-01-24 12:29:21 +02:00
Asias He	7ac3e835a6	messaging_service: Fix send_message_timeout_and_retry When a verb timeout, if we resend the message again, the peer could receive the message more than once. This would confuse the receiver. Currently, only the streaming code use the retry logic. - In case of rpc:timeout_error: Instead of doing timeout in a relatively short time and resending a few times, we make the timeout big enough and let the tcp to do the resend. Thus, we can avoid resending the message more than once, of course, the receiver would not receive the message more than once. - In case of rpc::closed_error: There are two cases: 1) Failing to establish a connection. For instance, the peer is down. It is safe to resend since we know for sure the receiver hasn't received the message yet. 2) The connection is established. We can not figure out if the remote peer have received the message already or not upon receiving the rpc::closed_error exception. Currently, we still sleep & resend the message again, so the receiver might receive the message more than once. We do not have better choice in this case, if we want the resend to recover the sending error due to temporary network issue, since failing the whole stream_session due to failing to send a single message is not wise. NOTE: If the duplicated message is received when the stream_session is done, it will be ignored since it can not find the stream_manager anymore. For message like, STREAM_MUTATION, it is ok to receive twice (we apply the mutation twice). TODO: For other messages which uses the retry logic, we need to make sure it is ok to receive more than once.	2016-01-22 08:20:48 +08:00
Avi Kivity	221ef4536c	messaging service: limit rpc server resources Otherwise, a slow node can be overwhelmed by other nodes and run out of memory. Fixes #596. Message-Id: <1452776394-13682-1-git-send-email-avi@scylladb.com>	2016-01-18 11:16:45 +02:00
Avi Kivity	d5050e4c6a	storage_proxy: make MUTATION and MUTATION_DONE verbs sychronous at the server side While MUTATION and MUTATION_DONE are asynchronous by nature (when a MUTATION completes, it sends a MUTATION_DONE message instead of responding synchronously), we still want them to be synchronous at the server side wrt. the RPC server itself. This is because RPC accounts for resources consumed by the handler only while the handler is executing; if we return immediately, and let the code execute asynchronously, RPC believes no resources are consumed and can instantiate more handlers than the shard has resources for. Fix by changing the return type of the handlers to future<no_wait_type> (from a plain no_wait_type), and making that future complete when local processing is over. Ref #596. Message-Id: <1453048967-5286-1-git-send-email-avi@scylladb.com>	2016-01-18 09:59:34 +02:00
Tomasz Grabiec	e88f41fb3f	messaging_service: Move REPAIR_CHECKSUM_RANGE verb out of the streaming verbs group Message-Id: <1452620321-17223-1-git-send-email-tgrabiec@scylladb.com>	2016-01-12 20:17:08 +02:00
Vlad Zolotarov	9232ad927f	messaging_service::get_rpc_client(): fix the encryption logic According to specification (here https://wiki.apache.org/cassandra/InternodeEncryption) when the internode encryption is set to `dc` the data passed between DCs should be encrypted and similarly, when it's set to `rack` the inter-rack traffic should encrypted. Currently Scylla would encrypt the traffic inside a local DC in the first case and inside the local RACK in the later one. This patch fixes the encryption logic to follow the specification above. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1452501794-23232-1-git-send-email-vladz@cloudius-systems.com>	2016-01-12 16:22:26 +02:00
Tomasz Grabiec	e1e8858ed1	service: Fetch and sync schema	2016-01-11 10:34:53 +01:00
Tomasz Grabiec	cdca20775f	messaging_service: Introduce get_source()	2016-01-11 10:34:53 +01:00
Tomasz Grabiec	da3a453003	service: Add GET_SCHEMA_VERSION remote call The verb belongs to a seaprate client to avoid potential deadlocks should the throttling on connection level be introduced in the future. Another reason is to reduce latency for version requests as it can potentially block many requests.	2016-01-11 10:34:52 +01:00
Asias He	2345cda42f	messaging_service: Rename shard_id to msg_addr Use shard_id as the destination of the messaging_service is confusing, since shard_id is used in the context of cpu id. Message-Id: <8c9ef193dc000ef06f8879e6a01df65cf24635d8.1452155241.git.asias@scylladb.com>	2016-01-07 10:36:35 +02:00
Nadav Har'El	f5b2135a80	repair: repair_checksum_range message This patch adds a new type of message, "REPAIR_CHECKSUM_RANGE" to scylla's "messaging_service" RPC mechanism, for the use of repair: With this message the repair's master host tells a slave host to calculate the checksum of a column-family's partitions in a given token range, and return that checksum. The implementation of this message uses the checksum_range() function defined in the previous patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2016-01-05 15:38:40 +02:00
Gleb Natapov	fae98f5d67	Revert "messaging_service: wait for outstanding requests" This reverts commit `9661d8936b`. Message-Id: <1450690729-22551-3-git-send-email-gleb@scylladb.com>	2016-01-03 16:06:39 +02:00
Gleb Natapov	de0771f1d1	Revert "messaging_service: restore indentation" This reverts commit `dcbba2303e`. Message-Id: <1450690729-22551-2-git-send-email-gleb@scylladb.com>	2016-01-03 16:06:38 +02:00
Asias He	1b3d2dee8f	streaming: Drop src_cpu_id parameter Now that we can get the src_cpu_id from rpc::client_info. No need to pass it as verb parameter.	2015-12-31 11:25:09 +01:00
Asias He	3ae21e06b5	messaging_service: Add src_cpu_id to CLIENT_ID verb It is useful to figure out which shard to send messages back to the sender.	2015-12-31 11:25:09 +01:00
Asias He	22d0525bc0	streaming: Get rid of the _from_ parameter Get this from cinfo.retrieve_auxiliary inside the rpc handler.	2015-12-31 11:25:08 +01:00
Asias He	89b79d44de	streaming: Get rid of the _connecting_ parameter messaging_service will use private ip address automatically to connect a peer node if possible. There is no need for the upper level like streaming to worry about it. Drop it simplifies things a bit.	2015-12-31 11:25:08 +01:00
Gleb Natapov	2bcfe02ee6	messaging: remove unused verbs	2015-12-30 15:06:35 +01:00
Gleb Natapov	f0e8b8805c	messaging: constify some handlers	2015-12-30 15:06:35 +01:00
Calle Wilund	d1badfa108	messaging_service: Optionally create SSL endpoints * Accept port + credentials + option for what to encrypt * If set, enable a SSL listener at ssl_port * Check outgoing connections by IP to determine if they should go to SSL/normal endpoint Requires seastar RPC patch Note: currently, the connections created by messaging service does _not_ do certificate name verification. While DNS lookup is probably not that expensive here, I am not 100% sure it is the desired behaviour. Normal trust is however verified.	2015-12-28 10:10:35 +00:00
Avi Kivity	827a4d0010	Merge "streaming: Invalidate cache upon receiving of stream" from Asias "When a node gain or regain responsibility for certain token ranges, streaming will be performed, upon receiving of the stream data, the row cache is invalidated for that range. Refs #484."	2015-12-28 10:24:46 +02:00
Avi Kivity	2b22772e3c	Merge "Introduce keep alive timer for stream_session" from Asias "Fixes stream_session hangs: 1) if the sending node is gone, the receiving peer will wait forever 2) if the node which should send COMPLETE_MESSAGE to the peer node is gone, the peer node will wait forever"	2015-12-27 16:56:32 +02:00
Avi Kivity	f3980f1fad	Merge seastar upstream * seastar 51154f7...8b2171e (9): > memcached: avoid a collision of an expiration with time_point(-1). > tutorial: minor spelling corrections etc. > tutorial: expand semaphores section > Merge "Use steady_clock where monotonic clock is required" from Vlad > Merge "TLS fixes + RPC adaption" from Calle > do_with() optimization > tutorial: explain limiting parallelism using semaphores > submit_io: change pending flushes criteria > apps: remove defunct apps/seastar Adjust code to use steady_clock instead of high_resolution_clock.	2015-12-27 14:40:20 +02:00
Asias He	f527e07be6	streaming: Get stream_session in STREAM_MUTATION handler Get from address from cinfo. It is needed to figure out which stream session this mutation is belonged to, since we need to update the keep alive timer for this stream session.	2015-12-24 20:34:44 +08:00
Asias He	bd276fd087	streaming: Increase retry timeout Currently, if the node is actually down, although the streaming_timeout is 10 seconds, the sending of the verb will return rpc_closed error immediately, so we give up in 20 * 5 = 100 seconds. After this change, we give up in 10 * 30 = 300 seconds at least, and 10 * (30 + 30) = 600 seconds at most.	2015-12-24 20:34:44 +08:00
Asias He	eaea09ee71	streaming: Retransmit COMPLETE_MESSAGE message It is oneway message at the moment. If a COMPLETE_MESSAGE is lost, no one will close the session. The first step to fix the issue is to try to retransmit the message.	2015-12-24 20:34:44 +08:00
Asias He	2d32195c32	streaming: Invalidate cache upon receiving of stream When a node gain or regain responsibility for certain token ranges, streaming will be performed, upon receiving of the stream data, the row cache is invalidated for that range. Refs #484.	2015-12-21 14:44:13 +08:00
Paweł Dziepak	dcbba2303e	messaging_service: restore indentation Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-17 14:06:41 +01:00
Paweł Dziepak	9661d8936b	messaging_service: wait for outstanding requests Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-17 14:06:41 +01:00
Avi Kivity	b34a1f6a84	Merge "Preliminary changes for handling of schema changes" from Tomasz "I extracted some less controversial changes on which the schema changes series will depend o somewhat reduce the noise in the main series."	2015-12-16 19:08:22 +02:00
Tomasz Grabiec	872bfadb3d	messaging_service: Remove unused parameters from send_migration_request()	2015-12-16 18:06:54 +01:00

1 2 3 4

160 Commits