scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 14:15:46 +00:00

Author	SHA1	Message	Date
Avi Kivity	cb1372523a	Merge "CQL code cleanups" from Pekka "Here's another round of cleanups to the CQL code. Nothing exciting here, mostly moving code to source files which makes changing the code less painful in terms of compilation times."	2015-08-27 18:32:45 +03:00
Pekka Enberg	28aad6fa67	cql3: Move ks_prop_defs implementation to source file Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-27 18:16:28 +03:00
Pekka Enberg	c2ff7b67ce	cql3: Move user_types implementation to source file Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-27 17:50:54 +03:00
Glauber Costa	dcd312a982	perf_sstable: more than just the index My plan was originally to have two separate sets of tests: one for the index, and one for the data. With most of the code having ended up in the .hh file anyway, this distinction became a bit pointless. Let's put it everything here. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-08-27 09:02:11 -05:00
Avi Kivity	5f62f7a288	Revert "Merge "Commit log replay" from Calle" Due to test breakage. This reverts commit `43a4491043`, reversing changes made to `5dcf1ab71a`.	2015-08-27 12:39:08 +03:00
Avi Kivity	43a4491043	Merge "Commit log replay" from Calle "Initial implementation/transposition of commit log replay. * Changes replay position to be shard aware * Commit log segment ID:s now follow basically the same scheme as origin; max(previous ID, wall clock time in ms) + shard info (for us) * SStables now use the DB definition of replay_position. * Stores and propagates (compaction) flush replay positions in sstables * If CL segments are left over from a previous run, they, and existing sstables are inspected for high water mark, and then replayed from those marks to amend mutations potentially lost in a crash * Note that CPU count change is "handled" in so much that shard matching is per _previous_ runs shards, not current. Known limitations: * Mutations deserialized from old CL segments are _not_ fully validated against existing schemas. * System::truncated_at (not currently used) does not handle sharding afaik, so watermark ID:s coming from there are dubious. * Mutations that fail to apply (invalid, broken) are not placed in blob files like origin. Partly because I am lazy, but also partly because our serial format differs, and we currently have no tools to do anything useful with it * No replay filtering (Origin allows a system property to designate a filter file, detailing which keyspace/cf:s to replay). Partly because we have no system properties. There is no unit test for the commit log replayer (yet). Because I could not really come up with a good one given the test infrastructure that exists (tricky to kill stuff just "right"). The functionality is verified by manual testing, i.e. running scylla, building up data (cassandra-stress), kill -9 + restart. This of course does not really fully validate whether the resulting DB is 100% valid compared to the one at k-9, but at least it verified that replay took place, and mutations where applied. (Note that origin also lacks validity testing)"	2015-08-27 10:53:36 +03:00
Calle Wilund	2a1c7d2587	CommitLogReplayer: Java -> C++ Initial implementation	2015-08-25 09:41:56 +02:00
Asias He	2ebd08cb42	dht/boot_strapper: Partially implement bootstrap	2015-08-24 18:54:42 +08:00
Avi Kivity	e928bcaf19	utils: introduce large_bitset Like boost::dynamic_bitset, but less capable. On the other hand it avoids very large allocations, which are incurred by the bloom filter's bitset on even moderately sized sstables.	2015-08-23 12:22:49 +03:00
Avi Kivity	0b01b74444	build: disable seastar Xen support Not needed, and conflicts with dpdk.	2015-08-18 12:31:26 +03:00
Avi Kivity	e9a46215ef	build: change project name The configure script originated from seastar, need a name change.	2015-08-18 12:29:05 +03:00
Avi Kivity	932ddc328c	logalloc: optimize current_allocation_strategy() This heavily used function shows up in many places in the profile (as part of other functions), so it's worth optimizing by eliminating the special case for the standard allocator. Use a statically allocated object instead. (a non-thread-local object is fine since it has no data members).	2015-08-17 16:51:10 +03:00
Avi Kivity	95847f86c3	Merge "locator: introduce i_endpoint_snitch::reset_snitch()" from Vlad "This series introduces the i_endpoint_snitch::reset_snitch() static method that allows to replace the current (global) snitch instance with the new one. This is done in an (per-shard) atomic way transparent so anyone holding a reference to snitch_ptr. This series starts with some cleanups, adds the above method and the unit test that verifies its functionality."	2015-08-12 19:29:08 +03:00
Avi Kivity	517ceed515	Merge "sstable index write benchmark" "I am currently looking at the performance of our index_read, since it was in the past pinpointed at the source of problems. While the read side is the one that is mostly interesting, I would like to test both - besides anything else, it is easier to test reads after writes so we don't have to create synthetic data with outside tools. This patch introduces the write side benchmark (read side will hopefully come tomorrow). While the write side is, as mentioned, not the most interesting part, I did see some standing from the flamegraph that allowed me to optimize one particular function, yielding a 8.6 % improvement."	2015-08-12 18:33:11 +03:00
Glauber Costa	4ddef06ba6	perf tests: test sstables index reads and writes This is a test that allow us to query the performance of our sstable index reads and writes (currently only writes implemented). A lot of potentially common code is put into a header, which will make writing new tests easier if needed. We don't want to take shortcuts for this, so all reading and writing is done through public sstable interfaces. For writing, there is no way to write the index without writing the datafile. But because we are only writing the primary key, the datafile will not contain anything else. This is the closest we can get to an index testing with the public interfaces. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-08-12 09:18:37 -05:00
Vlad Zolotarov	806cc8c09a	locator: snitch_reset_test Checks that both successful and insuccessful calls for reset_snitch() function as expected. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2015-08-12 16:44:47 +03:00
Pekka Enberg	a3c194b050	transport/server: Move event_notifier class to separate file Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-12 09:59:35 +03:00
Raphael S. Carvalho	9823164c89	db: introduce compaction manager Currently, each column family creates a fiber to handle compaction requests in parallel to the system. If there are N column families, N compactions could be running in parallel, which is definitely horrible. To solve that problem, a per-database compaction manager is introduced here. Compaction manager is a feature used to service compaction requests from N column families. Parallelism is made available by creating more than one fiber to service the requests. That being said, N compaction requests will be served by M fibers. A compaction request being submitted will go to a job queue shared between all fibers, and the fiber with the lowest amount of pending jobs will be signalled. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-08-11 17:25:46 +03:00
Avi Kivity	d6351ecca7	utils: add crc32 class C++ interface to the crc32 x86 instruction.	2015-08-09 00:05:33 +03:00
Avi Kivity	70618762c3	build: require at least a Nehalem-class cpu We want to use the crc32 instruction, which was made available on Nehalem, so let's require it. It's old enough to be present everywhere.	2015-08-08 23:28:32 +03:00
Tomasz Grabiec	038183eabd	Merge branch 'penberg/event-cleanups/v1' from seastar-dev.git Cleanups of CQL events code from Pekka.	2015-08-07 12:18:26 +03:00
Pekka Enberg	33ce99b732	transport/event: Move implementation to source file Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-07 09:28:46 +03:00
Tomasz Grabiec	9a1ee1b96a	api: Introduce RESTful API for LSA To force compaction, invoke: $ curl -X POST http://localhost:10000/lsa/compact	2015-08-06 16:50:15 +02:00
Tomasz Grabiec	658c21a060	tests: Add LSA tests	2015-08-06 14:05:16 +02:00
Tomasz Grabiec	5a9e296803	utils: lsa: Introduce log-structured allocator	2015-08-06 14:05:15 +02:00
Tomasz Grabiec	e7e79af435	tests: Add allocation_strategy_test	2015-08-06 12:52:43 +02:00
Avi Kivity	522f23b830	Merge "Schema table cleanups" from Pekka "Clean up the schema table code. Be explicit that we don't support Cassandra 3.0 and eliminate some dead code."	2015-08-05 15:09:59 +03:00
Avi Kivity	c720cddc5c	tests: mv tests/urchin/* -> tests/ Now that seastar is in a separate repository, we can use the tests/ directory.	2015-08-05 14:16:52 +03:00
Pekka Enberg	99a80050e3	db: Rename legacy_schema_tables to schema_tables There's nothing legacy about it so rename legacy_schema_tables to schema_tables. The naming comes from a Cassandra 3.x development branch which is not relevant for us in the near future. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-05 13:56:47 +03:00
Nadav Har'El	34b1cc42cd	Initial repair support This patch adds the beginning of node repair support. Repair is initiated on a node using the REST API, for example to repair all the column families in the "try1" keyspace, you can use: curl -X GET --header "Content-Type: application/json" --header "Accept: application/json" "http://127.0.0.1:10000/storage_service/repair_async/try1" I tested that the repair already works (exchanges mutations with all other replicas, and successfully repairs them), so I think can be committed, but will need more work to be completed 1. Repair options are not yet supported (range repair, sequential/parallel repair, choice of hosts, datacenters and column families, etc.). 2. All the data of the keyspace is exchanged - Merkle Trees (or an alternative optimization) and partial data exchange haven't been implemented yet. 3. Full repair for nodes with multiple separate ranges is not yet implemented correctly. E.g., consider 10 nodes with vnodes and RF=2, so each vnode's range has a different host as a replica, so we need to exchange each key range separately with a different remote host. 4. Our repair operation returns a numeric operation id (like Origin), but we don't yet provide any means to use this id to check on ongoing repairs like Origin allows. 5. Error hangling, logging, etc., needs to be improved. 6. SMP nodes (with multiple shards) should work correctly (thanks to Asias's latest patch for SMP mutation streaming) but haven't been tested. 7. Incremental repair is not supported (see http://www.datastax.com/dev/blog/more-efficient-repairs) Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-08-05 13:26:36 +03:00
Nadav Har'El	1d4c1eda51	ninja: add "clean" target This patch adds a "ninja clean", better than the current "ninja -t clean". Ninja's "ninja -t clean" is a nice trick, designed to save the Makefile writer the tedious chore of listing the targets to remove, by automatically gathering this list. But our build system, following OSv's one, actually uses a much cooler (and better) trick: All build files are generated in a single subdirectory, "build/", and cleaning the build products is as simple as "rm -rf build". So this patch adds a target, "ninja clean", which does exactly this (rm -rf build). "ninja clean" is not only easier to type than "ninja -t clean", it also has one important benefit: When the ninja rules change, "ninja -t clean" doesn't remember to delete now-defunct targets, and they stay behind. On my build machine, "ninja -t clean" left behind almost a gigabyte of old crap. Moreover, when the ninja file changes drastically (as it changed a few days ago), not cleaning up everything can even cause new builds to break - e.g., when something was previously a file and now needs to be a directory. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-08-04 16:00:09 +03:00
Avi Kivity	eaca3f7cd1	build: forward compiler selection to seastar configuration	2015-08-03 22:37:32 +03:00
Tomasz Grabiec	b88fc51e2a	tests: Introduce test for storage_proxy::make_local_reader()	2015-08-03 15:21:40 +02:00
Asias He	6398bb4bdc	gossip: Move code from gms/endpoint_state.hh to source file	2015-07-31 10:43:40 +08:00
Asias He	a95213e81e	gossip: Kill gms/gms.cc All headers of gms/* are included. No need to include them all in gms.cc now.	2015-07-31 10:43:40 +08:00
Asias He	e074b1b7f8	gossip: Move operator<< of gossip_digest_ack2 to gossip_digest_ack2.cc	2015-07-31 10:43:39 +08:00
Asias He	ca5eea7fad	gossip: Move operator<< of gossip_digest_ack to gossip_digest_ack.cc	2015-07-31 10:43:39 +08:00
Asias He	76efae87b5	gossip: Move operator<< of gossip_digest_syn to gossip_digest_syn.cc	2015-07-31 10:43:39 +08:00
Asias He	a2b54fc757	main: Introduce init.cc to cleanup service startup code This patch introduce init.cc file which hosts all the initialization code. The benefits are 1) we can share initialization code with tests code. 2) all the service startup dependency / order code is in one single place instead of everywhere.	2015-07-28 18:20:45 +08:00
Nadav Har'El	d074dfe322	build: no need any more to try both ninja and ninja-build We have the "ninja" variable. Use it. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-07-28 12:30:49 +03:00
Pekka Enberg	7fc1311d4a	db/consistency_level: Move implementation to .cc file Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-28 10:06:18 +03:00
Amnon Heiman	4908222d6a	Adding utils.json Swagger definition file The utils file will hold general modules, that need to be used by multiple modules. As a start, it holds the histogram definition. Signed-off-by: Amnon Heiman <amnon@cloudius-systems.com>	2015-07-26 10:58:45 +03:00
Tomasz Grabiec	45b4471a0e	tests: Introduce test for query::partition_range	2015-07-24 16:08:41 +02:00
Pekka Enberg	78840a690f	cql3: Move ut_name implementation to .cc file Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-24 10:21:11 +02:00
Pekka Enberg	1e5fad25d7	cql3: Move attributes implementation to .cc file Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-24 10:21:11 +02:00
Avi Kivity	8870bf1bf8	Merge "Handling of non-full partition range queries" from Tomasz	2015-07-22 15:18:02 +03:00
Avi Kivity	aa4dae29f3	build: unconditionally rebuild libseastar.a Instead of trying to second-guess the seastar build system, always rebuild libseastar.a. Specify restat = 1 so that binaries are only relinked if something truly changed.	2015-07-22 15:04:32 +03:00
Tomasz Grabiec	440962dbbf	tests: Run mutation source tests on memtable	2015-07-22 13:14:33 +02:00
Tomasz Grabiec	0f3588708e	tests: Extract range query tests from sstable_mutation_test into mutation_source_test The idea is to reuse the same testing code on any mutation_source, for example on memtable. The range query test cases are now part of a generic mutation_source test suite.	2015-07-22 13:14:33 +02:00
Pekka Enberg	e361f2a436	utils/runtime: Add uptime helpers The functionality is similar to RuntimeMBean.getUptime() that's needed in schema pulling logic. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-22 13:02:43 +03:00

1 2 3 4 5 ...

489 Commits