scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 18:40:38 +00:00

Author	SHA1	Message	Date
Nadav Har'El	506e074ba4	sstable decompression: fix skip() to end of file The skip() implementation for the compressed file input stream incorrectly handled the case of skipping to the end of file: In that case we just need to update the file pointer, but not skip anywhere in the compressed disk file; In particular, we must NOT call locate() to find the relevant on-disk compressed chunk, because there is none - locate() can only be called on actual positions of bytes, not on the one-past-end-of-file position. Fixes #2143 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20170308100057.23316-1-nyh@scylladb.com>	2017-03-08 12:35:05 +02:00
Raphael S. Carvalho	c26090a6b2	sstables/compress: fix error message for snappy uncompression Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <898ad07db705355bdbf780afdb3aa982b8ca3823.1482364125.git.raphaelsc@scylladb.com>	2016-12-22 09:08:34 +01:00
Duarte Nunes	cc3f26c993	lz4: Conditionally use LZ4_compress_default() Since not all distributions have a version of LZ4 with LZ4_compress_default(), we use it conditionally. This is specially important beginning with version 1.7.3 of LZ4, which deprecates the LZ4_compress() function in favour of LZ4_compress_default() and thus prevents Scylla from compiling due to the deprecated warning. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20161124092339.23017-1-duarte@scylladb.com>	2016-11-24 11:25:03 +02:00
Paweł Dziepak	2124d08b88	sstables: add skip() to compressed_file_data_source Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Avi Kivity	106e3703d9	sstables: stop using unaligned_cast unaligned_cast violates strict aliasing, and causes code misgeneration on gcc 6. Replace it with read_be/write_be, which are nicer anyway. Message-Id: <1469122850-7511-1-git-send-email-avi@scylladb.com>	2016-07-22 07:03:08 +01:00
Paweł Dziepak	4513f8b52c	sstables: add compressed_file_data_source_impl::close() compressed_file_data_source_impl should close the underlying data source properly when asked to. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-30 13:07:07 +01:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Nadav Har'El	02ba8ffbe8	Allow uncompression at end of file Asking to read from byte 100 when a file has 50 bytes is an obvious error. But what if we ask to read from byte 50? What if we ask to read 0 bytes at byte 50? :-) Before this patch, code which asked to read from the EOF position would get an exception. After this patch, it would simply read nothing, without error. This allows, for example, reading 0 bytes from position 0 on a file with 0 bytes, which apparently happened in issue #1039... A read which starts at a position higher than the EOF position still generates an exception. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1458137867-10998-1-git-send-email-nyh@scylladb.com>	2016-03-16 17:50:23 +02:00
Nadav Har'El	73297c7872	Fix out-of-range exception when uncompressing 0 bytes The uncompression code reads the compressed chunks containing the bytes pos through pos + len - 1. This, however, is not correct when len==0, and pos + len - 1 may even be -1, causing an out-of-range exception when calling locate() to find the chunks containing this byte position. So we need to treat len==0 specially, and in this case we don't read anything, and don't need to locate() the chunks to read. Refs #1039. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1458135987-10200-1-git-send-email-nyh@scylladb.com>	2016-03-16 15:54:48 +02:00
Nadav Har'El	1b4f8842ee	sstable: fix compressed data file overread Since commit `2f56577` ("sstables: more efficient read of compressed data file"), the compressed_file_input_stream uses a file_input_stream to efficiently read the compressed data at chunks some desired size (128 KB is our default) instead of at smaller compressed chunks. However, I had a bug where I mis-calculated the desired length of the read (giving the end byte instead of the length!) and as a result file_input_stream did not know where the read was supposed to stop, and always read 128 KB buffers. The results were not incorrect, because the sstable reader stops when it needs to, even if given too much data. But it was inefficient because too much data was read in the last buffer. With this patch, the length is correctly given to the input stream, and it can read a much smaller buffer at the end of the read, not the full 128 KB. I tested that this actually happens. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1457633616-15193-1-git-send-email-nyh@scylladb.com>	2016-03-11 09:17:50 +02:00
Nadav Har'El	2f56577794	sstables: more efficient read of compressed data file Before this patch, reading large ranges from a compressed data file involved two inefficiencies: 1. The compressed data file was read one compressed chunk at a time. Such a chunk is around 30 KB in size, well below our desired sstable read-ahead size (sstable_buffer_size = 128 KB). 2. Because the compressed chunks have variable length (the uncompressed chunk has a fixed length) they are not aligned to disk blocks, so consecutive chunks have overlapping blocks which were unnecessarily read twice. The fix for both issues is to build the compressed_file_input_stream on an existing file_input_stream, instead of using direct file IO to read the individual chunks. file_input_stream takes care of doing the appropriate amount of read-ahead, and the compressed_file_input_stream layer does the decompression of the data read from the underlying layer. Fixes #992. Historical note: Implementing compressed_file_input_stream on top of file_input_stream was already tried in the past, and rejected. The problem at that time was that compressed_file_input_stream's constructor did not specify the end of the range to read, so that when we wanted to read only a small range we got too much read-ahead beyond the exactly one compressed chunk that we needed to read. Following the fix to issue #964, we now know on every streaming read also the intended end of the stream, so we can now use this to stop reading at the end of the last required chunk, even when we use a read-ahead buffer much larger than a chunk. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1457304335-8507-1-git-send-email-nyh@scylladb.com>	2016-03-09 10:14:15 +02:00
Glauber Costa	8e4bf025ae	sstables: wire priority for read path All the SSTable read path can now take an io_priority. The public functions will take a default parameter which is Seastar's default priority. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-01-25 15:20:38 -05:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Nadav Har'El	4edf7fe206	clean up uses of lw_shared_ptr<file> recently, "file" started to use a shared_ptr internally, and is already copy-able and reference counted, and there is no reason to use lw_shared_ptr<file>. This patch cleans up a few remaining places where lw_shared_ptr<file> was used. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-07-22 11:51:40 +03:00
Raphael S. Carvalho	657c817d14	sstables: add support to deflate compression Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-06-17 11:55:02 -03:00
Raphael S. Carvalho	7244d392cd	sstables: add support to snappy compression Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-06-17 11:55:00 -03:00
Raphael S. Carvalho	3bfb86f541	sstables: add compress_max_size to compression used to return maximum size which compressor may output. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com> Reviewed-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-06-16 09:48:00 -03:00
Raphael S. Carvalho	4175fa92b3	sstables: fix lz4 compress compress_lz4 must account the input length prepended to the output buffer. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com> Reviewed-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-06-16 08:34:39 -03:00
Pekka Enberg	a9d08438cd	sstable: Inline adler32 checksum functions They're called in the fast-path so inline the functions to avoid an extra function call. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-06-04 15:48:37 +03:00
Raphael S. Carvalho	bdd3fe61c5	sstables: add initial support to generation of CRC component CRC component is composed of chunk size, and a vector of checksums for each chunk (at most chunk size bytes) composing the data file. The implementation is about computing the checksum every time the output stream of data file gets written. A write to output stream may cross the chunk boundary, so that must be handled properly. Note that CRC component will only be created if compression isn't being used. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-06-01 12:25:01 -03:00
Vlad Zolotarov	81c9b327c6	compress: Use file::dma_read_exactly() instead of a local version of it. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2015-05-31 11:21:56 +03:00
Raphael S. Carvalho	fdf50ef643	sstables: add initial support to compression Starting with LZ4, the default compressor. Stub functions were added to other compression algorithms, which should eventually be replaced with an actual implementation. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com> Reviewed-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-04-19 10:07:29 +03:00
Nadav Har'El	de58d08e59	sstable: fix compressed data file stream bug We need to update _pos after we read, or we keep reading the same chunk over and over :-( Also, don't read anything if we're already past the end of file. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-04-07 10:47:29 +03:00
Nadav Har'El	f80ac5a629	sstables: rework compression metadata to fix test. Previously we had both a "compression" structure (read from the Compression Info file on disk) and a "compression_metadata" class with additional information, which std::move()ed parts of the compression structure. This caused problems for the simplistic sstable-writing test (which does the non-interesting thing of writing a previously-read sstable). I'm ashamed to say, fixing this was very hard, because all this code is built like a house of cards - try to change one thing, and everything falls apart. After many failed attempts in trying to improve this code, what I ended up doing is simply extending the "compression" structure - the extended part isn't read or written, but it is in the structure. We also no longer move a shared pointer to the compression structure, but rather just an ordinary pointer; The assumption is that the user will already make sure that the sstable structure will live for the durations of any processing on it - and the compression structure is just one part of this sstable structure. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-03-29 16:14:53 +03:00
Nadav Har'El	4d57c8fd28	sstables: fix LZ4 decompression It turns out that Cassandra's LZ4Compressor doesn't use the LZ4 compressor directly - instead it prepends the uncompressed length, in 4-byte little-endian (!) encoding, to the compressed chunk. We don't need this extra information - we already know the expected uncompressed chunk length, so we need to just skip it. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-03-25 12:35:32 +02:00
Nadav Har'El	c6eb2a87ea	Move compress.{cc,hh} to sstables/ Move compress.{cc,hh} from db/ to sstables/. This makes more sense, as this code is only used for sstables (un)compression. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-03-24 16:54:58 +02:00

26 Commits