scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 20:27:03 +00:00

Author	SHA1	Message	Date
Pekka Enberg	380a7be54b	dist/docker: Add support for Alternator This adds a "alternator-address" and "alternator-port" configuration options to the Docker image, so people can enable Alternator with "docker run" with: docker run --name some-scylla -d <image> --alternator-port=8080 Message-Id: <20190902110920.19269-1-penberg@scylladb.com>	2019-09-11 18:01:05 +03:00
Takuya ASADA	31ddb2145a	dist/common/scripts: support nonroot mode on setup scripts Since nonroot mode requires to run everything on non-privileged user, most of setup scripts does not able to use nonroot mode. We only provide following functions on nonroot mode: - EC2 check - IO setup - Node exporter installer - Dev mode setup Rest of functions will be skipped on scylla_setup. To implement nonroot mode on setup scripts, scylla_util provides utility functions to abstract difference of directory structure between normal installation and nonroot mode.	2019-09-03 20:06:35 +09:00
Takuya ASADA	cfa8885ae1	reloc/python3: add install.sh on python relocatable package To support nonroot installation on scylla-python3, add install.sh on scylla-python3 relocatable package.	2019-09-03 20:06:30 +09:00
Takuya ASADA	2de14e0800	install.sh: add --nonroot mode This implements the way to install Scylla without requires root privilege, not distribution dependent, does not uses package manager.	2019-09-03 20:06:24 +09:00
Takuya ASADA	cde798dba5	dist/common/systemd: untemplataize *.service, use drop-in units instead Since systemd unit can override parameters using drop-in unit, we don't need mustache template for them. Also, drop --disttype and --target options on install.sh since it does not required anymore, introduce --sysconfdir instead for non-redhat distributions.	2019-09-03 20:06:15 +09:00
Takuya ASADA	49a360f234	dist/debian: delete debian/.install, debian/.dirs Since `ac9b115`, we switched to install.sh on Debian so we don't rely on .deb specific packaging scripts anymore. Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2019-09-03 20:06:09 +09:00
Avi Kivity	07010af44c	scylla_prepare: verify processor satisfies instruction set requirements Scylla requires the CLMUL and SSE 4.2 instruction sets and will fail without them. There is a check in main(), but that happens after the code is running and it may already be too late. Add a check in scylla_prepare which runs before the main executable.	2019-08-29 15:34:29 +03:00
Avi Kivity	f1d73d0c13	Merge "systemd: put scylla processes in systemd slices. #4743 " from Glauber " It is well known that seastar applications, like Scylla, do not play well with external processes: CPU usage from external processes may confuse the I/O and CPU schedulers and create stalls. We have also recently seen that memory usage from other application's anonymous and page cache memory can bring the system to OOM. Linux has a very good infrastructure for resource control contributed by amazingly bright engineers in the form of cgroup controllers. This infrastructure is exposed by SystemD in the form of slices: a hierarchical structure to which controllers can be attached. In true systemd way, the hierarchy is implicit in the filenames of the slice files. a "-" symbol defines the hierarchy, so the files that this patch presents, scylla-server and scylla-helper, essentially create a "scylla" cgroup at the top level with "server" and "helper" children. Later we mark the Services needed to run scylla as belonging to one or the other through the Slice= directive. Scylla DBAs can benefit from this setup by using the systemd-run utility to fire ad-hoc commands. Let's say for example that someone wants to hypothetically run a backup and transfer files to an external object store like S3, making sure that the amount of page cache used won't create swap pressure leading to database timeouts. One can then run something like: sudo systemd-run --uid=id -u scylla --gid=id -g scylla -t --slice=scylla-helper.slice /path/to/my/magical_backup_tool (or even better, the backup tool can itself be a systemd timer) " * 'slices' of https://github.com/glommer/scylla: systemd: put scylla processes in systemd slices. move postinst steps to an external script	2019-08-26 20:16:55 +03:00
Pekka Enberg	118a141f5d	scylla_blocktune.py: Kill btrfs related FIXME The scylla_blocktune.py has a FIXME for btrfs from 2016, which is no longer relevant for Scylla deployments, as Red Hat dropped support for the file system in 2017. Message-Id: <20190823114013.31112-1-penberg@scylladb.com>	2019-08-24 20:40:08 +03:00
Avi Kivity	698b72b501	relocatable: switch from run-time relocation to install-time relocation Our current relocation works by invoking the dynamic linker with the executable as an argument. This confuses gdb since the kernel records the dynamic linker as the executable, not the real executable. Switch to install-time relocation with patchelf: when installing the executable and libraries, all paths are known, and we can update the path to the dynamic loader and to the dynamic libraries. Since patchelf itself is dynamically linked, we have to relocate it dynamically (with the old method of invoking it via the dynamic linker). This is okay since it's a one-time operation and since we don't expect to debug core dumps of patchelf crashes. We lose the ability to run scylla directly from the uninstalled tarball, but since the nonroot installer is already moving in the direction of requiring install.sh, that is not a great loss, and certainly the ability to debug is more important. dh_strip barfs on some binaries which were treated with patchelf, so exclude them from dh_strip. This doesn't lose any functionality, since these binaries didn't have debug information to begin with (they are already-stripped Fedora executables). Fixes #4673.	2019-08-20 00:25:43 +02:00
Tomasz Grabiec	b9447d0319	Revert "relocatable: switch from run-time relocation to install-time relocation" This reverts commit `4ecce2d286`. Should be committed via the next branch.	2019-08-20 00:22:30 +02:00
Avi Kivity	4ecce2d286	relocatable: switch from run-time relocation to install-time relocation Our current relocation works by invoking the dynamic linker with the executable as an argument. This confuses gdb since the kernel records the dynamic linker as the executable, not the real executable. Switch to install-time relocation with patchelf: when installing the executable and libraries, all paths are known, and we can update the path to the dynamic loader and to the dynamic libraries. Since patchelf itself is dynamically linked, we have to relocate it dynamically (with the old method of invoking it via the dynamic linker). This is okay since it's a one-time operation and since we don't expect to debug core dumps of patchelf crashes. We lose the ability to run scylla directly from the uninstalled tarball, but since the nonroot installer is already moving in the direction of requiring install.sh, that is not a great loss, and certainly the ability to debug is more important. dh_strip barfs on some binaries which were treated with patchelf, so exclude them from dh_strip. This doesn't lose any functionality, since these binaries didn't have debug information to begin with (they are already-stripped Fedora executables). Fixes #4673.	2019-08-20 00:20:19 +02:00
Glauber Costa	da260ecd61	systemd: put scylla processes in systemd slices. It is well known that seastar applications, like Scylla, do not play well with external processes: CPU usage from external processes may confuse the I/O and CPU schedulers and create stalls. We have also recently seen that memory usage from other application's anonymous and page cache memory can bring the system to OOM. Linux has a very good infrastructure for resource control contributed by amazingly bright engineers in the form of cgroup controllers. This infrastructure is exposed by SystemD in the form of slices: a hierarchical structure to which controllers can be attached. In true systemd way, the hierarchy is implicit in the filenames of the slice files. a "-" symbol defines the hierarchy, so the files that this patch presents, scylla-server and scylla-helper, essentially create a "scylla" cgroup at the top level with "server" and "helper" children. Later we mark the Services needed to run scylla as belonging to one or the other through the Slice= directive. Scylla DBAs can benefit from this setup by using the systemd-run utility to fire ad-hoc commands. Let's say for example that someone wants to hypothetically run a backup and transfer files to an external object store like S3, making sure that the amount of page cache used won't create swap pressure leading to database timeouts. One can then run something like: ``` sudo systemd-run --uid=`id -u scylla` --gid=`id -g scylla` -t --slice=scylla-helper.slice /path/to/my/magical_backup_tool ``` (or even better, the backup tool can itself be a systemd timer) Changes from last version: - No longer use the CPUQuota - Minor typo fixes - postinstall fixup for small machines Benchmark results: ================== Test: read from disk, with 100% disk util using a single i3.xlarge (4 vCPUs). We have to fill the cache as we read, so this should stress CPU, memory and disk I/O. cassandra-stress command: ``` cassandra-stress read no-warmup duration=5m -rate threads=20 -node 10.2.209.188 -pop dist=uniform\(1..150000000\) ``` Baseline results: ``` Results: Op rate : 13,830 op/s [READ: 13,830 op/s] Partition rate : 13,830 pk/s [READ: 13,830 pk/s] Row rate : 13,830 row/s [READ: 13,830 row/s] Latency mean : 1.4 ms [READ: 1.4 ms] Latency median : 1.4 ms [READ: 1.4 ms] Latency 95th percentile : 2.4 ms [READ: 2.4 ms] Latency 99th percentile : 2.8 ms [READ: 2.8 ms] Latency 99.9th percentile : 3.4 ms [READ: 3.4 ms] Latency max : 12.0 ms [READ: 12.0 ms] Total partitions : 4,149,130 [READ: 4,149,130] Total errors : 0 [READ: 0] Total GC count : 0 Total GC memory : 0.000 KiB Total GC time : 0.0 seconds Avg GC time : NaN ms StdDev GC time : 0.0 ms Total operation time : 00:05:00 ``` Question 1: =========== Does putting scylla in a special slice affect its performance ? Results with Scylla running in a slice: ``` Results: Op rate : 13,811 op/s [READ: 13,811 op/s] Partition rate : 13,811 pk/s [READ: 13,811 pk/s] Row rate : 13,811 row/s [READ: 13,811 row/s] Latency mean : 1.4 ms [READ: 1.4 ms] Latency median : 1.4 ms [READ: 1.4 ms] Latency 95th percentile : 2.2 ms [READ: 2.2 ms] Latency 99th percentile : 2.6 ms [READ: 2.6 ms] Latency 99.9th percentile : 3.3 ms [READ: 3.3 ms] Latency max : 23.2 ms [READ: 23.2 ms] Total partitions : 4,151,409 [READ: 4,151,409] Total errors : 0 [READ: 0] Total GC count : 0 Total GC memory : 0.000 KiB Total GC time : 0.0 seconds Avg GC time : NaN ms StdDev GC time : 0.0 ms Total operation time : 00:05:00 ``` Conclusion : No significant change Question 2: =========== What happens when there is a CPU hog running in the same server as scylla? CPU hog: ``` taskset -c 0 /bin/sh -c "while true; do true; done" & taskset -c 1 /bin/sh -c "while true; do true; done" & taskset -c 2 /bin/sh -c "while true; do true; done" & taskset -c 3 /bin/sh -c "while true; do true; done" & sleep 330 ``` Scenario 1: CPU hog runs freely: ``` Results: Op rate : 2,939 op/s [READ: 2,939 op/s] Partition rate : 2,939 pk/s [READ: 2,939 pk/s] Row rate : 2,939 row/s [READ: 2,939 row/s] Latency mean : 6.8 ms [READ: 6.8 ms] Latency median : 5.3 ms [READ: 5.3 ms] Latency 95th percentile : 11.0 ms [READ: 11.0 ms] Latency 99th percentile : 14.9 ms [READ: 14.9 ms] Latency 99.9th percentile : 17.1 ms [READ: 17.1 ms] Latency max : 26.3 ms [READ: 26.3 ms] Total partitions : 884,460 [READ: 884,460] Total errors : 0 [READ: 0] Total GC count : 0 Total GC memory : 0.000 KiB Total GC time : 0.0 seconds Avg GC time : NaN ms StdDev GC time : 0.0 ms Total operation time : 00:05:00 ``` Scenario 2: CPU hog runs inside scylla-helper slice ``` Results: Op rate : 13,527 op/s [READ: 13,527 op/s] Partition rate : 13,527 pk/s [READ: 13,527 pk/s] Row rate : 13,527 row/s [READ: 13,527 row/s] Latency mean : 1.5 ms [READ: 1.5 ms] Latency median : 1.4 ms [READ: 1.4 ms] Latency 95th percentile : 2.4 ms [READ: 2.4 ms] Latency 99th percentile : 2.9 ms [READ: 2.9 ms] Latency 99.9th percentile : 3.8 ms [READ: 3.8 ms] Latency max : 18.7 ms [READ: 18.7 ms] Total partitions : 4,069,934 [READ: 4,069,934] Total errors : 0 [READ: 0] Total GC count : 0 Total GC memory : 0.000 KiB Total GC time : 0.0 seconds Avg GC time : NaN ms StdDev GC time : 0.0 ms Total operation time : 00:05:00 ``` Conclusion: With systemd slice we can keep the performance very close to baseline Question 3: =========== What happens when there is a CPU hog running in the same server as scylla? I/O hog: (Data in the cluster is 2x size of memory) ``` while true; do find /var/lib/scylla/data -type f -exec grep glauber {} + done ``` Scenario 1: I/O hog runs freely: ``` Results: Op rate : 7,680 op/s [READ: 7,680 op/s] Partition rate : 7,680 pk/s [READ: 7,680 pk/s] Row rate : 7,680 row/s [READ: 7,680 row/s] Latency mean : 2.6 ms [READ: 2.6 ms] Latency median : 1.3 ms [READ: 1.3 ms] Latency 95th percentile : 7.8 ms [READ: 7.8 ms] Latency 99th percentile : 10.9 ms [READ: 10.9 ms] Latency 99.9th percentile : 16.9 ms [READ: 16.9 ms] Latency max : 40.8 ms [READ: 40.8 ms] Total partitions : 2,306,723 [READ: 2,306,723] Total errors : 0 [READ: 0] Total GC count : 0 Total GC memory : 0.000 KiB Total GC time : 0.0 seconds Avg GC time : NaN ms StdDev GC time : 0.0 ms Total operation time : 00:05:00 ``` Scenario 2: I/O hog runs in the scylla-helper systemd slice: ``` Results: Op rate : 13,277 op/s [READ: 13,277 op/s] Partition rate : 13,277 pk/s [READ: 13,277 pk/s] Row rate : 13,277 row/s [READ: 13,277 row/s] Latency mean : 1.5 ms [READ: 1.5 ms] Latency median : 1.4 ms [READ: 1.4 ms] Latency 95th percentile : 2.4 ms [READ: 2.4 ms] Latency 99th percentile : 2.9 ms [READ: 2.9 ms] Latency 99.9th percentile : 3.5 ms [READ: 3.5 ms] Latency max : 183.4 ms [READ: 183.4 ms] Total partitions : 3,984,080 [READ: 3,984,080] Total errors : 0 [READ: 0] Total GC count : 0 Total GC memory : 0.000 KiB Total GC time : 0.0 seconds Avg GC time : NaN ms StdDev GC time : 0.0 ms Total operation time : 00:05:00 ``` Conclusion: With systemd slice we can keep the performance very close to baseline Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-08-19 14:31:28 -04:00
Takuya ASADA	f574112301	dist/debian: handle --dist correctly On `ac9b115`, it mistakenly ignores --dist option. It should set 'housekeeping' template variable to 'enable'. Fixes #4857 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190816120127.14099-1-syuu@scylladb.com>	2019-08-18 15:00:33 +03:00
Glauber Costa	ffc328c924	move postinst steps to an external script There are systemd-related steps done in both rpm and deb builds. Move that to a script so we avoid duplication. The tests are so far a bit specific to the distributions, so it needs to be adapted a bit. Also note that this also fixes a bug with rpm as a side-effect: rpm does not call daemon-reload after potentially changing the systemd files (it is only implied during postun operations, that happen during uninstall). daemon-reload was called explicitly for debian packages, and now it is called for both. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-08-15 10:43:17 -04:00
Avi Kivity	ca28fdc37d	Revert "dist/docker/redhat: change user of scylla services to 'scylla'" This reverts commit `b1226fb15a`. When the data volume is mounted from the host (as is usual in container deployments), we can't expect that the files will be owned by the in-container scylla user. So that commit didn't really fix #4536. A follow-up patch will relax the check so it passes in a container environment.	2019-08-13 14:36:00 +03:00
Vlad Zolotarov	15eaf2fd8e	dist: scylla_util.py: get_mode_cpuset(): don't let false alarm error messages Don't let perftune.py print false alarm error message when we calculate a compute CPU set for tuning modes. This may happen when we calculate a CPU set for non-MQ tuning modes on small systems on which these modes are forbidden because they would result in a zero CPU set, e.g. sq_split on a system with a single physical core. We are going to utilize a newly introduced --get-cpu-mask-quiet execution mode introduced to the seastar/script/perftune.py by the "perftune.py: introduce --get-cpu-mask-quiet" series which would return a zero CPU set if that's what it turns out to be instead of exiting with an error what --get-cpu-mask would do in such a case. The rest of scylla_util.py logic is going to handle a zero CPU set returned by get_mode_cpuset() correctly. Fixes #4211 Fixes #4443 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <20190731212901.9510-1-vladz@scylladb.com>	2019-08-01 11:14:39 +03:00
Takuya ASADA	60ec8b2a04	install.sh: install everything when --pkg is not specified On previous commit `ac9b115a8f`, install.sh requires to specify single package using --pkg, there is no way to select all. It should be select all packages when running install.sh without --pkg. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190731013245.5857-1-syuu@scylladb.com>	2019-07-31 16:43:57 +03:00
Takuya ASADA	ac9b115a8f	dist/debian: use install.sh on Debian Currently, install.sh just used for building .rpm, we have similar build script under dist/debian, sometimes it become inconsistent with install.sh. Since most of package build process are same, we should share install.sh on both .rpm and .deb package build process. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190725123207.2326-1-syuu@scylladb.com>	2019-07-25 18:22:42 +03:00
Takuya ASADA	bd7d1b2d38	dist/common/systemd: change stop timeout sec to 900s Currently scylla-server.service uses DefaultTimeoutStopSec = 90, if Scylla does not able to clean-shutdown in 90sec we may have data corruption on the node. Since we already set TimeoutStartSec = 900, we can use TimeoutSec to set both TimeoutStartSec and TimeoutStopSec to 900. See #4700 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190717095416.10652-1-syuu@scylladb.com>	2019-07-17 15:37:47 +03:00
Takuya ASADA	02c6db29c8	dist/debian: manage .pyc as a part of package Since `828b63f4fb` only add .pyc on .rpm package, we also need it to .deb package. See #4612 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190629023739.8472-1-syuu@scylladb.com>	2019-06-30 15:54:42 +03:00
Amos Kong	f0cd589a75	dist: suppress the yaml load warning YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details. Fix it by use new safe interface - yaml.safe_load() Signed-off-by: Amos Kong <amos@scylladb.com> Cc: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <9b68601845117274573474ede0341cc81f80efa6.1561156205.git.amos@scylladb.com>	2019-06-25 19:05:29 +03:00
Takuya ASADA	2135d2ae7f	dist/debian: install capabilities.conf on postinst script We still has "{{^jessie}}" tag on scylla-server systemd unit file to skip using AmbientCapabilities on Debian 8, but it does not able to work anymore since we moved to single binary .deb package for all debian variants, we must share same systemd unit file across all Debian variants. To do so we need to have separated file on /etc/systemd to define AmbientCapabilities, create the file while running postinst script only if distribution is not Debian 8, just like we do in .rpm. See #3344 See #3486 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190619064224.23035-1-syuu@scylladb.com>	2019-06-23 16:03:06 +03:00
Takuya ASADA	e4320d6537	dist/debian: run 'systemctl daemon-reload' automatically on package install/uninstall Since we cannot use dh --with=systemd because we don't want to automatically enabling systemd units, manage them by our setup scripts, we have to do 'systemctl daemon-reload' manually. (On dh --with=systemd, systemd helper automatically provides such scirpts) Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190618000210.28972-1-syuu@scylladb.com>	2019-06-23 16:03:06 +03:00
Takuya ASADA	7320c966bc	dist/common/scripts/scylla_setup: don't proceed with empty NIC name Currently NIC selection prompt on scylla_setup just proceed setup when user just pressed Enter key on the prompt. The prompt should ask NIC name again until user input correct NIC name. Fixes #4517 Message-Id: <20190617124925.11559-1-syuu@scylladb.com>	2019-06-17 15:52:29 +03:00
Takuya ASADA	f582a759ee	dist: merge /usr/lib/scylla to /opt/scylladb We used to use /opt/scylladb just for Scylla build toolchain and dependency libraries, not for Scylla main package. But since we merged relocatable package, Scylla main binary and dependency libraries are all located under /opt/scylladb, only setup scripts remained on /usr/lib/scylla. It strange to keep using both /usr/lib/<app name> and /opt/<app name>, we should merge them into single place. Message-Id: <20190614011038.17827-1-syuu@scylladb.com>	2019-06-14 21:03:36 +03:00
Glauber Costa	8a3fe3ac9b	debian: correctly relocate python scripts Relocation of python scripts mentions scylla-server in paths explicitly. It should use {{product}} instead. The current build is failing when {{product}} is different than scylla-server Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190613012518.28784-1-glauber@scylladb.com>	2019-06-13 09:39:36 +03:00
Takuya ASADA	b1226fb15a	dist/docker/redhat: change user of scylla services to 'scylla' On branch-3.1 / master, we are getting following error: ERROR 2019-06-11 10:58:49,156 [shard 0] database - /var/lib/scylla/data: File not owned by current euid: 0. Owner is: 999 ERROR 2019-06-11 10:58:49,156 [shard 0] init - Failed owner and mode verification: std::runtime_error (File not owned by current euid: 0. Owner is: 999) ERROR 2019-06-11 10:58:49,156 [shard 0] database - /var/lib/scylla/hints: File not owned by current euid: 0. Owner is: 999 ERROR 2019-06-11 10:58:49,156 [shard 0] init - Failed owner and mode verification: std::runtime_error (File not owned by current euid: 0. Owner is: 999) ERROR 2019-06-11 10:58:49,156 [shard 0] database - /var/lib/scylla/commitlog: File not owned by current euid: 0. Owner is: 999 ERROR 2019-06-11 10:58:49,156 [shard 0] init - Failed owner and mode verification: std::runtime_error (File not owned by current euid: 0. Owner is: 999) ERROR 2019-06-11 10:58:49,156 [shard 0] database - /var/lib/scylla/view_hints: File not owned by current euid: 0. Owner is: 999 ERROR 2019-06-11 10:58:49,156 [shard 0] init - Failed owner and mode verification: std::runtime_error (File not owned by current euid: 0. Owner is: 999) It seems like owner verification of data directory fails because scylla-server process is running in root but data directory owned by scylla, so we should run services as scylla user. Fixes #4536 Message-Id: <20190611113142.23599-1-syuu@scylladb.com>	2019-06-12 20:29:06 +03:00
Takuya ASADA	60d8a99f05	dist/common/scripts/scylla_setup: verify system umask is acceptable for scylla-server To avoid 'Bad permmisons' error when user changed default umask, we need to verify system umask is acceptable for scylla-server. Fixes #4157 Message-Id: <20190612130343.6043-1-syuu@scylladb.com>	2019-06-12 20:29:06 +03:00
Takuya ASADA	30414d9c23	dist/ami: install scylla debug symbols by default On AMI creation, install scylla-debuginfo by default. closes #4542 Message-Id: <20190612102355.21386-1-syuu@scylladb.com>	2019-06-12 13:49:46 +03:00
Benny Halevy	3ad005ba17	build-ami: fix branch detection failure when not in git tree Introduced in `513d01d53e` The script is trying to determine the branch to shallow clone when an rpm is missing and has to be built. This functionality in the current implementation assumes it is being run inside a git repository, but that must not be the case if the script is triggered after local rpms were placed on the local directory. This happens when putting all necessary rpm files in: dist/ami/files And then running: dist/ami/build_ami.sh --localrpm The dist/ami/ and dist/ami/files are the only ones required for this action so querying the git repository in that situation makes no sense. Fixes #4535 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190611112455.13862-1-bhalevy@scylladb.com>	2019-06-11 19:08:02 +03:00
Benny Halevy	2017de9387	build-ami: delete extra parenthesis in branch_arg calculation Fixing a typo Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190610062113.5604-1-bhalevy@scylladb.com>	2019-06-10 11:29:44 +03:00
Benny Halevy	c188f838bc	build-ami: use ssh git URLs Rather than https, for cert-based passwordless access. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190606133648.15877-2-bhalevy@scylladb.com>	2019-06-06 20:02:13 +03:00
Benny Halevy	513d01d53e	build-ami: use current git branch for shallow-clone of other repos We want to use the same branch on the other repos build-ami needs as the one we're building for. Automatically find the current branch using the `git branch` command. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190606133648.15877-1-bhalevy@scylladb.com>	2019-06-06 20:02:13 +03:00
Takuya ASADA	25112408a7	dist/debian: support relocatable python3 on Debian variants Unlike CentOS, Debian variants has python3 package on official repository, so we don't have to use relocatable python3 on these distributions. However, official python3 version is different on each distribution, we may have issue because of that. Also, our scripts and packaging implementation are becoming presuppose existence of relocatable python3, it is causing issue on Debian variants. Switching to relocatable python3 on Debian variants avoid these issues, it will easier to manage Scylla python3 environments accross multiple distributions. Fixes #4495 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190531112707.20082-1-syuu@scylladb.com>	2019-06-02 14:59:43 +03:00
Avi Kivity	7a0c6cd583	Revert "dist/debian: support relocatable python3 on Debian variants" This reverts commit `4d119cbd6d`. It breaks build_deb.sh: 18:39:56 + seastar/scripts/perftune.py seastar/scripts/seastar-addr2line seastar/scripts/perftune.py 18:39:56 Traceback (most recent call last): 18:39:56 File "./relocate_python_scripts.py", line 116, in <module> 18:39:56 fixup_scripts(archive, args.scripts) 18:39:56 File "./relocate_python_scripts.py", line 104, in fixup_scripts 18:39:56 fixup_script(output, script) 18:39:56 File "./relocate_python_scripts.py", line 79, in fixup_script 18:39:56 orig_stat = os.stat(script) 18:39:56 FileNotFoundError: [Errno 2] No such file or directory: '/data/jenkins/workspace/scylla-master/unified-deb/scylla/build/debian/scylla-package/+' 18:39:56 make[1]: *** [debian/rules:19: override_dh_auto_install] Error 1	2019-05-29 13:58:41 +03:00
Takuya ASADA	4d119cbd6d	dist/debian: support relocatable python3 on Debian variants Unlike CentOS, Debian variants has python3 package on official repository, so we don't have to use relocatable python3 on these distributions. However, official python3 version is different on each distribution, we may have issue because of that. Also, our scripts and packaging implementation are becoming presuppose existence of relocatable python3, it is causing issue on Debian variants. Switching to relocatable python3 on Debian variants avoid these issues, it will easier to manage Scylla python3 environments accross multiple distributions. Fixes #4495 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190526105138.677-1-syuu@scylladb.com>	2019-05-26 13:56:30 +03:00
Glauber Costa	71c4375a66	scylla_io_setup: adjust values for i3en instances Apparently we are having some issues running iotune in the i3en instances, as the values not always make sense. We believe it is something that XFS is doing, and running fio directly on the device (no filesystem) provides more meaningful results (more according to AWS published expected values). For now, let's use fio instead. In this patch I have ran fio for our 4 dimensions in each of the three types of disks (large, xlarge, 3xlarge). Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190524111454.27956-1-glauber@scylladb.com>	2019-05-24 19:37:58 +03:00
Takuya ASADA	a55330a10b	dist/ami: output scylla version information to AMI tags and description Users may want to know which version of packages are used for the AMI, it's good to have it on AMI tags and description. To do this, we need to download .rpm from specified .repo, extract version information from .rpm. Fixes #4499 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190520123924.14060-2-syuu@scylladb.com>	2019-05-20 15:46:06 +03:00
Takuya ASADA	abe44c28c5	dist/ami: build scylla-python3 when specified --localrpm Since we switched to relocatable python3, we need to build it for AMI too. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190520123924.14060-1-syuu@scylladb.com>	2019-05-20 15:46:05 +03:00
Takuya ASADA	f625284113	dist/debian: apply product name variable on override_dh_auto_install To make product name templatization works correctly, we cannot use "debian/scylla-server" as package contents directory path, need to use template like "debian/{{product}}-server" instead. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190517121946.18248-1-syuu@scylladb.com>	2019-05-19 10:46:08 +03:00
Glauber Costa	3b64727244	scylla_setup: be nicer about unrecognized OS Right now if the user tries to execute this in an unrecognized OS, the following will be thrown: Traceback (most recent call last): File "/usr/lib/scylla/libexec/scylla_setup", line 214, in <module> do_verify_package('scylla-enterprise-jmx') File "/usr/lib/scylla/libexec/scylla_setup", line 73, in do_verify_package if res != 0: UnboundLocalError: local variable 'res' referenced before assignment It would be a lot nicer to exit gracefully and print a messge saying what is going on. This was caught when running on OEL, which the previous patch fixed. Still, there are other unknown OS out there the users may try to run on. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-05-13 14:31:49 -04:00
Glauber Costa	6c15ae5b36	scylla_util: recognize OEL as part of the RHEL family Oracle Linux is a RHEL-like distribution and we support it just fine, but our new incarnation of scylla_setup is failing to recognize it. os-release for OEL is a bit different. It doesn't have an ID_LIKE string, and only shows an ID string, which is set to 'ol'. So let's recognize this. Fixes: #4493 Branches: 3.1 Signed-off-by: Glauber Costa <glauber@scylladb.com>	2019-05-13 14:31:38 -04:00
Takuya ASADA	19a973cd05	dist/ami: fix wrong path of SCYLLA-PRODUCT-FILE Since other build_*.sh are for running inside extracted relocatable package, they have SCYLLA-PRODUCT-FILE on top of the directory, but build_ami.sh is not running in such condition, we need to run SCYLLA-VERSION-GEN first, then refer to build/SCYLLA-PRODUCT-FILE. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190509110621.27468-1-syuu@scylladb.com>	2019-05-09 14:45:31 +03:00
Glauber Costa	a23531ebd5	Support AWS i3en instances AWS just released their new instances, the i3en instances. The instance is verified already to work well with scylla, the only adjustments that we need is advertise that we support it, and pre-fill the disk information according to the performance numbers obtained by running the instance. Fixes #4486 Branches: 3.1 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190508170831.6003-1-glauber@scylladb.com>	2019-05-08 20:09:44 +03:00
JP-Reddy	56420dc650	scylla_io_setup: TypeError in iotune_args array from scylla_io_setup script Whenever the iotune_args array uses "--smp", it needs cpudata.smp() which returns an integer instead of a string. So when iotune_args is passed to subprocess.check_call(), it actually throws "TypeError: expected str, bytes or os.PathLike object, not int" but "%s did not pass validation tests, it may not be on XFS..." is shown as the exception. Even though the user inputs correct arguments, it might still throw an error and confuse the user that he/she has not passed the right arguments. One simple fix is to use str(cpudata.smp()) instead of cpudata.smp(). Signed-off-by: JP-Reddy <guthijp.reddy@gmail.com> Message-Id: <20190406070118.48477-1-guthijp.reddy@gmail.com>	2019-05-07 20:13:54 +03:00
Glauber Costa	79a5351651	scylla-housekeeping: timeout eventually scylla-housekeeping always wants to run in the installation to check if we are running the latest version. This happens regardless of whether or not we said yes or no to the housekeeping scylla_setup question - as that question only deals with whether or not we want to do this through a timer. It is fine to try to run scylla-housekeeping, as long as we time it out. The current code doesn't. The naive solution is to add a timeout parameter to urllib.request.open. However, that timeout is not respected and in my tests I saw real timeouts up to four times higher the timeout we set. For a reasonable 5s timeout, this mean a 20s real timeout which can lead to a very bad user experience. This seems to be a known problem with this module according to a quick Google search. This patch then takes a slightly more complex solution and uses multiprocess to enforce a well-defined user-visible timeout. Fixes #3980 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190506122335.5707-1-glauber@scylladb.com>	2019-05-06 17:37:59 +03:00
Glauber Costa	47d04e49e8	scylla_setup: respect user's decision not to call housekeeping The setup script asks the user whether or not housekeeping should be called, and in the first time the script is executed this decision is respected. However if the script is invoked again, that decision is not respected. This is because the check has the form: if (housekeeping_cfg_file_exists) { version_check = ask_user(); } if (version_check) { do_version_check() } else { dont_do_it() } When it should have the form: if (housekeeping_cfg_file_exists) { version_check = ask_user(); if (version_check) { do_version_check() } else { dont_do_it() } } (Thanks python) This is problematic in systems that are not connected to the internet, since housekeeping will fail to run and crash the setup script. Fixes #4462 Branches: master, branch-3.1 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190502034211.18435-1-glauber@scylladb.com>	2019-05-02 18:46:41 +03:00
Glauber Costa	99c00547ad	make scylla_util OS detection robust against empty lines Newer versions of RHEL ship the os-release file with newlines in the end, which our script was not prepared to handle. As such, scylla_setup would fail. This patch makes our OS detection robust against that. Fixes #4473 Branches: master, branch-3.1 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190502152224.31307-1-glauber@scylladb.com>	2019-05-02 18:33:35 +03:00
Amnon Heiman	9ad63efcfe	Adding node_exporter to docker This patch add the node_exporter to the docker image. It install it create and run a service with it. After this patch node_exporter will run and will be part of scylla Docker image. Fixes #4300 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20190421130643.6837-1-amnon@scylladb.com>	2019-04-21 18:12:58 +03:00

1 2 3 4 5 ...

1082 Commits