scylladb

Author	SHA1	Message	Date
Takuya ASADA	eb30594a60	dist: detect corrupted NUMA topology information There are some environment which has corrupted NUMA topology information, such as some instance types on AWS EC2 with specific Linux kernel images. On such environment, we cannot get HW information correctly from hwloc, so we cannot proceed optimization on perftune. To avoid causing script error, check NUMA topology information and skip running perftune if the information corrupted. Related scylladb/seastar#2925 Closes scylladb/scylladb#26344	2025-10-22 01:11:14 +03:00
Takuya ASADA	781dec5852	dist/docker: run the container as non-root user Since it is requirement for Red Hat OpenShift Certification, we need to run the container as non-root user. Related scylladb/scylla-pkg#4858 Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2025-03-05 23:39:56 +09:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Takuya ASADA	6fe09a5a16	scylla_raid_setup: support installing semanage on Amazon Linux 2 Since Amazon Linux 2 has different package name for semange, we need to adjust package name. Fixes #21351	2024-11-11 17:27:24 +09:00
Takuya ASADA	7ad5e69c54	scylla_raid_setup: fix failure on SELinux package installation After merged `5a470b2`, we found that scylla_raid_setup fails on offline mode installation. This is because pkg_install() just print error and exit script on offline mode, instead of installing packages since offline mode not supposed able to connect internet. Seems like it occur because of missing "policycoreutils-python-utils" package, which is the package for "semange" command. So we need to implement the relabeling patch without using the command. Fixes #21441	2024-11-11 17:27:24 +09:00
Yaron Kaikov	8221a178d8	Revert "dist: support nonroot and offline mode for scylla-housekeeping" This reverts commit `c3bea539b6`. Since it breaking offline-installer artifact-tests. Also, it seems that we should have merged it in the first place since we don't need scylla-housekeeping checks for offline-installer Closes scylladb/scylladb#19976	2024-08-04 10:55:26 +03:00
Takuya ASADA	c3bea539b6	dist: support nonroot and offline mode for scylla-housekeeping Introduce support nonroot and offline mode for scylla-housekeeping. Closes #13084 Closes scylladb/scylladb#13088	2024-07-23 07:57:32 +03:00
Avi Kivity	dffd0901b3	dist: scylla_util: sysconfig_parser: replace deprecated ConfigParser.readfp ConfigParser.readfp was deprecated in Python 3.2 and removed in Python 3.12. Under Fedora 40, the container fails to launch because it cannot parse its configuration. Fix by using the newer read_file(). Closes scylladb/scylladb#19236	2024-06-12 10:07:10 +03:00
Takuya ASADA	7275b614aa	scylla_util.py: wait for apt operation on other processes apt_install() / apt_uninstall() may fail if background process running apt operation, such as unattended-upgrades. To avoid this, we need to add two things: 1. For apt-get install / remove, we need to option "DPkg::Lock::Timeout=-1" to wait for dpkg lock. 2. For apt-get update, there is no option to wait for cache lock. Therefore, we need to implement retry-loop to wait for apt-get update succeed. Fixes #16537 Closes scylladb/scylladb#16561	2023-12-28 19:00:36 +02:00
Takuya ASADA	a4aeef2eb0	scylla_util.py: run apt-get update before apt-get install if it necessary Unlike yum, "apt-get install" may fails because package cache is outdated. Let's check package cache mtime and run "apt-get update" if it's too old. Fixes #4059 Closes scylladb/scylladb#15960	2023-11-07 20:40:16 +02:00
Petr Gusev	0152c000bb	commitlog: use separate directory for schema commitlog The commitlog api originally implied that the commitlog_directory would contain files from a single commitlog instance. This is checked in segment_manager::list_descriptors, if it encounters a file with an unknown prefix, an exception occurs in commitlog::descriptor::descriptor, which is logged with the WARN level. A new schema commitlog was added recently, which shares the filesystem directory with the main commitlog. This causes warnings to be emitted on each boot. This patch solves the warnings problem by moving the schema commitlog to a separate directory. In addition, the user can employ the new schema_commitlog_directory parameter to move the schema commitlog to another disk drive. By default, the schema commitlog directory is nested in the commitlog_directory. This can help avoid problems during an upgrade if the commitlog_directory in the custom scylla.yaml is located on a separate disk partition. This is expected to be released in 5.3. As #13134 (raft tables->schema commitlog) is also scheduled for 5.3, and it already requires a clean rolling restart (no cl segments to replay), we don't need to specifically handle upgrade here. Fixes: #11867	2023-03-30 21:55:50 +04:00
Takuya ASADA	464b5de99b	scylla_setup: allow symlink to --disks option Currently, --disks options does not allow symlinks such as /dev/disk/by-uuid/* or /dev/disk/azure/*. To allow using them, is_unused_disk() should resolve symlink to realpath, before evaluating the disk path. Fixes #11634 Closes #11646	2022-10-28 07:24:11 +03:00
Takuya ASADA	cd6030d5df	scylla_util.py: adding unescape for sysconfig_parser Even we have __escape() for escaping " middle of the value to writing sysconfig file, we didn't unescape for reading from sysconfig file. So adding __unescape() and call it on get().	2022-10-27 16:39:47 +09:00
Takuya ASADA	de57433bcf	scylla_util.py: on sysconfig_parser, don't use double quote when it's possible It seems like distribution original sysconfig files does not use double quote to set the parameter when the value does not contain space. Adding function to detect spaces in the value, don't usedouble quote when it not detected. Fixes #9149	2022-10-27 16:36:27 +09:00
Takuya ASADA	7501465b7c	scylla_util.py: change debug log directory to /var/tmp/scylla Current debug log is bit difficult to collect in CI, to find the debug log we must know which script caused Exception. Because the filename does not include prefix, and also specified directory is shared with other programs. To make things more easily, let's change debug log directory to /var/tmp/scylla. Closes #10730	2022-07-05 14:49:00 +03:00
Israel Fruchter	d2ca2455db	scripts/scylla_util.py: introduce back user/group arguments for out() since #10467 remove the user/group parameters needed for the housekeeping call, need to introuce them back Fixes: #10804 Closes #10818	2022-06-16 13:50:17 +03:00
Takuya ASADA	5643c6de56	scylla_util.py: fix "systemctl is-active" causes error On `48b6aec16a` we mistakenly allowed check=True on systemd_unit.is_active(), it should be check=False. We check unit's status by "systemctl is-active" output string, it returns "active" or "inactive". But systemctl command returns non-zero status when it returning "inactive", so we are getting Exception here. To fix this, we need new option "ignore_error=True" for out(), and use it in systemd_unit.is_active(). Fixes #10455 Closes #10467	2022-06-13 13:45:50 +03:00
Takuya ASADA	b6003989f9	scylla_setup: stop using sudo -u, use user/group parameter on subprocess module To run scylla-housekeeping we currently use "sudo -u scylla <cmd>" to switch scylla user, but it fails on some environment. Since recent version of Python 3 supports to switch user on subprocess module, let's use python native way and drop sudo. Fixes #10483 Closes #10538	2022-05-19 17:21:35 +03:00
Takuya ASADA	883b97d8b2	dist/common/scripts: generate debug log when exception occurred Using traceback_with_variables module, generate more detail traceback with variables into debug log. This will help fixing bugs which is hard to reproduce. Closes #10472 [avi: regenerate frozen toolchain]	2022-05-17 13:18:27 +03:00
Takuya ASADA	48b6aec16a	scripts: use "out()" function for all capture_output subprocesses On `acaf0bb` we applied out() just for perftune.py because we had issue #10390 with this script. But the issue can happen with other commands too, let's apply it to all commands which uses capture_output. related #10390 Closes #10414	2022-04-26 13:56:52 +03:00
Takuya ASADA	acaf0bb88a	scripts: print perftune.py error message when capture_output=True We currently does not able to get any error message from subprocess when we specified capture_output=True on subprocess.run(). This is because CalledProcessError does not print stdout/stderr when it raised, and we don't catch the exception, we just let python to cause Traceback. Result of that, we only able to know exit status and failed command but not able to get stdout/stderr. This is problematic especially working on perftune.py bug, since the script should caused Traceback but we never able to see it. To resolve this, add wrapper function "out()" for capture output, and print stdout/stderr with error message inside the function. Fixes #10390 Closes #10391	2022-04-18 14:06:51 +03:00
Takuya ASADA	c2ccdac297	move cloud related code from scylla repository to scylla-machine-image Currently, cloud related code have cross-dependencies between scylla and scylla-machine-image. It is not good way to implement, and single change can break both package. To resolve the issue, we need to move all cloud related code to scylla-machine-image, and remove them from scylla repository. Change list: - move cloud part of scylla_util.py to scylla-machine-image - move cloud part of scylla_io_setup to scylla-machine-image - move scylla_ec2_check to scylla-machine-image - move cloud part of scylla_bootparam_setup to scylla-machine-image Closes #9957	2022-02-01 11:26:59 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Valerii Ponomarov	12fa68fe67	scylla_util: return boolean calling systemd_unit.available As of now, 'systemd_unit.available' works ok only when provided unit is present. It raises Exception instead of returning boolean when provided systemd unit is absent. So, make it return boolean in both cases. Fixes https://github.com/scylladb/scylla/issues/9848 Closes #9849	2021-12-28 15:14:04 +02:00
Takuya ASADA	097a6ee245	dist: add support im4gn/is4gen instance on AWS Add support next-generation, storage-optimized ARM64 instance types. Fixes #9711 Closes #9730	2021-12-05 13:20:01 +02:00
Takuya ASADA	d646673705	Revert "scylla_util.py: return bool value on systemd_unit.is_active()" This reverts commit `2545d7fd43`. Fixes #9627 Fixes scylladb/scylla-machine-image#241	2021-11-15 19:50:31 +09:00
Takuya ASADA	9b4cf8c532	scylla_util.py: On is_gce(), return False when it's on GKE GKE metadata server does not provide same metadata as GCE, we should not return True on is_gce(). So try to fetch machine-type from metadata server, return False if it 404 not found. Fixes #9471 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Closes #9582	2021-11-04 12:49:06 +02:00
Avi Kivity	075ceb8918	Merge 'AWS: add scylla_io_setup preset parameters for ARM instances' from Takuya ASADA Currently, scylla-server fails to start on ARM instances because scylla_io_setup does not have preset parameters even instance type added to 'supported instance'. To fix this, we need to add io parameter preset on scylla_io_setup. Also, we mistakenly added EBS only instances at `a004b1da30`, need to remove them. Instrances does not have ephemeral disk should be 'unsupported instance', we still run our AMI on it, but we print warning message on login prompt, and user requires to run scylla_io_setup. Fixes #9493 Closes #9532 * github.com:scylladb/scylla: scylla_util.py: remove EBS only ARM instances from support instance list scylla_io_setup: support ARM instances on AWS	2021-11-03 10:19:59 +02:00
Takuya ASADA	4a96a8145e	scylla_util.py: remove EBS only ARM instances from support instance list Since we required ephemeral disks for our AMI, these EBS only ARM instances cannot add in it is 'supported instance' list. We still able to run our AMI on these instance types but login message warns it is 'unsupported instance type', and requires to run scylla_io_setup manually.	2021-11-03 10:26:42 +09:00
Takuya ASADA	13ffe3c094	scylla_util.py: detect ephemeral/EBS disks correctly on Nitro System Currently, aws_instance.ephemeral_disks() returns both ephemeral disks and EBS disks on Nitro System. This is because both are attached as NVMe disks, we need to add disk type detection code on NVMe handle logic. Fixes #9440 Closes #9462	2021-10-28 08:58:25 +03:00
Takuya ASADA	9c830297ac	scylla_util.py: add persistent disk support for GCE Just like EBS disks for EC2, we want to use persistent disk on GCE. We won't recommend to use it, but still need to support it. Related scylladb/scylla-machine-image#215 Closes #9395	2021-10-03 17:58:18 +03:00
Takuya ASADA	d87b80ad14	scylla_util.py: add persistent disk support for Azure Just like EBS disks for EC2, we want to use persistent disk on Azure. We won't recommend to use it, but still need to support it. Related https://github.com/scylladb/scylla-machine-image/issues/218 Closes #9417	2021-10-03 17:56:31 +03:00
Pekka Enberg	ef5b2934e8	scylla_util: Use AzureSnitch on Azure Fixes #8593	2021-07-28 14:07:42 +03:00
Yaron Kaikov	a004b1da30	scylla_util:add AWS arm based instance to supported list Today we have a Scylla AMI image based on x86 archituctre only. Following the work we did in https://github.com/scylladb/scylla-machine-image/pull/153 we can build ARM based AMI image Let's add ARM based instance to supported list Closes #9064	2021-07-22 15:48:29 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Yaron Kaikov	6a447db8a8	scylla_util.py: Fix Azure support for machine-image In https://github.com/scylladb/scylla/pull/7807 we added support for Azure instance in Scylla. The following changes are required in order machine-image to work: 1) fix wrong metadata URL and updating metadata path values (was intreduce in `f627fcbb0c`) 2) fix function naming which been used my machine image 3) add missing function which are reuqired by mahcine-image 4) cleanup unused functions Closes #8596	2021-06-06 09:21:23 +03:00
Lubos Kosco	777771df34	scylla_util.py: Relax GCE setup NVMe device checks We don't want to fail I/O setup if there are more than one NVMe devices mounted as root nor if there are no NVMe devices. Fixes #8032 Closes #8444	2021-06-06 09:21:23 +03:00
Yaron Kaikov	dd453ffe6a	install.sh: Setup aio-max-nr upon installation This is a follow up change to #8512. Let's add aio conf file during scylla installation process and make sure we also remove this file when uninstall Scylla As per Avi Kivity's suggestion, let's set aio value as static configuration, and make it large enough to work with 500 cpus. Closes #8650	2021-05-24 14:24:20 +03:00
Yaron Kaikov	588a065304	scylla_io_setup: configure "aio-max-nr" before iotune On severl instance types in AWS and Azure, we get the following failure during scylla_io_setup process: ``` ERROR 2021-04-14 07:50:35,666 [shard 5] seastar - Could not setup Async I/O: Resource temporarily unavailable. The most common cause is not enough request capacity in /proc/sys/fs/aio-max-nr. Try increasing that number or reducing the amount of logical CPUs available for your application ``` We have scylla_prepare:configure_io_slots() running before the scylla-server.service start, but the scylla_io_setup is taking place before 1) Let's move configure_io_slots() to scylla_util.py since both scylla_io_setup and scylla_prepare are import functions from it 2) cleanup scylla_prepare since we don't need the same function twice 3) Let's use configure_io_slots() during scylla_io_setup to avoid such failure Fixes: #8587 Closes #8512	2021-05-11 18:39:10 +03:00
Lubos Kosco	f627fcbb0c	scylla_util.py: add new class for Azure cloud support	2021-05-04 13:12:42 +02:00
Peter Veentjer	c255903fb0	dist: Added r5b to ena instance_class. The r5b instances also have ena support. For a confirmation that all r5b instances have ena, go to the following page: https://instances.vantage.sh/ Select the r5b and add the 'enhanced networking' column. Then it will show that for every r5b type there is ena support Closes #8546	2021-04-27 15:39:24 +03:00
Takuya ASADA	2545d7fd43	scylla_util.py: return bool value on systemd_unit.is_active() Currently, 'if unit.is_active():' is always True since is_active() returns result in string (active, inactive, unknown). To avoid such scripting bug, change return value in bool.	2021-04-08 21:54:05 +09:00
Takuya ASADA	0b2c1edddc	scylla_ntp_setup: support systemd-timesyncd On Ubuntu/Debian systemd-timesyncd is default NTP client, and installed by default. So use it instead of installing chrony. Fixes #8339 Closes #8344	2021-04-06 15:28:34 +03:00
Takuya ASADA	3af31eebeb	scylla_setup: stop hardcode product name on scylla_setup Stop hardcode product name on scylla_setup, dynamically generate scylla_product.py in install.sh. Fixes #8367 Closes #8384	2021-04-01 15:07:58 +03:00
Takuya ASADA	6f678ab7ff	aws: initialize self._disks['ebs'] when no EBS disks Seems like aws_instance.ebs_disks() causes traceback when no EBS disks available, need to initialize with empty list. Fixes #8365 Closes #8366	2021-03-29 17:21:14 +03:00
Takuya ASADA	e3b5ffcf14	dist: install optional packages for SLES Support SUSE original package manager 'zypper' for pkg_install() function.	2021-03-15 19:17:48 +09:00
Takuya ASADA	32d4ec6b8a	scylla_util.py: resolve /dev/root to get actual device on aws When psutil.disk_paritions() reports / is /dev/root, aws_instance mistakenly reports root partition is part of ephemeral disks, and RAID construction will fail. This prevents the error and reports correct free disks. Fixes #8055 Closes #8040	2021-02-18 20:25:45 +02:00
Benny Halevy	55e3df8a72	dist: scylla_util: prevent IndexError when no ephemeral_disks were found Currently we call firstNvmeSize before checking that we have enough (at least 1) ephemeral disks. When none are found, we hit the following error (see #7971): ``` File "/opt/scylladb/scripts/libexec/scylla_io_setup", line 239, in if idata.is_recommended_instance(): File "/opt/scylladb/scripts/scylla_util.py", line 311, in is_recommended_instance diskSize = self.firstNvmeSize File "/opt/scylladb/scripts/scylla_util.py", line 291, in firstNvmeSize firstDisk = ephemeral_disks[0] IndexError: list index out of range ``` This change reverses the order and first checks that we found enough disks before getting the fist disk size. Fixes #7971 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #8027	2021-02-03 11:30:18 +02:00
Takuya ASADA	2a4d293841	dist: add package name translation Translate package name from CentOS package to different distribution package name, to use single package name for pkg_install().	2021-01-13 21:27:14 +09:00
Takuya ASADA	0a9843842d	dist: support SLES/OpenSUSE Add support SLES/OpenSUSE on setup script.	2021-01-13 19:32:46 +09:00

1 2 3 4

157 Commits