mirror of
https://github.com/SCST-project/scst.git
synced 2026-05-21 12:41:26 +00:00
svn+ssh://yanb123@svn.code.sf.net/p/scst/svn/trunk
........
r5536 | vlnb | 2014-05-22 06:06:46 +0300 (Thu, 22 May 2014) | 3 lines
Version changed to 3.1.0-pre1
........
r5537 | vlnb | 2014-05-22 06:18:27 +0300 (Thu, 22 May 2014) | 3 lines
Web updates
........
r5538 | bvassche | 2014-05-22 10:16:04 +0300 (Thu, 22 May 2014) | 1 line
nightly build: Update kernel versions
........
r5539 | vlnb | 2014-05-23 05:20:35 +0300 (Fri, 23 May 2014) | 9 lines
vdisk_nullio: Add "read_zero" attribute
Add an attribute called "read_zero" to vdisk_nullio devices that
controls whether or not READs from a vdisk_nullio device return
zeroed data buffers.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5543 | bvassche | 2014-05-23 10:33:53 +0300 (Fri, 23 May 2014) | 1 line
RHEL 7 build fixes
........
r5545 | bvassche | 2014-05-23 11:36:36 +0300 (Fri, 23 May 2014) | 1 line
scripts/rebuild-rhel-kernel-rpm: Add RHEL 7 RC support
........
r5547 | vlnb | 2014-05-24 06:10:34 +0300 (Sat, 24 May 2014) | 3 lines
Optimize read_zero functionality
........
r5555 | bvassche | 2014-05-27 14:59:11 +0300 (Tue, 27 May 2014) | 5 lines
qla2x00t: Documentation / source code comment / log messages spelling fix
Change a few occurrences of "conformation" into "confirmation". See also the
QLogic 2500 Series Firmware Interface Specification.
........
r5557 | vlnb | 2014-05-30 03:42:34 +0300 (Fri, 30 May 2014) | 5 lines
Small code reorganization.
No functionality changed
........
r5558 | vlnb | 2014-05-30 06:00:07 +0300 (Fri, 30 May 2014) | 3 lines
Logging fixes
........
r5560 | bvassche | 2014-06-02 18:31:50 +0300 (Mon, 02 Jun 2014) | 1 line
Makefile: Only report which RPMs have been built if "make rpm" is run as a non-privileged user
........
r5561 | bvassche | 2014-06-03 09:04:47 +0300 (Tue, 03 Jun 2014) | 1 line
nightly build: Update kernel versions
........
r5562 | vlnb | 2014-06-04 04:54:21 +0300 (Wed, 04 Jun 2014) | 3 lines
Decrease max WRITE SAME length for better latencies
........
r5563 | vlnb | 2014-06-04 05:16:51 +0300 (Wed, 04 Jun 2014) | 3 lines
Enforce limit on max unmap LBAs
........
r5566 | bvassche | 2014-06-04 18:14:22 +0300 (Wed, 04 Jun 2014) | 1 line
ib_srpt: Fix an error message
........
r5567 | bvassche | 2014-06-04 18:17:59 +0300 (Wed, 04 Jun 2014) | 1 line
ib_srpt: Avoid triggering a SCSI command timeout after login
........
r5568 | bvassche | 2014-06-05 09:34:19 +0300 (Thu, 05 Jun 2014) | 1 line
scst_vdisk: Build fix for kernel versions <= 2.6.32
........
r5569 | bvassche | 2014-06-05 09:46:57 +0300 (Thu, 05 Jun 2014) | 1 line
scst_vdisk: Fix a kernel version < 2.6.38 compiler warning
........
r5570 | vlnb | 2014-06-06 06:20:26 +0300 (Fri, 06 Jun 2014) | 8 lines
scst_lib: Fix a compiler warning triggered by the WRITE SAME implementation
Avoid for release builds that the compiler reports that the variable
'ws_sg_cnt' is not used.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5571 | vlnb | 2014-06-06 06:22:14 +0300 (Fri, 06 Jun 2014) | 7 lines
nullio_exec_read(): Fix kunmap() argument
The argument of kunmap() is of type struct page *. Detected by smatch.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5572 | vlnb | 2014-06-06 06:24:03 +0300 (Fri, 06 Jun 2014) | 11 lines
scst: Leave out FSF mail address
This avoids that the following checkpatch complaint is triggered:
Do not include the paragraph about writing to the Free Software Foundation's
mailing address from the sample GPL notice. The FSF has changed addresses in
the past, and may do so again. Linux already includes a copy of the GPL.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5573 | vlnb | 2014-06-06 06:26:55 +0300 (Fri, 06 Jun 2014) | 10 lines
scst: Make lockdep_assert_held() easier to use
The lockdep_assert_held() macro is a convenient debugging tool.
However, it is inconvenient to surround each invocation of that
macro by an #ifdef/#endif pair. Hence make it easier to use this
macro with older kernel versions.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5574 | vlnb | 2014-06-07 00:59:24 +0300 (Sat, 07 Jun 2014) | 3 lines
Use limits.discard_zeroes_data to set LBPRZ
........
r5575 | bvassche | 2014-06-07 13:46:49 +0300 (Sat, 07 Jun 2014) | 1 line
nightly build: Update kernel versions
........
r5577 | bvassche | 2014-06-10 17:16:14 +0300 (Tue, 10 Jun 2014) | 1 line
ib_srpt: Make the test for IB_EVENT_GID_CHANGE support more robust
........
r5578 | bvassche | 2014-06-10 17:49:59 +0300 (Tue, 10 Jun 2014) | 1 line
ib_srpt: Make IB_EVENT_GID_CHANGE test independent of the OFED detection code
........
r5579 | bvassche | 2014-06-11 13:02:15 +0300 (Wed, 11 Jun 2014) | 1 line
ib_srpt: RHEL 5 build fix
........
r5581 | bvassche | 2014-06-11 18:27:06 +0300 (Wed, 11 Jun 2014) | 1 line
regression tests: Sync with a recent sysfs change
........
r5582 | bvassche | 2014-06-11 18:27:48 +0300 (Wed, 11 Jun 2014) | 1 line
regression tests: Sort hash keys before comparing
........
r5583 | bvassche | 2014-06-11 18:41:01 +0300 (Wed, 11 Jun 2014) | 1 line
nightly build: Update kernel versions
........
r5584 | vlnb | 2014-06-11 22:33:18 +0300 (Wed, 11 Jun 2014) | 8 lines
scst: RHEL 5 build fix
Avoid that building the scst kernel module fails on RHEL 5 due to
a missing kvasprintf() implementation.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5585 | vlnb | 2014-06-11 22:38:10 +0300 (Wed, 11 Jun 2014) | 11 lines
scst: Remove unused variables
Avoid that building scst with W=1 triggers compiler warnings about
variables that are set but not used. See also the documentation of
the gcc compiler flag -Wunused-but-set-variable.
This patch does not change any functionality.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5586 | vlnb | 2014-06-11 22:39:51 +0300 (Wed, 11 Jun 2014) | 9 lines
scst_lib: Introduce additional temporary variables
Make the code slightly easier to read by introducing temporary
variables for the expressions 'tgt_dev->sess' and 'sess->tgt->tgtt'.
This patch does not change any functionality.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5587 | vlnb | 2014-06-11 23:57:03 +0300 (Wed, 11 Jun 2014) | 10 lines
scst: Add support for 64-bit LUNs
The datatype of scsi_device.lun will be changed from u32 into u64
in the near future. Update SCST accordingly. These changes have
been implemented such that these are compatible with 32-bit and
64-bit LUNs.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5588 | vlnb | 2014-06-12 00:00:16 +0300 (Thu, 12 Jun 2014) | 9 lines
scst_local: Support LUN numbers >= 16384
Add support for 32-bit LUN numbers. As soon as the patches that add
64-bit LUN support are upstream this patch will also make 64-bit
LUN support available in scst_local.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5589 | vlnb | 2014-06-12 00:42:08 +0300 (Thu, 12 Jun 2014) | 8 lines
scst: Clean up __scst_resume_activity()
Move all management commands from scst_delayed_mgmt_cmd_list to the
active command list during resume instead of only the first one.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5590 | vlnb | 2014-06-12 01:07:00 +0300 (Thu, 12 Jun 2014) | 9 lines
scst: Introduce scst_lookup_tgt_dev()
This patch does not change any functionality.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
with some improvements
........
r5592 | bvassche | 2014-06-12 11:38:45 +0300 (Thu, 12 Jun 2014) | 7 lines
scst.h: Move definition of swap()
Make sure that the definition of swap() is guarded by
"#if LINUX_VERSION_CODE < KERNEL_VERSION(2, 6, 29)" only instead
of "#if LINUX_VERSION_CODE < KERNEL_VERSION(2, 6, 28)" and
"#if LINUX_VERSION_CODE < KERNEL_VERSION(2, 6, 29)".
........
r5593 | bvassche | 2014-06-12 12:15:50 +0300 (Thu, 12 Jun 2014) | 1 line
nightly build: Update kernel versions
........
r5594 | bvassche | 2014-06-12 14:33:00 +0300 (Thu, 12 Jun 2014) | 1 line
ib_srpt: Set MOFED include path correctly if MOFED has been installed with --add-kernel-support
........
r5595 | bvassche | 2014-06-12 16:38:38 +0300 (Thu, 12 Jun 2014) | 1 line
ib_srpt: Make non-OFED build work again
........
r5596 | vlnb | 2014-06-13 07:52:18 +0300 (Fri, 13 Jun 2014) | 16 lines
scst: Switch from the cpu_*() to the cpumask_*() API
The cpus_*() functions were deprecated via patch "cpumask:
introduce new API, without changing anything" (November 2008,
commit ID 2d3854a37e8b). Hence switch from the cpus_*() API to
the cpumask_*() API.
This patch has the intended side effect of not adding the "[key]"
property to cpumask sysfs attributes that contain the default
cpumask. The current code namely reads uninitialized bits on
systems where nr_cpu_ids < NR_CPUS because cpus_equal() compares
more bits than those that were set by cpumask_copy().
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5597 | vlnb | 2014-06-13 08:03:17 +0300 (Fri, 13 Jun 2014) | 3 lines
Forgotten versions updated
........
r5598 | bvassche | 2014-06-13 09:55:23 +0300 (Fri, 13 Jun 2014) | 1 line
ib_srpt: Make one_target_per_port the default mode
........
r5600 | vlnb | 2014-06-14 01:24:06 +0300 (Sat, 14 Jun 2014) | 9 lines
scst: Avoid that W=1 triggers complaints about unused variables
Avoid that building scst with W=1 triggers compiler warnings about
variables that are set but not used. See also the documentation of
the gcc compiler flag -Wunused-but-set-variable.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5601 | vlnb | 2014-06-14 01:31:42 +0300 (Sat, 14 Jun 2014) | 8 lines
scst_local: Add close_session() callback function
This is useful for triggering the session reassignment code via
the scst_local driver.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5602 | vlnb | 2014-06-14 02:57:26 +0300 (Sat, 14 Jun 2014) | 8 lines
scst_pr_read_reservation(): Initialize returned buffer
Avoid that this function returns an uninitialized buffer to the
initiator if buffer_size < 8. Detected by Coverity.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5603 | vlnb | 2014-06-14 02:58:28 +0300 (Sat, 14 Jun 2014) | 5 lines
scst: Help Coverity recognize that vmalloc(0) returns NULL
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5605 | bvassche | 2014-06-14 20:10:58 +0300 (Sat, 14 Jun 2014) | 1 line
fcst: Remove an unused variable
........
r5606 | bvassche | 2014-06-14 20:17:56 +0300 (Sat, 14 Jun 2014) | 5 lines
fcst: Move exch_done() calls into ft_cmd_done()
This patch ensures that exch_done() gets called if an fcst
callback returns SCST_TGT_RES_FATAL_ERROR.
........
r5607 | bvassche | 2014-06-14 20:18:34 +0300 (Sat, 14 Jun 2014) | 10 lines
fcst: Handle frame send failures properly
Retry sending XFER_RDY, data and response frames if the network
driver reports that sending failed (-ENOMEM) instead of reporting
a kernel warning (WARN_ON(1)). If sending XFER_RDY or data frames
failed for another reason, report this to the initiator as a
write error (ASC = 03; ASCQ = 00 which stands for PERIPHERAL
DEVICE WRITE FAULT). If sending a response frame failed with
another error code than -ENOMEM, do not send a response.
........
r5608 | vlnb | 2014-06-17 03:50:46 +0300 (Tue, 17 Jun 2014) | 10 lines
scst: Make access control group removal behavior configurable
SCST rejects removal of an access control group with one or more
sessions with error code -EBUSY. Make it easy to change this
behavior into forcibly closing sessions when an access control
group is removed.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5609 | bvassche | 2014-06-17 09:37:08 +0300 (Tue, 17 Jun 2014) | 1 line
nightly build: Update kernel versions
........
r5610 | vlnb | 2014-06-19 06:51:48 +0300 (Thu, 19 Jun 2014) | 3 lines
Update for 3.15 kernels
........
r5611 | bvassche | 2014-06-19 10:09:53 +0300 (Thu, 19 Jun 2014) | 1 line
nightly build: Add kernel 3.15 build infrastructure
........
r5612 | bvassche | 2014-06-19 15:48:25 +0300 (Thu, 19 Jun 2014) | 1 line
kernel module installation: Skip "depmod" when building an RPM
........
r5613 | vlnb | 2014-06-20 07:00:41 +0300 (Fri, 20 Jun 2014) | 8 lines
scst: Convert a loop to keep smatch happy
Avoid that smatch reports the following warning:
scst_init_session() info: loop could be replaced with if statement.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5614 | vlnb | 2014-06-20 07:02:00 +0300 (Fri, 20 Jun 2014) | 13 lines
iscsi-scst: Suppress a compiler warning
Avoid that the following compiler warning is reported when compiling
iscsi-scst:
chap.c: In function 'chap_rand':
chap.c:348:5: warning: ignoring return value of 'read', declared with attribute warn_unused_result [-Wunused-result]
(void)read(fd, &r, sizeof(r));
^
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5615 | vlnb | 2014-06-20 07:03:40 +0300 (Fri, 20 Jun 2014) | 5 lines
scst, iscsi-scst: Fix RHEL 5 compilation warnings
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5616 | vlnb | 2014-06-20 07:05:11 +0300 (Fri, 20 Jun 2014) | 10 lines
scst: Exclude certain locking code from static analysis
Loops with locking statements and also lock and unlock
statements guarded by an if-statement trigger false positive
warnings when analyzing the SCST code with smatch and/or sparse.
Hence exclude such code from static analysis.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5617 | vlnb | 2014-06-20 07:09:11 +0300 (Fri, 20 Jun 2014) | 10 lines
scst: Avoid that sparse complains about unreachable code
Remove the code after BUG() statements to avoid that smatch
complains about unreachable code. Hide the spin_unlock() statements
before BUG() statements for static analysis tools to avoid that
sparse complains about locking imbalances.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5618 | vlnb | 2014-06-20 07:10:40 +0300 (Fri, 20 Jun 2014) | 12 lines
Change BUG_ON(1) into BUG()
With CONFIG_BUG=y both BUG() and BUG_ON(1) halt the system. However,
with CONFIG_BUG=n BUG() halts the system but BUG_ON(1) not. To avoid
such subtleties, change BUG_ON(1) into BUG().
See also patch Josh Triplett, "bug: Make BUG() always stop the machine",
7 April 2014 (commit ID a4b5d580e07875f9be29f62a57c67fbbdbb40ba2).
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5619 | bvassche | 2014-06-20 08:56:36 +0300 (Fri, 20 Jun 2014) | 1 line
nightly build: Add kernel version 3.15.1
........
r5620 | vlnb | 2014-06-24 07:45:08 +0300 (Tue, 24 Jun 2014) | 9 lines
scst_vdisk: Split vdisk_exec_inquiry()
Make vdisk_exec_inquiry() easier to read by moving the code
for the implementation of each VPD page into a separate function.
This patch does not change any functionality.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5621 | bvassche | 2014-06-24 16:32:18 +0300 (Tue, 24 Jun 2014) | 1 line
ib_srpt: Complain if another ib_srpt.ko kernel module already exists
........
r5622 | bvassche | 2014-06-24 16:33:23 +0300 (Tue, 24 Jun 2014) | 2 lines
ib_srpt: Set SCSI residual fields in SRP_CMD reply
........
r5624 | bvassche | 2014-06-25 14:50:40 +0300 (Wed, 25 Jun 2014) | 1 line
nightly build: Use http instead of ftp for downloading kernel source code
........
r5625 | vlnb | 2014-06-26 00:38:19 +0300 (Thu, 26 Jun 2014) | 5 lines
scst_debug.h: Make EXTRACHECKS_*_ON() statements visible to Coverity
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5626 | vlnb | 2014-06-27 02:26:25 +0300 (Fri, 27 Jun 2014) | 10 lines
scst_vdisk: Three more put_unaligned_*() conversions
Convert three more *(__be16 *)p = cpu_to_be16(v) statements into
put_unaligned_be16(v, p) since the latter is easier to read. Also
convert one "cmd->dev" into "dev" expression. This patch does not
change any functionality.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
........
r5627 | bvassche | 2014-06-27 13:32:02 +0300 (Fri, 27 Jun 2014) | 1 line
nightly build: Update kernel versions
........
r5628 | bvassche | 2014-06-28 22:56:36 +0300 (Sat, 28 Jun 2014) | 1 line
ib_srpt: Remove existing ib_srpt.ko kernel modules before installation
........
r5629 | bvassche | 2014-06-28 22:58:44 +0300 (Sat, 28 Jun 2014) | 6 lines
scst_vdisk: Fix 32-bit build
Avoid 64-bit modulo computations since these result in undefined symbol
errors on 32-bit systems (__moddi3 / __umoddi3). Support sizes >= 2**32
bytes on 32-bit systems.
........
r5630 | bvassche | 2014-06-28 23:00:22 +0300 (Sat, 28 Jun 2014) | 1 line
scst.spec.in: Follow-up for r5628
........
r5631 | bvassche | 2014-06-28 23:15:45 +0300 (Sat, 28 Jun 2014) | 1 line
scst_local: Avoid that session deletion triggers a kernel warning
........
r5647 | bvassche | 2014-06-30 10:18:53 +0300 (Mon, 30 Jun 2014) | 1 line
scst: Build fix for Linux kernel versions 2.6.33 and 2.6.34
........
r5648 | bvassche | 2014-06-30 10:28:18 +0300 (Mon, 30 Jun 2014) | 1 line
scst: Build fix for kernel versions <= 2.6.31
........
r5649 | bvassche | 2014-06-30 11:40:23 +0300 (Mon, 30 Jun 2014) | 6 lines
scst_vdisk: Fix a checkpatch warning
Address the following checkpatch warning:
char * array declaration might be better as static const
........
r5650 | bvassche | 2014-06-30 11:52:06 +0300 (Mon, 30 Jun 2014) | 1 line
nightly build: Correct a kernel version
........
r5651 | bvassche | 2014-06-30 12:18:41 +0300 (Mon, 30 Jun 2014) | 1 line
nightly build: Correct a kernel version
........
r5654 | bvassche | 2014-07-01 09:38:13 +0300 (Tue, 01 Jul 2014) | 6 lines
scst_vdisk: Fix a checkpatch warning
Avoid that checkpatch reports the following warning:
WARNING: static const char * array should probably be static const char * const
........
r5657 | bvassche | 2014-07-01 19:46:12 +0300 (Tue, 01 Jul 2014) | 1 line
nightly build: Update kernel versions
........
r5658 | bvassche | 2014-07-03 11:36:48 +0300 (Thu, 03 Jul 2014) | 1 line
scripts/kernel-functions: Handle 3.x.0 kernel versions correctly
........
r5659 | bvassche | 2014-07-03 11:42:08 +0300 (Thu, 03 Jul 2014) | 1 line
scripts/generate-patched-kernel: Clean up
........
r5661 | bvassche | 2014-07-04 08:39:28 +0300 (Fri, 04 Jul 2014) | 1 line
Make scripts/kernel-functions again compatible with 2.6.x kernels
........
r5662 | bvassche | 2014-07-06 11:02:28 +0300 (Sun, 06 Jul 2014) | 1 line
scripts/run-regression-tests: Add command-line option -4 (disable IPv6)
........
git-svn-id: http://svn.code.sf.net/p/scst/svn/branches/iser@5666 d57e44dd-8a1f-0410-8b47-8ef2f437770f
454 lines
19 KiB
Plaintext
454 lines
19 KiB
Plaintext
SCSI RDMA Protocol (SRP) Target driver for Linux
|
|
=================================================
|
|
|
|
The SRP target driver has been designed to work on top of the Linux RDMA
|
|
kernel drivers -- either the RDMA drivers included with a Linux distribution
|
|
or the OFED RDMA drivers. For more information about using the SRP target
|
|
driver in combination with OFED, see also README.ofed.
|
|
|
|
The SRP target driver has been implemented as an SCST driver. This
|
|
makes it possible to support a lot of I/O modes on real and virtual
|
|
devices. A few examples of supported device handlers are:
|
|
|
|
1. scst_disk. This device handler implements transparent pass-through
|
|
of SCSI commands and allows SRP to access and to export real
|
|
SCSI devices, i.e. disks, hardware RAID volumes, tape libraries
|
|
as SRP LUNs.
|
|
|
|
2. scst_vdisk, either in fileio or in blockio mode. This device handler
|
|
allows to export software RAID volumes, LVM volumes, IDE disks, and
|
|
normal files as SRP LUNs.
|
|
|
|
3. nullio. The nullio device handler allows to measure the performance
|
|
of the SRP target implementation without performing any actual I/O.
|
|
|
|
|
|
Installation
|
|
------------
|
|
|
|
Building and installing the SRP target driver is possible as follows:
|
|
|
|
cd ${SCST_DIR}
|
|
if type -p rpm >/dev/null; then
|
|
make -s rpm
|
|
sudo rpm -U rpmbuilddir/RPMS/*/*rpm scstadmin/rpmbuilddir/RPMS/*/*rpm
|
|
else
|
|
make -s scst_clean srpt_clean scst srpt scstadmin
|
|
sudo make -s scst_install srpt_install scstadm_install
|
|
fi
|
|
|
|
The ib_srpt kernel module supports the following parameters:
|
|
* one_target_per_port (boolean) and
|
|
* use_node_guid_in_target_name (boolean)
|
|
ib_srpt can operate in one of the following three modes:
|
|
1. Access control configuration per HCA and assigning a "ib_srpt_target_<n>"
|
|
style name to each HCA.
|
|
2. Access control configuration per HCA and referring to a HCA via its node
|
|
GUID (e.g. 0002:c903:0005:f34a).
|
|
3. Access control configuration per HCA port and referring to a HCA via its
|
|
port GID (e.g. fe80:0000:0000:0000:0002:c903:0005:f34b).
|
|
Mode (1) is choosen if both one_target_per_port and
|
|
use_node_guid_in_target_name are false. Mode (2) is choosen if
|
|
one_target_per_port is false and use_node_guid_in_target_name is true. Mode
|
|
(3) is choosen if one_target_per_port is true. This last mode is the
|
|
default mode.
|
|
* rdma_cm_port (number)
|
|
A 16-bit number that specifies the port number to be registered via the
|
|
RDMA/CM. Must be specified to make communication over RoCE or iWARP
|
|
possible. If this parameter is zero (the default value) the SRP target
|
|
driver does not register with the RDMA/CM.
|
|
* srp_max_req_size (number)
|
|
Maximum size of an SRP control message in bytes. Examples of SRP control
|
|
messages are: login request, logout request, data transfer request, ...
|
|
The larger this parameter, the more scatter/gather list elements can be
|
|
sent at once. Use the following formula to compute an appropriate value
|
|
for this parameter: 68 + 16 * (sg_tablesize). The default value of
|
|
this parameter is 4148, which corresponds to an sg table size of 255.
|
|
* srp_max_rsp_size (number)
|
|
Maximum size of an SRP response message in bytes. Sense data is sent back
|
|
via these messages towards the initiator. The default size is 256 bytes.
|
|
With this value there remains (256-36) = 220 bytes for sense data.
|
|
* srp_max_rdma_size (number)
|
|
Maximum number of bytes that may be transferred at once via RDMA. Defaults
|
|
to 65536 bytes, which is sufficient to use the full bandwidth of low-latency
|
|
HCAs. Increasing this value may decrease latency for applications
|
|
transferring large amounts of data at once.
|
|
* srpt_srq_size (number, default 4095)
|
|
ib_srpt uses a shared receive queue (SRQ) for processing incoming SRP
|
|
requests. This number may have to be increased when a large number of
|
|
initiator systems is accessing a single SRP target system.
|
|
* srpt_sq_size (number, default 4096)
|
|
Per-channel InfiniBand send queue size. The default setting is sufficient
|
|
for a credit limit of 128. Changing this parameter to a smaller value may
|
|
cause RDMA requests to be retried and hence may slow down data transfer
|
|
severely.
|
|
* trace_flag (unsigned integer, only available in debug builds)
|
|
The individual bits of the trace_flag parameter define which categories of
|
|
trace messages should be sent to the kernel log and which ones not.
|
|
|
|
|
|
Configuring the SRP Target System
|
|
---------------------------------
|
|
|
|
The first step is to choose whether access control will be controlled per
|
|
HCA or per HCA port and to create a modprobe configuration file that reflects
|
|
this choice. An example:
|
|
|
|
# cat /etc/modprobe.d/ib_srpt.conf
|
|
options ib_srpt one_target_per_port=1
|
|
|
|
Next, create the file /etc/scst.conf. You can create this file with
|
|
the scstadmin tool as follows:
|
|
|
|
/etc/init.d/scst stop
|
|
/etc/init.d/scst start
|
|
|
|
Now configure SCST using scstadmin - see also the scstadmin documentation for
|
|
further information. Once finished, save the configuration to /etc/scst.conf:
|
|
|
|
scstadmin -write_config /etc/scst.conf (sysfs version)
|
|
or
|
|
scstadmin -WriteConfig /etc/scst.conf (procfs version)
|
|
|
|
One can verify the contents of scst.conf e.g. as follows:
|
|
|
|
cat /etc/scst.conf
|
|
|
|
Now verify that loading the configuration from file works correctly:
|
|
|
|
/etc/init.d/scst reload
|
|
|
|
Note: when using InfiniBand loading the ib_ipoib kernel module and assigning
|
|
an IP address to each IPoIB interface is only needed when using the RDMA/CM.
|
|
When using the IB/CM however, it is allowed but not necessary to load the
|
|
ib_ipoib kernel module.
|
|
|
|
|
|
Configuring the SRP Initiator System
|
|
------------------------------------
|
|
|
|
First of all, load the SRP kernel module as follows:
|
|
|
|
modprobe ib_srp
|
|
|
|
Next, when using InfiniBand, discover the new SRP target by running the
|
|
srp_daemon command:
|
|
|
|
for d in /dev/infiniband/umad*; do srp_daemon -oacd$d; done
|
|
|
|
If you want to let the initiator system log in to all SRP targets available
|
|
in the same InfiniBand subnet that is possible as follows (-e = execute):
|
|
|
|
for d in /dev/infiniband/umad*; do srp_daemon -oecd$d; done
|
|
|
|
If you want to let the initiator log in to a specific target you can do that
|
|
e.g. as follows:
|
|
|
|
echo "id_ext=0002c903000f1366,ioc_guid=0002c903000f1366,dgid=fe800000000000000002c903000f1367,pkey=ffff,service_id=0002c903000f1366" > /sys/class/infiniband_srp/${SRP_HCA_NAME}/add_target; done
|
|
|
|
The meaning of the parameters in the above command is as follows:
|
|
* id_ext: must match ioc_guid.
|
|
* ioc_guid: see also the documentation of the ib_srpt ioc_guid parameter.
|
|
* dgid: target HCA port GID to connect to.
|
|
* pkey: IB partition key (P_Key) of the target to connect to.
|
|
* service_id: must match ioc_guid.
|
|
|
|
When using RoCE or iWARP, log in to the target system to determine the id_ext
|
|
and ioc_guid parameters and use these to log in. An example:
|
|
|
|
[ target system ]
|
|
# sed 's/,\(pkey\|dgid\|service_id\)=[^,]*//g' $(find /sys/kernel/scst_tgt/targets/ib_srpt -name login_info) | uniq
|
|
id_ext=0002c90300a34270,ioc_guid=0002c90300a34270
|
|
|
|
[ initiator system ]
|
|
echo dest=192.168.5.1:5000,id_ext=0002c90300a34270,ioc_guid=0002c90300a34270
|
|
>/sys/class/infiniband_srp/srp-mlx4_0-1/add_target
|
|
echo dest=192.168.6.1:5000,id_ext=0002c90300a34270,ioc_guid=0002c90300a34270
|
|
>/sys/class/infiniband_srp/srp-mlx4_0-2/add_target
|
|
|
|
Initiator port GIDs can be queried e.g. via sysfs:
|
|
|
|
$ for f in /sys/devices/*/*/*/infiniband/*/ports/*/gids/0; do echo $f; \
|
|
cat $f | sed 's/://g'; done
|
|
/sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0/infiniband/mlx4_0/ports/1/gids/0
|
|
fe800000000000000002c9030005f34b
|
|
/sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0/infiniband/mlx4_0/ports/2/gids/0
|
|
fe800000000000000002c9030005f34c
|
|
/sys/devices/pci0000:00/0000:00:1c.0/0000:05:00.0/infiniband/mlx4_1/ports/1/gids/0
|
|
fe800000000000000002c9030003cca7
|
|
/sys/devices/pci0000:00/0000:00:1c.0/0000:05:00.0/infiniband/mlx4_1/ports/2/gids/0
|
|
fe800000000000000002c9030003cca8
|
|
|
|
Finally run lsscsi to display the details of the newly discovered SCSI disks:
|
|
|
|
lsscsi
|
|
|
|
SRP targets can be recognized in the output of lsscsi by looking for
|
|
the disk names assigned on the SCST target ("disk01" in the example below):
|
|
|
|
[8:0:0:0] disk SCST_FIO disk01 102 /dev/sdb
|
|
|
|
|
|
Target names
|
|
------------
|
|
|
|
The name assigned by the ib_srpt target driver to an SCST target is either
|
|
ib_srpt_target_<n>, the node GUID of a HCA in hexadecimal form with a colon
|
|
after every fourth digit or the port GID with a colon afer every fourth
|
|
digit. The HCA node GUID and the port GIDs can be obtained via the
|
|
ibv_devinfo command. An example:
|
|
|
|
# ibv_devinfo -v | grep -E '[^a-z]port:|guid|GID'
|
|
node_guid: 0002:c903:0005:f34e
|
|
sys_image_guid: 0002:c903:0005:f351
|
|
port: 1
|
|
GID[0]: fe80:0000:0000:0000:0002:c903:0005:f34f
|
|
port: 2
|
|
GID[0]: fe80:0000:0000:0000:0002:c903:0005:f350
|
|
|
|
Once the ib_srpt driver has been loaded the available SCST targets can be
|
|
queried as follows:
|
|
|
|
# (cd /sys/kernel/scst_tgt/targets/ib_srpt && ls -d [0-9a-f]*)
|
|
fe80:0000:0000:0000:0002:c903:0005:f34f
|
|
fe80:0000:0000:0000:0002:c903:0005:f350
|
|
|
|
|
|
Session names
|
|
-------------
|
|
|
|
The name assigned by the ib_srpt target driver to a session depends on the
|
|
mode in which it is operating. If one_target_per_port=y then the source port
|
|
GID is used as the session name. If one_target_per_port=n then the 128-bit SRP
|
|
initiator port identifier is used as the session name. This identifier is sent
|
|
by the SRP initiator to the SRP target via the SRP_LOGIN_REQ information unit.
|
|
The Linux SRP initiator (ib_srp) generates the initiator port identifier as
|
|
follows:
|
|
- The first eight bytes are the identifier extension ('initiator_ext' parameter
|
|
specified in the login string echoed into the sysfs file 'add_target').
|
|
- The last eight bytes are the GUID of the initiator HCA port used to
|
|
communicate with the target.
|
|
|
|
An example:
|
|
|
|
[ INITIATOR ]
|
|
|
|
$ for f in /sys/devices/*/*/*/infiniband/*/ports/*/gids/0; do echo
|
|
f; cat $f; done
|
|
/sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0/infiniband/mlx4_0/ports/1/gids/0
|
|
fe80:0000:0000:0000:0002:c903:0005:f34b
|
|
/sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0/infiniband/mlx4_0/ports/2/gids/0
|
|
fe80:0000:0000:0000:0002:c903:0005:f34c
|
|
/sys/devices/pci0000:00/0000:00:1c.0/0000:05:00.0/infiniband/mlx4_1/ports/1/gids/0
|
|
fe80:0000:0000:0000:0002:c903:0003:cca7
|
|
/sys/devices/pci0000:00/0000:00:1c.0/0000:05:00.0/infiniband/mlx4_1/ports/2/gids/0
|
|
fe80:0000:0000:0000:0002:c903:0003:cca8
|
|
|
|
[ TARGET, after login ]
|
|
|
|
$ (cd /sys/kernel/scst_tgt/targets/ib_srpt/[0-9a-f]* && ls -d sessions/*)
|
|
sessions/fe80:0000:0000:0000:0002:c903:0003:cca7
|
|
sessions/fe80:0000:0000:0000:0002:c903:0005:f34b
|
|
|
|
|
|
LUN masking
|
|
-----------
|
|
|
|
In a straightforward configuration every LUN is visible to every initiator.
|
|
It is possible however to make a different set of LUNs visible to each
|
|
initiator by using the LUN masking feature of SCST. SRP initiators are
|
|
identified by their session name (see above). An example of an scst.conf
|
|
file using LUN masking for ib_srpt:
|
|
|
|
TARGET_DRIVER ib_srpt {
|
|
TARGET fe80:0000:0000:0000:0002:c903:0005:f34b {
|
|
enabled 1
|
|
rel_tgt_id 1
|
|
|
|
# LUNs visible by all initiators not listed below
|
|
LUN 0 disk01
|
|
|
|
GROUP grp1 {
|
|
# LUNs visible by initiator system 1
|
|
LUN 0 disk02
|
|
|
|
INITIATOR fe80:0000:0000:0000:0002:c903:0005:f34b
|
|
}
|
|
|
|
GROUP grp2 {
|
|
# LUNs visible by initiator system 2
|
|
LUN 0 disk03
|
|
|
|
INITIATOR fe80:0000:0000:0000:0002:c903:0005:f34c
|
|
}
|
|
}
|
|
}
|
|
|
|
|
|
Adding and Removing LUNs Dynamically
|
|
------------------------------------
|
|
|
|
It is possible to add and/or remove LUNs on the target without restarting
|
|
target or initiator. This can be done either via scstadmin or directly via the
|
|
sysfs interface. Although the SCST core will notify the initiator about LUN
|
|
changes, Linux initiators will ignore these notifications. In order to bring a
|
|
Linux initiator again in sync after a LUN change, the initiator has to be told
|
|
to rescan SCSI devices. Rescanning SCSI devices is e.g. possible via the
|
|
rescsan-scsi-bus.sh script that can be found here:
|
|
http://www.garloff.de/kurt/linux/#rescan-scsi. An example:
|
|
$ rescan-scsi-bus --hosts=${srp_host_id} --channels=0 --ids=0 --luns=0-31
|
|
|
|
|
|
InfiniBand Partitions
|
|
---------------------
|
|
|
|
Just like a VLAN allows to segment traffic on an Ethernet network partitions
|
|
allow to segment traffic on an InfiniBand network. Each InfiniBand partition
|
|
is identified by a partition key which is a 16-bit number. During fabric
|
|
initialization the subnet manager assigns one or more partition keys to
|
|
each InfiniBand port. For opensm partitions are defined in
|
|
/etc/opensm/partitions.conf. ib_srpt uses the partition with index 0. Which
|
|
partition key corresponds to index 0 can be found out by querying sysfs:
|
|
|
|
$ head /sys/class/infiniband/*/ports/*/pkeys/0
|
|
==> /sys/class/infiniband/mlx4_0/ports/1/pkeys/0 <==
|
|
0xffff
|
|
|
|
==> /sys/class/infiniband/mlx4_0/ports/2/pkeys/0 <==
|
|
0xffff
|
|
|
|
|
|
High availability
|
|
-----------------
|
|
|
|
If there are redundant paths in the IB network between initiator and target,
|
|
automatic path failover can be set up on the initiator as follows:
|
|
* Edit /etc/infiniband/openib.conf to load the SRP driver and SRP HA daemon
|
|
automatically: set SRP_LOAD=yes and SRPHA_ENABLE=yes.
|
|
* To set up and use the high availability feature you need the dm-multipath
|
|
driver and multipath tool.
|
|
* Please refer to the OFED-1.x user manual for more detailed instructions
|
|
on how to enable and how to use the HA feature. See e.g.
|
|
http://www.mellanox.com/related-docs/prod_software/Mellanox_OFED%20_Linux_user_manual_1_5_1_2.pdf.
|
|
|
|
A setup with automatic failover between redundant targets is possible by
|
|
installing and configuring DRBD on both targets. If the initiator system
|
|
supports mirroring (e.g. Linux), you can use the following approach:
|
|
* Configure DRBD in Active/Active mode.
|
|
* Configure the initiator(s) for mirroring between the redundant targets.
|
|
If the initiator system does not support mirroring (e.g. VMware ESX), you
|
|
can use the following approach:
|
|
* Configure DRBD in Active/Passive mode and enable STONITH mode in the
|
|
Heartbeat software.
|
|
|
|
For more information, see also:
|
|
* http://www.drbd.org/
|
|
* http://www.linux-ha.org/wiki/Main_Page
|
|
|
|
|
|
Performance Notes - Target Side
|
|
-------------------------------
|
|
|
|
* Building the SCST core and the ib_srpt target driver in release mode
|
|
improves performance compared to debug mode.
|
|
|
|
* When using high-latency storage devices (hard disks), the default value
|
|
choosen by SCST for DEVICE.threads_num should be fine. When using
|
|
low-latency storage devices though (SSDs), DEVICE.threads_num should be set
|
|
to 1 or 2 in /etc/scst.conf in order to reach optimal performance for small
|
|
block sizes (e.g. 4 KB).
|
|
|
|
* When multiple InfiniBand HCA's are present in a target system the Linux
|
|
kernel by default will assign the associated interrupt handlers to CPU 0.
|
|
Even irqbalance will often assign the interrupt handlers of multiple HCA's
|
|
to the same CPU. That is unfortunate because it leads to unfair handling of
|
|
SRP sessions. The solution is to assign InfiniBand HCA interrupts manually
|
|
to different CPU's. That's possible by writing looking up the InfiniBand
|
|
interrupt numbers in /proc/interrupts and by writing proper bitmasks into
|
|
/proc/irq/<n>/smp_affinity.
|
|
|
|
|
|
Performance Notes - Initiator Side
|
|
----------------------------------
|
|
|
|
* Choose a proper value for the ib_srp kernel module parameter
|
|
cmd_sg_entries. The default value 12 works well for buffered reads while
|
|
the throughput for write-dominated workloads improves by changing this value
|
|
into 255. One way to set this kernel module parameter is as follows:
|
|
|
|
echo options ib_srp cmd_sg_entries=255 >>/etc/modprobe.d/ib_srp.conf
|
|
|
|
* For multithreaded workloads using small block sizes changing rq_affinity
|
|
into 2 improves IOPS significantly (Linux kernel 3.1 and later; see also
|
|
commit 5757a6d76cdf6dda2a492c09b985c015e86779b1).
|
|
|
|
* For latency sensitive applications, using the noop scheduler at the initiator
|
|
side can give significantly better results than with other schedulers.
|
|
|
|
* The SRP initiator limits by default the queue depth to 64 commands. If your
|
|
workload benefits from a larger queue depth, enlarge the queue depth by
|
|
setting the max_cmd_per_lun and queue_size parameters in the SRP login
|
|
string.
|
|
|
|
* The following parameters have a small but measurable impact on SRP
|
|
performance:
|
|
* /sys/class/block/${dev}/queue/rotational
|
|
* /sys/class/block/${dev}/queue/rq_affinity
|
|
* /proc/irq/${ib_int_no}/smp_affinity
|
|
|
|
|
|
Performance Notes - Both Sides
|
|
------------------------------
|
|
|
|
* Disabling CONFIG_SCHED_DEBUG and CONFIG_SCHEDSTATS in the kernel config
|
|
improves performance.
|
|
|
|
* Disable CONFIG_IRQSOFF_TRACER such that CONFIG_TRACE_IRQFLAGS is disabled.
|
|
|
|
* Consider which memory allocator to use. With recent kernels using the SLUB
|
|
memory allocator instead of SLAB may help. On multi-socket systems the SLAB
|
|
memory allocator may result in better performance. Please note that SLAB is
|
|
tunable while SLUB is not. See also http://lkml.org/lkml/2010/7/9/264 and
|
|
http://www.ibm.com/developerworks/linux/library/l-linux-slab-allocator/.
|
|
|
|
|
|
Frequently Asked Questions
|
|
--------------------------
|
|
|
|
Q: Every now and then "SRP abort called" and "SRP reset_device called"
|
|
messages are logged at the initiator side. Around the same time I see the
|
|
following message in the target log: "ib_srpt: ***ERROR***: Command ...: IB
|
|
completion for idx ... has not been received in time (SRPT command state
|
|
...)". What is the meaning of these messages mean and how can I fix this ?
|
|
|
|
A: This means that a timeout occurred while a HCA was waiting for an
|
|
acknowledge message. Check the IB network for bad IB cables, bad HCA's
|
|
and/or bad switch ports. Also make sure that the HCA firmware is up to
|
|
date.
|
|
|
|
Q: Loading the kernel module ib_srpt triggers a kernel panic with a call trace
|
|
like the one below. What is the cause of this and how can this be solved ?
|
|
|
|
Call Trace:
|
|
[<ffffffffa02f2a50>] srpt_alloc_ioctx+0x60/0xb0 [ib_srpt]
|
|
[<ffffffffa02f2f0a>] srpt_alloc_ioctx_ring+0xea/0x1e0 [ib_srpt]
|
|
[<ffffffffa02f32e9>] srpt_add_one+0x2e9/0x670 [ib_srpt]
|
|
[<ffffffffa015a480>] ib_register_client+0x80/0xa0 [ib_core]
|
|
[<ffffffffa02421eb>] srpt_init_module+0x1eb/0x235 [ib_srpt]
|
|
[<ffffffff81000344>] do_one_initcall+0x34/0x1a0
|
|
[<ffffffff8107a63c>] sys_init_module+0xdc/0x260
|
|
[<ffffffff81002e3b>] system_call_fastpath+0x16/0x1b
|
|
|
|
A: This means that you are using a system on which OFED has been installed but
|
|
that ib_srpt has been compiled against the in-tree kernel headers instead
|
|
of the OFED kernel headers. You can fix this by rebuilding ib_srpt against
|
|
the OFED kernel headers. The ib_srpt makefile should detect the OFED kernel
|
|
headers automatically - at least if ib_srpt is built after OFED has been
|
|
installed.
|
|
|
|
|
|
Feedback
|
|
--------
|
|
|
|
Send questions about this driver to scst-devel@lists.sourceforge.net.
|