HBA Troubleshooting Guide
HBA Troubleshooting Guide
Document History
Title Publication number Summary of changes Date
Index
In this chapter
• How this document is organized . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
• Supported hardware and software. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
• What’s new in this document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
• Document conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
• Notice to the reader . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x
• Additional information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
• Getting technical help . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii
• HBA serial number. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii
• Port World-Wide Port Name (PWWN) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii
• Document feedback . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
NOTE
This publication is a companion guide to be used with the Brocade Fibre Channel HBA
Administrator’s Guide. That publication provides detailed information on HBA monitoring and
diagnostic tools in Host Connectivity Manager (HCM) and the BCU.
HBA support
The following Fibre Channel host bus adapters (HBAs) are supported in this release.
• Brocade 815. Single-port HBA with a per-port maximum of 8 Gbps using an 8 Gbps SFP+.
• Brocade 825. Dual-port HBA with a per-port maximum of 8 Gbps using an 8 Gbps SFP+.
• Brocade 415. Single port HBA with a per-port maximum of 4 Gbps using a 4 Gbps SFP.
• Brocade 425 Dual port HBA with a per-port maximum of 4 Gbps using a 4 Gbps SFP.
Notes:
• This publication only supports the HBA models listed above and does not provide information
about the Brocade 410 and 420 Fibre Channel HBAs, also known as the Brocade 400 Fibre
Channel HBAs.
• Although you can install an 8 Gbps SFP+ into a Brocade 415 or 425 HBA, only 4 Gbps
maximum port speed is possible.
NOTE
Drivers, BCU, and HCM Agent are supported only on the VMware “console” Operating System.
HCM is supported only on the guest operating system on VMware.
Document conventions
This section describes text formatting conventions and important notice formats used in this
document.
Text formatting
The narrative-text formatting conventions that are used are as follows:
bold text Identifies command names
Identifies the names of user-manipulated GUI elements
Identifies keywords and operands
Identifies text to enter for the HCM or BCU
italic text Provides emphasis
Identifies variables
Identifies paths and Internet addresses
Identifies document titles
code text Identifies BCU output
Identifies command syntax examples
For readability, command names in the narrative portions of this guide are presented in mixed
lettercase: for example, switchShow. In actual examples, command lettercase is often all
lowercase. Otherwise, this manual specifically notes those cases in which a command is case
sensitive.
NOTE
A note provides a tip, guidance or advice, emphasizes important information, or provides a reference
to related information.
ATTENTION
An Attention statement indicates potential damage to hardware or data.
CAUTION
A Caution statement alerts you to situations that can be potentially hazardous to you or cause
damage to hardware, firmware, software, or data.
DANGER
A Danger statement indicates conditions or situations that can be potentially lethal or extremely
hazardous to you. Safety labels are also attached directly to products to warn of these conditions
or situations.
Key terms
For definitions specific to Brocade and Fibre Channel, see the technical glossaries on Brocade
Connect. See ““Brocade resources” on page xi for instructions on accessing Brocade Connect.
Microsoft Corporation Windows, Windows Server 2003, Windows Server 2008, Vista, XP
Sun Microsystems, Inc. Solaris
Red Hat Inc. Red Hat Enterprise Linux (RHEL)
Novell, Inc SUSE Linux Enterprise Server (SLES)
VMware Inc. ESX Server
SPARC International, Inc SPARC
Additional information
This section lists additional Brocade and industry-specific documentation that you might find
helpful.
Brocade resources
Access a variety of HBA resources, such as driver packages, HCM, product documentation,
compatibility information, product registration, and product support through the Brocade HBA web
site at www.brocade.com/hba.
To get up-to-the-minute information, join Brocade Connect. It’s free! Go to http://www.brocade.com
and click Brocade Connect to register at no cost for a user ID and password.
For practical discussions about SAN design, implementation, and maintenance, you can obtain
Building SANs with Brocade Fabric Switches through:
http://www.amazon.com
For additional Brocade documentation, visit the Brocade SAN Info Center and click the Resource
Library location:
http://www.brocade.com
*FT00X0054E9*
FT00X0054E9
You can also display the serial number through the following HCM dialog boxes and BCU
commands:
• HBA Properties dialog box in HCM
Select an HBA in the device tree, then click the Properties tab in the right pane.
• BCU adapter --list command
NOTE
For details on using HCM and BCU commands, refer to the Brocade Fibre Channel HBA
Administrator’s Guide.
This command displays HBA information. The <ad_id> parameter is the HBA’s serial number.
port --list <ad_id>
This command lists all the physical ports on the HBA along with their basic attributes. The
<ad_id> parameter is the HBA’s serial number.
Document feedback
Quality is our first concern at Brocade and we have made every effort to ensure the accuracy and
completeness of this document. However, if you find an error or an omission, or you think that a
topic needs further development, we want to hear from you. Forward your feedback to:
documentation@brocade.com
Provide the title and version number of the document and as much detail as possible about your
comment, including the topic heading and page number and your suggestions for improvement.
Introduction to troubleshooting 1
In this chapter
• How to use this manual for troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . 1
• Gathering problem information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
• Does the issue persist when the HBA is installed in a different platform or is connected
using a different switch port, SFP, and cable?
• Can this problem be reproduced on one or more HBAs, port, or host system? Can you
identify specific steps that consistently reproduce this problem on one or more hosts?
• Is the problem documented in release notes for the HBA, operating system, or host system
BIOS?
• Is the problem documented in release notes for the switch and target storage system?
• Is unexpected behavior intermittent or always present?
If the problem is in a Fibre Channel switch, cabling, storage device, or in connectivity between
these components, refer to documentation, help systems, or service providers of that
equipment.
5. If you cannot resolve the problem, gather and provide problem information to your HBA support
provider for resolution.
The bfa_supportSave and FOS supportsave commands can provide current information for the
topology map. Also, consider using the Brocade SAN Health products to provide information on
your SAN environment, including an inventory of devices, switches, firmware versions, and SAN
fabrics, historical performance data, zoning and switch configurations, and other data. Click
the Support tab on www.brocade.com for more information on these products.
• Run appropriate diagnostic tools for storage targets.
• Use additional HCM, BCU, host system, and Fabric OS commands summarized in Chapter 3,
“Tools for Collecting Data” to gather statistics and problem data on the HBA, host, Fibre
Channel links, and connected devices.
• Determine what has changed in the SAN. For example, if the SAN functioned without problems
before installing the HBA, then the problem is most likely in the HBA installation or
configuration, HBA hardware, or HBA driver package. Other examples to investigate could be
changes in the switch or storage system firmware, an offline switch, or a disconnected or faulty
cable between the HBA, switch, or storage controller fiber optic ports.
• Record the time and frequency of symptoms and the period of time symptoms have been
observed.
• Determine if unexpected behavior is intermittent or always present.
• List steps that have been taken to troubleshoot the problem, including changes attempted to
isolate the problem.
Isolating Problems 2
In this chapter
• General problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
• Resolving installation problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
• Host system freezes or crashes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
• HCM GUI fails to connect with HCM agent. . . . . . . . . . . . . . . . . . . . . . . . . . . 17
• Verifying Fibre Channel links. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
General problems
Table 2 on page 6 describes general problems related to HBA operation, possible causes, and
recommended actions that may fix the problem. Recommended actions may refer you to
information in the following locations as appropriate to gather information to further isolate and
resolve the problem.
• Chapter 3, “Tools for Collecting Data” in this manual.
Contains procedures to perform HBA diagnostics, display HBA statistics and event logs, and
collect data for troubleshooting using BCU commands, HCM options, Fabric OS commands,
and your host system commands.
• Fabric OS Administrator’s Guide
Provides detailed information on features available on Brocade storage area network (SAN)
products, and how to configure and administer these products
• Fabric OS Command Reference Manual.
Provides detailed descriptions of command line interface commands to help system
administrators and technicians operate, maintain, and troubleshoot Brocade SAN products.
• Fabric OS Troubleshooting and Diagnostic Guide.
Provides help with isolating problems in other Brocade SAN components.
• Your host’s operating system documentation and help system.
Provides details on commands for gathering information and isolating problems.
HBA not reported under server’s PCI subsystem. 1 Card not seated properly. 1 Execute host operating system
2 Server slot issues. command to list PCI devices. Refer to
3 HBA not compatible with host the “List PCI Devices” row in Table 5
operating system or connected on page 24. If the HBA is not listed,
storage systems. perform the following steps.
2 Reseat the HBA.
3 Replace the HBA with an HBA in
known working condition to
determine whether there is a slot
malfunction.
4 Verify compatibility by reviewing the
Brocade Server Connectivity
Compatibility Matrix. To find this
document, log into Brocade Connect
on www.brocade.com, then select
the Compatibility Information quick
link under Documentation Library.
No HBAs reported when using the adapter --list 1 HBA not reported under server’s 1 Execute your host’s operating system
BCU command: PCI sub-system. command to list PCI devices. Refer to
2 HBA driver not loaded. the “List PCI Devices” row in Table 5
on page 24. If the HBA is not listed in
the output from this command, go on
to the next step.
2 Refer to “HBA not reported under
server’s PCI sub-system” under the
“Symptom” column in this table.
3 The HBA driver may not be loaded.
Refer to “Confirming driver package
installation” on page 14 for methods
to verify driver installation.
Port link does not come up. 1 SFP or cable problems. 1 Ensure that the SFPs and cables are
2 Link between HBA and switch is connected properly on both HBA and
down. switch side. Check for any cable
3 Switch port is disabled or switch is damage.
disabled. 2 Verify HBA side link status by
4 HBA port speed or topology executing the BCU port --list
mis-match with the switch port. command. Check the FC Addr field
5 Non-Brocade branded SFP for an address and the State field for
installed. Linkup. For details on using this
command, refer to “Port list” on
page 47.
3 Execute either the Fabric OS
switchShow or portShow commands
on the attached switch to ensure
that the switch or individual port is
not disabled or offline.
4 Check the port topology setting on
the switch using the Fabric OS
portCfgShow command to ensure
that Locked L_Port is OFF.
Use the portCfgLport command to
change the setting to OFF if required.
Port link does not come up (continued) On the HBA, execute the port --list
and port --query BCU commands.
Refer to “Port list” on page 47 and
“Port query” on page 47. If an
unsupported SFP is detected, the
“Sfp” field displays “us”
(unsupported SFP) for port --list and
the “Media” field displays
“Unsupported SFP” for port --query.
Target not visible. 1 No fabric connectivity between HBA 1 Execute the Fabric OS nsAllShow
and target or target is not online. command on the attached switch to
2 The target and the HBA are not on verify that the target and the host
the same zone. are online in the fabric and
3 The HBA driver is not loaded. registered in the name server.
4 There is a problem with remote 2 Execute the Fabric OS cfgActvShow
port. command on the attached switch
and verify that the host and target
are in the same zone (either using
domain area members, port area
members, or port or node WWNs).
3 The HBA driver may not be loaded.
Refer to “Confirming driver package
installation” on page 14 for methods
to verify driver installation.
4 Verify that the remote target port
(rport) is reporting itself online by
comparing rport online and rport
offline statistics (refer to “Remote
port statistics” on page 36). The
rport online counter should be one
greater than the rport offline
counter. If not, clear the counters
and try connecting to the remote
port again. Verify the rport online
and rport offline statistics again.
LUN not visible 1 Missing or improper storage array 1 Check LUN mapping and masking
LUN masking setting. using storage array configuration
2 HBA driver not loaded. tools.
2 The HBA driver may not be loaded.
Refer to “Confirming driver package
installation” on page 14 for methods
to verify driver installation.
I/Os are not failing over immediately on a path Improper driver mpiomode setting. Execute the port --query <port_id> BCU
failure in MPIO setup. command and ensure fcpim MPIO mode
is enabled (which implies zero Path TOV
values) or that fcpim MPIO mode is
disabled with the expected “Path TOV”
settings (default is 10 seconds).
Unable to create more than 126 Virtual (NPIV) The switch’s maximum NPIV limit has Execute the Fabric OS configure
ports. been reached. command on the attached switch and
change the Maximum logins per port
parameter under the F_Port login
parameters menu to increase the
maximum NPIV I/Ds allowed per port.
On Linux, the maximum IOPS numbers are very The amount of disk I/O requests are Refer to “Linux tuning” on page 51 for
low. causing low throughput and high latency. suggestions to optimize HBA performance
in Linux systems.
On VMware, the maximum IOPS numbers are very The amount of disk I/O requests are Refer to “VMware tuning” on page 53 for
low. causing low throughput and high latency. suggestions to optimize HBA performance
in Vmware systems.
QoS performance issues 1 QoS not enabled on both switch 1 Verify if QoS is enabled for an HBA
and HBA port. port using the qos -query <port_id>
2 QoS zones not created properly on BCU command. Verify if it is enabled
switch for high, medium, and low on the switch using the islShow
priority targets. command.
2 Verify zones on the switch using
cfgActvShow command.
3 Verify that QoS is configured on
switch using instructions in the
Brocade Fabric OS Administrator’s
Guide.
The HBA is not showing in the fabric. There is a problem in the fabric or a Check fabric statistics. Refer to “Fabric
protocol issue between the HBA and statistics” on page 36 for methods to
fabric. display fabric statistics for the HBA.
• If counts for FLOGI sent and FLOLGI
accept fabric statistics do not match,
suspect fabric problem or protocol
issue between HBA and fabric.
• If fabric offline counts increase and
fabric maintenance is not occurring,
this may indicate a serious fabric
problem. Refer to your switch
troubleshooting guide.
Errors when installing Appropriate distribution kernel Refer to “Errors when installing driver” on
bfa_driver_linux--<version>.noarch.rpm driver development packages are not installed page 13 for more information to isolate
package. on your host system for the currently this problem.
running kernel.
Installer program does not autorun (Windows Autorun is not enabled on your system. Refer to “Installer program does not
only). autorun from CD (Windows only)” on
page 13 for more information to isolate
this problem.
Host not booting from remote LUN. Boot from LUN not configured correctly. Refer to “Host not booting from remote
LUN” on page 14 for more information to
isolate this problem.
Host system freezes or crashes. 1 Fault fiber optic cabling and Refer to “Verifying Fibre Channel links” on
connections. page 20 for more information to isolate
2 Faulty or unseated SFPs or this problem.
unsupported SFPs.
3 Conflicts with port operating speed
or topology of attached devices.
4 HBA not compatible with host
system.
Virtual devices not listed in name server. • Problem in the fabric or a protocol Check virtual port statistics, such as
issue between the HBA and fabric. FDISK sent, FDISK accept, and No NPIV
• NPIV is not supported or is disabled support statistics. Refer to “Virtual port
on the switch statistics” on page 38 for methods to
display virtual port statistics.
Operating system errors (blue screen). HBA driver not loaded. Refer to “Confirming driver package
installation” on page 14 for methods to
verify driver installation.
“Failed to connect to agent on host...” error when The client application cannot connect to Refer to “HCM GUI fails to connect with
using HCM. the HCM Agent listening on the HCM agent” on page 17 to isolate and
configured port for the following reasons: resolve the problem.
• The agent is not running.
• The agent not accepting
connections on the expected port.
• The agent is not listening on the
expected port.
• Communication between the client
and agent is blocked by a firewall
preventing access to the port
(usually only a consideration for
remote HCM management).
The HBA not registering with the name server or 1 HBA cannot log in to the name Display logical port statistics (refer to
cannot access storage. server. “Logical port statistics” on page 37.
2 HBA has a problem registering with Check for the following errors:
the name server. 1 Increasing name server port login
3 HBA has a problem querying the (NS PLOGI) error rejects and
name server for available storage. unknown name server port login
response (NS login unknown rsp)
errors means that HBA most likely
cannot log in to the name server.
2 Increasing errors of the following
types indicates that the HBA has a
problem registering with the name
server.
• Name server register symbolic
port name identifier (NS
RSPN_ID) errors.
• Name server register symbolic
port name identifier response
(NS RFT_ID rsp) errors
• Name server register symbolic
port name identifier response
rejects (NS RFT_ID rejects).
3 Increasing name server get all port
ID response (NS GID_FT rsp), rejects
(NS_GID FT rejects), or unknown
responses (NS_GID FT unknown rsp)
means that the HBA has a problem
querying the name server for
available storage.
Driver event messages appearing in host system Various causes and severity levels. • Follow recommended action in
log files. message.
• Resolve critical-level messages and
multiple major or minor-level
messages relating to the same issue
as soon as possible.
• For details on event messages, refer
to “Event logs” on page 30.
“Files needed for bfad.sys” message appears Installing the driver using the driver Do not uninstall the driver using the
when removing driver installer program (bfa_installer.exe) or Device Manager if you have used the
Brocade Software Installer (GUI or Brocade installer programs to install
command-based application), then driver instances. Always use the Brocade
subsequently removing the driver using installer programs to remove the driver.
the Device Manager. Refer to ““Files needed for bfad.sys”
message appears when removing driver”
on page 13 for more information.
Cannot roll back driver on all HBA instances using Installing the driver using the Brocade • Install the driver for each HBA
Device Manager driver installer program instances using the Device Manager,
(bfa_installer.exe) or Software Installer then roll back the driver using Device
(GUI or command-based application), Manager.
then rolling back driver HBA instances • Use the driver installer program
using the Device Manager. (bfa_installer.exe) or Brocade
Software Installer (GUI or
command-based application) to
install or upgrade the driver, then
use the Brocade Software
Uninstaller to roll back drivers on all
HBA instances in one-step.
• Refer to “Cannot roll back driver on
all HBA instances using Device
Manager” on page 14 for more
information.
NOTE
If troubleshooting actions in Table 2 do not resolve problems, check the installed version of the HBA
(chip revision) and driver (fw version) using the adapter --query BCU command. To use this
command, refer to “Collecting data using BCU commands” on page 28. Refer to release notes
posted on the Brocade HBA web site (www.brocade.com/hba) for known problems relating to the
HBA and driver versions.
Verifying installation
Problems with HBA operation may be due to improper hardware or software installation,
incompatibility between the HBA and your host system, unsupported SFPs installed on the HBA,
improper fiber optic cable connected to the fabric, or the HBA not operating within specifications.
Determine if problems may exist because of these factors by reviewing your installation with
information in the Brocade Fibre Channel HBA Installation and Reference Manual listed in Table 3.
Cannot roll back driver on all HBA instances using Device Manager
When using the Windows Device Manager, you can only roll back the driver for the first HBA
instance.
This occurs if you perform the following sequence of steps:
1. Install the driver using the Brocade driver installer program (bfa_installer.exe) or Software
Installer (GUI or command-based application).
2. Roll back driver HBA instances using Device Manager.
To avoid this problem, use one of the following methods:
• Install the driver for each HBA instances using the Device Manager, then roll back the driver
using Device Manager.
• Use the driver installer program (bfa_installer.exe) or Brocade Software Installer (GUI or
command-based application) to install or upgrade the driver, then use the Brocade Software
Uninstaller to roll back drivers on all HBA instances in one-step.
You can use HCM and tools available through your host’s operating system to obtain information
such as driver name, driver version, and HBA port WWNs.
Windows
Use the Device Manager to determine driver installation. Verify if the driver is installed and
Windows is recognizing the HBA using the following steps.
1. Open the Device Manager.
2. Expand the list of SCSI and RAID controllers.
3. Right-click the Brocade FC HBA model where you are installing the driver.
If you do not see this entry or Fibre Channel Controller displays with a yellow question mark
under Other Devices, the driver is not installed.
4. Select Properties to display the Properties dialog box.
5. Click the Driver tab to display the driver date and version. Click Driver Details for more
information.
NOTE
If driver is not installed, try re-installing the driver or re-installing the HBA hardware and then
the driver.
Linux
Verify if the HBA driver installed successfully using the following commands:
• # rpm -qa|grep -i bfa
This command prints the names of the Brocade HBA driver package (bfa) if installed.
• # lspci
This is a utility that displays information about all PCI buses in the system and all devices
connected to them.
• # lsmod
This command displays information about all loaded modules. If bfa appears in the list, the
HBA driver is loaded to the system.
• # dmesg
This command prints kernal boot messages. For the bfa entry, HBA model and driver version
should display if the hardware and driver are installed successfully.
• # modprobe -l bfa
This verifies that the module has loaded. If bfa displays, the module has been loaded to the
system.
Solaris
Verify if the HBA driver installed successfully using the following commands.
• pkgchk -nv bfa
This checks for and lists the installed HBA driver package files.
• pkginfo -l bfa
This displays details about installed Brocade HBA (bfa) drivers. Look for information as in the
following example. Note that the VERSION may be different, depending on the driver version you
installed. The ARCH and DESC information may also be different, depending on your host system
platform. If the HBA driver package is installed, bfa_pkg should display with a “completely
installed.” status.
PKGINST: bfa
NAME: Brocade Fibre Channel Adapter Driver
CATEGORY: system
ARCH: sparc&i386
VERSION: alpha_bld31_20080502_1205
BASEDIR: /
VENDOR: Brocade
DESC: 32 bit & 64 bit Device driver for Brocade Fibre Channel adapters
PSTAMP: 20080115150824
INSTDATE: May 02 2008 18:22
HOTLINE: Please contact your local service provider
STATUS: completely installed
VMware
Verify if the HBA driver installed successfully using the following commands:
• vmkload_mod -l
This lists installed driver names, R/O and R/W addresses, and whether the ID is loaded. Verify
that an entry for bfa exists and that the ID loaded.
• cat /proc/vmware/version
This displays the latest versions of installed drivers. Look for a bfa entry and related build
number.
- Try rebooting the system without any connectivity to the switch. This will help isolate any
hang caused by switch and device interactions.
- Reseat SFPs in the HBA. Determine whether the installed SFPs are faulty by observing LED
operation by HBA ports. If all LEDs are flashing amber, the SFP is invalid and may not be a
required Brocade model. You can also verify SFP operation by replacing them with SFPs in
known operating condition. If the problem is resolved after replacement, original SFP is
faulty.
- Check for conflicts with attached devices. Verify that data speed (1-8 Gbps) and
connection topology (for example, point-to-point) for devices attached to the HBA are
compatible with settings on the HBA port. Although auto may be set, configuring settings
manually on the HBA port and devices may allow connection. Also, note that the HBA only
supports point-to-point connection topology. Refer to the Brocade Fibre Channel HBA
Administrator’s Guide for procedures to configure HBA ports.
NOTE
Observe the LEDs by HBA ports. Illuminated LEDs indicate connection, link activity, and
connection speed negotiated with the attached device. Refer to “LED Operation” in the
Specifications chapter of the Brocade Fibre Channel HBA Installation and Reference
Manual.
• Communications between the client and agent is blocked by a firewall preventing access to the
port (usually only a consideration for remote HCM management).
NOTE
This command is a single line. The localhost can be replaced with a different IP address.
wget --no-check-certificate
https://admin:password@localhost:34568/JSONRPCServiceApp/
SupportSaveController.do
If successful, the file SupportSaveController.do (actually a zip format file) will contain the data
from the HCM agent.
4. If you are managing a VMware host system through HCM from a remote system, the host’s
firewall may be blocking TCP/IP port 34568, which allows agent communication with HCM.
Use the following command to open port 34568:
/usr/sbin/esxcfg-firewall-o 34568,tcp,out,https
Use Windows Firewall and Advanced Service (WFAS) to open port 34568.
NOTE
You can change the default communication port (34568) for the agent using procedures in the
“Installation” chapter of the Brocade Fibre Channel HBA Installation and Reference Manual.
Refer to the section on modifying HCM agent operation.
5. If HCM is still unable to connect to the HCM agent after using the preceding steps, collect the
following data and send to your Support representative for analysis:
• Data collected from the previous step in SupportSaveController.do.
• Data from the HCM application SupportSave feature. Select Tools > SupportSave to
generate a supportsave file. The data file name and location displays when the
SupportSave feature runs.
• HBA agent files on the HBA host (where the HCM agent is installed). Collect these files
using the following command:
tar cvfz hbaagentfiles.tgz /opt/hbaagent
Output collects to a file and location specified when the SupportSave feature runs.
Windows systems
Perform the following tasks to isolate and resolve the problem.
1. Verify that the agent is running by executing the appropriate “status” command for your
operating system described in the “Installation” chapter of the Brocade Fibre Channel HBA
Installation and Reference Manual. Refer to the section on modifying HCM agent operation.
2. If you receive a message that the hcmagent is stopped, restarting the agent should resolve the
problem. To restart, use the appropriate “start” command for your operating system which is
also described in the Brocade Fibre Channel HBA Installation and Reference Manual.
Note that one command described in the manual restarts the agent, but the agent will not
restart if the system reboots or the agent stops unexpectedly. Another command restarts the
agent, but the agent will restart if the system reboots.
3. If the HCM agent starts, verify which TCP port the agent is listening on by executing the
following command at the Windows command prompt:
netstat -nao | findstr 34568
1960 in the last column is the process identifier for the Windows process listening on the TCP
port. Note that this identifier may be different on your system.
4. Enter the following command to confirm that the process identifier bound to TCP port 34568 is
for the hcmagent.exe process:
tasklist /svc | findstr 1960
The following should display if the identifier from step 3 is bound to TCP port 34568:
hcmagent.exe 1960 hcmagent
5. If you are managing a Windows 2008 host system through HCM from a remote system, the
host’s firewall may be blocking TCP/IP port 34568.
Use Windows Firewall and Advanced Service (WFAS) to open port 34568.
NOTE
You can change the default communication port (34568) for the agent using procedures in the
“Installation” chapter of the Brocade Fibre Channel HBA Installation and Reference Manual.
Refer to the section on modifying HCM agent operation.
6. If the hcmagent is running and listening on port 34568 and there are no firewall issues (as
explained in step 5), but you get the same “Failed to connect to agent on host..." error when
using HCM, collect the following data. Send this data to your Support representative for
analysis:
• Copies of output from the commands in step 3 and step 4.
• Files from the output directory created after you execute the bfa_supportsave feature.
To collect these files, execute the following command:
bfa_supportsave
NOTE
Also verify LED operation on switch port that is connected to an HBA port. Refer to the switch
Hardware Reference Manual to analyze LED meaning.
• Fiber optic cables may not be rated or compatible with HBA port speeds. Refer to Fibre Optic
Cable specifications in the Brocade Fibre Channel HBA Installation and Reference Manual.
• Faulty switch or HBA SFPs. Verify if an SFP is the problem by connecting a different link to the
HBA port or, if convenient, replace the cable with a cable of known quality. If the errors or
invalid data on the link still indicate a cable problem, the SFP may be faulty. Try replacing the
SFP.
• SCSI retries and timeouts determine communication between HBA and storage. Dropped
packets cause timeouts. Packets can drop because of SFP issues on HBA or switch - possibly
the SFP is not compatible with HBA, but is compatible with switch or vice versa. You can run
the BCU port --stats command to display port statistics, such as error and dropped frames.
Table 4 lists HCM options and BCU commands, as well as Fabric OS commands that you can use to
determine link status.
In this chapter
• For detailed information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
• Data to provide support . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
• Collecting data using host system commands . . . . . . . . . . . . . . . . . . . . . . . 24
• Collecting data using BCU commands and HCM . . . . . . . . . . . . . . . . . . . . . 25
• Collecting data using Fabric OS commands . . . . . . . . . . . . . . . . . . . . . . . . . 28
• Event logs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
• Statistics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
• Diagnostics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
• Collecting SFP data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
• Collecting port data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
• Authentication settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
• QoS and target rate limiting settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
• Persistent binding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
NOTE
Output from all of these commands is captured using the Support Save feature.
List PCI devices lspci -vv In Windows registry lspci -vv, esxcfg-info -w prtdiag -v, prtconf -pv
location
HKEY_LOCAL_MACHINE
\SYSTEM\CurrentContro
lSet\Enum\PCI devcon
find pci\*
List installed HW details lsdev msinfo32.exe esxcfg-info -a prtdiag -v, prtconf -pv
Click the plus sign(+)
next to Components to
view hardware details.
Process information ps -efl, top Windows Task Manager, ps -efl, top ps -efl, top
tasklist.exe
Memory usage top, vmstat -m Windows Task Manager, top, vmstat -m vmstat -s
tasklist.exe
Performance monitoring iostat, vmstat, sar Windows Task Manager, vmstat, VM Performance: iostat -nx 1 5, vmstat,
perfmon.exe esxtop [first type 'v', 'e' mpstat, sar
then enter vm# in the list
down], Disk Performance:
esxtop [type 'v' then 'd'].
List for driver modules lsmod driverquery vmkload_mod -l modinfo
To check for Brocade Fibre lsmod | grep bfa driverquery /v | findstr vmkload_mod -l modinfo | grep bfa
Channel adapter (BFA) driver bfad
module
Show OS distribution info (SuSE) cat systeminfo.exe cat /etc/vmware-release uname -a, cat
/etc/SuSE-release,( /etc/release
RedHat) cat
/etc/redhat-release
BFA configuration file location /etc/bfa.conf Windows Registry /etc/bfa.conf /kernel/drv/bfa.conf
(HKEY_LOCAL_MACHINE
\SYSTEM\CurrentContro
lSet\Services\bfad\Para
meters\Device), HBA
Flash
BFA device file location /dev/bfa* Windows Registry /dev/bfa* • (Release 1.0) -
(HKEY_LOCAL_MACHINE /devices/pci*/pci*/
\HARDWARE\DEVICEMA fibre-channel@0:dev
P\Scsi\Scsi Port x) ctl,
• (Release 1.1 and
later) -
/devices/pci*/pci*/
bfa@0:devctl
NOTE
A bfa_supportsave collection can also occur automatically for a port crash event.
NOTE
Master and Application logs are saved when Support Save is initiated through HCM, but not through
BCU.
where:
[output_dir] An optional parameter that specifies the directory where you want output
saved. If not specified, output is saved as a directory in the current working
directory as bfa_ss_out.
Messages display as the system gathers information. When complete, an output file and directory
display. The directory name specifies the date when the file was saved.
For more information on the bfa_supportsave feature, refer to the Host Connectivity Manager
(HCM) Administrator’s Guide.
where localhost is the IP address of the server from which you want to collect the
bfa_supportsave information.
2. Log in using the factory default user name (admin) and password (password). Use the current
user name and password if they have changed from the default,
The File Download dialog box displays, prompting you to save the supportSaveController.do
file.
3. Click Save and navigate to the location where you want to save the bfa_supportsave file.
Port crash events have a CRITICAL severity and you can view the details in the Master Log and
Application Log tables in HCM. For more information on these logs, refer to “HCM logs” on page 32.
Port crash events have a CRITICAL severity and you can view the details in the Master Log and
Application Log tables in HCM.
where:
list Lists all adapters in the system. For each adapter in the system, a brief
summary line is displays.
The adapter --query command displays adapter information, such as the current version of the HBA
(chip revision) and driver (fw version), maximum port speed, model information, serial number,
number of ports, PCI information, pwwn, nwwn, hardware path, and flash information (such as
firmware version).
adapter –-query <ad_id>
where:
ad_id ID of the adapter (HBA) for which you want to query.
• fcpProbeShow
Use this command to display the Fibre Channel Protocol daemon (FCPd) device probing
information for the devices attached to a specified F_Port or FL_Port. This information includes
the number of successful logins and SCSI INQUIRY commands sent over this port and a list of
the attached devices.
• nsShow
Use this command to display local NS information about all devices connected to a specific
switch. This includes information such as the device PID, device type, and the port and node
WWN.
• zoneshow
Use this command without parameters to display all zone configuration information (both
defined and enabled).
• portErrShow
Use this command to display an error summary for all switch ports.
• portLogShow
Use this command to display the port log for ports on a switch.
• portLogShowPort
Use this command to display the port log for a specified switch port.
• portPerfShow
Use this command to display throughput information for all ports on the switch.
• portStatsShow
Use this command to display hardware statistics counters for a specific switch port.
• portShow
Use this command to display information and status of a specified switch port, including the
speed, ID, operating state, type, and WWN.
• SecAuthSecret
Use this command to manage the DH-CHAP shared secret key database used for
authentication.This command displays, sets, and removes shared secret key information from
the databases
• sfpShow
Use this command to display detailed information about specific SFPs installed in a switch.
• switchShow
Use this command to display switch and port information. Output may vary depending on the
switch model. Use this information to determine the fabric port WWN and PID connected to an
HBA port. Also display topology, speed, and state of each port on the switch.
Event logs
Event messages that occur during HBA and driver operation are important tools for isolating and
resolving problems. Messages provide descriptions of the event, severity, time and date of the
event, and in some cases, cause and recommended actions. These messages are captured in logs.
Monitoring events in these logs allows early fault detection and isolation on a specific HBA.
The following types of logs are available:
• Host operating system logs
• HCM logs
Linux /opt/bfa
VMware /opt/bfa
Solaris /opt/bfa
Windows aen.zip loaded to your driver installation
directory. Unzip this file to obtain all message
catalog files.
Table 7 describes the logs for each supported operating system, where the logs are stored, and
how to view them.
1. For ESX Server Console operating system. For Guest system, refer to information in Windows or Linux.
You can view all event messages that can display for a Brocade HBA by viewing HTML files that are
loaded to your system as the driver package installs. These files contain all message information
that can display on system logs for the Brocade HBA. View these files through your internet browser.
Table 8 provides the location of the Brocade HBA message files for each supported system.
Linux /opt/bfa
VMware /opt/bfa
Solaris /opt/bfa
Windows aen.zip loaded to your driver installation
directory. Unzip this file to obtain all message
catalog files.
Adjust the logging level, or the types of messages logged to your system log that relate to HBA
driver operation, using the following HCM options and BCU commands.
• Adjust logging level using this BCU command.
log -–level <port_id> [<Trace | Debug | Info | Warn| Error | Fatal>]
• Adjust logging level using the HCM.
Select Configure > HCM Logging Levels. Select the a level on the Agent Communication Log
and HCM Debug Log lists. Values are Trace, Debug, Info, Warning, Error, and Fatal.
NOTE
For details on adjusting logging levels, refer to the Brocade Fibre Channel HBA Administrator’s
Guide.
HCM logs
You can view data about HBA operation through the following HCM logs. Both of these logs display
on the bottom of the HCM main window. Click the Master Log or Application Log to toggle between
logs.
• The Master Log displays informational and error messages during HBA operation. This log
contains the severity level, event description, date and time of event, the function that reported
the event (such as a specific HBA port or remote target port), WWN of device where event
occurs, and other information.
• The Application Log displays informational and error messages related to HBA discovery or
HCM application issues.
Master Log
The Master Log displays event information in seven fields:
• Sr No.
Sequence number that event occurred in ascending order.
• Severity
Event severity level (informational, minor, major, or critical).
- Critical-level messages indicate that the software has detected serious problems that will
eventually cause a partial or complete failure of a subsystem if not corrected immediately.
Examples of these could be a power supply failure or rise in temperature
- Major messages represent conditions that do not impact overall system functionality
significantly. Examples of these could be timeouts on certain operations, failures of
certain operations after retries, invalid parameters, or failure to perform a requested
operation.
- Minor messages highlight a current operating condition that should be checked or it might
lead to a failure.
- Information-level messages report the current non-error status of the system components;
for example, the online and offline status of a fabric port.
• WWN
World Wide Name of HBA where event occurred.
• Category
The category or type of event. Categories define the component where events occur:
- Adapter - Events relating to the HBA (Adapter).
- Port - Events relating to a specific port on the HBA.
- LPORT - Events relating to a specific logical port.
- RPORT - Events relating to a specific remote initiator or target port.
- ItNIM - Events relating to an initiator-target nexus. Examples of these include end to end
target discovery, initiator target connectivity, and loss of connectivity.
- Audit - Audit events.
- IOC - Driver and firmware events involving the I/O controller on the HBA.
• Subcategory
Subcategory of main category.
• Description, Date, and Time
Brief description of event and date and time when event occurred.
You can block events from display in the Master Log by severity, category, and WWN of HBA. To
display the Master Log Filter dialog box, click the Filter button in the Master Log section of the main
HCM screen. Select areas that you want to filter and click OK.
Application Log
The Application Log displays all application-related informational and error messages, as well as
the following attributes.
• Date and time the message occurred.
• Severity of the message.
• Description of the message.
• The agent IP address.
Syslog support
You can configure the HCM agent to forward events to a maximum of three syslog destinations
using the Syslog option on the HCM Configure menu. These events will display in the operating
system logs for systems such as Solaris and Linux. For procedures to configure syslog destinations,
refer to the Brocade Fibre Channel HBA Administrator’s Guide.
Statistics
You can access a variety of statistics using BCU commands and HCM. Use these statistics to
monitor HBA performance and traffic between the HBA and LUNs and isolate areas that impact
performance and device login.
You can display statistics for the following:
• HBA ports
• IO controller
• Virtual ports (vport)
• Logical ports (lport)
• Remote ports (rport)
• FCP initiator mode
• Fabric (BCU only)
• Targets
• Security authentication
This section provides an overview of these statistics and how to access them. For more detail, refer
to the Brocade Fibre Channel HBA Administrator’s Guide.
Port statistics
Use BCU and HCM to display a variety of port statistics, such as transmitted and received frames
and words, received loop initialization primitive (LIP) event counts, error frames received, loss of
synchronization, link failure and invalid CRS counts, and end of frame (EOF) errors. Use these
statistics to isolate link and frame errors. For example, loss of synch and loss of signal errors
indicate a physical link problem. To resolve these problems, check cables, SFPs on the HBA or
switch, and patch panel connections.
where:
port_id ID of the port for which you want to display statistics.
IOC statistics
Use BCU and HCM to display port-level statistics for the I/O controller through the BCU and HCM.
The I/O controller refers to the firmware entity controlling the port. The following types of IOC
statistics are displayed:
• IOC driver
• IOC firmware
• Firmware IO
• Firmware port FPG
• Firmware port PHYSM
• Firmware port LKSM
• Firmware port SNSM
where:
ioc_id ID of the IOC controller for which you want to display statistics.
Fabric statistics
Use BCU and HCM to display statistics for fabric login (FLOGI) activity and fabric offlines and
onlines detected by the port. Use these statistics to help isolate fabric login problems. Following are
two examples of how to use these statistics for troubleshooting:
• If the HBA is not showing in the fabric, check the FLOGI sent and FLOLGI accept statistics. If the
counts do not match, the switch or fabric may not be ready to respond. This is normal as long
as it does not persist. If the problem persists, this could indicate a problem in the fabric or a
protocol issue between the HBA and fabric.
• If fabric offline counts increase and fabric maintenance is not being done, this may indicate a
serious fabric problem. Slow fabric performance or hosts unable to address storage could also
be seen.
where:
port_id ID of the HBA port for which you want to display statistics.
where:
port_id ID of the port for which you want to display rport statistics.
lpwwn Displays the logical port world wide name. This is an optional argument. If the
-l lpwwn argument is not specified, the base port is used.
rpwwn Displays the remote port’s port world wide name.
where:
port_id ID of the port for which you want to display statistics.
lpwwn Logical port world wide name. This is an optional argument. If the -l lpwwn
argument is not specified, the base port is used.
rpwwn Remote port world wide name.
where:
port_id ID of the port for which you want to display statistics.
lpwwn Logical port world wide name for which you want to display statistics. This is
an optional argument. If the -l lpwwn argument is not specified, the base port
is used.
rpwwn Remote port world wide name for which you want to display statistics.
• Check the No NPIV support statistics to verify that NPIV is supported and enabled on the
switch.
where:
port_id ID of the port for which you want to display rport statistics.
vpwwn Displays the statistics for the virtual port by its WWN. If no part WWN is
specified, the information provided is for the base vport.
where:
port_id ID of the port for which you want to display rport statistics.
Diagnostics
Diagnostics, available through BCU commands and HCM, evaluate the integrity of HBA hardware
and end-to-end connectivity in the fabric. All of these diagnostics can be used while the system is
running.
Beaconing
Initiate beaconing on a specific HBA port to flash the port LEDs and make it easier to locate the
HBA in an equipment room.
Initiate link beaconing to flash the LEDs on a specific HBA port and the LEDs on a connected switch
port to verify the connection between HBA and switch. When you initiate link beaconing, commands
are sent to the other side of the link. When the remote port receives these commands, that port’s
LEDs flash. The remote port sends a command back to the originating port. When that port
receives this command, the port’s LEDs flash.
NOTE
To initiate link beaconing, this feature must be available on the connected switch.
Toggle beaconing on and off and set beaconing duration using the BCU or HCM.
where:
port_id ID of the port for which you want to enable beaconing.
duration Length of time between blinks.
where:
port_id ID of the port for which you want to run a link beacon test.
on | off Toggle on or off. If turned on, you can specify duration.
duration Length of time between blinks.
Loopback tests
Use the BCU or the HCM to perform a loopback test for a specific port. Loopback tests require that
you disable the port. The following loopback tests are available:
• Internal
Random data is sent to the HBA port, then returned without transmitting through the port. The
returned data is validated to determine port operation. Errors may indicate an failed port.
• External
For this test, a loopback connector is required for the port. Random data is sent to the HBA
port. The data transmits from the port then returns. The returned data is validated to
determine port operation. Errors may indicate a failed port.
where:
port_id ID of the port that you want to run the test.
loopback type internal, external, serdes
speed For 8 Gbps HBA, this is 2, 4, or 8. For 4 Gbps HBA, this is 1, 2 or 4.
duration Length of time between blinks.
frame count Integer from 0- 4,294,967,295. Default is 8192.
-p pattern Hex number. Default value is A5A5A5A5.
where:
port_id ID of the port from which you want to run the test.
pattern Hex number.
frame count Integer.
Memory test
Use the BCU or the HCM to perform a memory test for the HBA.
NOTE
Performing the Memory test disables the HBA.
where:
ad_id Serial number of adapter.
HBA temperature
Use the BCU diag --memtest command to read the adapter’s temperature sensor registers.
diag --tempshow <ad_id>
where:
port_id ID of the HBA port from which you want to ping the remote port.
rpwwn Remote port WWN that you want to ping.
lpwwn Logical port WWN. 0 indicates the base port.
7. Click Start.
Trace route
Use the BCU and HCM to report the SAN path between the HBA and remote end point.
where:
port_id ID of the port from which you want to trace the route.
rpwwn Remote port WWN that you want to ping.
lpwwn Logical port WWN. 0 indicates the base port
Echo test
Use the BCU and HCM to initiate an echo test between the HBA port and a Fibre Channel end point.
This sends an ECHO command and response sequence between the HBA port and target port to
verify connection with the target.
where:
port_id ID of the port for which you want to perform the test.
rpwwn Remote port WWN that you want to ping.
lpwwn Logical port WWN. 0 indicates the base port; otherwise.
SCSI test
Use the fcdiag --scsitest BCU command to test the SCSI link between the HBA and remote port.
fcdiag –-scsitest <port_id> <rpwwn> [-l lpwwn]
where:
port_id ID of the port for which you want to test the SCSI link.
rpwwn Remote port WWN that you want to ping.
lpwwn Logical port WWN. 0 indicates the base port; otherwise.
SFP diagnostics
SFP diagnostics provide detailed information on the SFP transceiver for a selected port, such as its
health status, port speed, connector type, minimum and maximum distance, as well as details on
the extended link.
where:
port_id ID of the port for which you want to display SFP attributes.
Port log
Use the debug --portlog BCU command to display a log of Fibre Channel frames and other main
control messages that were sent out and received on a specific port. You can use this information
to isolate HBA and Fibre Channel protocol problems.
debug --portlog <port_id>
where:
port_id The ID of the port for which you want to display the port log.
NOTE
If the port log is disabled, a warning message displays. Use the debug -portlogctl command to
enable and disable the port log.
Port list
Use the port --list BCU command to list all physical ports on the HBA along with their physical
attributes, such as PWWN, Fibre Channel address, port type, speed, and state.
port --list <port_id>
where:
port_id ID of the port for which you want to display information.
Port query
Use port --query BCU command to display port information, such as WWN, NWWN, state, current
and configured speed, topology, received and transmitted BB_Credits, and beacon status.
port --query <port_id>
Port speed
Use port --speed BCU command to display the current port speed setting, such as 1, 4, or 8 Gbps.
port --speed <port_id <1|2|4|8|auto>
where:
port_id ID of the port for which you want to display port speed.
<1|2|4|8|auto> The speed settings, with auto being autosensing mode.
Authentication settings
Use the Brocade CLI utility (BCU) or the HCM GUI to display the HBA authentication settings and
status.
where:
port_id ID of the port for which you want to display authentication settings.
BCU commands
• Use the following BCU command to determine Target Rate Limiting speed and enabled status.
ratelim --query <port-id>
where:
port_id ID of the port for which you want to display target rate limiting settings.
• Use the following BCU command to display QoS and target rate limiting enabled status and
target rate limiting default speed.
port --query <port-id>
where:
port_id ID of the port for which you want to display port information.
• Use the following command to display QoS status and information for a port.
bcu qos --query <port_id>
where:
port_id ID of the port for which you want to display target rate limiting settings.
• Use the following command to determine operating speed of the remote port, QoS priority, and
target rate limiting enforcement:
bcu rport --query
where:
port_id Specifies the ID of the port for which you want to query attributes of a remote
port.
HCM
• Use the Port Properties panel in HCM to display configured QoS parameters.
To open the Port Properties panel:
1. Select a port in the device tree.
2. Click the Properties tab in the right pane.
• Use the Remote Port Properties panel in HCM to display information on target rate limiting and
QoS for the remote port.
To open the Remote Port Properties panel:
1. From the device tree, select a remote port (target or initiator).
2. Click the Remote Port Properties tab in the right pane.
Persistent binding
Persistent binding is a feature of Fibre Channel (FC) host bus adapters that enables you to
permanently assign a system SCSI target ID to a specific FC device, even though the device’s ID on
the FC loop may be different each time the FC loop initializes. Persistent binding is available in the
Windows and VmWare environments only.
Use the HCM or BCU to display target ID mapping for an HBA port.
BCU
Use the pbind --list BCU command to query the list of mappings for persistent binding on a specific
port.
pbind --list <port_id> <pwwn>
where:
port_id ID of the port for which you want to query mappings.
pwwn Port World Wide Name
HCM
Use the Persistent Binding dialog box to determine SCSI target ID mappings, perform the following
steps:
1. Launch the HCM.
2. Select an HBA or port from the device tree.
3. Select Configure > Persistent Binding.
You can also select and right-click on an HBA or port in the device tree and select Persistent
Binding from the list.
Performance optimization 4
This chapter provides information and tools for optimizing your HBA performance.
In this chapter
• Linux tuning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
• Solaris tuning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
• Windows tuning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
• VMware tuning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
Linux tuning
Linux disk I/O scheduling reorders, delays, and merges requests for disk I/O to achieve better
throughput and lower latency than would happen if all the requests were sent straight to the disk.
Linux 2.6 has four different disk I/O schedulers: noop, deadline, anticipatory and completely fair
queuing. Enabling the “noop” scheduler avoids any delays in queuing of I/O commands. This helps
in achieving higher I/O rates by queuing multiple outstanding I/O requests to each disk.
To enable the noop scheduler, run the following commands on your system.
for i in /sys/block/sd[b-z]/queue/scheduler
do
echo noop > $i
done
NOTE
You must disable the default scheduler because it is not tuned for achieving the maximum I/O
performance.
Solaris tuning
To increase I/O transfer performance, set the following parameters on your system:
• Set the maximum device read/write directive (maxphy”).
• Set the Fibre disk maximum transfer parameter (ssd_max_xfer_size).
Please refer to Sun StorageTek SAM File System Configuration and Administration Guide document
for details of the two parameters.”
Windows tuning
Windows tuning involves configuring the driver and operating system tunable parameters.
Valid Range: 0 – 225 micro seconds (Note that the value of 0 disables the latency monitor
timeout interrupt.)
• Interrupt Coalescing
When this feature is turned off, I/O completion requests are not coalesced by the firmware.
While this helps reduce I/O latency, the host CPU will frequently be interrupted, leading to a
slower system response under heavy I/O load (more than 7000 I/O’s per second).
When this feature is turned on, the HBA will not interrupt the host until Interrupt delay duration.
Interrupt delay, together with Interrupt latency, helps to reduce the number of interrupts that
the host CPU processes per second, leading to improved overall CPU utilization. However, if the
number of interrupts handled between the Interrupt delay period is relatively smaller, then this
will result in performance degradation as the I/O completion process has slowed down.
The BCU ioc -intr command can be used to configure these interrupt attributes for the desired
port.
bcu ioc --intr <ioc_id> <--coalesce|-c> {on | off} [<Latency> <Delay>]
OS tunable parameters
Please see the section “Storage Stack Drivers” in Disk Subsystem Performance Analysis for
Windows Server 2003 optimizations located on the following website.
http://download.microsoft.com
Please see the sections “Performance Tuning for Storage Subsystem” and “I/O Priorities” in
Performance Tuning Guidelines for Windows Server 2008 located on the following website.
http://www.microsoft.com
VMware tuning
For performance tuning on VMware, refer to the following publications on the VMware website at
www.vmware.com:
• Performance Tuning Best Practices for ESX Server 3. Refer to the following sections:
- Storage Performance Best Practices
- Related Publications
• Fibre Channel SAN Configuration Guide. Refer to “Using ESX Server with SAN: Concepts.”
A diagnostics, 40
beaconing, 40
adapter list command, 28 enabling through BCU, 40
adapter query command, 28 enabling through HCM, 40
application log, 32, 33 echo test
enabling through BCU, 44
authentication settings, 48
enabling through HCM, 45
HBA, 40
HBA temperature, 43
B loopback tests, 41
enabling through BCU, 41
BCU commands enabling through HCM, 41
adapter list, 28 memory test, 42
adapter query, 28 enabling through BCU, 42
port list, 47 enabling through HCM, 43
port query, 47 PCI loopback tests, 42
port speed, 48 enabling through BCU, 42
to collect data, 25 enabling through HCM, 42
beaconing, 40 ping end points, 43
enabling through BCU, 40 enable through HCM, 43
enabling through HCM, 40 enabling through BCU, 43
bfa_supportsave, xii SCSI test, 45
SFP, 45
enable through BCU, 45, 46
enable through HCM, 46
C trace route, 44
enable through HCM, 44
collecting data using BCU, 28
enabling through BCU, 44
collecting data using event logs, 30
document conventions, ix
collecting data using Fabric OS commands, 28
driver install errors, 13
command syntax conventions, ix
driver installation
verify using Linux commands, 15
verify using Solaris commands, 16
D verify using VMware command, 16
driver tunable parameters for Windows, 52, 53
data
collecting data with BCU and HCM, 25
collecting using host commands, 24
data to provide support, 24
E
device manager, 15 echo test, 44
enable through HCM, 45
enabling through BCU, 44
error message when removing Windows driver, 13
error when rolling back driver, 14
Q
QoS settings, 48
QoS statistics, 39
displaying through BCU, 39
displaying through HCM, 39
R
remote port properties, 46
remote port statistics, 36
displaying through BCU, 37
displaying through HCM, 37
resolving installation problems, 12