NetBackup Dedupe Guide
NetBackup Dedupe Guide
Release 7.5
21220065
Legal Notice
Copyright 2012 Symantec Corporation. All rights reserved. Symantec and the Symantec Logo, Veritas, and NetBackup are trademarks or registered trademarks of Symantec Corporation or its affiliates in the U.S. and other countries. Other names may be trademarks of their respective owners. This Symantec product may contain third party software for which Symantec is required to provide attribution to the third party (Third Party Programs). Some of the Third Party Programs are available under open source or free software licenses. The License Agreement accompanying the Software does not alter any rights or obligations you may have under those open source or free software licenses. Please see the Third Party Legal Notice Appendix to this Documentation or TPIP ReadMe File accompanying this Symantec product for more information on the Third Party Programs. The product described in this document is distributed under licenses restricting its use, copying, distribution, and decompilation/reverse engineering. No part of this document may be reproduced in any form by any means without prior written authorization of Symantec Corporation and its licensors, if any. THE DOCUMENTATION IS PROVIDED "AS IS" AND ALL EXPRESS OR IMPLIED CONDITIONS, REPRESENTATIONS AND WARRANTIES, INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NON-INFRINGEMENT, ARE DISCLAIMED, EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID. SYMANTEC CORPORATION SHALL NOT BE LIABLE FOR INCIDENTAL OR CONSEQUENTIAL DAMAGES IN CONNECTION WITH THE FURNISHING, PERFORMANCE, OR USE OF THIS DOCUMENTATION. THE INFORMATION CONTAINED IN THIS DOCUMENTATION IS SUBJECT TO CHANGE WITHOUT NOTICE. The Licensed Software and Documentation are deemed to be commercial computer software as defined in FAR 12.212 and subject to restricted rights as defined in FAR Section 52.227-19 "Commercial Computer Software - Restricted Rights" and DFARS 227.7202, "Rights in Commercial Computer Software or Commercial Computer Software Documentation", as applicable, and any successor regulations. Any use, modification, reproduction release, performance, display or disclosure of the Licensed Software and Documentation by the U.S. Government shall be solely in accordance with the terms of this Agreement.
Symantec Corporation 350 Ellis Street Mountain View, CA 94043 http://www.symantec.com Printed in the United States of America. 10 9 8 7 6 5 4 3 2 1
Technical Support
Symantec Technical Support maintains support centers globally. Technical Supports primary role is to respond to specific queries about product features and functionality. The Technical Support group also creates content for our online Knowledge Base. The Technical Support group works collaboratively with the other functional areas within Symantec to answer your questions in a timely fashion. For example, the Technical Support group works with Product Engineering and Symantec Security Response to provide alerting services and virus definition updates. Symantecs support offerings include the following:
A range of support options that give you the flexibility to select the right amount of service for any size organization Telephone and/or Web-based support that provides rapid response and up-to-the-minute information Upgrade assurance that delivers software upgrades Global support purchased on a regional business hours or 24 hours a day, 7 days a week basis Premium service offerings that include Account Management Services
For information about Symantecs support offerings, you can visit our Web site at the following URL: www.symantec.com/business/support/ All support services will be delivered in accordance with your support agreement and the then-current enterprise technical support policy.
Hardware information Available memory, disk space, and NIC information Operating system Version and patch level Network topology Router, gateway, and IP address information Problem description:
Error messages and log files Troubleshooting that was performed before contacting Symantec Recent software configuration changes and network changes
Customer service
Customer service information is available at the following URL: www.symantec.com/business/support/ Customer Service is available to assist with non-technical questions, such as the following types of issues:
Questions regarding product licensing or serialization Product registration updates, such as address or name changes General product information (features, language availability, local dealers) Latest information about product updates and upgrades Information about upgrade assurance and support contracts Information about the Symantec Buying Programs Advice about Symantec's technical support options Nontechnical presales questions Issues that are related to CD-ROMs, DVDs, or manuals
Contents
Chapter 2
Contents
How file size may affect the deduplication rate ............................ About deduplication stream handlers .............................................. Deployment best practices ............................................................. Use fully qualified domain names ............................................. About scaling deduplication ..................................................... Send initial full backups to the storage server ............................. Increase the number of jobs gradually ........................................ Introduce load balancing servers gradually ................................. Implement client deduplication gradually ................................... Use deduplication compression and encryption ........................... About the optimal number of backup streams .............................. About storage unit groups for deduplication ............................... About protecting the deduplicated data ...................................... Save the deduplication storage server configuration ..................... Plan for disk write caching ....................................................... How deduplication restores work .............................................. Replacing the PureDisk Deduplication Option with Media Server Deduplication on the same host ................................................ Migrating from PureDisk to the NetBackup Media Server Deduplication option .............................................................. Migrating from another storage type to deduplication ........................
54 54 54 54 55 55 56 56 57 57 57 57 58 59 59 59 60 61 62
Chapter 3
Chapter 4
Chapter 5
Contents
About NetBackup deduplication pools .............................................. 79 Configuring a deduplication disk pool .............................................. 81 Media server deduplication pool properties ................................. 81 Configuring a deduplication storage unit .......................................... 83 Deduplication storage unit properties ........................................ 85 Deduplication storage unit recommendations .............................. 86 Enabling deduplication encryption .................................................. 87 Configuring optimized synthetic backups for deduplication ................. 89 About configuring optimized duplication and replication bandwidth ............................................................................ 89 Configuring MSDP optimized duplication copy behavior ..................... 91 Configuring a separate network path for MSDP optimized duplication ........................................................................... 92 Configuring optimized duplication of deduplicated data ...................... 94 About the replication topology for Auto Image Replication .................. 97 Configuring a target for MSDP replication ....................................... 98 Viewing the replication topology for Auto Image Replication .............. 100 Sample volume properties output for MSDP replication ............... 101 About the storage lifecycle policies required for Auto Image Replication ......................................................................... 102 Customizing how nbstserv runs duplication and import jobs ................................................................................... 105 Creating a storage lifecycle policy .................................................. 105 Storage Lifecycle Policy dialog box settings ............................... 106 Adding a storage operation to a storage lifecycle policy ............... 108 About backup policy configuration ................................................ 110 Creating a policy using the Policy Configuration Wizard .................... 111 Creating a policy without using the Policy Configuration Wizard ........ 111 Enabling client-side deduplication ................................................. 112 Resilient Network properties ........................................................ 113 Resilient connection resource usage ........................................ 115 Specifying resilient connections .................................................... 116 Seeding the fingerprint cache for remote client-side deduplication ....................................................................... 117 Adding a deduplication load balancing server .................................. 117 About the pd.conf configuration file for NetBackup deduplication ....................................................................... 119 pd.conf file settings for NetBackup deduplication ...................... 119 Editing the pd.conf deduplication file ............................................. 126 About the contentrouter.cfg file for NetBackup deduplication ............. 127 About saving the deduplication storage server configuration .............. 128 Saving the deduplication storage server configuration ...................... 129 Editing a deduplication storage server configuration file ................... 129
10
Contents
Setting the deduplication storage server configuration ...................... About the deduplication host configuration file ............................... Deleting a deduplication host configuration file ............................... Resetting the deduplication registry .............................................. Configuring deduplication log file timestamps on Windows ................ Setting NetBackup configuration options by using bpsetconfig ...........
Chapter 6
Chapter 7
Contents
11
Viewing deduplication disk pools ............................................. Determining the deduplication disk pool state ........................... Changing the deduplication disk pool state ............................... Viewing deduplication disk pool attributes ................................ Setting deduplication disk pool attributes ................................. Changing deduplication disk pool properties ............................. Clearing deduplication disk pool attributes ............................... Determining the deduplication disk volume state ....................... Changing the deduplication disk volume state ........................... Deleting a deduplication disk pool ........................................... Deleting backup images ............................................................... Disabling client-side deduplication for a client ................................. About deduplication queue processing ........................................... Processing the deduplication transaction queue manually ................. About deduplication data integrity checking .................................... Configuring deduplication data integrity checking behavior ............... Deduplication data integrity checking configuration settings ........................................................................ About managing storage read performance ..................................... About deduplication storage rebasing ............................................ Resizing the deduplication storage partition .................................... About restoring files at a remote site .............................................. About restoring from a backup at a target master domain .................. Specifying the restore server ........................................................
162 162 162 163 164 165 170 171 171 172 173 173 174 174 175 176 178 179 180 182 183 183 184
Chapter 8
12
Contents
Errors, delayed response, hangs .............................................. Cannot delete a disk pool ....................................................... Media open error (83) ............................................................ Media write error (84) ........................................................... Storage full conditions .......................................................... Viewing disk errors and events ..................................................... Deduplication event codes and messages ........................................
Chapter 9
Chapter 10
Appendix A
Chapter
About NetBackup deduplication New features and enhancements for NetBackup 7.5
Reduce the amount of data that is stored. Reduce backup bandwidth. Reduced bandwidth can be especially important when you want to limit the amount of data that a client sends over the network. Over the network can be to a backup server or for image duplication between remote locations. Reduce backup windows. Reduce infrastructure.
14
NetBackup Media Server NetBackup clients send their backups to a NetBackup media Deduplication Option server, which deduplicates the backup data. A NetBackup media server hosts the NetBackup Deduplication Engine, which writes the data to the storage and manages the deduplicated data. See About the NetBackup Media Server Deduplication Option on page 23. NetBackup appliance deduplication Symantec provides a hardware and a software solution that includes NetBackup deduplication. The NetBackup 5200 series of appliances run the SUSE Linux operating system and include NetBackup software and disk storage. The NetBackup appliances have their own documentation set. See About NetBackup appliance deduplication on page 227. PureDisk deduplication NetBackup PureDisk is a deduplication solution that provides bandwidth-optimized backups of data in remote offices. You use the PureDisk interfaces to install, configure, and manage the PureDisk servers, storage pools, and client backups. You do not use NetBackup to configure or manage the storage or backups. PureDisk has its own documentation set. See the NetBackup PureDisk Getting Started Guide. A PureDisk storage pool can be a storage destination for both the NetBackup Client Deduplication Option and the NetBackup Media Server Deduplication Option. PureDisk appliance deduplication Symantec provides a hardware and a software solution that includes PureDisk deduplication. The NetBackup 5000 series of appliances run the PDOS operating system and include PureDisk software and disk storage. The NetBackup appliances have their own documentation set. See About NetBackup appliance deduplication on page 227.
15
Third- party vendor The NetBackup OpenStorage option lets third-party vendor appliance deduplication appliances function as disk storage for NetBackup. The disk appliance provides the storage and it manages the storage. A disk appliance may provide deduplication functionality. NetBackup backs up and restores client data and manages the life cycles of the data.
File 2
Q D L
The following list describes how NetBackup derives unique segments to store:
The deduplication engine breaks file 1 into segments A, B, C, D, and E. The deduplication engine breaks file 2 into segments A, B, Q, D, and L. The deduplication engine stores file segments A, B, C, D, and E from file 1 and file segments Q, and L from file 2. The deduplication engine does not store file
16
Introducing NetBackup deduplication New features and enhancements for NetBackup 7.5
segments A, B, and D from file 2. Instead, it points to the unique data copies of file segments A, B, and D that were already written from file 1. More detailed information is available. See Media server deduplication process on page 217.
Support for the AIX 5.3, 6.1, and 7.1 operating systems for deduplication servers and for client-side deduplication. 64-TB support for media server deduplication pools. Information about how to upgrade to more than 32 TB of storage space is available. See About support for more than 32-TB of storage on page 68. Resilient network connections provide improved support for remote office client deduplication. See About remote office client deduplication on page 31. iSCSI support. Information about the requirements for iSCSI support in NetBackup is available. See About deduplication server requirements on page 26. PureDisk 6.6.3 supports iSCSI disks in PureDisk storage pools only if the storage pool is deployed exclusively for PureDisk Deduplication Option (PDDO) use. iSCSI storage pools for PDDO use must be configured with the XFS file system and cannot be clustered. The PureDisk 6.6.3 documentation does not describe how to configure or manage a PureDisk storage pool that includes iSCSI disks. Information about how to use iSCSI disks in a PureDisk environment is in the PureDisk 6.6.1 documentation. http://www.symantec.com/docs/DOC3878 For known issues about iSCSI storage pools, a Symantec tech note is available. http://www.symantec.com/docs/TECH137146 Enhancements that improve restore and duplication performance. See About managing storage read performance on page 179. Deduplication integrity enhancements. See About deduplication data integrity checking on page 175. Windows storage server performance enhancements.
Introducing NetBackup deduplication New features and enhancements for NetBackup 7.5
17
Interprocess communication changes on Windows hosts improve performance to be similar to UNIX and Linux hosts. This change affects upgrades to NetBackup 7.5. See About shared memory on Windows deduplication storage servers on page 159.
FlashBackup performance improvements. Backup image delete and import performance improvements. NetBackup now reserves 4 percent of the storage space for the deduplication database and transaction logs rather than 10 percent. A new stream handler for EMC NDMP. Fibre Channel connections to NetBackup 5020 appliances. Fibre Channel is supported on x86-64 hosts that run the Red Hat Enterprise Linux 5 or SUSE Enterprise Linux Server 10 SP1 operating systems. See About Fibre Channel to a NetBackup 5020 appliance on page 228. The performance of the first backup of a remote client can be improved. See Seeding the fingerprint cache for remote client-side deduplication on page 117. Garbage collection now occurs during regularly scheduled queue processing. Therefore, the sched_GarbageCollection.log file no longer exists.
18
Introducing NetBackup deduplication New features and enhancements for NetBackup 7.5
Chapter
Planning your deduplication deployment About the deduplication tech note NetBackup naming conventions About the deduplication storage destination About the NetBackup Media Server Deduplication Option About NetBackup Client Deduplication About remote office client deduplication About NetBackup Deduplication Engine credentials About the network interface for deduplication About deduplication port usage About deduplication compression About deduplication encryption About optimized synthetic backups and deduplication About deduplication and SAN Client About optimized duplication and replication About deduplication performance About deduplication stream handlers Deployment best practices
20
Replacing the PureDisk Deduplication Option with Media Server Deduplication on the same host Migrating from PureDisk to the NetBackup Media Server Deduplication option Migrating from another storage type to deduplication
Deployment task
Read the deduplication tech note
Determine the storage destination See About the deduplication storage destination on page 22. Determine which type of deduplication to use See About the NetBackup Media Server Deduplication Option on page 23. See About NetBackup Client Deduplication on page 28. See About remote office client deduplication on page 31.
Step 4
See About NetBackup deduplication servers on page 25. See About deduplication server requirements on page 26. See About client deduplication requirements and limitations on page 30. See About the network interface for deduplication on page 33. See About deduplication port usage on page 33. See About scaling deduplication on page 55. See About deduplication performance on page 52.
Step 5
Determine the credentials for deduplication Read about compression and encryption Read about optimized synthetic backups
Step 6
See About deduplication compression on page 34. See About deduplication encryption on page 34. See About optimized synthetic backups and deduplication on page 35.
Step 7
21
Deployment task
Read about deduplication and SAN See About deduplication and SAN Client on page 36. Client Read about optimized duplication and replication Read about stream handlers Read about best practices for implementation Determine the storage requirements and provision the storage See About optimized duplication and replication on page 36.
Step 9
Step 10 Step 11
See About deduplication stream handlers on page 54. See Deployment best practices on page 54.
Step 12
See About provisioning the deduplication storage on page 65. See About deduplication storage requirements on page 66. See About deduplication storage capacity on page 68. See About the deduplication storage paths on page 69.
Step 13
See Replacing the PureDisk Deduplication Option with Media Server Deduplication on the same host on page 60. See Migrating from PureDisk to the NetBackup Media Server Deduplication option on page 61. See Migrating from another storage type to deduplication on page 62.
Step 14
Currently supported systems Media server and client sizing information Configuration, operational, and troubleshooting updates And more
22
Alphabetic (A-Z a-z) (names are case sensitive) Numeric (0-9) Period (.) Plus (+) Minus (-) Do not use a minus as the first character. Underscore (_)
Note: No spaces are only allowed. The naming conventions for the NetBackup Deduplication Engine differ from these NetBackup naming conventions. See About NetBackup Deduplication Engine credentials on page 32.
Media Server A NetBackup Media Server Deduplication Pool represents the disk storage that is attached Deduplication Pool to a NetBackup media server. NetBackup deduplicates the data and hosts the storage. If you use this destination, use this guide to plan, implement, configure, and manage deduplication and the storage. When you configure the storage server, select Media Server Deduplication Pool as the storage type. For a Media Server Deduplication Pool storage destination, all hosts that are used for the deduplication must be NetBackup 7.0 or later. Hosts include the master server, the media servers, and the clients that deduplicate their own data. Integrated deduplication means that the components installed with NetBackup perform deduplication.
Planning your deployment About the NetBackup Media Server Deduplication Option
23
PureDisk A NetBackup PureDisk Deduplication Pool represents a PureDisk storage pool. NetBackup Deduplication Pool deduplicates the data, and the PureDisk environment hosts the storage. If you use a PureDisk Deduplication Pool, use the following documentation: The PureDisk documentation to plan, implement, configure, and manage the PureDisk environment, which includes the storage. See the NetBackup PureDisk Getting Started Guide. This guide to configure backups and deduplication in NetBackup. When you configure the storage server, select PureDisk Deduplication Pool as the storage type.
A PureDisk Deduplication Pool destination requires that PureDisk be at release 6.6 or later. A NetBackup 5000 series appliance also provides a PureDisk storage pool to which NetBackup can send deduplicated data.
In addition to the previous storage destinations, the PureDisk Deduplication Option provides deduplication storage for a NetBackup environment. The PureDisk Storage Pool Authority provides an agent that you install on a NetBackup media server. This solution was created before NetBackup included integrated deduplication. If you use this storage option, use the PureDisk documentation to plan, implement, configure, and manage the storage and to configure NetBackup to use the agent. For a PureDisk storage pool destination, you can use NetBackup 6.5 or later NetBackup hosts. Hosts include the master server and the media servers.
24
Planning your deployment About the NetBackup Media Server Deduplication Option
The NetBackup Media Server Deduplication Option is integrated into NetBackup. It uses the NetBackup administration interfaces, commands, and processes for configuring and executing backups and for configuring and managing the storage. Deduplication occurs when NetBackup backs up a client to a deduplication storage destination. You do not have to use the separate PureDisk interfaces to configure and use deduplication. The NetBackup Media Server Deduplication Option integrates with NetBackup application agents that are optimized for the client stream format. Agents include but are not limited to Microsoft Exchange and Microsoft SharePoint Agents. Figure 2-1 shows NetBackup media server deduplication. The deduplication storage server is a media server on which the deduplication core components are enabled. The storage destination is a Media Server Deduplication Pool. Figure 2-1 NetBackup media server deduplication
NetBackup client
NetBackup client
NetBackup client
NetBackup client
Deduplication
Deduplication
plug-in
plug-in Deduplication
plug-in
Planning your deployment About the NetBackup Media Server Deduplication Option
25
A PureDisk storage pool may also be the storage destination. See About the deduplication storage destination on page 22. More detailed information is available. See Deduplication storage server components on page 215. See Media server deduplication process on page 217.
Deduplication storage One host functions as the storage server for a deduplication node; that host must be a server NetBackup media server. The storage server does the following:
Writes the data to and reads data from the disk storage. Manages that storage.
The storage server also deduplicates data. Therefore, one host both deduplicates the data and manages the storage. Only one storage server exists for each NetBackup deduplication node. See About deduplication nodes on page 26. You can use NetBackup deduplication with one media server host only: the media server that is configured as the deduplication storage server. How many storage servers you configure depends on your storage requirements. It also depends on whether or not you use optimized duplication or replication, as follows:
Optimized duplication in the same domain requires the following storage servers: One for the backup storage, which is the source for the duplication operations.
Another to store the copies of the backup images, which is the target for the duplication operations.
See About MSDP optimized duplication within the same domain on page 36. Auto Image Replication to another domain requires the following storage servers: One for the backups in the originating domain. This is the storage server that writes the NetBackup client backups to the storage. It is the source for the duplication operations. Another in the remote domain for the copies of the backup images. This storage server is the target for the duplication operations that run in the originating domain. See About NetBackup Auto Image Replication on page 47.
26
Planning your deployment About the NetBackup Media Server Deduplication Option
Load balancing server You can configure other NetBackup media servers to help deduplicate data. They perform file fingerprint calculations for deduplication, and they send the unique results to the storage server. These helper media servers are called load balancing servers. See About deduplication fingerprinting on page 223. A NetBackup media server becomes a load balancing server when two things occur: You enable the media server for deduplication load balancing duties. You do so when you configure the storage server or later by modifying the storage server properties. You select it in the storage unit for the deduplication pool.
See Introduce load balancing servers gradually on page 56. Load balancing servers also perform restore and duplication jobs. Load balancing servers can be any supported server type for deduplication. They do not have to be the same type as the storage server.
Planning your deployment About the NetBackup Media Server Deduplication Option
27
Newer SPARC processors, such as the SPARC64 VII, provide the single core throughput that is similar to AMD and Intel. Alternatively, UltraSPARC T1 and T2 single core performance does not approach that of the AMD and Intel processors. Tests show that the UltraSPARC processors can achieve high aggregate throughput. However, they require eight times as many backup streams as AMD and Intel processors to do so. Table 2-4 Component Storage server
CPU Symantec recommends at least a 2.2-GHz clock rate. A 64-bit processor is required. At least four cores are required. Symantec recommends eight cores. For 64 TBs of storage, Intel x86-64 architecture requires eight cores. RAM 4 GBs to 64 GBs. If your storage exceeds 4 TBs, Symantec recommends at least 1 GB more of memory for every terabyte of additional storage. For example, 10 TBs of back-end data require 10 GBs of RAM, 32 TBs require 32 GBs of RAM, and so on. For 64 TBs of storage, 64 GBs of RAM are required. Operating system The operating system must be a supported 64-bit The operating system must be a supported 64-bit operating system. operating system. See the operating system compatibility list for your NetBackup release on the NetBackup Enterprise Server landing page on the Symantec Support Web site. See the operating system compatibility list for your NetBackup release on the NetBackup Enterprise Server landing page on the Symantec Support Web site.
Deduplication server minimum requirements Load balancing server or PureDisk Deduplication Option host
Symantec recommends at least a 2.2-GHz clock rate. A 64-bit processor is required. At least two cores are required. Depending on throughput requirements, more cores may be helpful.
4 GBs.
A deduplication tech note provides detailed information about and examples for sizing the hosts for deduplication. Information includes the number of NICs or HBAs per server that are required to support your performance objectives. See About the deduplication tech note on page 21.
28
Note: In some environments, a single host can function as both a NetBackup master server and as a deduplication server. Such environments typically run fewer than 100 total backup jobs a day. (Total backup jobs means backups to any storage destination, including deduplication and nondeduplication storage.) If you perform more than 100 backups a day, deduplication operations may affect master server operations. See About deduplication performance on page 52. See About deduplication queue processing on page 174.
29
are stored in that deduplication node. The plug-in then sends only the unique segments to the NetBackup Deduplication Engine on the storage server. The engine writes the data to a media server deduplication pool. Client deduplication does the following:
Reduces network traffic. The client sends only unique file segments to the storage server. Duplicate data is not sent over the network. Distributes some deduplication processing load from the storage server to clients. (NetBackup does not balance load between clients; each client deduplicates its own data.)
Remote office or branch office backups to the data center. NetBackup provides resilient network connections for remote office backups. See About remote office client deduplication on page 31. LAN connected file server Virtual machine backups.
Client-side deduplication is also a useful solution if a client host has unused CPU cycles or if the storage server or load balancing servers are overloaded. Figure 2-2 shows client deduplication. The deduplication storage server is a media server on which the deduplication core components are enabled. The storage destination is a Media Server Deduplication Pool
30
Figure 2-2
plug-in
plug-in
Deduplication
plug-in
A PureDisk storage pool may also be the storage destination. See About the deduplication storage destination on page 22. More information is available. See About remote office client deduplication on page 31. See Deduplication client components on page 220. See Clientside deduplication backup process on page 220.
31
Client deduplication reduces the time that is required for WAN backups by reducing the amount of data that must be transferred. The resilient connections provide automatic recovery from network failures and latency (within the parameters from which NetBackup can recover).
When you configure a resilient connection, NetBackup uses that connection for the backups. Use the NetBackup Resilient Network host properties to configure NetBackup to use resilient network connections. See Resilient Network properties on page 113. See Specifying resilient connections on page 116. You can improve the performance of the first backup for a remote client. See Seeding the fingerprint cache for remote client-side deduplication on page 117.
32
For user names and passwords, you can use characters in the printable ASCII range (0x20-0x7E) except for the following characters:
Asterisk (*) Backward slash (\) and forward slash (/) Double quote (") Left parenthesis [(] and right parenthesis [)]
The user name and the password can be up to 63 characters in length. Leading and trailing spaces and quotes are ignored. The user name and password cannot be empty or all spaces.
Note: Record and save the credentials in case you need them in the future.
Caution: You cannot change the NetBackup Deduplication Engine credentials after you enter them. Therefore, carefully choose and enter your credentials. If you must change the credentials, contact your Symantec support representative.
33
10085
10102
34
If you enable encryption on a client that deduplicates its own data, the client encrypts the data before it sends it to the storage server. The data remains encrypted on the storage. Data also is transferred from the client over a Secure Sockets Layer to the server regardless of whether or not the data is encrypted. Therefore, data transfer from the clients that do not deduplicate their own data is also protected. If you enable encryption on a load balancing server, the load balancing server encrypts the data. It remains encrypted on storage. If you enable encryption on the storage server, the storage server encrypts the data. It remains encrypted on storage. If the data is already encrypted, the storage server does not encrypt it.
35
Note: Do not enable encryption by selecting the Encryption setting on the Attributes tab of the Policy dialog box. If you do, NetBackup encrypts the data before it reaches the deduplication plug-in that deduplicates it. Consequently, deduplication rates are very low. See Use deduplication compression and encryption on page 57. See Enabling deduplication encryption on page 87. See About the pd.conf configuration file for NetBackup deduplication on page 119. See pd.conf file settings for NetBackup deduplication on page 119.
36
37
duplication. Only the unique, deduplicated data segments are transferred. Optimized duplication reduces the amount of data that is transmitted over your network. Optimized duplication is a good method to copy your backup images off-site for disaster recovery. The following sections provide the conceptual information about optimized duplication. A process topic describes the configuration process. See Configuring optimized duplication of deduplicated data on page 94.
If the source images reside on a NetBackup Media Server Deduplication Pool, the destination can be another Media Server Deduplication Pool or a PureDisk Deduplication Pool. (In NetBackup, a PureDisk storage pool is configured as a PureDisk Deduplication Pool.) If the destination is a PureDisk storage pool, the PureDisk environment must be at release level 6.6 or later. If the source images reside on a PureDisk storage pool, the destination must be another PureDisk storage pool. Both PureDisk environments must be at release level 6.6 or later. The source storage and the destination storage must have at least one media server in common. See About the media servers for optimized MSDP duplication within the same domain on page 38. In the storage unit you use for the destination for the optimized duplication, you must select only the common media server or media servers. If you select more than one, NetBackup assigns the duplication job to the least busy media server. If you select a media server or servers that are not common, the optimized duplication job fails. For more information about media server load balancing, see the NetBackup Administrator's Guide for UNIX and Linux, Volume I or the NetBackup Administrator's Guide for Windows, Volume I. The destination storage unit cannot be the same as the source storage unit.
38
You cannot use optimized duplication from a PureDisk storage pool (a PureDisk Deduplication Pool) to a Media Server Deduplication Pool. If an optimized duplication job fails after the configured number of retries, NetBackup does not run the job again. By default, NetBackup retries an optimized duplication job three times. You can change the number of retries. See Configuring MSDP optimized duplication copy behavior on page 91. Optimized duplication does not work with storage unit groups. If you use a storage unit group as a destination for optimized duplication, NetBackup uses regular duplication. Optimized duplication does not support multiple copies. If NetBackup is configured to make multiple new copies from the (source) copy of the backup image, the following occurs:
In a storage lifecycle policy, one duplication job creates one optimized duplication copy. If multiple optimized duplication destinations exist, a separate job exists for each destination. This behavior assumes that the device for the optimized duplication destination is compatible with the device on which the source image resides. If multiple remaining copies are configured to go to devices that are not optimized duplication capable, NetBackup uses normal duplication. One duplication job creates those multiple copies. For other duplication methods, NetBackup uses normal duplication. One duplication job creates all of the copies simultaneously. The other duplication methods include the following: NetBackup Vault, the bpduplicate command line, and the duplication option of the Catalog utility in the NetBackup Administration Console.
See Optimized MSDP duplication within the same domain requirements on page 37.
About the media servers for optimized MSDP duplication within the same domain
For optimized Media Server Deduplication Pool duplication within the same domain, the source storage and the destination storage must have at least one media server in common. The common server initiates, monitors, and verifies the
39
copy operation. The common server requires credentials for both the source storage and the destination storage. (For deduplication, the credentials are for the NetBackup Deduplication Engine, not for the host on which it runs.) Which server initiates the duplication operation determines if it is a push or a pull operation. If it is physically in the source domain, it is push duplication. If it is in the destination domain, it is a pull duplication. Technically, no advantage exists with a push duplication or a pull duplication. However, the media server that initiates the duplication operation also becomes the write host for the new image copies. A storage server or a load balancing server can be the common server. The common server must have the credentials and the connectivity for both the source storage and the destination storage.
plug-in
plug-in
MSDP_R
LB_R1
Credentials: StorageServer-R
40
Figure 2-4 shows the settings for the storage unit for the normal backups for the local deduplication node. The disk pool is the MSDP_L in the local environment. Because all hosts in the local node are co-located, you can use any available media server for the backups. Figure 2-4 Storage unit settings for backups to MSDP_L
Figure 2-5 shows the storage unit settings for the optimized duplication. The destination is the MSDP_R in the remote environment. You must select the common server, so only load balancing server LB_L2 is selected.
41
Figure 2-5
If you use the remote node for backups also, select StorageServer-R and load balancing server LB_R1 in the storage unit for the remote node backups. If you select server LB_L2, it becomes a load balancing server for the remote deduplication pool. In such a case, data travels across your WAN. Figure 2-6 shows a push duplication from a Media Server Deduplication Pool to a PureDisk storage pool. The Media Server Deduplication Pool contains normal backups; the PureDisk Deduplication Pool is the destination for the optimized duplication copies. The StorageServer-A has credentials for both environments; it is the common media server.
42
Figure 2-6
Deduplication node A
StorageServer-A Deduplication Credentials: StorageServer-A Remote_MediaServer plug-in plug-in PureDisk content router and storage pool NetBackup Deduplication Engine
Get ready, here comes data
Deduplication
PureDisk_DedupePool
Figure 2-7 shows the storage unit settings for normal backups for the environment in Figure 2-6. The disk pool is the MediaServer_DedupePool in the local environment. For normal backups, you do not want a remote host deduplicating data, so only the local host is selected.
43
Figure 2-7
Figure 2-8 shows the storage unit settings for duplication for the environment in Figure 2-6. The disk pool is the PureDisk_DedupePool in the remote environment. You must select the common server, so only the local media server is selected. If this configuration were a pull configuration, the remote host would be selected in the storage unit.
44
Figure 2-8
Figure 2-9 shows optimized duplication between two PureDisk storage pools. NetBackup media server A has credentials for both storage pools; it initiates, monitors, and verifies the optimized duplication. In the destination storage unit, the common server (media server A) is selected. This configuration is a push configuration. For a PureDisk Deduplication Pool (that is, a PureDisk storage pool), the PureDisk content router functions as the storage server.
45
Figure 2-9
Deduplication
Storage pool A, send your data to storage pool B
plug-in
You can use a load balancing server when you duplicate between two NetBackup deduplication pools. However, it is more common between two PureDisk storage pools.
46
Figure 2-10
Pull duplication
Deduplication node B (duplicates) Storage server B Credentials: Host A Host B Host B Deduplication
Please verify that the data arrived
Deduplication node A (normal backups) Storage server A Credentials: Host A Host A Deduplication
plug-in
plug-in
MediaServer_DedupePool_A
MediaServer_DedupePool_B
Figure 2-11 shows the storage unit settings for the duplication destination. They are similar to the push example except host B is selected. Host B is the common server, so it must be selected in the storage unit.
47
Figure 2-11
If you use node B for backups also, select host B and not host A in the storage unit for the node B backups. If you select host A, it becomes a load balancing server for the node B deduplication pool.
One-to-one model
48
One-to-many model A single production datacenter can back up to multiple disaster recovery sites. See One-to-many Auto Image Replication model on page 49. Many-to-one model Remote offices in multiple domains can back up to a storage device in a single domain. Many-to-many model Remote datacenters in multiple domains can back up multiple disaster recovery sites.
Note: Although Auto Image Replication is a disaster recovery solution, the administrator cannot directly restore to clients in the primary (or originating) domain from the target master domain. Table 2-7 is an overview of the process, generally describing the events in the originating and target domains. Table 2-7 Event
1
The deduplication storage server in the target domain recognizes that a replication event has occurred and notifies the NetBackup master server in that domain. NetBackup imports the image immediately, based on an SLP that contains an import operation. NetBackup can import the image quickly because the metadata is replicated as part of the image. (This import process is not the same as the import process available in the Catalog utility.) After the image is imported into the target domain, NetBackup continues to manage the copies in that domain. Depending on the configuration, the media server in Domain 2 can replicate the images to a media server in Domain 3.
49
The image is created in Domain 1, and then replicated to the target Domain 2. The image is imported in Domain 2, and then replicated to a target Domain 3. The image is then imported into Domain 3.
50
Figure 2-12
Domain 1
Domain 2
Import All copies have the same Target retention, as indicated in Domain 1.
Domain 3
Import
In the cascading model, the originating master server for Domain 2 and Domain 3 is the master server in Domain 1. Note: When the image is replicated in Domain 3, the replication notification event initially indicates that the master server in Domain 2 is the originating master server. However, when the image is successfully imported into Domain 3, this information is updated to correctly indicate that the originating master server is in Domain 1. The cascading model presents a special case for the Import SLP that will replicate the imported copy to a target master. (This is the master server that is neither the first nor the last in the string of target master servers.) As discussed previously, the requirements for an Import SLP include at least one operation that uses a Fixed retention type and at least one operation that uses a Target Retention type. So that the Import SLP can satisfy these requirements, the import operation must use a Target Retention. Table 2-8 shows the difference in the import operation setup.
51
Table 2-8
Import operation difference in an SLP configured to replicate the imported copy Import operation in a cascading model
Same; no difference.
Same; no difference.
At least one operation must use the Target Here is the difference: retention. To meet the criteria, the import operation must use Target retention.
The target retention is embedded in the source image. Because the imported copy is the copy being replicated to a target master server domain, the fixed retention (three weeks in this example) on the replication to target master operation is ignored. The target retention is used instead. (See Figure 2-13.) Figure 2-13 Storage lifecycle policy configured to replicate the imported copy
Target retention of source image
In the cascading model that is represented in Figure 2-12, all copies have the same Target Retentionthe Target Retention indicated in Domain 1. For the copy in Domain 3 to have a different target retention, add an intermediary replication operation to the Domain 2 storage lifecycle policy. The intermediary replication operation acts as the source for the replication to target master. Since the target retention is embedded in the source image, the copy in Domain 3 honors the retention level that is set for the intermediary replication operation.
52
Figure 2-14
Domain 1
Domain 2
Import
Domain 3
The copy in Domain 3 has the retention indicated by the source replication in Domain 2.
Import
53
Table 2-9
When
Initial seeding
Normal operation
Normal operation is when all clients have been backed up once. Approximately 15 to 20 jobs can run concurrently and with high performance under the following conditions: The hardware meets minimum requirements. (More capable hardware improves performance.) No compression. If data is compressed, the CPU usage increases quickly, which reduces the number of concurrent jobs that can be handled. The deduplication rate is between 10% and 50%. The deduplication rate is the percentage of data already stored so it is not stored again. The amount of data that is stored is between 30% to 90% of the capacity of the storage.
Clean up periods
Clean up is when the NetBackup Deduplication Engine performs maintenance such as deleting expired backup image data segments. NetBackup maintains the same number of concurrent backup jobs as during normal operation. However, the average time to complete the jobs increases significantly.
Storage NetBackup maintains the same number of concurrent backup jobs as during approaches normal operation under the following conditions: full capacity The hardware meets minimum requirements. (More capable hardware improves performance.) The amount of data that is stored is between 85% to 90% of the capacity of the storage. However, the average time to complete the jobs increases significantly.
54
55
Select Only use the following media servers. Select all of the load balancing servers but do not select the deduplication storage server.
The deduplication storage server performs storage server tasks only: storing and managing the deduplicated data, file deletion, and optimized duplication. If you configure client deduplication, the clients deduplicate their own data. Some of the deduplication load is removed from the deduplication storage server and loading balancing servers. Symantec recommends the following strategies to scale deduplication:
For the initial full backups of your clients, use the deduplication storage server. For subsequent backups, use load balancing servers. Enable client-side deduplication gradually. If a client cannot tolerate the deduplication processing workload, be prepared to move the deduplication processing back to a server.
56
Symantec also recommends that you implement load balancing servers and client deduplication gradually. Therefore, it may be beneficial to use the storage server for backups while you implement deduplication on other hosts.
The deduplication storage server is CPU limited on any core. Memory resources are available on the storage server. Network bandwidth is available on the storage server. Back-end I/O bandwidth to the deduplication pool is available. Other NetBackup media servers have CPU available for deduplication.
Gigabit Ethernet should provide sufficient performance in many environments. If your performance objective is the fastest throughput possible with load balancing servers, you should consider 10 Gigabit Ethernet.
57
Use the storage server for the initial backup of the clients. Enable deduplication on only a few clients at a time. It may be easier to evaluate how your environment handles traffic and easier to troubleshoot any problems with fewer hosts added for deduplication.
If a client cannot tolerate the deduplication processing workload, be prepared to move the deduplication processing back to the storage server.
A storage unit group can contain the storage units that have a Media Server Deduplication Pool as the storage destination.
58
A storage unit group can contain the storage units that have a PureDisk storage pool as the storage destination.
A group must contain storage units of one storage destination type only. That is, a group cannot contain both Media Server Deduplication Pool storage units and PureDisk storage pool storage units. Storage unit groups avoid a single point of failure that can interrupt backup service. The best storage savings occur when a backup policy stores its data in the same deduplication destination disk pool instead of across multiple disk pools. For this reason, the Failover method for the Storage unit selection uses the least amount of storage. All of the other methods are designed to use different storage every time the backup runs. Note: NetBackup does not support storage unit groups for optimized duplication of deduplicated data. If you use a storage unit group as a destination for optimized duplication of deduplicated data, NetBackup uses regular duplication.
Use NetBackup optimized duplication to copy the images to another deduplication node off-site location. Optimized duplication copies the primary backup data to another deduplication pool. It provides the easiest, most efficient method to copy data off-site yet remain in the same NetBackup domain. You then can recover from a disaster that destroys the storage on which the primary copies reside by retrieving images from the other deduplication pool. See About MSDP optimized duplication within the same domain on page 36. See Configuring optimized duplication of deduplicated data on page 94. For the primary deduplication storage, use a SAN volume with resilient storage methodologies to replicate the data to a remote site. If the deduplication database is on a different SAN volume, replicate that volume to the remote site also.
The preceding methods help to make your data highly-available. Also, you can use NetBackup to back up the deduplication storage server system or program disks. If the disk on which NetBackup resides fails and you have to replace it, you can use NetBackup to restore the media server.
59
A battery backup unit that supplies power to the cache memory so write operations can continue if power is restored within sufficient time. An uninterruptible power supply that allows the components to complete their write operations.
If your devices that have caches are not protected, Symantec recommends that you disable the hardware caches. Read and write performance may decline, but you help to avoid data loss.
60
Planning your deployment Replacing the PureDisk Deduplication Option with Media Server Deduplication on the same host
The following other servers can have credentials for the NetBackup Deduplication Engine:
A load balancing server in the same deduplication node. A deduplication server in a different deduplication node that is the target of optimized duplication. Optimized duplication requires a server in common between the two deduplication nodes. See About the media servers for optimized MSDP duplication within the same domain on page 38.
You can specify the server to use for restores. See Specifying the restore server on page 184.
Replacing the PureDisk Deduplication Option with Media Server Deduplication on the same host
The PureDisk Deduplication Option provides deduplication of NetBackup backups for NetBackup release 6.5. The destination storage for PDDO is a PureDisk storage pool. The PureDisk agent that performs the deduplication is installed from the PureDisk software distribution, not from the NetBackup distribution. PDDO is not the same as integrated NetBackup deduplication. You can upgrade to 7.0 a NetBackup media server that hosts a PDDO agent and use that server for integrated NetBackup deduplication. The storage can remain the PureDisk storage pool, and NetBackup maintains access to all of the valid backup images in the PureDisk storage pool. If you perform this procedure, the NetBackup deduplication plug-in replaces the PureDisk agent on the media server. The NetBackup deduplication plug-in can deduplicate data for either integrated NetBackup deduplication or for a PureDisk storage pool. The PDDO agent can deduplicate data only for a PureDisk storage pool. Note: To use the NetBackup deduplication plug-in with a PureDisk storage pool, the PureDisk storage pool must be part of a PureDisk 6.6 or later environment.
Planning your deployment Migrating from PureDisk to the NetBackup Media Server Deduplication option
61
Task
Deactivate all backup policies Deactivate the policies to ensure that no activity occurs on the host. that use the host See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I..
Step 2
NetBackup deduplication components cannot reside on the same host as a PureDisk Deduplication Option (PDDO) agent. Therefore, remove the PDDO agent from the host. See the NetBackup PureDisk Deduplication Option Guide.
Step 3
Upgrade the media server to If the media server runs a version of NetBackup earlier than 7.0, upgrade 7.0 or later that server to NetBackup 7.0 or later. See the NetBackup Installation Guide for UNIX and Linux. See the NetBackup Installation Guide for Windows.
Step 4
In the Storage Server Configuration Wizard, select PureDisk Deduplication Pool and enter the name of the Storage Pool Authority. See Configuring a NetBackup deduplication storage server on page 79.
Step 5
Activate your backup policies See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I..
62
See Configuring NetBackup media server deduplication on page 76. Redirect your backup jobs to the NetBackup media server deduplication pool. See the NetBackup Administrator's Guide for UNIX and Linux, Volume I. See the NetBackup Administrator's Guide for Windows, Volume I.
After the PureDisk backup images expire, uninstall PureDisk. See your NetBackup PureDisk documentation.
63
After all of the backup images that are associated with the storage expire, repurpose that storage. If it is disk storage, you cannot add it to an existing media server deduplication pool. You can use it as storage for another, new deduplication node.
See Migrating from PureDisk to the NetBackup Media Server Deduplication option on page 61.
64
Chapter
About provisioning the deduplication storage About deduplication storage requirements About deduplication storage capacity About the deduplication storage paths Do not modify storage directories and files About adding additional storage About volume management for NetBackup deduplication
66
Optimized duplication within You must provision the storage for at least two deduplication the same NetBackup domain nodes in the same NetBackup domain:
Storage for the backups, which is the source for the duplication operations. Different storage in another deduplication node for the copies of the backup images, which is the target for the duplication operations.
Auto Image Replication to a You must provision the storage in at least two NetBackup different NetBackup domain domains: Storage for the backups in the originating domain. This storage contains your client backups. It is the source for the duplication operations. Different storage in the remote domain for the copies of the backup images. This storage is the target for the replication operations that run in the originating domain.
See About the deduplication storage destination on page 22. See Planning your deduplication deployment on page 20.
67
Requirements
Disk, with the following minimum requirements per individual data stream (read or write):
32 to 48 TBs of storage: 200 MB/sec. Symantec recommends that you store the data and the deduplication database on separate disk, each with 200 MB/sec read or write speed. 48 to 64 TBs of storage: 250 MB/sec. Symantec recommends that you store the data and the deduplication database on separate disk, each with 250 MB/sec read or write speed.
These are minimum requirements for single stream read or write performance. Greater individual data stream capability or aggregate capability may be required to satisfy your objectives for writing to and reading from disk. Connection Storage area network (Fibre Channel or iSCSI), direct-attached storage (DAS), or internal disks. The storage area network should conform to the following criteria: A dedicated, low latency network for storage with a maximum 0.1-millisecond latency per round trip. Enough bandwidth on the storage network to satisfy your throughput objectives. Symantec supports iSCSI on storage networks with at least 10-Gigabit Ethernet network bandwidth. Symantec recommends Fibre Channel storage networks with at least 4-Gigabit network bandwidth. The storage server should have an HBA or HBAs dedicated to the storage. Those HBAs must have enough bandwidth to satisfy your throughput objectives.
Local disk storage may leave you vulnerable in a disaster. SAN disk can be remounted at a newly provisioned server with the same name.
NetBackup requires exclusive use of the disk resources. If the storage is used for purposes other than backups, NetBackup cannot manage disk pool capacity or manage storage lifecycle policies correctly. Therefore, NetBackup must be the only entity that uses the storage. NetBackup deduplication does not support file based storage protocols, such as CIFS or NFS, for deduplication storage. The storage must be configured and operational before you can configure deduplication in NetBackup.
68
A deduplication tech note provides detailed information about and examples for hosts and environments for deduplication. See About the deduplication tech note on page 21.
Stop the deduplication services (spad and spoold) on the storage server host. Grow the disk storage to the desired size up to 64 TBs. See About adding additional storage on page 70. If more CPU and memory are required, shut down the storage server host and then upgrade the CPU and add the required amount of memory. Start the host. Upgrade NetBackup on the storage server host.
Use more than one media server deduplication node. Use a PureDisk storage pool as the storage destination. A PureDisk storage pool provides larger storage capacity; PureDisk also provides global deduplication. See About the deduplication storage destination on page 22.
69
Only one deduplication storage path can exist on a media server. You cannot add another storage path to increase capacity beyond 64 TBs.
Add files to the deduplication storage directories or database directories. Delete files from the deduplication storage directories or database directories. Modify files in the deduplication storage directories or database directories. Move files within the deduplication storage directories or database directories.
70
Chapter
Licensing deduplication
This chapter includes the following topics:
About licensing deduplication About the deduplication license key Licensing NetBackup deduplication
NetBackup supports deduplication on specific 64-bit host operating systems. If you intend to upgrade an existing media server and use it for deduplication, that host must be supported. For the supported systems, see the NetBackup Release Notes. NetBackup deduplication components cannot reside on the same host as a PureDisk Deduplication Option agent. To use a PDDO agent host for NetBackup deduplication, first remove the PDDO agent from that host. See the NetBackup PureDisk Deduplication Option (PDDO) Guide. Then, upgrade that host to NetBackup 7.0 or later. Finally, configure that host as a deduplication storage server or as a load balancing server.
72
1 2 3
On the Help menu of the NetBackup Administration Console, select License Keys. In the NetBackup License Keys dialog box, click New. In the Add a New License Key dialog box, enter the license key and click Add or OK.
73
4 5
74
Chapter
Configuring deduplication
This chapter includes the following topics:
Configuring NetBackup media server deduplication Configuring NetBackup client-side deduplication Configuring a NetBackup deduplication storage server About NetBackup deduplication pools Configuring a deduplication disk pool Configuring a deduplication storage unit Enabling deduplication encryption Configuring optimized synthetic backups for deduplication About configuring optimized duplication and replication bandwidth Configuring MSDP optimized duplication copy behavior Configuring a separate network path for MSDP optimized duplication Configuring optimized duplication of deduplicated data About the replication topology for Auto Image Replication Configuring a target for MSDP replication Viewing the replication topology for Auto Image Replication About the storage lifecycle policies required for Auto Image Replication Creating a storage lifecycle policy About backup policy configuration
76
Creating a policy using the Policy Configuration Wizard Creating a policy without using the Policy Configuration Wizard Enabling client-side deduplication Resilient Network properties Specifying resilient connections Seeding the fingerprint cache for remote client-side deduplication Adding a deduplication load balancing server About the pd.conf configuration file for NetBackup deduplication Editing the pd.conf deduplication file About the contentrouter.cfg file for NetBackup deduplication About saving the deduplication storage server configuration Saving the deduplication storage server configuration Editing a deduplication storage server configuration file Setting the deduplication storage server configuration About the deduplication host configuration file Deleting a deduplication host configuration file Resetting the deduplication registry Configuring deduplication log file timestamps on Windows Setting NetBackup configuration options by using bpsetconfig
77
Task
Install the license key Configure a deduplication storage server
Step 3
How many disk pools you configure depends on your storage requirements. It also depends on whether or not you use optimized duplication or replication. See About NetBackup deduplication pools on page 79. See Configuring a deduplication disk pool on page 81.
Step 4 Step 5
See Configuring a deduplication storage unit on page 83. Encryption is optional. See Enabling deduplication encryption on page 87.
Step 6
Optimized synthetic backups are optional. See Configuring optimized synthetic backups for deduplication on page 89. Optimized duplication is optional. See Configuring a separate network path for MSDP optimized duplication on page 92. See Configuring optimized duplication of deduplicated data on page 94.
Step 7
Step 8
Configure replication
Replication is optional. See About the replication topology for Auto Image Replication on page 97. See Configuring a target for MSDP replication on page 98. See Viewing the replication topology for Auto Image Replication on page 100. See About the storage lifecycle policies required for Auto Image Replication on page 102. See Creating a storage lifecycle policy on page 105.
78
Task
Configure a backup policy
Step 10
Advanced settings are optional. See About the pd.conf configuration file for NetBackup deduplication on page 119. See Editing the pd.conf deduplication file on page 126. See pd.conf file settings for NetBackup deduplication on page 119.
Task
Configure media server deduplication
Step 2
Step 3
Step 4 Step 5
Enable client-side deduplication Seed the fingerprint cache of a remote client-side deduplication client
See Enabling client-side deduplication on page 112. Remote client seeding is optional. See Seeding the fingerprint cache for remote client-side deduplication on page 117.
79
The type of storage server. For NetBackup media server deduplication, select Media Server Deduplication Pool for the type of disk storage. For a PureDisk deduplication pool, select PureDisk Deduplication Pool for the type of disk storage. The credentials for the deduplication engine. See About NetBackup Deduplication Engine credentials on page 32. The storage paths. See About the deduplication storage paths on page 69. The network interface. See About the network interface for deduplication on page 33. The load balancing servers, if any. See About NetBackup deduplication servers on page 25.
When you create the storage server, the wizard lets you create a disk pool and storage unit also. To configure a deduplication storage server by using the wizard
1 2 3
In the NetBackup Administration Console, expand Media and Device Management > Configure Disk Storage Servers. Follow the wizard screens to configure a deduplication storage server. After NetBackup creates the deduplication storage server, you can click Next to continue to the Disk Pool Configuration Wizard.
80
A NetBackup Media Server Deduplication Pool represents the disk storage that is attached to a NetBackup media server. NetBackup deduplicates the data and hosts the storage. A NetBackup PureDisk Deduplication Pool represents a PureDisk storage pool. NetBackup deduplicates the data, and PureDisk hosts the storage. A PureDisk Deduplication Pool destination requires that PureDisk be at release 6.6 or later.
When you configure a deduplication pool, choose PureDisk as the deduplication pool type. NetBackup requires exclusive ownership of the disk resources that comprise the deduplication pool. If you share those resources with other users, NetBackup cannot manage deduplication pool capacity or storage lifecycle policies correctly. How many deduplication pools you configure depends on your storage requirements. It also depends on whether or not you use optimized duplication or replication, as described in the following table: Table 5-3 Type Deduplication pools for duplication or replication
Requirements
Optimized duplication within Optimized duplication in the same domain requires the following deduplication pools: the same NetBackup domain At least one for the backup storage, which is the source for the duplication operations. The source deduplication pool is in one deduplication node. Another to store the copies of the backup images, which is the target for the duplication operations. The target deduplication pool is in a different deduplication node. Auto Image Replication to a Auto Image Replication deduplication pools can be either replication source or different NetBackup domain replication target. The replication properties denote the purpose of the deduplication pool.. The deduplication pools inherit the replication properties from their volumes. See About the replication topology for Auto Image Replication on page 97. Auto Image Replication requires the following deduplication pools: At least one replication source deduplication pool in the originating domain. A replication source deduplication pool is one to which you send your backups. The backup images on the source deduplication pool are replicated to a deduplication pool in the remote domain or domains. At least one replication target deduplication pool in a remote domain or domains. A replication target deduplication pool is the target for the duplication operations that run in the originating domain.
81
The type of disk pool (PureDisk). The NetBackup deduplication storage server to query for the disk storage to use for the pool. The disk volume to include in the pool. NetBackup exposes the storage as a single volume. The disk pool properties. See Media server deduplication pool properties on page 81.
Symantec recommends that disk pool names be unique across your enterprise. To create a NetBackup disk pool
1 2
In the NetBackup Administration Console, select the Media and Device Management node. From the list of wizards in the Details pane, click Configure Disk Pool and follow the wizard instructions. For help, see the wizard help.
After NetBackup creates the deduplication pool, you have the option to create a storage unit that uses the pool.
82
Refresh
Query the storage server for its replication capabilities. If the capabilities are different than the current NetBackup configuration, update the configuration. The amount of space available in the disk pool. The total raw size of the storage in the disk pool. A comment that is associated with the disk pool. The High water mark setting is a threshold that invokes the following actions: The High water mark indicates that the PureDiskVolume is full. When the PureDiskVolume reaches the High water mark, NetBackup fails any backup jobs that are assigned to the storage unit. NetBackup also does not assign new jobs to a storage unit in which the disk pool is full. NetBackup also fails backup jobs if the PureDiskVolume does not contain enough storage for its estimated space requirement. NetBackup begins image cleanup when the PureDiskVolume reaches the High water mark; image cleanup expires the images that are no longer valid. NetBackup again assigns jobs to the storage unit when image cleanup reduces the PureDiskVolume capacity to less than the High water mark.
The default is 98%. Low water mark The Low water mark has no affect on the PureDiskVolume.
83
per volume
Select or enter the number of read and write streams to allow per volume. Many factors affect the optimal number of streams. Factors include but are not limited to disk speed, CPU speed, and the amount of memory.
84
1 2
In the NetBackup Administration Console, expand NetBackup Management > Storage > Storage Units. On the Actions menu, select New > Storage Unit.
Complete the fields in the New Storage Unit dialog box. For a storage unit for optimized duplication destination, select Only use the following media servers. Then select the media servers that are common between the two deduplication nodes.
85
Storage unit name A unique name for the new storage unit. The name can describe the type of storage. The storage unit name is the name used to specify a storage unit for policies and schedules. The storage unit name cannot be changed after creation. Storage unit type Select Disk as the storage unit type. Disk type Select PureDisk for the disk type for a media server deduplication pool, a PureDisk deduplication pool, or a PureDisk Deduplication Option storage pool. Select the disk pool that contains the storage for this storage unit. All disk pools of the specified Disk type appear in the Disk pool list. If no disk pools are configured, no disk pools appear in the list. Media server The Media server setting specifies the NetBackup media servers that can deduplicate the data for this storage unit. Only the load balancing servers appear in the media server list. Specify the media server or servers as follows: To allow any server in the media server list to deduplicate data, select Use any available media server. To use specific media servers to deduplicate the data, select Only use the following media servers. Then, select the media servers to allow.
Disk pool
NetBackup selects the media server to use when the policy runs. Maximum fragment size For normal backups, NetBackup breaks each backup image into fragments so it does not exceed the maximum file size that the file system allows. You can enter a value from 20 MBs to 51200 MBs. For a FlashBackup policy, Symantec recommends that you use the default, maximum fragment size to ensure optimal deduplication performance.
86
Configure the media servers for NetBackup deduplication and configure the storage. Configure a disk pool.
87
Configure a storage unit for your most important clients (such as STU-GOLD). Select the disk pool. Select Only use the following media servers. Select two media servers to use for your important backups. Create a backup policy for the 100 important clients and select the STU-GOLD storage unit. The media servers that are specified in the storage unit move the client data to the deduplication storage server. Configure another storage unit (such as STU-SILVER). Select the same disk pool. Select Only use the following media servers. Select the other two media servers. Configure a backup policy for the 500 regular clients and select the STU-SILVER storage unit. The media servers that are specified in the storage unit move the client data to the deduplication storage server.
Backup traffic is routed to the wanted data movers by the storage unit settings. Note: NetBackup uses storage units for media server selection for write activity (backups and duplications) only. For restores, NetBackup chooses among all media servers that can access the disk pool.
You can enable encryption on all hosts that deduplicate their own data without configuring them individually. Use this procedure if you want all of your clients that deduplicate their own data to encrypt that data. See To enable encryption on all hosts on page 88. You can enable encryption on individual hosts.
88
Use this procedure to enable compression or encryption on the storage server, on a load balancing server, or on a client that deduplicates its own data. See To enable encryption on a single host on page 88. See About deduplication encryption on page 34. To enable encryption on all hosts
On the storage server, open the contentrouter.cfg file in a text editor; it resides in the following directory:
storage_path/etc/puredisk/contentrouter.cfg
Add agent_crypt to the ServerOptions line of the file. The following line is an example:
ServerOptions=fast,verify_data_read,agent_crypt
If you use load balancing servers, make the same edits to the contentrouter.cfg files on those hosts.
Use a text editor to open the pd.conf file on the host. The pd.conf file resides in the following directories:
2 3
For the line in the file that contains ENCRYPTION, remove the pound character (#) in column 1 from that line. In that line, replace the 0 (zero) with a 1. Note: The spaces to the left and right of the equal sign (=) in the file are significant. Ensure that the space characters appear in the file after you edit the file.
Ensure that the LOCAL_SETTINGS parameter is set to 1. If LOCAL_SETTINGS is 0 (zero) and the ENCRYPTION setting on the storage server is 0, the client setting does not override the server setting. Consequently, the data is not encrypted on the client host.
5 6
Save and close the file. If the host is the storage server or a load balancing server, restart the NetBackup Remote Manager and Monitor Service (nbrmms) on the host.
89
Step 2 Set the Optimizedlmage attribute on See Setting deduplication disk pool your existing deduplication pools. (Any attributes on page 164. deduplication pools that you create after you set the storage server attribute inherit the new functionality.) Step 3 Configure a Standard or MS-Windows See the administrator's guide for your backup policy. Select the Synthetic operating system: backup attribute on the Schedule NetBackup Administrator's Guide for Attribute tab of the backup policy. UNIX and Linux, Volume I. NetBackup Administrator's Guide for Windows, Volume I.
See Setting deduplication storage server attributes on page 152. See Creating a policy without using the Policy Configuration Wizard on page 111.
90
bandwidthlimit
The bandwidthlimit parameter in the agent.cfg file is the global bandwidth setting. You can use this parameter to limit the bandwidth that all replication jobs use. If bandwidthlimit is greater than zero, all of the jobs share the bandwidth. That is, the bandwidth for each job is the bandwidthlimit divided by the number of jobs. If bandwidthlimit=0, total bandwidth is not limited. However, you can limit the bandwidth that each job uses. See the following OPTDUP_BANDWIDTH description. By default, bandwidthlimit=0.
OPTDUP_BANDWIDTH
The OPTDUP_BANDWIDTH parameter in the pd.conf file specifies the per job bandwidth. OPTDUP_BANDWIDTH applies only if the bandwidthlimit parameter in the agent.cfg file is zero. If OPTDUP_BANDWIDTH and bandwidthlimit are both 0, bandwidth per replication job is not limited. By default, OPTDUP_BANDWIDTH = 0.
To specify the bandwidth, edit the configuration files on the deduplication storage server, as follows:
bandwidthlimit.
OPTDUP_BANDWIDTH.
See About the pd.conf configuration file for NetBackup deduplication on page 119. See Editing the pd.conf deduplication file on page 126. See pd.conf file settings for NetBackup deduplication on page 119. If you specify bandwidth limits, optimized duplication and replication traffic to any destination is limited. See About MSDP optimized duplication within the same domain on page 36. See About NetBackup Auto Image Replication on page 47.
91
Caution: These settings affect all optimized duplication jobs; they are not limited to optimized duplication to a Media Server Deduplication Pool or a PureDisk Deduplication Pool.
92
Configuring deduplication Configuring a separate network path for MSDP optimized duplication
See Setting NetBackup configuration options by using bpsetconfig on page 134. Alternatively on UNIX systems, add the entry to the bp.conf file on the NetBackup master server. To configure the number of duplication attempts
Create a file named OPT_DUP_BUSY_RETRY_LIMIT that contains an integer that specifies the number of times to retry the job before NetBackup fails the job. The file must reside on the master server in the following directory (depending on the operating system):
Change the wait period for retries by adding an IMAGE_EXTENDED_RETRY_PERIOD_IN_HOURS entry to the NetBackup LIFECYCLE_PARAMETERS file. The default for this value is two hours. For example, the following entry configures NetBackup to wait four hours before NetBackup tries the job again:
IMAGE_EXTENDED_RETRY_PERIOD_IN_HOURS 4
A separate, dedicated network interface card in both the source and the destination storage servers.
Configuring deduplication Configuring a separate network path for MSDP optimized duplication
93
The separate network is operational and using the dedicated network interface cards on the source and the destination storage servers.
See About MSDP optimized duplication within the same domain on page 36. To configure a separate network path for MSDP optimized duplication
On the source storage server, add the destination storage servers's dedicated network interface to the operating system hosts file. If StorageServer-A is the source MSDP and StorageServer-B is the destination MSDP, the following is an example of the hosts entry in IPv4 notation:
192.168.0.0 StorageServer-B.symantecs.org
Symantec recommends that you always use the fully qualified domain name when you specify hosts.
If the source storage server is a UNIX or Linux computer, ensure that it checks the hosts file first when resolving host names. To do so, verify that the /etc/nsswitch.conf file look-up order is first files and then dns, as in the following example:
hosts: files dns
On the destination storage server, add the source storage servers's dedicated network interface to the operating system hosts file. If StorageServer-A is the source MSDP and StorageServer-B is the destination MSDP, the following is an example of the hosts entry in IPv4 notation:
192.168.0.1 StorageServer-A.symantecs.org
Symantec recommends that you always use the fully qualified domain name when specifying hosts.
94
If the destination storage server is a UNIX or Linux computer, ensure that it checks the hosts file first when resolving host names. To do so, verify that the /etc/nsswitch.conf file look-up order is first files and then dns, as in the following example:
hosts: files dns
From each host, use the ping command to verify that the host resolves the name of the host.
StorageServer-A.symantecs.org> ping StorageServer-B.symantecs.org StorageServer-B.symantecs.org> ping StorageServer-A.symantecs.org
If the ping command returns positive results, the hosts are configured for optimized duplication over the separate network.
Step 1 Configure the storage servers One server must be common between the source storage and the destination storage. Which you choose depends on whether you want a push or a pull configuration. See About the media servers for optimized MSDP duplication within the same domain on page 38. For a push configuration, configure the common server as a load balancing server for the storage server for your normal backups. For a pull configuration, configure the common server as a load balancing server for the storage server for the copies at your remote site. Alternatively, you can add a server later to either environment. (A server becomes a load balancing server when you select it in the storage unit for the deduplication pool.) See Optimized MSDP duplication within the same domain requirements on page 37. See Configuring a NetBackup deduplication storage server on page 79.
95
Step 2 Configure the deduplication If you did not configure the deduplication pools when you configured the pools storage servers, use the Disk Pool Configuration Wizard to configure them. See Configuring a deduplication disk pool on page 81. Step 3 Configure the storage unit for backups In the storage unit for your backups, do the following:
For the Disk type, select PureDisk. For the Disk pool, select one of the following: If you back up to integrated NetBackup deduplication, select your Media Server Deduplication Pool. If you back up to a PureDisk environment, select the PureDisk Deduplication Pool.
If you use a pull configuration, do not select the common media server in the backup storage unit. If you do, NetBackup uses it to deduplicate backup data. (That is, unless you want to use it for a load balancing server for the source deduplication node.) See Configuring a deduplication storage unit on page 83. Step 4 Configure the storage unit for duplication Symantec recommends that you configure a storage unit specifically to be the target for the optimized duplication. Configure the storage unit in the deduplication node that performs your normal backups. Do not configure it in the node that contains the copies. In the storage unit that is the destination for your duplicated images, do the following:
For the Disk type, select PureDisk. For the Disk pool, the destination can be a Media Server Deduplication Pool or a PureDisk Deduplication Pool.
96
Step 6 Configure a separate network Optionally, you can use a separate network for the optimized duplication path for optimized traffic. duplication traffic See Configuring a separate network path for MSDP optimized duplication on page 92. Step 7 Configure a storage lifecycle Configure a storage lifecycle policy only if you want to use one to duplicate policy for the duplication images. The storage lifecycle policy manages both the backup jobs and the duplication jobs. Configure the lifecycle policy in the deduplication environment that performs your normal backups. Do not configure it in the environment that contains the copies. When you configure the storage lifecycle policy, do the following: For the Backup destination, select the storage unit that is the target of your backups. That storage unit may use a Media Server Deduplication Pool or a PureDisk Deduplication Pool. These backups are the primary backup copies; they are the source images for the duplication operation. For the Duplication destination, select the storage unit for the destination deduplication pool. That pool may be a Media Server Deduplication Pool or a PureDisk Deduplication Pool. If the backup destination is a PureDisk Deduplication Pool, the duplication destination also must be a PureDisk Deduplication Pool.
See Creating a storage lifecycle policy on page 105. Step 8 Configure a backup policy Configure a policy to back up your clients. Configure the backup policy in the deduplication environment that performs your normal backups. Do not configure it in the environment that contains the copies. If you use a storage lifecycle policy to manage the backup job and the duplication job: Select that storage lifecycle policy in the Policy storage field of the Policy Attributes tab. If you do not use a storage lifecycle policy to manage the backup job and the duplication job: Select the storage unit for the Media Server Deduplication Pool that contains your normal backups. These backups are the primary backup copies.
Configuring deduplication About the replication topology for Auto Image Replication
97
See the NetBackup Vault Administrators Guide. Step 10 Duplicate by using the bpduplicate command Use the NetBackup bpduplicate command to copy images manually. If you use a storage lifecycle policy or NetBackup Vault for optimized duplication, you do not have to use the bpduplicate command. Duplicate from the source storage to the destination storage. The destination storage may be a Media Server Deduplication Pool or a PureDisk Deduplication Pool. See NetBackup Commands Reference Guide.
Target
None
98
For a Media Server Deduplication Pool, NetBackup exposes the storage as a single volume. Therefore, there is always a one-to-one volume relationship for MSDP. You configure the replication relationships when you add target storage servers in the Replication tab of the Change Storage Server dialog box. NetBackup discovers topology changes when you use the Refresh option of the Change Disk Pool dialog box. See Changing deduplication disk pool properties on page 165. NetBackup includes a command that can help you understand your replication topology. Use the command in the following situations:
After you configure the MSDP replication targets. After changes to the volumes that comprise the storage.
See Viewing the replication topology for Auto Image Replication on page 100.
1 2 3
In the NetBackup Administration Console, expand Media and Device Management > Credentials > Storage Server. Select the MSDP storage server. On the Edit menu, select Change.
99
In the Change Storage Server dialog box, select the Replication tab.
Enter the Storage Server Name. Enter Username and Password credentials for the NetBackup Deduplication Engine. Click Add to add the storage server to the Replication Targets list. After you click Add, NetBackup verifies that the target storage server exists. NetBackup also configures the replication properties of the volumes in the source domain and the target domain.
All targets are considered for replication, depending on the rules of the storage lifecycle policies that control the replication.
100
Configuring deduplication Viewing the replication topology for Auto Image Replication
6 7
After all replication targets are added, click OK. For the deduplication pool in each domain, open the Change Disk Pool dialog box and click Refresh. Configuring a replication target configures the replication properties of the disk volumes in both domains. However, NetBackup only updates the properties of the disk pool when you click Refresh in the Change Disk Pool dialog box and then click OK. See Changing deduplication disk pool properties on page 165.
Run the bpstsinfo command, specifying the storage server name and the server type. The following is the command syntax:
bpstsinfo -lsuinfo -storage_server storage_server_name -stype PureDisk
The following are the options and arguments for the command:
-storage_server storage_server_name The name of the storage server. -stype PureDisk Deduplication storage servers are of type PureDisk.
Save the output to a file so that you can compare the current topology with the previous topology to determine what has changed. Example output is available. See Sample volume properties output for MSDP replication on page 101.
Configuring deduplication Viewing the replication topology for Auto Image Replication
101
All of the storage in a deduplication disk pool is exposed as one volume: PureDiskVolume. The PureDiskVolume of the deduplication storage server bit1.datacenter.symantecs.org is the source for the replication operation. The PureDiskVolume of the deduplication storage server target_host.dr-site.symantecs.org is the target of the replication operation.
> bpstsinfo -lsuinfo -storage_server bit1.datacenter.symantecs.org -stype PureDisk LSU Info: Server Name: PureDisk:bit1.datacenter.symantecs.org LSU Name: PureDiskVolume Allocation : STS_LSU_AT_STATIC Storage: STS_LSU_ST_NONE Description: PureDisk storage unit (/bit1.datacenter.symantecs.org#1/2) Configuration: Media: (STS_LSUF_DISK | STS_LSUF_ACTIVE | STS_LSUF_STORAGE_NOT_FREED | STS_LSUF_REP_ENABLED | STS_LSUF_REP_SOURCE) Save As : (STS_SA_CLEARF | STS_SA_IMAGE | STS_SA_OPAQUEF) Replication Sources: 0 ( ) Replication Targets: 1 ( PureDisk:target_host.dr-site.symantecs.org:PureDiskVolume ) Maximum Transfer: 2147483647 Block Size: 512 Allocation Size: 0 Size: 74645270666 Physical Size: 77304328192 Bytes Used: 138 Physical Bytes Used: 2659057664 Resident Images: 0 > bpstsinfo -lsuinfo -storage_server target_host.dr-site.symantecs.org -stype PureDisk LSU Info: Server Name: PureDisk:target_host.dr-site.symantecs.org LSU Name: PureDiskVolume Allocation : STS_LSU_AT_STATIC Storage: STS_LSU_ST_NONE
102
Configuring deduplication About the storage lifecycle policies required for Auto Image Replication
Description: PureDisk storage unit (/target_host.dr-site.symantecs.org#1/2) Configuration: Media: (STS_LSUF_DISK | STS_LSUF_ACTIVE | STS_LSUF_STORAGE_NOT_FREED | STS_LSUF_REP_ENABLED | STS_LSUF_REP_TARGET) Save As : (STS_SA_CLEARF | STS_SA_IMAGE | STS_SA_OPAQUEF) Replication Sources: 1 ( PureDisk:bit1:PureDiskVolume ) Replication Targets: 0 ( ) Maximum Transfer: 2147483647 Block Size: 512 Allocation Size: 0 Size: 79808086154 Physical Size: 98944983040 Bytes Used: 138 Physical Bytes Used: 19136897024 Resident Images: 0
About the storage lifecycle policies required for Auto Image Replication
To replicate images from the one NetBackup domain to another NetBackup domain requires that two storage lifecycle policies be configured:
In the first (originating) NetBackup domain: One SLP that contains at least one Backup operation and one Replication operation that is configured to replicate to a target NetBackup domain. (The Auto Image Replication SLP.) In the second, target NetBackup domain: One SLP that contains an Import operation to import the replication. (The Import SLP.) The Import SLP can be configured to create additional copies in that domain or to cascade the copies to another domain.
Note: Both SLPs must have identical names. Figure 5-1 shows how the SLP in the target domain is set up to replicate the images from the originating master server domain.
Configuring deduplication About the storage lifecycle policies required for Auto Image Replication
103
Figure 5-1
SLP on master server in the source domain
Import
Table 5-8 describes the requirements for each SLP in the pair. Table 5-8 Domain
Domain 1 (Originating domain)
The SLP must have the same name as the Import SLP in Domain 2. The SLP must be of the same data classification as the Import SLP in Domain 2. The Backup operation must be to a Media Server Deduplication Pool (MSDP). Indicate the exact storage unit from the drop-down list. Do not select Any Available.
Note: The target domain must contain the same type of storage to import the image.
At least one operation must be a Replication operation with the Target master option selected. See Figure 5-2 on page 104. Multiple Replication operations can be configured in an Auto Image Replication SLP. The master server in Domain 1 does not know which target media server will be selected. If multiple SLPs in target domains meet the criteria, NetBackup imports copies in all qualifying domains.
104
Configuring deduplication About the storage lifecycle policies required for Auto Image Replication
The following topic describes useful reporting information about Auto Image Replication jobs and import jobs. See Reporting on Auto Image Replication jobs on page 147. Figure 5-2 Replication operation with Target master option selected in Domain 1 storage lifecycle policy
105
Figure 5-3
106
1 2
In the NetBackup Administration Console, select NetBackup Management > Storage > Storage Lifecycle Policies. Click Actions > New > Storage Lifecycle Policy (UNIX) or Actions > New > New Storage Lifecycle Policy (Windows).
3 4 5
In the New Storage Lifecycle Policy dialog box, enter a Storage lifecycle policy name. Select a Data classification. (Optional.) Select the Priority for secondary operations. This number represents the priority that jobs from secondary operations have in relationship to all other jobs. See Storage Lifecycle Policy dialog box settings on page 106.
Click Add to add operations to the SLP. The operations act as instructions for the data. See Adding a storage operation to a storage lifecycle policy on page 108.
107
Figure 5-4
Description
The Storage lifecycle policy name describes the SLP. The name cannot be modified after the SLP is created. The Data classification defines the level of data that the SLP is allowed to process. The Data classification drop-down menu contains all of the defined classifications. The Data classification is an optional setting. One data classification can be assigned to each SLP and applies to all operations in the SLP. An SLP is not required to have a data classification. If a data classification is selected, the SLP stores only those images from the policies that are set up for that data classification. If no data classification is indicated, the SLP accepts images of any classification or no classification. The Data classification setting allows the NetBackup administrator to classify data based on relative importance. A classification represents a set of backup requirements. When data must meet different backup requirements, consider assigning different classifications. For example, email backup data can be assigned to the silver data classification and financial data backup may be assigned to the platinum classification. A backup policy associates backup data with a data classification. Policy data can be stored only in an SLP with the same data classification. Once data is backed up in an SLP, the data is managed according to the SLP configuration. The SLP defines what happens to the data from the initial backup until the last copy of the image has expired.
108
Table 5-9
Setting
Priority for secondary operations
Description
The Priority for secondary operations setting is the priority that secondary jobs (for example, duplication jobs), have in relationship to all other jobs. Range: 0 (default) to 99999 (highest priority). For example, the Priority for secondary operations for a policy with a gold data classification may be set higher than for a policy with a silver data classification. The priority of the backup job is set in the backup policy on the Attributes tab.
Operations
The Operations list contains all of the operations in the SLP. Multiple operations imply that multiple copies are created. The list also contains the columns that display information about each operation. Note that not all columns display by default. For column descriptions, see the following topic:
Enable Suspend secondary operations to stop the operations in the SLP. A selected SLP can also be suspended from the Actions menu and then activated again (Activate). Use this button to see how changes to this SLP can affect the policies that are associated with this SLP. The button generates a report that displays on the Validation Report tab. This button performs the same validation as the -conflict option performs when used with the nbstl command.
Arrows
Use the arrows to indicate the indentation (or hierarchy) of the source for each copy. One copy can be the source for many other copies. Many operations can be hierarchical or non-hierarchical:
1 2
In the NetBackup Administration Console, select NetBackup Management > Storage > Storage Lifecycle Policies. Click Actions > New > New Storage Lifecycle Policy (Windows) or Actions > New > Storage Lifecycle Policy (UNIX).
109
Click Add to add operations to the SLP. The operations are the instructions for the SLP to follow and apply to the data that is eventually specified in the backup policy. To create a hierarchical SLP, select an operation to become the source of the next operation, then click Add.
In the New Storage Operation dialog box, select an Operation type. The name of the operation reflects its purpose in the SLP:
Backup Backup From Snapshot Duplication See About NetBackup Auto Image Replication on page 47. Import Index From Snapshot Replication Snapshot
110
Indicate where the operation is to write the image. Depending on the operation, selections may include storage units or storage unit groups. No BasicDisk, SnapVault, or disk staging storage units can be used as storage unit selections in an SLP. Note: In NetBackup 7.5, the Any_Available selection is not available for new SLPs. In an upgrade situation, existing SLPs that use Any_Available continue to work as they did before NetBackup 7.5. However, if the NetBackup administrator edits an existing SLP, a specific storage unit or storage unit group must be selected before the SLP can be saved successfully.
6 7
If the storage unit is a tape device or virtual tape library (VTL)., indicate the Volume pool where the backups (or copies) are to be written. Indicate the Media owner if the storage unit is a Media Manager type and server groups are configured. By specifying a Media owner, you allow only those media servers to write to the media on which backup images for this policy are written.
Capacity managed Expire after copy If a policy is configured to back up to a lifecycle, the retention that is indicated in the lifecycle is the value that is used. The Retention attribute in the schedule is not used. Fixed Maximum snapshot limit Mirror Target retention
Indicate an Alternate read server that is allowed to read a backup image originally written by a different media server.
111
For a storage lifecycle policy, for the Storage unit select a storage unit that uses a deduplication pool. For VMware backups, select the Enable file recovery from VM backup option when you configure a VMware backup policy. The Enable file recovery from VM backup option provides the best deduplication rates. NetBackup deduplicates the client data that it sends to a deduplication storage unit.
1 2 3 4
In the NetBackup Administration Console, in the left pane, click NetBackup Management. In the right pane, click Create a Policy to begin the Policy Configuration Wizard. Select File systems, databases, or applications. Click Next to start the wizard and follow the prompts.
Click Help on any wizard panel for assistance while running the wizard.
112
1 2
In the NetBackup Administration Console, in the left pane, expand NetBackup Management > Policies. Type a unique name for the new policy in the Add a New Policy dialog box. See NetBackup naming conventions on page 22.
3 4 5
If necessary, clear the Use Policy Configuration Wizard checkbox. Click OK. Configure the attributes, the schedules, the clients, and the backup selections for the new policy.
1 2 3 4 5
In the NetBackup Administration Console, expand NetBackup Management > Host Properties > Master Servers. In the details pane, select the master server. On the Actions menu, select Properties. On the Host Properties General tab, add the clients that use client direct to the Clients list. Select one of the following Deduplication Location options:
Always use the media server disables client deduplication. By default, all clients are configured with the Always use the media server option. Prefer to use client-side deduplication uses client deduplication if the deduplication plug-in is active on the client. If it is not active, a normal backup occurs; client deduplication does not occur. Always use client-side deduplication uses client deduplication. If the deduplication backup job fails, NetBackup retries the job.
You can override the Prefer to use client-side deduplication or Always use client-side deduplication host property in the backup policies. See Disable client-side deduplication in the NetBackup Administrator's Guide for UNIX and Linux, Volume I.
113
See Disable client-side deduplication in the NetBackup Administrator's Guide for Windows, Volume I.
114
client, NetBackup updates the resilient network setting of the client and the media server that runs the job. NetBackup then uses a resilient connection. Figure 5-5 Master server Resilient Network host properties
Table 5-10 describes the Resilient Network properties. Table 5-10 Property
Host Name or IP Address
Description
The Host Name or IP Address of the host. The address can also be a range of IP addresses so you can configure more than one client at once. You can mix IPv4 addresses and ranges with IPv6 addresses and subnets. If you specify the host by name, Symantec recommends that you use the fully qualified domain name. Use the arrow buttons on the right side of the pane to move up or move down an item in the list of resilient networks.
Resiliency
115
Note: The order is significant for the items in the list of resilient networks. If a client is in the list more than once, the first match determines its resilient connection status. For example, suppose you add a client and specify the client IP address and specify On for Resiliency. Suppose also that you add a range of IP addresses as Off, and the client IP address is within that range. If the client IP address appears before the address range, the client connection is resilient. Conversely, if the IP range appears first, the client connection is not resilient. The resilient status of each client also appears as follows:
In the NetBackup Administration Console, select NetBackup Management > Policies in the left pane and then select a policy. In the right pane, a Resiliency column shows the status for each client in the policy. In the NetBackup Administration Console, select NetBackup Management > Host Properties > Clients in the left pane. In the right pane, a Resiliency column shows the status for each client.
Other NetBackup properties control the order in which NetBackup uses network addresses. The NetBackup resilient connections use the SOCKS protocol version 5. Resilient connection traffic is not encrypted. Symantec recommends that you encrypt your backups. For deduplication backups, use the deduplication-based encryption. For other backups, use policy-based encryption. Resilient connections apply to backup connections. Therefore, no additional network ports or firewall ports must be opened.
More socket connections are required per data stream. Three socket connections are required to accommodate the Remote Network Transport Service that runs on both the media server and the client. Only one socket connection is required for a non-resilient connection. More sockets are open on media servers and clients. Three open sockets are required rather than one for a non-resilient connection. The increased number of open sockets may cause issues on busy media servers. More processes run on media servers and clients. Usually, only one more process per host runs even if multiple connections exist.
116
The processing that is required to maintain a resilient connection may reduce performance slightly.
1 2 3 4 5
In the NetBackup Administration Console, expand NetBackup Management > Host Properties > Master Servers in the left pane. In the right pane, select the host or hosts on which to specify properties. Click Actions > Properties. In the properties dialog box left pane, select Resilient Network. In the Resilient Network dialog box, use the following buttons to manage resiliency:
Add Opens a dialog box in which you can add a host or an address range. If you specify the host by name, Symantec recommends that you use the fully qualified domain name. Add To All If you select multiple hosts in the NetBackup Administration Console, the entries in the Resilient Network list may appear in different colors, as follows: The entries that appear in black type are configured on all of the hosts. The entries that appear in gray type are configured on some of the hosts only.
For the entries that are configured on some of the hosts only, you can add them to all of the hosts. To do so, select them and click Add To All. Change Opens a dialog box in which you can change the resiliency settings of the select items. Remove the select host or address range. A confirmation dialog box does not appear.
Remove
Configuring deduplication Seeding the fingerprint cache for remote client-side deduplication
117
Move the selected item or items up or down. The order of the items in the list is significant. See Resilient Network properties on page 113.
Before the first backup of the remote client, edit the FP_CACHE_CLIENT_POLICY parameter in the pd.conf file on the remote client. Add the name of the existing similar client. Also add the name of the backup policy for that client and the last date on which to use that client's fingerprint cache. Specify the setting in the following format: clienthostmachine,backuppolicy,date The date is the last date in mm/dd/yyyy format to use the fingerprint cache from the client you specify. See Editing the pd.conf deduplication file on page 126. See pd.conf file settings for NetBackup deduplication on page 119.
118
1 2 3
In the NetBackup Administration Console, expand Media and Device Management > Credentials > Storage Server Select the deduplication storage server. On the Edit, select Change.
4 5
In the Change Storage Server dialog box, select the Media Servers tab Select the media server or servers that you want to use as a load balancing server. It must be a supported host. The media servers that are checked are configured as load balancing servers.
6 7
Click OK. For all storage units in which Only use the following media servers is configured, ensure that the new load balancing server is selected.
Configuring deduplication About the pd.conf configuration file for NetBackup deduplication
119
120
Configuring deduplication About the pd.conf configuration file for NetBackup deduplication
Description
On a client, specifies the IP address or range of addresses that the local network interface card (NIC) should use for backups and restores. Specify the value in one of two ways, as follows:
Classless Inter-Domain Routing (CIDR) format. For example, the following notation specifies 192.168.10.0 and 192.168.10.1 for traffic: BACKUPRESTORERANGE = 192.168.10.1/31 Comma-separated list of IP addresses. For example, the following notation specifies 192.168.10.1 and 192.168.10.2 for traffic: BACKUPRESTORERANGE = 192.168.10.1, 192.168.10.2
Default value: BACKUPRESTORERANGE= (no default value) Possible values: Classless Inter-Domain Routing format notation or comma-separated list of IP addresses BANDWIDTH_LIMIT Determines the maximum bandwidth that is allowed when backing up or restoring data between the deduplication host and the deduplication pool. The value is specified in KBytes/second. The default is no limit. Default value: BANDWIDTH_LIMIT = 0 Possible values: 0 (no limit) to the practical system limit, in KBs/sec CLIENT_POLICY_DATE
Note: This setting is valid only in a pd.conf file on a client that deduplicates its
own data. Specifies the client, the backup policy, and the date for which to perform client-side rebasing. Specify the client that hosts the pd.conf file. If you specify a value, NetBackup does not check for the existence of segments on the date specified. In the deduplication pool, NetBackup stores these segments next to each other within containers. Queue processing removes the file segments that were stored previously , which maintains unique segments within the deduplication pool. Specify the setting in a clienthostmachine,backuppolicy,date (mm/dd/yyyy) format. The date is the date on which to not check the existence of a segment on the deduplication storage. NetBackup automatically adds the CLIENT_POLICY_DATE parameter to Linux and UNIX NetBackup client-side deduplication clients. If rebasing is required on Windows clients, edit the pd.conf file and add the parameter manually. Default value: CLIENT_POLICY_DATE = (no default value) See About deduplication storage rebasing on page 180.
Configuring deduplication About the pd.conf configuration file for NetBackup deduplication
121
Description
Specifies whether to compress the data. By default, files are not compressed. Default value: COMPRESSION = 1 Possible values: 0 (off) or 1 (on) See About deduplication compression on page 34.
CR_STATS_TIMER
Specifies a time interval in seconds for retrieving statistics from the storage server host. The default value of 0 disables caching and retrieves statistics on demand. Consider the following information before you change this setting: If disabled (set to 0), a request for the latest storage capacity information occurs whenever NetBackup requests it. If you specify a value, a request occurs only after the specified number of seconds since the last request. Otherwise, a cached value from the previous request is used. Enabling this setting may reduce the queries to the storage server. The drawback is the capacity information reported by NetBackup becomes stale. Therefore, if storage capacity is close to full, Symantec recommends that you do not enable this option. On high load systems, the load may delay the capacity information reporting. If so, NetBackup may mark the storage unit as down.
Default value: CR_STATS_TIMER = 0 Possible values: 0 or greater, in seconds DEBUGLOG Specifies the file to which NetBackup writes the deduplication log information. NetBackup prepends a date stamp to each day's log file. On Windows, a partition identifier and slash must precede the file name. On UNIX, a slash must precede the file name. Default value:
122
Configuring deduplication About the pd.conf configuration file for NetBackup deduplication
Description
A comma-separated list of file name extensions of files not to be deduplicated. Files in the backup stream that have the specified extensions are given a single segment if smaller than 16 MB. Larger files are deduplicated using the maximum 16-MB segment size. Example: DONT_SEGMENT_TYPES = mp3,avi. This setting prevents NetBackup from analyzing and managing segments within the file types that do not deduplicate globally. Default value: DONT_SEGMENT_TYPES = (no default value) Possible values: comma-separated file extensions
ENCRYPTION
Specifies whether to encrypt the data. By default, files are not encrypted.. If you set this parameter to 1 on all hosts, the data is encrypted during transfer and on the storage. Default value: ENCRYPTION = 0 Possible values: 0 (no encryption) or 1 (encryption) See About deduplication encryption on page 34.
FIBRECHANNEL
Enable Fibre Channel for backup and restore traffic to and from a NetBackup series appliance. See About Fibre Channel to a NetBackup 5020 appliance on page 228. Default value: FIBRECHANNEL = 0 Possible values: 0 (off) or 1 (on)
FP_CACHE_LOCAL
Specifies whether or not to use the fingerprint cache for the backup jobs that are deduplicated on the storage server. This parameter does not apply to load balancing servers or to clients that deduplicate their own data. When the deduplication job is on the same host as the NetBackup Deduplication Engine, disabling the fingerprint cache improves performance. Default value: FP_CACHE_LOCAL = 0 Possible values: 0 (off) or 1 (on)
FP_CACHE_MAX_MBSIZE
Specifies the amount of memory in MBs to use for the fingerprint cache. Default value: FP_CACHE_MAX_MBSIZE = 20 Possible values: 0 to the computer limit
Configuring deduplication About the pd.conf configuration file for NetBackup deduplication
123
Description
Specifies the maximum number of images to load in the fingerprint cache. Default value: FP_CACHE_MAX_COUNT = 1024 Possible values: 0 to 4096
Note: Symantec recommends that you use this setting on the individual clients
that back up their own data (client-side deduplication). If you use it on a storage server or load balancing server, it affects all backup jobs. Specifies the client, backup policy, and date from which to obtain the fingerprint cache for the first backup of a client. By default, the fingerprints from the previous backup are loaded. This parameter lets you load the fingerprint cache from another, similar backup. It can reduce the amount of time that is required for the first backup of a client. This parameter especially useful for remote office backups to a central datacenter in which data travels long distances over a WAN. Specify the setting in the following format: clienthostmachine,backuppolicy,date The date is the last date in mm/dd/yyyy format to use the fingerprint cache from the client you specify. Default value: FP_CACHE_CLIENT_POLICY = (no default value) See Seeding the fingerprint cache for remote client-side deduplication on page 117.
124
Configuring deduplication About the pd.conf configuration file for NetBackup deduplication
Description
Specifies whether to allow the pd.conf settings of the deduplication storage server to override the settings in the local pd.conf file. Set this value to 1 if you use local SEGKSIZE, MINFILE_KSIZE, MATCH_PDRO, and DONT_SEGMENT_TYPES settings. Default value: LOCAL_SETTINGS = 0 Possible values: 0 (allow override) or 1 (always use local settings)
LOGLEVEL
Specifies the amount of information that is written to the log file. The range is from 0 to 10, with 10 being the most logging. Default value: LOGLEVEL = 0 Possible values: An integer, 0 to 10 inclusive
Configuring deduplication About the pd.conf configuration file for NetBackup deduplication
125
Description
Determines the bandwidth that is allowed for each optimized duplication and Auto Image Replication stream on a deduplication server. OPTDUP_BANDWITH does not apply to clients. The value is specified in KBytes/second. Default value: OPTDUP_BANDWITH = 0 Possible values: 0 (no limit) to the practical system limit, in KBs/sec A global bandwidth parameter affects whether or not OPTDUP_BANDWITH applies. See About configuring optimized duplication and replication bandwidth on page 89.
OPTDUP_COMPRESSION
Specifies whether to compress optimized duplication data. By default, files are compressed. To disable compression, change the value to 0. Default value: OPTDUP_COMPRESSION = 1 Possible values: 0 (off) or 1 (on) See About deduplication compression on page 34.
OPTDUP_ENCRYPTION
Specifies whether to encrypt the optimized duplication data. By default, files are not encrypted. If you want encryption, change the value to 1. If you set this parameter to 1 on all hosts, the data is encrypted during transfer and on the storage. Default value: OPTDUP_ENCRYPTION = 0 Possible values: 0 (off) or 1 (on) See About deduplication encryption on page 34.
OPTDUP_TIMEOUT
Specifies the number of minutes before the optimized duplication times out. Default value: OPTDUP_TIMEOUT = 720 Possible values: The value, expressed in minutes
PREFERRED_EXT_SEGKSIZE
Specifies the file extensions and the preferred segment sizes in KB for specific file types. File extensions are case sensitive. The following describe the default values: edb are Microsoft Exchange files; mdfare Microsoft SQL master database files, ndf are Microsoft SQL secondary data files, and segsize64k are Microsoft SQL streams. Default value: PREFERRED_EXT_SEGKSIZE = edb:32,mdf:64,ndf:64,segsize64k:64 Possible values: file_extension:segment_size_in_KBs pairs, separated by commas. See also SEGKSIZE.
126
Description
The size in bytes to use for the data buffer for restore operations. Default value: PREFETCH_SIZE = 33554432 Possible values: 0 to the computers memory limit
Warning: Changing this value may reduce capacity and decrease performance.
Change this value only when directed to do so by a Symantec representative. You can also specify the segment size for specific file types. See PREFERRED_EXT_SEGKSIZE. WS_RETRYCOUNT This parameter applies to the PureDisk Deduplication Option only. It does not affect NetBackup deduplication. Default value: WS_RETRYCOUNT = 3 Possible values: Integer WS_TIMEOUT This parameter applies to the PureDisk Deduplication Option only. It does not affect NetBackup deduplication. Default value: WS_TIMEOUT = 120 Possible values: Integer
127
See About the pd.conf configuration file for NetBackup deduplication on page 119. See pd.conf file settings for NetBackup deduplication on page 119. To edit the pd.conf file
Use a text editor to open the pd.conf file. The pd.conf file resides in the following directories:
2 3
To activate a setting, remove the pound character (#) in column 1 from each line that you want to edit. To change a setting, specify a new value. Note: The spaces to the left and right of the equal sign (=) in the file are significant. Ensure that the space characters appear in the file after you edit the file.
4 5
Save and close the file. Restart the NetBackup Remote Manager and Monitor Service (nbrmms) on the host.
The NetBackup documentation exposes only some of the contentrouter.cfg file parameters. Those parameters appear in topics that describe a task or process to change configuration settings.
128
Note: Change values in the contentrouter.cfg only when directed to do so by the NetBackup documentation or by a Symantec representative.
V7.0 represents the version of the I/O format not the NetBackup release level. The version may differ on your system. If you get the storage server configuration when the server is not configured or is down and unavailable, NetBackup creates a template file. The following is an example of a template configuration file:
V7.0 V7.0 V7.0 V7.0 V7.0 V7.0 V7.0 "storagepath" " " string "spalogin" " " string "spapasswd" " " string "spalogretention" "7" int "verboselevel" "3" int "dbpath" " " string "required_interface" " " string
129
To use a storage server configuration file for recovery, you must edit the file so that it includes only the information that is required for recovery. See Saving the deduplication storage server configuration on page 129. See Editing a deduplication storage server configuration file on page 129. See Setting the deduplication storage server configuration on page 131.
For sshostname, use the name of the storage server. For file.txt, use a file name that indicates its purpose. If you get the file when a storage server is not configured or is down and unavailable, NetBackup creates a template file.
130
V7.0 "replication_target(s)" "none" string A value for replication_target(s) is required only if you configured optimized duplication. Otherwise, do not edit this line. V7.0 "spalogin" "username" string Replace username with the NetBackup Deduplication Engine user ID. Replace password with the password for the NetBackup Deduplication Engine user ID.
See About saving the deduplication storage server configuration on page 128. See Recovering from a deduplication storage server disk failure on page 209. See Recovering from a deduplication storage server failure on page 211.
131
If you did not save a storage server configuration file, get a storage server configuration file. See Saving the deduplication storage server configuration on page 129.
Use a text editor to enter, change, or remove values. Remove lines from and add lines to your file until only the required lines (see Table 5-12) are in the configuration file. Enter or change the values between the second set of quotation marks in each line. A template configuration file has a space character (" ") between the second set of quotation marks.
On the master server, run the following command: UNIX: /usr/openv/netbackup/bin/admincmd/nbdevconfig -setconfig
-storage_server sshostname -stype PureDisk -configlist file.txt
For sshostname, use the name of the storage server. For file.txt, use the name of the file that contains the configuration.
132
The storage_server_name is the fully qualified domain name if that was used to configure the storage server. For example, if the storage server name is DedupeServer.symantecs.org, the configuration file name is DedupeServer.symantecs.org.cfg. The following is the location of the file: UNIX: /usr/openv/lib/ost-plugins Windows: install_path\Veritas\NetBackup\bin\ost-plugins
Delete the file; it's location depends on the operating system type, as follows: UNIX: /usr/openv/lib/ost-plugins Windows: install_path\Veritas\NetBackup\bin\ost-plugins The following is an example of the host configuration file name of a server that has a fully qualified domain name:
DedupeServer.symantecs.org.cfg
133
The procedure differs on UNIX and on Windows. To reset the deduplication registry file on UNIX and Linux
Enter the following commands on the storage server to reset the deduplication registry file:
rm /etc/pdregistry.cfg cp -f /usr/openv/pdde/pdconfigure/cfg/userconfigs/pdregistry.cfg /etc/pdregistry.cfg
HKLM\SOFTWARE\Symantec\PureDisk\Agent\ConfigFilePath HKLM\SOFTWARE\Symantec\PureDisk\Agent\EtcPath
Delete the storage path in the following key in the Windows key. That is, delete everything after postgresql-8.3 -D in the key.
HKLM\SYSTEM\ControlSet001\Services\postgresql-8.3\ImagePath
For example, in the following example registry key, you would delete the content of the key that is in italic type:
"C:\Program Files\Veritas\pdde\pddb\bin\pg_ctl.exe" runservice -N postgresql-8.3 -D "D:\DedupeStorage\databases\pddb\data" -w
134
The database path may be the same as the configured storage path.
2 3 4
In the line that begins with log_line_prefix, change the value from %%t to %t. (That is, remove one of the percent signs (%).) Save the file. Run the following command:
install_path\Veritas\pdde\pddb\bin\pg_ctl reload -D dbpath\databases\pddb\data
If the command output does not include server signaled, use Windows Computer Management to restart the PostgreSQL Server 8.3 service. See About deduplication logs on page 187.
Alternatively, on UNIX systems you can set configuration options in the bp.conf file. See the NetBackup Administrator's Guide for UNIX and Linux, Volume I.
135
On the host on which you want to set configuration options, invoke the bpsetconfig command, as follows: UNIX: /usr/openv/netbackup/bin/admincmd/bpsetconfig Windows: install_path\NetBackup\bin\admincmd\bpsetconfig
At the bpsetconfig prompt, enter the key and the value pairs of the configuration options that you want to set, one pair per line. Ensure that you understand the values that are allowed and the format of any new options that you add. You can change existing key and value pairs. You can add key and value pairs.
To save the configuration changes, type the following, depending on the operating system: UNIX: Ctrl + D Enter Windows: Ctrl + Z Enter
136
Chapter
Monitoring the deduplication rate Viewing deduplication job details About deduplication storage capacity and usage reporting About deduplication container files Viewing storage usage within deduplication container files Viewing disk reports Monitoring deduplication processes Reporting on Auto Image Replication jobs
To view the global deduplication ratio To view the deduplication rate for a backup job in the Activity Monitor
138
1 2 3 4
In the NetBackup Administration Console, expand Media and Device Management > Credentials > Storage Server Select the deduplication storage server. On the Edit menu, select Change. In the Change Storage Server dialog box, select the Properties tab. The Deduplication Ratio field displays the ratio.
To view the deduplication rate for a backup job in the Activity Monitor
1 2
In the NetBackup Administration Console, click Activity Monitor. Click the Jobs tab. The Deduplication Rate column shows the rate for each job.
Many factors affect deduplication performance. See About deduplication performance on page 52.
139
1 2 3 4
In the NetBackup Administration Console, click Activity Monitor. Click the Jobs tab. To view the details for a specific job, double-click on the job that is displayed in the Jobs tab pane. In the Job Details dialog box, click the Detailed Status tab. The deduplication job details are described in a different topic. See Deduplication job details on page 139.
Description
For media server deduplication, the Detailed Status tab shows the deduplication rate on the server that performed the deduplication. The following job details excerpt shows details for a client for which Server_A deduplicated the data (the dedup field shows the deduplication rate): 10/6/2010 10:02:09 AM - Info Server_A(pid=30695) StorageServer=PureDisk:Server_A; Report=PDDO Stats for (Server_A): scanned: 30126998 KB, stream rate: 162.54 MB/sec, CR sent: 1720293 KB, dedup: 94.3%, cache hits: 214717 (94.0%)
140
Client-side deduplication job For client-side deduplication jobs, the Detailed Status tab shows two deduplication details rates. The first deduplication rate is always for the client data. The second deduplication rate is for the metadata (disk image header and True Image Restore information (if applicable)). That information is always deduplicated on a server; typically, deduplication rates for that information are zero or very low. The following job details example excerpt shows the two rates. The 10/8/2009 11:58:09 PM entry is for the client data; the 10/8/2010 11:58:19 PM entry is for the metadata. 10/8/2010 11:54:21 PM - Info Server_A(pid=2220) Using OpenStorage client direct to backup from client Client_B to Server_A 10/8/2009 11:58:09 PM - Info Server_A(pid=2220) StorageServer=PureDisk:Server_A; Report=PDDO Stats for (Server_A: scanned: 3423425 KB, stream rate: 200.77 MB/sec, CR sent: 122280 KB, dedup: 96.4%, cache hits: 49672 (98.2%) 10/8/2010 11:58:09 PM - Info Server_A(pid=2220) Using the media server to write NBU data for backup Client_B_1254987197 to Server_A 10/8/2010 11:58:19 PM - Info Server_A(pid=2220) StorageServer=PureDisk:Server_A; Report=PDDO Stats for (Server_A: scanned: 17161 KB, stream rate: 1047.42 MB/sec, CR sent: 17170 KB, dedup: 0.0%, cache hits: 0 (0.0%) the requested operation was successfully completed(0)
Table 6-2 describes the deduplication activity fields. Table 6-2 Field
cache hits
Monitoring deduplication activity About deduplication storage capacity and usage reporting
141
CR sent over FC
The amount of data that is sent from the deduplication plug-in over Fibre Channel to the component that stores the data. In NetBackup, the NetBackup Deduplication Engine stores the data. The percentage of data that was stored already. That data is not stored again. The amount of data that the deduplication plug-in scanned. The speed of the scan: The kilobytes of data that are scanned divided by how long the scan takes.
dedup
Expired backups may not change the available size and the used size. An expired backup may have no unique data segments. Therefore, the segments remain valid for other backups. NetBackup Deduplication Manager clean-up may not have run yet. The Deduplication Manager performs clean up twice a day. Until it performs clean-up, deleted image fragments remain on disk.
142
Monitoring deduplication activity About deduplication storage capacity and usage reporting
If you use operating system tools to examine storage space usage, their results may differ from the usage reported by NetBackup, as follows:
NetBackup usage data includes reserved space that the operating system tools do not include. If other applications use the storage, NetBackup cannot report usage accurately. NetBackup requires exclusive use of the storage.
Table 6-3 describes the options for monitoring capacity and usage. Table 6-3 Option
Change Storage Server dialog box
The Disk Pools window of the Administration Console displays values that were stored when NetBackup polled the disk pools. NetBackup polls every 5 minutes; therefore, the value may not be as current as the value that is displayed in theChange Storage Server dialog box. To display the window, expand Media and Device Management > Devices > Disk Pools.
A command that is installed with NetBackup provides a view of storage capacity and usage within the deduplication container files. See About deduplication container files on page 143. See Viewing storage usage within deduplication container files on page 143.
License Keys dialog box The summary of active capacity-based license features in the NetBackup License Keys dialog box. The summary displays the storage capacity for which you are licensed and the capacity used. It does not display the amount of physical storage space. On the Help menu in the NetBackup Administration Console, select License Keys.
143
The Disk Logs report displays event and message information. A useful event for monitoring capacity is event 1044; the following is the description of the event in theDisk Logs report: The usage of one or more system resources has exceeded a warning level. The threshold for this message is at 96% capacity. No more data can be stored. See Viewing disk reports on page 145. See Deduplication event codes and messages on page 202.
The nbdevquery command shows the state of the disk volume and its properties and attributes. It also shows capacity, usage, and percent used. See Determining the deduplication disk volume state on page 171.
NetBackup OpsCenter
The NetBackup OpsCenter also provides information about storage capacity and usage. See the NetBackup OpsCenter Administrator's Guide.
144
Monitoring deduplication activity Viewing storage usage within deduplication container files
Use the crcontrol command and the --dsstat option on the deduplication storage server. (For help with the command options, use the --help option.) The following is an example of the command usage on a Windows deduplication storage server.
C:\Program Files\Veritas\pdde>Crcontrol.exe --dsstat ************ Data Store statistics ************ Data storage Raw Size Used Avail Use% 62.8T 60.2T 58.7T 1.5T 98% Number of containers : Average container size : Space allocated for containers : Space used within containers : Space available within containers: Space needs compaction : Reserved space : Reserved space percentage : Records marked for compaction : Active records : Total records : 239731 268120188 bytes (255.70MB) 64276720994658 bytes (58.46TB) 63904602067295 bytes (58.12TB) 372118927363 bytes (346.56GB) 154014169 bytes (146.88MB) 2865754513408 bytes (2.61TB) 4.2% 29620 912115621 912145241
Avail Use%
145
Figure 6-1
The NetBackup Deduplication Manager checks the storage space every 20 seconds. It then periodically compacts the space available inside the container files. Therefore, space within a container is not available as soon as it is free. Various internal parameters control whether a container file is compacted. Although space may be available within a container file, the file may not be eligible for compaction.
146
The Disk Storage Unit Status report displays the state of disk storage units in the current NetBackup configuration. For disk pool capacity, see the disk pools window in Media and Device Management > Devices > Disk Pools. Multiple storage units can point to the same disk pool. When the report query is by storage unit, the report counts the capacity of disk pool storage multiple times.
The Disk Pool Status report displays the state of disk pool and usage information.
1 2 3 4
In the NetBackup Administration Console, expand NetBackup Management > Reports > Disk Reports. Select the name of a disk report. In the right pane, select the report settings. Click Run Report.
147
On Windows systems, NetBackup Deduplication Manager in the Activity Monitor Services tab. On UNIX, the NetBackup Deduplication Manager appears as spad in the Administration Console Activity Monitor Daemons tab. The NetBackup bpps command shows the spad process.
The database On Windows systems, the postgres database processes appear processes (postgres) in the Activity Monitor Processes tab. The NetBackup bpps command shows the postgres processes.
Description
The job that replicates a backup image to a target master displays in the Activity Monitor as a Replication job. The Target Master label displays in the Storage Unit column for this type of job. Similar to other Replication jobs, the job that replicates images to a target master can work on multiple backup images in one instance. The detailed status for this job contains a list of the backup IDs that were replicated.
148
Description
The job that imports a backup copy into the target master domain displays in the Activity Monitor as an Import job. An Import job can import multiple copies in one instance. The detailed status for an Import job contains a list of processed backup IDs and a list of failed backup IDs. Note that a successful replication does not confirm that the image was imported at the target master. If the SLP names or data classifications are not the same in both domains, the Import job fails and NetBackup does not attempt to import the image again. Failed Import jobs fail with a status 191 and appear in the Problems report when run on the target master server. The image is expired and deleted during an Image Cleanup job. Note that the originating domain (Domain 1) does not track failed imports.
Chapter
Managing deduplication
This chapter includes the following topics:
Managing deduplication servers Managing NetBackup Deduplication Engine credentials Managing deduplication disk pools Deleting backup images Disabling client-side deduplication for a client About deduplication queue processing Processing the deduplication transaction queue manually About deduplication data integrity checking Configuring deduplication data integrity checking behavior About managing storage read performance About deduplication storage rebasing Resizing the deduplication storage partition About restoring files at a remote site About restoring from a backup at a target master domain Specifying the restore server
150
See Viewing deduplication storage servers on page 150. See Determining the deduplication storage server state on page 150. See Viewing deduplication storage server attributes on page 151. See Setting deduplication storage server attributes on page 152. See Changing deduplication storage server properties on page 153. See Clearing deduplication storage server attributes on page 154. See About changing the deduplication storage server name or storage path on page 155. See Changing the deduplication storage server name or storage path on page 155. See Removing a load balancing server on page 157. See Deleting a deduplication storage server on page 158. See Deleting the deduplication storage server configuration on page 159.
In the NetBackup Administration Console, expand Media and Device Management > Credentials > Storage Server. The All Storage Servers pane shows all configured deduplication storage servers. deduplication storage servers show PureDisk in the Disk Type column.
151
Run the following command on the NetBackup master server or a deduplication storage server: UNIX: /usr/openv/netbackup/bin/admincmd/nbdevquery -liststs
-storage_server server_name -stype PureDisk U
This example output is shortened; more flags may appear in actual output.
152
The following is the command syntax to set a storage server attribute. Run the command on the NetBackup master server or on the deduplication storage server: UNIX: /usr/openv/netbackup/bin/admincmd/nbdevquery -liststs
-storage_server server_name -stype PureDisk U
This example output is shortened; more flags may appear in actual output.
The following is the command syntax to set a storage server attribute. Run the command on the master server or on the storage server.
nbdevconfig -changests -storage_server storage_server -stype PureDisk -setattribute attribute
The following describes the options that require the arguments that are specific to your domain:
153
The attribute is the name of the argument that represents the new functionality. For example, OptimizedImage specifies that the environment supports the optimized synthetic backup method.
To verify, view the storage server attributes. See Viewing deduplication storage server attributes on page 151.
1 2 3
In the NetBackup Administration Console, expand Media and Device Management > Credentials > Storage Server Select the deduplication storage server. On the Edit menu, select Change.
154
In the Change Storage Server dialog box, select the Properties tab.
5 6 7
For the property to change, select the value in the Value column. Change the value. Click OK.
Run the following command on the NetBackup master server or on a storage server:
nbdevconfig -changests -storage_server storage_server -stype PureDisk -clearattribute attribute -storage_server storage_server -setattribute attribute The name of the storage server.
The attribute is the name of the argument that represents the functionality.
155
You want to change the host name. For example, the name of host A was changed to B or a new network card was installed with a private interface C. To use the host name B or the private interface C, you must reconfigure the storage server. See Changing the deduplication storage server name or storage path on page 155. You want to change the storage path. To do so, you must reconfigure the storage server with the new path. See Changing the deduplication storage server name or storage path on page 155. You need to reuse the storage for disaster recovery. The storage is intact, but the storage server was destroyed. To recover, you must configure a new storage server. In this scenario, you can use the same host name and storage path or use different ones. See Recovering from a deduplication storage server failure on page 211.
156
Task
Ensure that no deduplication Deactivate all backup policies that use deduplication storage. activity occurs See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I.
Step 2
Expire all backup images that reside on the deduplication disk storage.
Warning: Do not delete the images. They are imported back into NetBackup
later in this process. If you use the bpexpdate command to expire the backup images, use the -nodelete parameter. See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I. Step 3 Delete the storage units that See the NetBackup Administrator's Guide for UNIX and Linux, Volume I use the disk pool See the NetBackup Administrator's Guide for Windows, Volume I. Delete the disk pool Delete the deduplication storage server Delete the configuration See Deleting a deduplication disk pool on page 172. See Deleting a deduplication storage server on page 158.
Step 4 Step 5
Step 6
Delete the deduplication configuration. See Deleting the deduplication storage server configuration on page 159.
Step 7
Delete the deduplication host Each load balancing server contains a deduplication host configuration configuration file file. If you use load balancing servers, delete the deduplication host configuration file from those servers. See Deleting a deduplication host configuration file on page 132.
Step 8
Change the storage server See the computer or the storage vendor's documentation. name or the storage location See Use fully qualified domain names on page 54. See About the deduplication storage paths on page 69.
Step 9
When you configure deduplication, select the host by the new name and enter the new storage path (if you changed the path). You can also use a new network interface. See Configuring NetBackup media server deduplication on page 76.
Step 10
See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I.
157
For every storage unit that specifies the media server in Use one of the following media servers, clear the check box that specifies the media server. This step is not required if the storage unit is configured to use any available media server.
In the NetBackup Administration Console, expand Media and Device Management > Credentials > Storage Server.
158
Select the deduplication storage server, then select Edit > Change.
4 5 6
In the Change Storage Server dialog box, select the Media Servers tab. Clear the check box of the media server you want to remove. Click OK.
159
Warning: Do not delete a deduplication storage server if its storage contains unexpired NetBackup images; if you do, data loss may occur. To delete a deduplication storage server
1 2 3
In the NetBackup Administration Console, expand Media and Device Management > Credentials > Storage Server On the Edit menu, select Delete. Click Yes in the confirmation dialog box.
See Changing the deduplication storage server name or storage path on page 155.
Delete the deduplication configuration by running one of the following scripts, depending on your operation system: UNIX:
/usr/openv/pdde/pdconfigure/scripts/installers/PDDE_deleteConfig.sh
Windows:install_path\Program
Files\Veritas\pdde\PDDE_deleteConfig.bat
160
If the settings do not exist or their values differ from those in this topic, add or change them accordingly. Then, restart both the NetBackup Deduplication Manager (spad.exe) and the NetBackup Deduplication Engine (spoold.exe).
1 2 3
In the NetBackup Administration Console, expand Media and Device Management > Credentials > Storage Server. Select the storage server, then select Edit > Change. In the Change Storage Server dialog box, select the Media Servers tab. The media servers for which credentials are configured are checked.
161
On the host to which you want to add credentials, run the following command: UNIX: /usr/openv/volmgr/bin/tpconfig -add -storage_server
sshostname -stype PureDisk -sts_user_id UserID -password PassWord
On the load balancing server, run the following command: UNIX: /usr/openv/volmgr/bin/tpconfig -delete -storage_server
sshostname -stype PureDisk -sts_user_id UserID
162
See Viewing deduplication disk pools on page 162. See Determining the deduplication disk pool state on page 162. See Changing the deduplication disk pool state on page 162. See Viewing deduplication disk pool attributes on page 163. See Setting deduplication disk pool attributes on page 164. See Changing deduplication disk pool properties on page 165. See Clearing deduplication disk pool attributes on page 170. See Determining the deduplication disk volume state on page 171. See Changing the deduplication disk volume state on page 171. See Deleting a deduplication disk pool on page 172.
In the NetBackup Administration Console, expand Media and Device Management > Devices > Disk Pools.
1 2 3
In the NetBackup Administration Console, expand Media and Device Management > Device Monitor. Select the Disk Pools tab. The state is displayed in the Status column.
163
1 2 3 4
In the NetBackup Administration Console, expand Media and Device Management > Device Monitor. Select the Disk Pools tab. Select the disk pool. Select either Actions > Up or Actions > Down.
See About NetBackup deduplication pools on page 79. See Determining the deduplication disk pool state on page 162. See Media server deduplication pool properties on page 81. See Configuring a deduplication disk pool on page 81.
164
The following is the command syntax to view the attributes of a deduplication pool. Run the command on the NetBackup master server or on the deduplication storage server: UNIX: /usr/openv/netbackup/bin/admincmd/nbdevquery -listdp -dp
pool_name -stype PureDisk U
This example output is shortened; more flags may appear in actual output.
The following is the command syntax to set a deduplication pool attribute. Run the command on the master server or on the storage server.
165
The following describes the options that require the arguments that are specific to your domain:
-changedp pool_name -setattribute attribute The name of the disk pool.
The attribute is the name of the argument that represents the new functionality. For example, OptimizedImage specifies that the environment supports the optimized synthetic backup method.
To verify, view the disk pool attributes. See Viewing deduplication disk pool attributes on page 163.
1 2
In the NetBackup Administration Console, expand Media and Device Management > Devices > Disk Pools. Select the disk pool you want to change in the details pane.
166
In the Change Disk Pool dialog box, click Refresh to update the disk pool replication properties. If NetBackup discovers changes, your actions depend on the changes discovered. See How to resolve volume changes for Auto Image Replication on page 167.
Change the other properties as necessary. See Media server deduplication pool properties on page 81.
167
6 7
Click OK. If you clicked Refresh and the Replication value for the PureDiskVolume changed, refresh the view in the Administration Console.
Discuss the changes with the storage administrator. You need to understand the changes so you can change your disk pools (if required) so that NetBackup can continue to use them. If the changes were not planned for NetBackup, request that the changes be reverted so that NetBackup functions correctly again.
If these volume properties change, NetBackup can update the disk pool to match the changes. NetBackup can continue to use the disk pool, although the disk pool may no longer match the storage unit or storage lifecycle purpose. Table 7-2 describes the possible outcomes and describes how to resolve them. Table 7-2 Outcome
No changes are discovered.
Refresh outcomes
Description
No changes are required.
NetBackup discovers the new The new volumes appear in the Change Disk Pool dialog box. Text in the dialog box volumes that you can add to the changes to indicate that you can add the new volumes to the disk pool. disk pool.
168
Description
The replication properties of all A Disk Pool Configuration Alert pop-up box notifies you that the properties of all of of the volumes changed, but they the volumes in the disk pool changed, but they are all the same (homogeneous). are still consistent.
You must click OK in the alert box, after which the disk pool properties in the Change Disk Pool dialog box are updated to match the new volume properties. If new volumes are available that match the new properties, NetBackup displays those volumes in the Change Disk Pool dialog box. You can add those new volumes to the disk pool. In the Change Disk Pool dialog box, select one of the following two choices: OK. To accept the disk pool changes, click OK in the Change Disk Pool dialog box. NetBackup saves the new properties of the disk pool. NetBackup can use the disk pool, but it may no longer match the intended purpose of the storage unit or storage lifecycle policy. Change the storage lifecycle policy definitions to ensure that the replication operations use the correct source and target disk pools, storage units, and storage unit groups. Alternatively, work with your storage administrator to change the volume properties back to their original values. Cancel. To discard the changes, click Cancel in the Change Disk Pool dialog box. NetBackup does not save the new disk pool properties. NetBackup can use the disk pool, but it may no longer match the intended use of the storage unit or storage lifecycle policy.
169
Description
The replication properties of the A Disk Pool Configuration Error pop-up box notifies you that the replication properties volumes changed, and they are of some of the volumes in the disk pool changed. The properties of the volumes in the now inconsistent. disk pool are not homogeneous.
You must click OK in the alert box. In the Change Disk Pool dialog box, the properties of the disk pool are unchanged, and you cannot select them (that is, they are dimmed). However, the properties of the individual volumes are updated. Because the volume properties are not homogeneous, NetBackup cannot use the disk pool until the storage configuration is fixed. NetBackup does not display new volumes (if available) because the volumes already in the disk pool are not homogeneous. To determine what has changed, compare the disk pool properties to the volume properties. See Viewing the replication topology for Auto Image Replication on page 100. Work with your storage administrator to change the volume properties back to their original values. The disk pool remains unusable until the properties of the volumes in the disk pool are homogenous. In the Change Disk Pool dialog box, click OK or Cancel to exit the Change Disk Pool dialog box.
170
Description
NetBackup cannot find a volume A Disk Pool Configuration Alert pop-up box notifies you that an existing volume or or volumes that were in the disk volumes was deleted from the storage device: pool.
NetBackup can use the disk pool, but data may be lost. To protect against accidental data loss, NetBackup does not allow volumes to be deleted from a disk pool. To continue to use the disk pool, do the following: Use the bpimmedia command or the Images on Disk report to display the images on the specific volume. Expire the images on the volume.
Use the nbdevconfig command to set the volume state to DOWN so NetBackup does not try to use it.
The following is the command syntax to clear a deduplication pool attribute. Run the command on the master server or on the storage server.
nbdevconfig -changedp -dp pool_name -stype PureDisk -clearattribute attribute
The attribute is the name of the argument that represents the new functionality.
171
Display the volume state by using the following command: UNIX: /usr/openv/netbackup/bin/admincmd/nbdevquery -listdv -stype
PureDisk -U
172
To change the state to DOWN, the disk pool in which the volume resides must not be busy. If backup jobs are assigned to the disk pool, the state change fails. Cancel the backup jobs or wait until the jobs complete. To change the deduplication disk volume state
Determine the name of the disk volume. The following command lists all volumes in the specified disk pool:
nbdevquery -listdv -stype PureDisk -dp disk_pool_name
The nbdevquery and the nbdevconfig commands reside in the following directory:
To display the disk volumes in all disk pools, omit the -dp option.
Change the disk volume state; the following is the command syntax:
nbdevconfig -changestate -stype PureDisk -dp disk_pool_name dv vol_name -state state
1 2 3 4
In the NetBackup Administration Console, expand Media and Device Management > Devices > Disk Pools. Select a disk pool On the Edit menu, select Delete. In the Delete Disk Pool dialog box, verify that the disk pool is the one you want to delete and then click OK.
173
Expire all of the images by using the bpexpdate command and the -notimmediate option. The -notimmediate option prevents bpexpdate from calling the nbdelete command, which deletes the image. Without this option, bpexpdate calls nbdelete to delete images. Each call to nbdelete creates a job in the Activity Monitor, allocates resources, and launches processes on the media server.
After you expire the last image, delete all of the images by using the nbdelete command with the allvolumes option. Only one job is created in the Activity Monitor, fewer resources are allocated, and fewer processes are started on the media servers. The entire process of expiring images and deleting images takes less time.
1 2 3 4 5 6
In the NetBackup Administration Console, expand NetBackup Management > Host Properties > Master Servers. In the details pane, select the master server. On the Actions menu, select Properties. On the Host Properties Client Attributes General tab, select the client that deduplicates its own data. In the Deduplication Location drop-down list, select Always use the media server. Click OK.
174
175
Run the control command with the --processqueue option. The following is an example on a Windows system:
install_path\Veritas\pdde\Crcontrol.exe --processqueue
To examine the results, run the control command with the --dsstat 1 option (number 1 not lowercase letter l). The command may run for a long time; if you omit the 1, results return more quickly but they are not as accurate. See Viewing storage usage within deduplication container files on page 143.
Data consistency check Cyclic redundancy check (CRC) Storage leak check
NetBackup resolves many integrity issues without user intervention, and some issues are fixed when the next backup runs. However, a severe issue may require intervention by Symantec Support. In such cases, NetBackup writes a message to the NetBackup Disk Logs report. See Viewing disk reports on page 145. Data integrity message codes are 1047, 1057, and 1058. See Deduplication event codes and messages on page 202. NetBackup writes integrity checking activity messages to the storaged.log file. See About deduplication logs on page 187. You can configure some of the data integrity checking behaviors. See Configuring deduplication data integrity checking behavior on page 176.
176
Edit a configuration file parameter. See To configure data integrity checking behavior by editing the configuration file on page 176. Run a command. See To configure data integrity checking behavior by using a command on page 176.
More information about data integrity checking is available. See About deduplication data integrity checking on page 175. See Deduplication data integrity checking configuration settings on page 178. To configure data integrity checking behavior by editing the configuration file
Use a text editor to open the contentrouter.cfg file. The contentrouter.cfg file resides in the following directories:
To change a parameter, specify a new value. Note: The spaces to the left and right of the equal sign (=) in the file are significant. Ensure that the space characters appear in the file after you edit the file. See Deduplication data integrity checking configuration settings on page 178.
To configure behavior, specify a value for each of the data integrity checks, as follows:
177
Enable
Disable
178
Disable
UNIX: storage_path/etc/puredisk Windows: storage_path\etc\puredisk spa.cfg file parameters for data integrity checking Description
Enable or disable data consistency checking. The possible values are True or False.
EnableDataCheck
DataCheckDays
30
The number of days to check the data for consistency. The greater the number of days, the fewer the objects that are checked each day. The greater the number of days equals fewer storage server resources consumed each day.
179
EnableCRCCheck
CRCCheckSleepSeconds
The time in seconds to sleep between checking containers. The longer the sleep interval, the more time it takes to check containers.
EnableStorageLeakCheck True
Enable or disable storage leak checking. The possible values are True or False.
CheckExpirationDays
60
The number of days to test for storage leaks. Usually, set this value to double that of the DataCheckDays value. The greater the number of days, the fewer the objects that are checked each day. The greater the number of days equals fewer storage server resources consumed each day.
180
Defragmenting the NetBackup includes a process, called rebasing, which defragments the storage backup images in a deduplication pool. Read performance improves when the file segments from a client backup are close to each other on deduplication storage. If you upgrade from a NetBackup release earlier than 7.5, rebasing may affect your deduplication performance temporarily. See About deduplication storage rebasing on page 180. The number of prefetching threads The PrefetchThreadNum parameter in the contentrouter.cfg file specifies the number of threads to use to preload segments during storage read operations. The default value is 1. Depending on your disks, a value as high as 4 may improve performance. However, Symantec recommends that you test higher values thoroughly to ensure that a value greater than 1 yields better performance. Higher values may actually decrease performance. See About the contentrouter.cfg file for NetBackup deduplication on page 127. The prefetch buffer size The PREFETCH_SIZE parameter in the pd.conf file specifies the size in bytes to use for the data buffer for read operations. See About the pd.conf configuration file for NetBackup deduplication on page 119. See pd.conf file settings for NetBackup deduplication on page 119. Decrypting the data on the client rather than the server The RESTORE_DECRYPT_LOCAL parameter in the pd.conf file specifies on which host to decrypt and decompress the data during restore operations. See About the pd.conf configuration file for NetBackup deduplication on page 119. See pd.conf file settings for NetBackup deduplication on page 119.
181
NetBackup includes a process, called rebasing, which defragments the backup images in a deduplication pool. Read performance improves when the file segments from a client backup are close to each other on deduplication storage. NetBackup consumes less time finding and reassembling files when their segments are near each other. Table 7-6 Rebasing type
Server-side rebasing
Types of rebasing
Description
NetBackup automatically rebases deduplicated data in a NetBackup Media Server Deduplication Pool. The rebasing process is configured to occur when the storage server is not otherwise busy. The following parameters in the contentrouter.cfg file control server-side rebasing behavior:
RebaseScatterThreshold This parameter specifies the average data size threshold per container for a given backup image to be considered for rebasing. By default, this parameter is RebaseScatterThreshold=64MB.
RebaseQuota This parameter specifies the amount of data that the rebasing operation can relocate (or move) each day. By default, this parameter is RebaseQuota=500GB.
Usually, rebasing has little affect on normal operations. However, immediately after an upgrade to NetBackup 7.5, rebasing may overlap into your backup windows. If performance degrades unacceptably during rebasing, change the parameter values in the contentrouter.cfg file. A lower value for each parameter reduces the resources that the storage server uses for rebasing. See About the contentrouter.cfg file for NetBackup deduplication on page 127.
182
Description
Client-side deduplication only. Client-side rebasing can yield further performance increases. Segments from a backup of the specified client with the specified policy are sent to the storage server without checking for the existence of a segment on the specified date. In the deduplication pool, these segments are stored next to each other within containers. Queue processing removes previously stored, duplicated segments to keep unique segments within the container. Some limitations exist, as follows: The deduplication rate per job may not accurately reflect the actual storage deduplication rate. Performance may not improve for the remote clients that have limited network bandwidth.
The CLIENT_POLICY_DATE parameter in the pd.conf file controls rebasing on a NetBackup client. See About the pd.conf configuration file for NetBackup deduplication on page 119. See pd.conf file settings for NetBackup deduplication on page 119. NetBackup automatically adds the CLIENT_POLICY_DATE parameter to Linux and UNIX NetBackup client-side deduplication clients. If rebasing is required on Windows clients, edit the pd.conf file and add the parameter manually.
1 2
Stop all NetBackup jobs on the storage on which you want to change the disk partition sizes and wait for the jobs to end. Deactivate the media server that hosts the storage server. See the NetBackup Administrator's Guide for UNIX and Linux, Volume I or the NetBackup Administrator's Guide for Windows, Volume I.
Stop the NetBackup services on the storage server. Be sure to wait for all services to stop.
Use the operating system or disk manager tools to dynamically increase or decrease the deduplication storage area.
183
5 6
Restart the NetBackup services. Activate the media server that hosts the storage server. See the NetBackup Administrator's Guide for UNIX and Linux, Volume I or the NetBackup Administrator's Guide for Windows, Volume I.
184
Table 7-7
Description
Configure the client in another domain and restore directly to the client. Create the client in the recovery domain and restore directly to the client. This is the most likely scenario. Perform an alternate client restore in the recovery domain.
Scenario 2
No
Scenario 3
No
The steps to recover the client are the same as any other client recovery. The actual steps depend on the client type, the storage type, and whether the recovery is an alternate client restore. For restores that use Granular Recovery Technology (GRT), an application instance must exist in the recovery domain. The application instance is required so that NetBackup has something to recover to.
Use NetBackup Host Properties to specify a Media host override server. All restore jobs for any storage unit on the original backup server use the media server you specify. Specify the same server for the Restore server as for the Original backup server. See Forcing restores to use a specific server in the NetBackup Administrator's Guide for UNIX and Linux, Volume I or the NetBackup Administrator's Guide for Windows, Volume I. This procedure sets the FORCE_RESTORE_MEDIA_SERVER option. Configuration options are stored in the bp.conf file on UNIX systems and the registry on Windows systems. Create the touch file USE_BACKUP_MEDIA_SERVER_FOR_RESTORE on the NetBackup master server in the following directory:
185
UNIX: usr/openv/netbackup/db/config Windows: install_path\veritas\netbackup\db\config This global setting always forces restores to the server that did the backup. It applies to all NetBackup restore jobs, not just deduplication restore jobs. If this touch file exists, NetBackup ignores the FORCE_RESTORE_MEDIA_SERVER and FAILOVER_RESTORE_MEDIA_SERVER settings.
Always use a different server. Use NetBackup Host Properties to specify a Media host override server. See the previous explanation about Media host override, except: Specify the different server for the Restore server. A single restore instance. Use the bprestore command with the -disk_media_server option. Restore jobs for each instance of the command use the media server you specify. See NetBackup Commands Reference Guide.
186
Chapter
Troubleshooting
This chapter includes the following topics:
About deduplication logs Troubleshooting installation issues Troubleshooting configuration issues Troubleshooting operational issues Viewing disk errors and events Deduplication event codes and messages
188
Description
The client deduplication proxy plug-in on the media server runs under bptm, bpstsinfo, and bpbrm processes. Examine the log files for those processes for proxy plug-in activity. The strings proxy or ProxyServer embedded in the log messages identify proxy server activity. They write log files to the following directories:
For bpstsinfo: Windows: /usr/openv/netbackup/logs/admin UNIX: /usr/openv/netbackup/logs/bpstsinfo Windows: install_path\Veritas\NetBackup\logs\admin Windows: install_path\Veritas\NetBackup\logs\stsinfo
The deduplication proxy server nbostpxy on the client writes messages to files in an eponymous directory, as follows: UNIX: /usr/openv/netbackup/logs/nbostpxy Windows: install_path\Veritas\NetBackup\logs\nbostpxy.
The following is the path name of the log file for the deduplication configuration script:
NetBackup creates this log file during the configuration process. If your configuration succeeded, you do not need to examine the log file. The only reason to look at the log file is if the configuration failed. If the configuration process fails after it creates and populates the storage directory, this log file identifies when the configuration failed. Deduplication database The deduplication database log file (postgresql.log) is in the storage_path/databases/pddb directory. You can configure log parameters. For more information, see the following: http://www.postgresql.org/docs/current/static/runtime-config-logging.html See Configuring deduplication log file timestamps on Windows on page 133.
189
Description
The NetBackup Deduplication Engine writes several log files, as follows:
The spoold.log file is the main log file The storaged.log file is for queue processing. A log file for each connection to the engine is stored in a directory in the storage path spoold directory. The following describes the pathname to a log file for a connection: hostname/application/TaskName/MMDDYY.log For example, the following is an example of a crcontrol connection log pathname on a Linux system: /storage_path/log/spoold/server.symantecs.org/crcontrol/Control/010112.log
Usually, the only reason to examine these connection log files is if a Symantec support representative asks you to. A VxUL log file for the events and errors that NetBackup receives from polling. The originator ID for the deduplication engine is 364. See About VxUL logs for deduplication on page 190. NetBackup Deduplication Manager The log files are in the /storage_path/log/spad directory, as follows:
spad.log sched_QueueProcess.log SchedClass.log A log file for each connection to the manager is stored in a directory in the storage path spad directory. The following describes the pathname to a log file for a connection: hostname/application/TaskName/MMDDYY.log For example, the following is an example of a bpstsinfo connection log pathname on a Linux system: /storage_path/log/spoold/server.symantecs.org/bpstsinfo/spad/010112.log Usually, the only reason to examine these connection log files is if a Symantec support representative asks you to.
You can set the log level and retention period in the Change Storage Server dialog box Properties tab. See Changing deduplication storage server properties on page 153. Deduplication plug-in You can configure the location and name of the log file and the logging level. To do so, edit the DEBUGLOG entry and the LOGLEVEL in the pd.conf file. See About the pd.conf configuration file for NetBackup deduplication on page 119. See Editing the pd.conf deduplication file on page 126.
190
N/A
The bpbrm backup and restore manager The bpdbm database manager The bptm tape manager for I/O operations
117
Device 111 configuration and monitoring Device 178 configuration and monitoring Device 202 configuration and monitoring Device 230 configuration and monitoring
The Disk Service Manager process that runs in the Enterprise Media Manager (EMM) process.
The storage server interface process that runs in the Remote Manager and Monitor Service. RMMS runs on media servers. The Remote Disk Service Manager interface (RDSM) that runs in the Remote Manager and Monitor Service. RMMS runs on media servers.
191
192
See Storage server configuration fails on page 192. See Database system error (220) on page 192. See Server not found error on page 193. See License information failure during configuration on page 193. See The disk pool wizard does not display a volume on page 194.
Diagnosis
The PDDE_initConfig script was invoked, but errors occurred during the storage initialization. First, examine the deduplication configuration script log file for references to the server name. See About deduplication logs on page 187.. Second, examine the tpconfig command log file errors about creating the credentials for the server name. The tpconfig command writes to the standard NetBackup administrative commands log directory.
193
Diagnosis
Possible root causes: When you configured the storage server, you selected a media server that runs an unsupported operating system. All media servers in your environment appear in the Storage Server Configuration Wizard; be sure to select only a media server that runs a supported operating system. If you used the nbdevconfig command to configure the storage server, you may have typed the host name incorrectly. Also, case matters for the storage server type, so ensure that you use PureDisk for the storage server type.
1 2 3 4 5
In the NetBackup Administration Console, expand NetBackup Management > Host Properties > Master Servers. In the details pane, select the master server. On the Actions menu, select Properties. In the Master Server Properties dialog box, select the Network Settings properties. Select one of the following options:
194
For a description of these options, see the NetBackup online Help or the administrator's guide. To prohibit reverse host name lookup by using the bpsetconfig command
Enter the following command on each media server that you use for deduplication:
echo REVERSE_NAME_LOOKUP = PROHIBITED | bpsetconfig -h host_name
The bpsetconfig command resides in the following directories: UNIX: /usr/openv/netbackup/bin/admincmd Windows: install_path\Veritas\NetBackup\bin\admincmd
195
Error 800: Disk Volume Examine the disk error logs to determine why the volume was marked DOWN. is Down If the storage server is busy with jobs, it may not respond to master server disk polling requests in a timely manner. A busy load balancing server also may cause this error. Consequently, the query times out and the master server marks the volume DOWN. If the error occurs for an optimized duplication job: verify that source storage server is configured as a load balancing server for the target storage server. Also verify that the target storage server is configured as a load balancing server for the source storage server. See Viewing disk errors and events on page 202.
196
Description
Windows servers only. The NetBackup Deduplication Manager (spad.exe) and the NetBackup Deduplication Engine (spoold.exe) have different shared memory configuration values. This problem can occur when you use a command to change the shared memory value of only one of these two components. To resolve the issue, specify the following shared memory value in the configuration file: SharedMemoryEnabled=1 Then, restart both components. Do not change the values of the other two shared memory parameters. The SharedMemoryEnabled parameter is stored in the following file: storage_path\etc\puredisk\agent.cfg
media manager - system If the job details also includes errors similar to the following, it indicates that an error occurred (174) image clean-up job failed: Critical failed: Critical failed: bpdm (pid=610364) sts_delete_image error 2060018 file not found bpdm (pid=610364) image delete error 2060018: file not found
This error occurs if a deduplication backup job fails after the job writes part of the backup to the Media Server Deduplication Pool. NetBackup starts an image cleanup job, but that job fails because the data necessary to complete the image clean-up was not written to the Media Server Deduplication Pool. Deduplication queue processing cleans up the image objects, so you do not need to take corrective action. However, examine the job logs and the deduplication logs to determine why the backup job failed. See About deduplication queue processing on page 174.
197
Also, Symantec recommends that you use fully-qualified domain names for your NetBackup environment. See Use fully qualified domain names on page 54.
Invoke the following command on the master server or the media server that functions as the deduplication storage server: UNIX: /usr/openv/netbackup/bin/admincmd/nbdevquery -listdv -stype
PureDisk -U
Mount the file system After a brief period of time, the volume state changes to UP. No further action is required.
198
Set the memory size of each virtual machine to double the physical memory of the host. Set the minimum and the maximum values of each virtual machine to the same value (double the physical memory of the host). These memory settings prevent the virtual memory from becoming fragmented on the disk because it does not grow or shrink.
These recommendations may not be the best configuration for every virtual machine. However, Symantec recommends that you try this solution first when troubleshooting performance issues.
To delete the disk pool, you must first delete the image fragments. The nbdelete command deletes expired image fragments from disk volumes. To delete the fragments of expired images
199
The -allvolumes option deletes expired image fragments from all volumes that contain them. The -force option removes the database entries of the image fragments even if fragment deletion fails.
Determine if incomplete SLP duplication jobs exist by running the following command on the master server: UNIX: install_path\NetBackup\bin\admincmd\nbstlutil stlilist
-incomplete
Cancel the incomplete jobs by running the following command for each backup ID returned by the previous command (xxxxx represents the backup ID): UNIX: install_path\NetBackup\bin\admincmd\nbstlutil cancel
-backupid xxxxx
200
Diagnosis
Examine what caused the core media server deduplication processes (spad and spoold) to be unresponsive. Were they temporarily busy (such as queue processing in progress)? Do too many jobs run concurrently? See About deduplication performance on page 52. Status 83 is a generic error for the duplications. The NetBackup bpdm log provides additional information for determining the specific issue.
201
Client-side deduplication can fail if the client cannot resolve the host name of the server. More specifically, the error can occur if the storage server was configured with a short name and the client tries to resolve a fully qualified domain name. To determine which name the client uses for the storage server, examine the deduplication host configuration file on the client. See About the deduplication host configuration file on page 132. To fix this problem, configure your network environment so that all permutations of the storage server name resolve. Symantec recommends that you use fully qualified domain names. See Use fully qualified domain names on page 54.
202
The Disk Logs report. See Viewing disk reports on page 145. The NetBackup bperror command with the -disk option reports on disk errors. The command resides in the following directories: UNIX: /usr/openv/netbackup/bin/admincmd Windows: install_path\Veritas\NetBackup\bin\admincmd
1001
Error
1002
Warning
1003
Error
1004
Critical
203
1008
Error
1009
Authorization Authorization request from <IP> for user <USER> denied (<REASON>). Error Task initialization on server PureDisk:server1.symantecs.org on host server1.symantecs.org got an unexpected error. Task ended on server PureDisk:server1.symantecs.org on host server1.symantecs.org. A request for agent task was denied on server PureDisk:server1.symantecs.org on host server1.symantecs.org. Task session start request on server PureDisk:server1.symantecs.org on host server1.symantecs.org got an unexpected error. Task creation failed, could not initialize task class on server PureDisk:server1.symantecs.org on host server1.symantecs.org.
1010
1011
Error
1012
Error
1014
Critical
1015
Critical
204
1018
16
Info
1019
Critical
1020
Critical
1028
Critical
1029
Critical
205
1031
Critical
1032
Critical
1036
Warning
1037
Warning
1040
Error
1043
Error
1044
multiple
multiple
1047
Error
206
Deduplication event codes and messages (continued) Event Severity NetBackup Message example Severity
A data corruption has been detected. The data consistency check detected a data loss or data corruption in the Media Server Deduplication Pool (MSDP) and reported the affected backups. Search storaged.log on the server for the affected backups and contact technical support. A data inconsistency has been detected and corrected automatically. Explanation: The data consistency check detected a potential data loss and fixed it automatically in the Media Server Deduplication Pool (MSDP). Search the storaged.log file on the pertinent media server. Contact support to investigate the root cause if the problem persists. Error Low space threshold exceeded on the partition containing the storage database on server PureDisk:server1.symantecs.org on host server1.symantecs.org.
1058
2000
Chapter
Replacing the deduplication storage server host computer Recovering from a deduplication storage server disk failure Recovering from a deduplication storage server failure Recovering the storage server after NetBackup catalog recovery About uninstalling media server deduplication Removing media server deduplication
Use the same host name or a different name. Use the same network interface (if the original server used a specific network interface) or a different network interface. Alternatively, you do not have to use a specific network interface.
208
Host replacement, recovery, and uninstallation Replacing the deduplication storage server host computer
Use the same storage path or a different storage path. If you use a different storage path, you must move the deduplication storage to that new location.
Warning: The new host must use the same operating system and the same byte order as the old host. If it does not, you cannot access the deduplicated data. In computing, endianness describes the byte order that represents data: big endian and little endian. For example, SPARC processors and Intel processors use different byte orders. Therefore, you cannot replace an Oracle Solaris SPARC host with an Oracle Solaris host that has an Intel processor. Table 9-1 Step
Step 1
Task
Expire the backup images
Warning: Do not delete the images. They are imported back into NetBackup
later in this process. If you use the bpexpdate command to expire the backup images, use the -nodelete parameter. See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I. Step 2 Delete the storage units that See the NetBackup Administrator's Guide for UNIX and Linux, Volume I use the disk pool See the NetBackup Administrator's Guide for Windows, Volume I. Delete the disk pool Delete the deduplication storage server See Deleting a deduplication disk pool on page 172. See Deleting a deduplication storage server on page 158.
Step 3 Step 4
Step 5
Delete the deduplication host Each load balancing server contains a deduplication host configuration configuration file file. If you use load balancing servers, delete the deduplication host configuration file from those servers. See Deleting a deduplication host configuration file on page 132.
Step 6
If you have load balancing servers, delete the NetBackup Deduplication Engine credentials on those media servers. See Deleting credentials from a load balancing server on page 161.
Host replacement, recovery, and uninstallation Recovering from a deduplication storage server disk failure
209
Task
Configure the new host so it When you configure the new host, you can do the following: meets deduplication Use the same host name or a different name. requirements Use the same network interface (if the original server used a specific network interface) or a different network interface. Alternatively, you do not have to use a specific network interface. Use the same storage path or a different storage path. If you use a different storage path, you must move the deduplication storage to that new location. See About NetBackup deduplication servers on page 25. See About deduplication server requirements on page 26.
Step 8
Use the storage path that you configured for this replacement host. See the computer or the storage vendor's documentation.
Step 9
Install the NetBackup media See the NetBackup Installation Guide for UNIX and Linux. server software on the new See the NetBackup Installation Guide for Windows. host Reconfigure deduplication Import the backup images See Configuring NetBackup media server deduplication on page 76. See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I.
Step 10 Step 12
210
Host replacement, recovery, and uninstallation Recovering from a deduplication storage server disk failure
Symantec recommends that you use NetBackup to protect the deduplication storage server system or program disks. You then can use NetBackup to restore that media server if the disk on which NetBackup resides fails and you have to replace it. Table 9-2 Step
Step 1
Task
Replace the disk.
Step 2
Ensure that the storage and database are mounted at the same locations. See the storage vendor's documentation.
Step 3
See the NetBackup Installation Guide for UNIX and Linux. See the NetBackup Installation Guide for Windows. See About the deduplication license key on page 72.
Step 4
Delete the deduplication host Each load balancing server contains a deduplication host configuration configuration file file. If you use load balancing servers, delete the deduplication host configuration file from those servers. See Deleting a deduplication host configuration file on page 132.
Step 5
If you have load balancing servers, delete the NetBackup Deduplication Engine credentials on those media servers. See Deleting credentials from a load balancing server on page 161.
Step 6
Add the credentials to the storage server Get a configuration file template
Add the NetBackup Deduplication Engine credentials to the storage server. See Adding NetBackup Deduplication Engine credentials on page 160. If you did not save a storage server configuration file before the disk failure, get a template configuration file. See Saving the deduplication storage server configuration on page 129.
Step 7
Step 8 Step 9
Configure the storage server Configure the storage server by uploading the configuration from the file you edited. See Setting the deduplication storage server configuration on page 131.
Step 10
If you use load balancing servers in your environment, add them to your configuration. See Adding a deduplication load balancing server on page 117.
Host replacement, recovery, and uninstallation Recovering from a deduplication storage server failure
211
Use the same host name or a different name. Use the same network interface (if the original server used a specific network interface) or a different network interface. Alternatively, you do not have to use a specific network interface. Use the same storage path or a different storage path. If you use a different storage path, you must move the deduplication storage to that new location.
Warning: The new host must use the same operating system and the same byte order as the old host. If it does not, you cannot access the deduplicated data. In computing, endianness describes the byte order that represents data: big endian and little endian. For example, SPARC processors and Intel processors use different byte orders. Therefore, you cannot replace an Oracle Solaris SPARC host with an Oracle Solaris host that has an Intel processor. Table 9-3 Step
Step 1
Task
Expire the backup images
Warning: Do not delete the images. They are imported back into NetBackup
later in this process. If you use the bpexpdate command to expire the backup images, use the -nodelete parameter. See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I. Step 2 Delete the storage units that See the NetBackup Administrator's Guide for UNIX and Linux, Volume I use the disk pool See the NetBackup Administrator's Guide for Windows, Volume I. Delete the disk pool Delete the deduplication storage server See Deleting a deduplication disk pool on page 172. See Deleting a deduplication storage server on page 158.
Step 3 Step 4
212
Host replacement, recovery, and uninstallation Recovering the storage server after NetBackup catalog recovery
Task
Delete the deduplication host Each load balancing server contains a deduplication host configuration configuration file file. If you use load balancing servers, delete the deduplication host configuration file from those servers. See Deleting a deduplication host configuration file on page 132.
Step 6
If you have load balancing servers, delete the NetBackup Deduplication Engine credentials on those media servers. See Deleting credentials from a load balancing server on page 161.
Step 7
Configure the new host so it When you configure the new host, you can do the following: meets deduplication Use the same host name or a different name. requirements Use the same network interface (if the original server used a specific network interface) or a different network interface. Alternatively, you do not have to use a specific network interface. Use the same storage path or a different storage path. If you use a different storage path, you must make the storage available over that storage path. See About NetBackup deduplication servers on page 25. See About deduplication server requirements on page 26.
Step 8
Use the storage path that you configured for this replacement host. See the computer or the storage vendor's documentation.
Step 9
Install the NetBackup media See the NetBackup Installation Guide for UNIX and Linux. server software on the new See the NetBackup Installation Guide for Windows. host Reconfigure deduplication You must use the same credentials for the NetBackup Deduplication Engine. See Configuring NetBackup media server deduplication on page 76.
Step 10
Step 11
See the NetBackup Administrator's Guide for UNIX and Linux, Volume I See the NetBackup Administrator's Guide for Windows, Volume I.
Host replacement, recovery, and uninstallation About uninstalling media server deduplication
213
See Setting the deduplication storage server configuration on page 131. Symantec recommends that you save your storage server configuration. See Save the deduplication storage server configuration on page 59. Information about recovering the master server is available. See the NetBackup Troubleshooting Guide.
Reconfigure an existing deduplication environment. See Changing the deduplication storage server name or storage path on page 155. Deactivate deduplication and remove the configuration files and the storage files. See Removing media server deduplication on page 213.
Task
Remove client deduplication Remove the clients that deduplicate their own data from the client deduplication list. See Disabling client-side deduplication for a client on page 173.
214
Task
Delete the storage units that See the NetBackup Administrator's Guide for UNIX and Linux, Volume I use the disk pool See the NetBackup Administrator's Guide for Windows, Volume I.. Delete the disk pool Delete the deduplication storage server See Deleting a deduplication disk pool on page 172. See Deleting a deduplication storage server on page 158. Deleting the deduplication storage server does not alter the contents of the storage on physical disk. To protect against inadvertent data loss, NetBackup does not automatically delete the storage when you delete the storage server. Delete the deduplication configuration. See Deleting the deduplication storage server configuration on page 159.
Step 3 Step 4
Step 5
Step 6
Delete the deduplication host Each load balancing server contains a deduplication host configuration configuration file file. If you use load balancing servers, delete the deduplication host configuration file from those servers. See Deleting a deduplication host configuration file on page 132.
Step 7
Delete the storage directory Delete the storage directory and database directory. (Using a separate and the database directory database directory was an option when you configured deduplication.)
Warning: If you delete the storage directory and valid NetBackup images
reside on the deduplication storage, data loss may occur. See the operating system documentation.
Chapter
10
Deduplication architecture
This chapter includes the following topics:
Deduplication storage server components Media server deduplication process Deduplication client components Clientside deduplication backup process About deduplication fingerprinting Data removal process
216
Figure 10-1
Deduplication plug-in
Storage path
Catalog plug-in
Database application
Table 10-1 describes the components. Table 10-1 Component NetBackup deduplication components Description
Deduplication plug-in The deduplication plug-in is the data interface to the NetBackup Deduplication Engine on the storage server. The deduplication plug-in does the following:
Separates the files metadata from the files content. Deduplicates the content (separates files into segments ). Controls the data stream from NetBackup to the NetBackup Deduplication Engine and vice versa.
The plug-in runs on the deduplication storage server. The plug-in also runs on load balancing servers and on the clients that deduplicate their own data. NetBackup Deduplication Engine The NetBackup Deduplication Engine is one of the storage server core components. It stores and manages deduplicated file data. The binary file name is spoold, which is short for storage pool daemon; do not confuse it with a print spooler daemon. The spoold process appears as the NetBackup Deduplication Engine in the NetBackup Administration Console.
217
Catalog plug-in
The catalog plug-in implements a standardized catalog API, which lets the NetBackup Deduplication Engine communicate with the back-end database process. The catalog plug-in translates deduplication engine catalog calls into the calls that are native to the back-end database application. The database application communicates with the catalog plug-in. The database application writes data to and reads data from the database. The deduplication database stores and manages the metadata of deduplicated files. The metadata includes a unique fingerprint that identifies the files content. The metadata also includes information about the file such as its owner, where it resides on a client, when it was created, and other information. NetBackup uses the PostgresSQL database for the deduplication database. You can use the NetBackup bpps command to view the database process (postgres). The deduplication database is separate from the NetBackup catalog. The NetBackup catalog maintains the usual NetBackup backup image information.
Database application
Deduplication database
218
Figure 10-2
Master server
nbjm bpdbm
bpbrm
bptm
Client
The following list describes the backup process when a media server deduplicates the backups and the destination is a Media Server Deduplication Pool:
The NetBackup Job Manager (nbjm) starts the Backup/Restore Manager (bpbrm) on a media server. The Backup/Restore Manager starts the bptm process on the media server and the bpbkar process on the client. The Backup/Archive Manager (bpbkar) on the client generates the backup images and moves them to the media server bptm process. The Backup/Archive Manager also sends the information about files within the image to the Backup/Restore Manager (bpbrm). The Backup/Restore Manager sends the file information to the bpdbm process on the master server for the NetBackup database.
The bptm process moves the data to the deduplication plug-in. The deduplication plug-in retrieves a list of fingerprints from the last full backup for the client from the NetBackup Deduplication Engine. The list is used as a cache so the plug-in does not have to request each fingerprint from the engine. The deduplication plug-in performs file fingerprinting calculations. The deduplication plug-in compares the file fingerprints and the segment fingerprints against the fingerprint list in its cache.
219
The deduplication plug-in sends only unique data segments to the NetBackup Deduplication Engine on the storage server. The NetBackup Deduplication Engine writes the data to the Media Server Deduplication Pool.
Figure 10-3 shows the backup process when a media server deduplicates the backups. The destination is a PureDisk Deduplication Pool. A description follows. Figure 10-3
Master server nbjm bpdbm
bpbrm
bptm
bpbkar
The following list describes the backup process when a media server deduplicates the backups and the destination is a PureDisk Deduplication Pool:
The NetBackup Job Manager (nbjm) starts the Backup/Restore Manager (bpbrm) on a media server. The Backup/Restore Manager starts the bptm process on the media server and the bpbkar process on the client). The Backup/Archive Manager (bpbkar) generates the backup images and moves them to the media server bptm process. The Backup/Archive Manager also sends the information about files within the image to the Backup/Restore Manager (bpbrm). The Backup/Restore Manager sends the file information to the bpdbm process on the master server for the NetBackup database.
The bptm process moves the data to the deduplication plug-in. The deduplication plug-in retrieves a list of fingerprints from the last full backup for the client from the PureDisk storage pool authority. The list is used as a cache so the plug-in does not have to request each fingerprint individually.
220
The deduplication plug-in compares the file fingerprints and the segment fingerprints against the fingerprint list in its cache. The deduplication plug-in performs file fingerprinting calculations. The deduplication plug-in sends only unique data segments to the PureDisk Deduplication Pool.
Description
The deduplication plug-in is the data interface to the NetBackup Deduplication Engine on the deduplication storage server. The deduplication plug-in does the following:
Deduplicates the content (separates files into segments ). Controls the data stream from NetBackup to the NetBackup Deduplication Engine and vice versa. Proxy server Client The OpenStorage proxy server (nbostpxy) manages control communication with the media server.
Proxy plugin
Media server The proxy plug-in manages control communication with the client.
221
Figure 10-4
Master server nbjm
plug-in
bpbrm
Proxy plug-in
bptm
The following list describes the backup process for a deduplication client to a media server deduplication pool:
The NetBackup Job Manager (nbjm) starts the Backup/Restore Manager (bpbrm) on a media server. The Backup/Restore Manager probes the client to determine if it is configured and ready for deduplication. If the client is ready, the Backup/Restore Manager starts the following processes: The OpenStorage proxy server (nbostpxy) on the client and the data moving processes (bpbkar) on the client and bptm on the media server). NetBackup uses the proxy plug-in on the media server to route control information from bptm to nbostpxy.
The Backup/Archive Manager (bpbkar) generates the backup images and moves them to the client nbostpxy process by shared memory. The Backup/Archive Manager also sends the information about files within the image to the Backup/Restore Manager (bpbrm). The Backup/Restore Manager sends the file information to the bpdbm process on the master server for the NetBackup database.
The client nbostpxy process moves the data to the deduplication plug-in.
222
The deduplication plug-in retrieves a list of fingerprints from the last full backup for the client from the NetBackup Deduplication Engine. The list is used as a cache so the plug-in does not have to request each fingerprint from the engine. The deduplication plug-in performs file fingerprinting calculations. The deduplication plug-in sends only unique data segments to the storage server, which writes the data to the media server deduplication pool.
Figure 10-5 shows the backup process of a client that deduplicates its own data. The destination is a PureDisk storage pool. A description follows. Figure 10-5
Master server nbjm bpdbm bpbkar Proxy server (nbostpxy) Deduplication
plug-in
bpbrm
Proxy plug-in
The following list describes the backup process for a deduplication client to a media server deduplication pool:
The NetBackup Job Manager (nbjm) starts the Backup/Restore Manager (bpbrm) on a media server. The Backup / Restore Manager probes the client to determine if it is configured and ready for deduplication. If the client is ready, the Backup/Restore Manager starts the following processes: The OpenStorage proxy server (nbostpxy) on the client and the data moving processes (bpbkar on the client and bptm on the media server).
223
NetBackup uses the proxy plug-in on the media server to route control information from bptm to nbostpxy.
The Backup/Archive Manager (bpbkar) generates the backup images and moves them to the client nbostpxy process by shared memory. The Backup/Archive Manager also sends the information about files within the image to the Backup/Restore Manager (bpbrm). The Backup/Restore Manager sends the file information to the bpdbm process on the master server for the NetBackup database.
The client nbostpxy process moves the data to the deduplication plug-in. The deduplication plug-in retrieves a list of fingerprints from the last full backup for the client from the NetBackup Deduplication Engine. The list is used as a cache so the plug-in does not have to request each fingerprint from the engine. The deduplication plug-in performs file fingerprinting calculations. The deduplication plug-in sends only unique data segments to the PureDisk storage pool.
The deduplication plug-in reads the backup image and separates the image into files. The plug-in separates files into segments. For each segment, the plug-in calculates the hash key (or fingerprint) that identifies each data segment. To create a hash, every byte of data in the segment is read and added to the hash. The plug-in compares its calculated fingerprints to the fingerprints that the NetBackup Deduplication Engine stores on the media server. Two segments that have the same fingerprint are duplicates of each other. The plug-in sends unique segments to the deduplication engine to be stored. A unique segment is one for which a matching fingerprint does not exist in the engine already.
224
The first backup may have a 0% deduplication rate; however, a 0% deduplication rate is unlikely. Zero percent means that all file segments in the backup data are unique.
The NetBackup Deduplication Engine saves the fingerprint information for that backup.
The deduplication plug-in retrieves a list of fingerprints from the last full backup for the client from the NetBackup Deduplication Engine. The list is used as a cache so the plug-in does not have to request each fingerprint from the engine. The deduplication plug-in reads the backup image and separates the image into files. The deduplication plug-in separates files into segments and calculates the fingerprint for each file and segment. The plug-in compares each fingerprint against the local fingerprint cache. If the fingerprint is not known in the cache, the plug-in requests that the engine verify if the fingerprint already exists. If the fingerprint does not exist, the segment is sent to the engine. If the fingerprint exists, the segment is not sent.
The fingerprint calculations are based on the MD5 algorithm. However, any segments that have different content but the same MD5 hash key get different fingerprints. So NetBackup prevents MD5 collisions.
NetBackup removes the image record from the NetBackup catalog. NetBackup directs the NetBackup Deduplication Manager to remove the image. The deduplication manager immediately removes the image entry and adds a removal request for the image to the database transaction queue. From this point on, the image is no longer accessible. When the queue is next processed, the NetBackup Deduplication Engine executes the removal request. The engine also generates removal requests for underlying data segments At the successive queue processing, the NetBackup Deduplication Engine executes the removal requests for the segments.
225
Storage is reclaimed after two queue processing runs; that is, in one day. However, data segments of the removed image may still be in use by other images. If you manually delete an image that has expired within the previous 24 hours, the data becomes garbage. It remains on disk until removed by the next garbage collection process. See About deduplication queue processing on page 174. See Deleting backup images on page 173.
226
Appendix
About NetBackup appliance deduplication About Fibre Channel to a NetBackup 5020 appliance Enabling Fibre Channel to a NetBackup 5020 appliance Disabling Fibre Channel to a NetBackup 5020 appliance Displaying NetBackup 5020 appliance Fibre Channel port information
NetBackup 5200 series of enterprise backup appliances that is based on the NetBackup backup platform. The 5200 series provides between 4 TB to 32 TB of deduplication storage. A NetBackup 5200 series appliance can be a destination for optimized duplication from a NetBackup Media Server Deduplication Pool.
228
NetBackup 5000 series of scalable deduplication appliances that is based on the PureDisk backup platform. The 5000 series is scalable from 16 TB to 192 TB of storage. A NetBackup 5000 series can be a storage destination for both the NetBackup Client Deduplication Option and the NetBackup Media Server Deduplication Option.
Easy to install, configure, and use. Modular capacity to fulfill your storage needs. A solution for the datacenter, remote office and branch office, and virtual machine backups. Source or target deduplication. Optimized synthetic backup to minimize data movement. Tape support for long-term data retention. Built in disk to disk replication for disaster recovery and an alternative solution to tape based vaulting Enterprise-class hardware and software. Hardware monitoring with a call home feature.
A target for NetBackup client backups. A target for optimized duplication from a NetBackup Media Server Deduplication Pool. A source for optimized duplication to another deduplication destination.
For any operation that involves NetBackup PureDisk in this guide, PureDisk means both a traditional PureDisk environment and a NetBackup 5020 appliance. Requirements for the NetBackup media server:
229
See the NetBackup operating system compatibility list on the NetBackup landing page on the Symantec support Web site.
Command traffic travels over the IP network. If no Fibre Channel connection is available, backup data travels over the IP network. For each job, the job details show the amount of data that is transferred over Fibre Channel. See Viewing deduplication job details on page 138. For information about configuring the appliance and zoning the Fibre Channel SAN, see the appliance documentation.
As the root user, run the dedup_fcmanager.sh script with the -e option as in the following example:
/usr/openv/pdde/pdconfigure/scripts/support/dedup_fcmanager.sh -e WARNING: Enabling/disabling Fibre Channel transport may require spad to be restarted. Do you want to continue? [y/n] y FC transport enabled
230
As the root user, run the dedup_fcmanager.sh script with the -d option as in the following example:
/usr/openv/pdde/pdconfigure/scripts/support/dedup_fcmanager.sh -d WARNING: Enabling/disabling Fibre Channel transport may require spad to be restarted. Do you want to continue? [y/n] y Restarting services FC transport disabled
Port information. See To display Fibre Channel port information on a NetBackup 5020 appliance on page 231. Statistics. See To display Fibre Channel statistics on a NetBackup 5000 series appliance on page 232.
By default, the top port (port number 1) of the FC HBA in the appliance is configured in the target mode. Before you display the port information, ensure that the target ports of the desired NetBackup 5000 series appliance are the only target ports in the Fibre Channel zone. See About Fibre Channel to a NetBackup 5020 appliance on page 228.
NetBackup appliance deduplcation Displaying NetBackup 5020 appliance Fibre Channel port information
231
As the root user, run the dedup_fcmanager.sh script with the -r option as in the following example:
/usr/openv/pdde/pdconfigure/scripts/support/dedup_fcmanager.sh -r **** Ports **** Bus ID Port WWN 06:00.0 06:00.1 06:00.0 06:00.1 21:00:00:24:FF:xx:xx:xx 21:00:00:24:FF:xx:xx:xx 21:00:00:24:FF:xx:xx:xx 21:00:00:24:FF:xx:xx:xx
Dev Num 3 4 5 6
Remote Ports
**** VLAN **** The result is based on the scan at Sun, Jan 1 00:00:01 CST 2012 /dev/sg3 192.168.0.2 /dev/sg8 192.168.1.2 /dev/sg5 192.168.0.3 **** Fibre Channel Transport **** Replication over Fibre Channel is disabled Backup/Restore over Fibre Channel is disabled
This output shows the target mode Fibre Channel ports and the hosts to which Fibre Channel traffic can travel.
232
NetBackup appliance deduplcation Displaying NetBackup 5020 appliance Fibre Channel port information
As the root user, run the dedup_fcmanager.sh script with the -t option and interval and repeat arguments. The following command example lists the statistics five times with a one second interval between them:
usr/openv/pdde/pdconfigure/scripts/support/dedup_fcmanager.sh -t 1 5 Port 5 6 Port 5 6 Port 5 6 Port 5 6 Port 5 6 I/O R(count/s) 0 0 I/O R(count/s) 0 2823 I/O R(count/s) 0 2105 I/O R(count/s) 0 2130 I/O R(count/s) 0 2108 I/O W(count/s) 0 0 I/O W(count/s) 0 12702 I/O W(count/s) 0 9557 I/O W(count/s) 0 9597 I/O W(count/s) 0 9632 I/O R(KB/s) 0 0 I/O R(KB/s) 0 0 I/O R(KB/s) 0 0 I/O R(KB/s) 0 0 I/O R(KB/s) 0 0 I/O W(KB/s) 0 0 I/O W(KB/s) 0 17144 I/O W(KB/s) 0 13070 I/O W(KB/s) 0 13161 I/O W(KB/s) 0 13136
Index
A
about NetBackup deduplication 13 about NetBackup deduplication options 13 about the deduplication host configuration file 132 appliance deduplication 1415 attributes clearing deduplication pool 170 clearing deduplication storage server 154 OptimizedImage 35 setting deduplication pool 164 setting deduplication storage server 152 viewing deduplication pool 163 viewing deduplication storage server 151 Auto Image Replication nbstserv 105 overview 47 using MSDP 98
B
backup client deduplication process 220 big endian 208, 211 bpstsinfo command 100 byte order 208, 211
C
cache hits field of the job details 140 capacity adding storage 70 capacity and usage reporting for deduplication 141 Capacity managed retention type 110 changing deduplication server hostname 155 changing the deduplication storage server name and path 155 CIFS 67 clearing deduplication pool attributes 170 client deduplication about 28 components 220 disabling for a specific client 173
client deduplication (continued) host requirements 30 limitations 30 sizing the systems 21 Common Internet File System 67 compacting container files 145 compression and deduplication 34 pd.conf file setting 121 configuring a deduplication pool 81 configuring a deduplication storage server 79 configuring a deduplication storage unit 83 configuring deduplication 76, 78 container files about 143 compaction 145 viewing capacity within 143 contentrouter.cfg file about 127 parameters for data integrity checking 179 RebaseQuota parameter 181 RebaseScatterThreshold parameter 181 ServerOptions for encryption 88 CR sent field of the job details 141 credentials 32 adding NetBackup Deduplication Engine 160 changing NetBackup Deduplication Engine 161
D
data classifications in storage lifecycle policies 107 data integrity checking about deduplication 175 configuring behavior for deduplication 176 configuring settings for deduplication 178 data removal process for deduplication 224 database system error 192 deactivating media server deduplication 213 dedup field of the job details 141
234
Index
deduplication about credentials 32 about fingerprinting 223 about the license key 72 adding credentials 160 cache hits field of the job details 140 capacity and usage reporting 141 changing credentials 161 client backup process 220 compression 34 configuration file 119 configuring 76, 78 configuring optimized synthetic backups 89 container files 143 CR sent field of the job details 141 data removal process 224 dedup field of the job details 141 encryption 34 event codes 202 how it works 15 license key for 72 licensing 72 limitations 28 media server process 217 network interface 33 node 26 performance 52 planning deployment 20 requirements for optimized within the same domain 37 scaling 55 scanned field of the job details 141 storage capacity 68 storage destination 22 storage management 70 storage paths 69 storage requirements 66 storage unit properties 85 stream rate field of the job details 141 supported systems 21 deduplication configuration file editing 126 settings 119 deduplication data integrity checking about 175 configuring behavior for 176 configuring settings 178 deduplication database about 217
deduplication database (continued) log file 188 deduplication deduplication pool. See deduplication pool deduplication disk volume changing the state 171 determining the state 171 deduplication encryption enabling 87 deduplication host configuration file about 132 deleting 132 deduplication hosts and firewalls 33 client requirements 30 load balancing server 26 server requirements 26 storage server 25 deduplication logs about 187 client deduplication proxy plug-in log 188 client deduplication proxy server log 188 configuration script 188 deduplication database 188 deduplication plug-in log 189 NetBackup Deduplication Engine 189 NetBackup Deduplication Manager 189 VxUL deduplication logs 190 deduplication node about 26 adding a load balancing server 117 removing a load balancing server 157 deduplication optimized synthetic backups about 35 deduplication plug-in about 216 log file 189 deduplication pool. See deduplication pool about 79 changing properties 165 changing the state 162 clearing attributes 170 configuring 81 deleting 172 determining the state 162 properties 81 setting attributes 164 viewing 162 viewing attributes 163
Index
235
deduplication port usage about 33 troubleshooting 200 deduplication processes do not start 195 deduplication rate how file size affects 54 monitoring 137 deduplication registry resetting 132 deduplication servers about 25 components 215 host requirements 26 deduplication storage capacity about 68 viewing capacity in container files 143 deduplication storage destination 22 deduplication storage paths 69 deduplication storage requirements 66 deduplication storage server about 25 change the name 155 changing properties 153 clearing attributes 154 components 215 configuration failure 193 configuring 79 defining target for Auto Image Replication 49 deleting 158 deleting the configuration 159 determining the state 150 editing configuration file 129 getting the configuration 129 recovery 211 replacing the host 207 setting attributes 152 setting the configuration 131 viewing 150 viewing attributes 151 deduplication storage server configuration file about 128 deduplication storage server name changing 155 deduplication storage type 22 Deduplication storage unit Only use the following media servers 85 Use any available media server 85 deleting backup images 173 deleting deduplication host configuration file 132
disaster recovery protecting the data 58 recovering the storage server after catalog recovery 212 disk failure deduplication storage server 209 disk logs 146 disk logs report 143 disk pool cannot delete 198 disk pool status report 143, 146 disk storage unit report 146 Disk type 85 disk volume changing the state 171 determining the state of a deduplication 171 volume state changes to down 197 domains replicating backups to another. See Auto Image Replication
E
Enable file recovery from VM backup 111 encryption and deduplication 34 enabling for deduplication 87 pd.conf file setting 122 endian big 208, 211 little 208, 211 event codes deduplication 202 Expire after copy retention type 110
F
file system CIFS 67 NFS 67 Veritas File System for deduplication storage 70 fingerprinting about deduplication 223 firewalls and deduplication hosts 33 Fixed retention type 110 FlashBackup policy Maximum fragment size (storage unit setting) 85 FQDN or IP Address property in Resilient Network host properties 114
236
Index
G
garbage collection. See queue processing
H
host requirements 26 how deduplication works 15
I
images on disk report 145 Import operation 104 initial seeding 53 iSCSI 67
media server deduplication (continued) sizing the systems 21 Media Server Deduplication Option about 23 Media Server Deduplication Pool 98 media server deduplication pool. See deduplication pool migrating from PureDisk to NetBackup deduplication 61 migrating to NetBackup deduplication 62 Mirror retention type 110 MSDP replication about 36
N L
license information failure for deduplication 193 license key for deduplication 72 licensing deduplication 72 limitations media server deduplication 28 little endian 208, 211 load balancing server about 26 adding to a deduplication node 117 for deduplication 26 removing from deduplication node 157 logs about deduplication 187 client deduplication proxy plug-in log 188 client deduplication proxy server log 188 deduplication configuration script log 188 deduplication database log 188 deduplication plug-in log 189 disk 146 NetBackup Deduplication Engine log 189 NetBackup Deduplication Manager log 189 VxUL deduplication logs 190 nbstserv process 105 NetBackup naming conventions 22 NetBackup 5000 series appliance as a storage destination 23 NetBackup appliance deduplication 14 NetBackup Client Deduplication Option 14 NetBackup deduplication about 13 license key for 72 NetBackup Deduplication Engine about 216 about credentials 32 adding credentials 160 changing credentials 161 logs 189 NetBackup Deduplication Manager about 217 logs 189 NetBackup deduplication options 13 NetBackup Media Server Deduplication Option 14 network interface for deduplication 33 NFS 67 node deduplication 26
M
maintenance processing. See queue processing Maximum concurrent jobs 86 Maximum fragment size 85 Maximum snapshot limit retention type 110 media server deduplication process 217
O
OpenStorage appliance deduplication 15 optimized deduplication configuring bandwidth 89 optimized deduplication copy configuring 94
Index
237
optimized deduplication copy (continued) guidance for 39 limitations 38 push configuration 41 separate network for 92 optimized MSDP deduplication about the media server in common within the same domain 39 pull configuration within the same domain 45 push configuration within the same domain 39 requirements 37 within the same domain 37 optimized MSDP duplication about 36 optimized synthetic backups configuring for deduplication 89 deduplication 35 OptimizedImage attribute 35
P
pd.conf file about 119 editing 126 settings 119 pdde-config.log 188 performance deduplication 52 monitoring deduplication rate 137 policies changing properties 111112 creating 111112 port usage and deduplication 33 troubleshooting 200 Priority for secondary operations 108 provisioning the deduplication storage 65 PureDisk appliance deduplication 14 PureDisk deduplication 14 PureDisk Deduplication Option replacing with media server deduplication 60
recovery deduplication storage server 211 from deduplication storage server disk failure 209 Red Hat Linux deduplication processes do not start 195 replacing PDDO with NetBackup deduplication 60 replacing the deduplication storage server 207 replication between NetBackup domains. See Auto Image Replication for MSDP 36 to an alternate NetBackup domain. See Auto Image Replication Replication to remote master. See Auto Image Replication reports disk logs 143, 146 disk pool status 143, 146 disk storage unit 146 resetting the deduplication registry 132 Resiliency property in Resilient Network host properties 114 Resilient connection Resilient Network host properties 113 resilient network connection log file 191 Resilient Network host properties 113 FQDN or IP Address property in 114 Resiliency property in 114 restores at a remote site 183 how deduplication restores work 59 specifying the restore server 184 retention periods lifecycle and policy-based 110 reverse host name lookup prohibiting 193 reverse name lookup 193
S
scaling deduplication 55 scanned field of the job details 141 seeding initial 53 server not found error 193 setting deduplication pool attributes 164 spa.cfg file parameters for data integrity checking 178
Q
queue processing 174 invoke manually 174
R
rebasing 180
238
Index
storage capacity about 68 adding 70 for deduplicaton 68 viewing capacity in container files 143 storage lifecycle policies creating 105 Data classification setting 107 operations 108 Priority for secondary operations 108 retention type 110 Storage lifecycle policy name 107 Suspend secondary operations 108 Validate Across Backup Policies button 108 storage paths about reconfiguring 155 changing 155 for deduplication 69 storage requirements for deduplication 66 storage server about the configuration file 128 change the name 155 changing properties for deduplication 153 changing the name 155 components for deduplication 215 configuring for deduplication 79 deduplication 25 define target for Auto Image Replication 49 deleting a deduplication 158 deleting the deduplication configuration 159 determining the state of a deduplication 150 editing deduplication configuration file 129 getting deduplication configuration 129 recovery 211 replacing the deduplication host 207 setting the deduplication configuration 131 viewing 150 viewing attributes 151 storage server configuration getting 129 setting 131 storage server configuration file editing 129 storage type for deduplication 22 storage unit configuring for deduplication 83 properties for deduplication 85
storage unit (continued) recommendations for deduplication 86 Storage unit name 85 Storage unit type 85 storage units selection in SLP 110 stream handlers NetBackup 54 stream rate field of the job details 141 supported systems 21 Suspend secondary operations 108
T
Target retention type 110 topology of storage 97, 100 troubleshooting database system error 192 deduplication backup jobs fail 195 deduplication processes do not start 195 general operational problems 198 host name lookup 193 installation fails on Linux 191 no volume appears in disk pool wizard 194 server not found error 193
U
uninstalling media server deduplication 213
V
Validation Report tab 108 viewing deduplication pool attributes 163 viewing storage server attributes 151 VM backup 111 volume manager Veritas Volume Manager for deduplication storage 70
W
wizards Policy Configuration 111