Data Domain Integration Guide
Data Domain Integration Guide
Data Domain Integration Guide
Integration Guide
P/N 300-011-692
REV A04
EMC Corporation
Corporate Headquarters:
Hopkinton, MA 01748-9103
1-508-435-1000
www.EMC.com
Copyright © 2010-2011 EMC Corporation. All rights reserved.
EMC believes the information in this publication is accurate as of its publication date. The information is subject to change
without notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED “AS IS.” EMC CORPORATION MAKES NO
REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION,
AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR
PURPOSE.
Use, copying, and distribution of any EMC software described in this publication requires an applicable software license.
For the most up-to-date regulatory document for your product line, go to the Technical Documentation and Advisories section
on EMC Powerlink.
For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com.
All other trademarks used herein are the property of their respective owners.
2 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Contents
Preface
Chapter 1 Introduction
Revision history ...................................................................................................... 14
NetWorker Data Domain integration features ................................................... 14
Integrated deduplication backup with DD Boost software ....................... 14
Storage on NetWorker Data Domain devices .............................................. 15
Cloning and remote storage of backups ....................................................... 16
Support for client-side deduplication backup and recovery ..................... 16
Data Domain configuration and monitoring................................................ 17
Features not supported by the integration ................................................... 17
Version requirements.............................................................................................. 17
License requirements.............................................................................................. 18
NetWorker licenses .......................................................................................... 18
Data Domain licenses....................................................................................... 19
Hosts and software ................................................................................................. 19
Data Domain storage system .......................................................................... 19
Data Domain Archiver system ....................................................................... 19
NetWorker Management Console server ..................................................... 19
NMC portal ....................................................................................................... 19
NetWorker server ............................................................................................. 20
NetWorker client .............................................................................................. 20
NetWorker storage node ................................................................................. 20
NetWorker application modules.................................................................... 20
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 3
Contents
4 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Contents
Glossary
Index
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 5
Contents
6 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Figures
Title Page
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 7
Figures
8 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Preface
As part of an effort to improve and enhance the performance and capabilities of its product
lines, EMC periodically releases revisions of its hardware and software. Therefore, some
functions described in this document may not be supported by all versions of the software or
hardware currently in use. For the most up-to-date information on product features, refer to
your product release notes.
If a product does not function properly or does not function as described in this document,
please contact your EMC representative.
Audience
This guide is written for NetWorker 7.6 Service Pack 2 and Data Domain operating
system 4.9 and later. Earlier versions should refer to the NetWorker Data Domain
7.6 SP1 integration guide.
This document is part of the NetWorker documentation set, and is intended for use
by system administrators. It contains planning, practices, and configuration
information for the use of NetWorker Data Domain devices within an EMC
NetWorker backup and storage management environment.
Readers of this document should be familiar with the following tasks:
◆ Identify the different hardware and software components that make up the
NetWorker datazone.
◆ Follow procedures to configure storage management operations.
◆ Follow guidelines to locate problems and implement solutions.
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 9
Preface
10 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Preface
◆ Technical notes and white papers provide in-depth technical reviews of products
regarding business requirements, applied technologies, and best practices.
These resources are available at http://Powerlink.EMC.com. You must have a service
agreement to use the Powerlink site.
IMPORTANT
An important notice contains information essential to software or hardware
operation.
Typographical conventions
EMC uses the following type style conventions in this document:
<> Angle brackets enclose parameter or variable values supplied by the user
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 11
Preface
Your comments
Your suggestions will help us continue to improve the accuracy, organization, and
overall quality of the user publications. Please send your opinion of this document to:
[email protected]
If you have issues, comments, or questions about specific information or procedures,
please include the title and, if available, the part number, the revision (for example,
A01), the page numbers, and any other details that will help us locate the subject that
you are addressing.
12 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
1
Introduction
Introduction 13
Introduction
Revision history
E-mail your clarifications or suggestions for this document to:
[email protected]
The following table lists the revision history of this document.
A04 May 18, 2011 “Data Domain storage folder structure and limits” on page 22
“Data Domain Archiver system” on page 19
“DD Boost devices on Data Domain Archiver systems” on page 22
A02 May 2, 2011 “Support for client-side deduplication backup and recovery” on page 16
“NetWorker application modules” on page 20
“Dedicated storage node environment” on page 34
“Configure a backup client with the client wizard” on page 59
“Configure a backup client with the property windows” on page 60
14 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Introduction
Backup support
The DFA feature enables supported NetWorker clients to deduplicate their backup
data locally and store it directly on a NetWorker Data Domain device, thereby
bypassing the NetWorker storage node and reducing network bandwidth usage.
Because multiple clients with DFA backup support can share a device by using
multiple sessions, DFA can reduce the number of devices used, thereby reducing the
impact on the Data Domain system performance and maintenance.
Recovery support
If a supported DFA client has access to its NetWorker Data Domain storage device, it
will recover data directly from the device, regardless of whether DFA was used for
the backup. Because DFA bypasses the storage node, performance is improved. If the
DFA client cannot access the data, then the recovery process reverts to the traditional
method that uses the storage node. The Data Domain system converts the stored data
to its original non-deduplicated state for the recovery.
Table 1 on page 17 lists DFA support requirements.
16 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Introduction
Table 1 DFA support with NetWorker 7.6 SP1 and later software
NMM, specifically: NMM 2.3 with one of the following: NMM 2.3
• Microsoft Exchange 2010 • NW 7.6 SP1 server with DD OS 4.9
• Microsoft SharePoint 2010 • NW 7.6 SP2 server with DD OS 4.9 or 5.0
• Microsoft SQL Server 2008 R2
* All DFA recovery clients require NetWorker 7.6 SP1 or later client software.
The EMC NetWorker Module for Microsoft Applications Administration Guide provides
details on how to configure an NMM Client resource for Data Domain Boost backups.
Version requirements
The hosts in a NetWorker Data Domain backup and recovery datazone should meet
the following minimum version requirements:
◆ The NetWorker server must be installed with NetWorker 7.6 SP2 or later software.
This software includes the DD Boost 2.3.1 library.
◆ The Data Domain operating system (DD OS) must be version 4.9 or later.
◆ NMC must be release 7.6 SP2 or later.
◆ Any storage node that uses NetWorker Data Domain devices must be running
NetWorker 7.6 SP2 or later.
◆ NetWorker clients may be earlier releases, although release 7.5 SP3 or 7.6 and later
is recommended. Clients that use NetWorker application modules with direct file
access (DFA) and DD Boost should be upgraded to NetWorker 7.6 SP2 or later.
◆ Other NetWorker storage nodes and clients that are not directly managing or
sending data to the Data Domain system can be earlier releases although release
7.5 SP 3 or 7.6 and later is recommended.
Appendix A, “Upgrading to NetWorker 7.6 SP2,” provides upgrade details.
Version requirements 17
Introduction
License requirements
A fresh installation of a NetWorker release (not an upgrade) enables evaluation of all
features, including Data Domain features with no enabler (license key) required for
30 days. You can extend this evaluation period by 15 additional days by typing grace
as an authorization code before the 30-day expiry period. After expiration, backups
cannot be performed unless permanent enablers are applied.
NetWorker licenses
Two types of license keys must be enabled for the NetWorker server to interact with a
Data Domain or Data Domain Archiver system, the Data Domain Device Type
Enabler and a Data Domain Device Type Capacity Enabler. Upgrade enablers may be
available for earlier releases.
The EMC NetWorker Licensing Guide and your EMC sales representative can provide
licensing details.
18 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Introduction
NMC portal
The NMC server may be accessed and operated from an Internet portal on a remote
computer with a supported Internet web browser and Java Runtime Environment
(JRE) software installed. Multiple users can access the NMC server concurrently from
different portals.
NetWorker server
NetWorker 7.6 SP2 or later software must be installed on the NetWorker server that
stores or clones data on Data Domain systems.
The NetWorker software provides services to back up and recover data for client
computers within a NetWorker datazone. Deduplicated backup data is stored on a
Data Domain system by the use of special NetWorker Data Domain devices, which
are created and managed by the NetWorker software. The NetWorker software also
provides features to create clone copies of stored deduplicated data.
The EMC NetWorker Release Notes provides details on supported Data Domain
features.
NetWorker client
A NetWorker client is a supported host whose data requires backup and restore
services. The NMC server, NetWorker server, and NetWorker storage nodes are all
NetWorker clients.
NetWorker client software must be installed and running before NetWorker Data
Domain devices are configured. Clients may use a supported earlier release of
NetWorker client software than the NetWorker server or storage node.
The EMC Information Protection Software Compatibility Guide provides information on
supported releases.
20 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
2
Planning and
Practices
CAUTION
Data Domain 5.0 or later systems with more than 14 active SU folders for
NetWorker Data Domain devices can suffer impaired performance and hang.
22 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Planning and Practices
◆ An mtree’s data movement policy determines when when data will be moved
from devices on the active tier to devices on the archive tier. These policies and
movements are internal to the Data Domain Archiver system and are transparent
to the NetWorker software.
◆ An mtree applies a single data movement policy to all of the devices it serves
within its corresponding NetWorker datazone, whether they are NetWorker
AFTD or DD Boost devices.
◆ Alternative data movement policies may be configured for the client data in the
same datazone by additional mtrees, which may be manually created in the
Device Configuration Wizard.
◆ DD OS 5.0 supports up to 14 active and 99 defined mtrees on a Data Domain
Archiver system. There is no recommended limit on the number of NetWorker
Data Domain devices that may be created on the system.
◆ The DFA option is supported for clients to write directly to NetWorker Data
Domain archive devices, provided that the devices are assigned to a NetWorker
storage node for data indexing purposes. “Client direct file access (DFA)
deduplication environment” on page 35 provides details.
The Data Domain documentation provides additional details on Data Domain
Archiver features and use.
NetWorker Data Domain devices. Each device may also use two interface
connections, either 1 GbE or 10 GbE.
“Network connectivity and firewall” on page 28 provides details on network
bandwidth and connections.
Note: Currently, the NetWorker Data Domain devices support only an IP network and do not
support SAN (Fibre Channel) data transport.
The EMC Data Domain Storage system with EMC NetWorker 7.6.0 and earlier Best
Practices Planning white paper provides details on deployments of NetWorker with
Data Domain in a VTL or CIFS/NFS AFTD environment.
Note: If a NetWorker Data Domain device becomes full during a backup, the backup fails
immediately. It does not pause or wait for space to become available.
24 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Planning and Practices
The DD OS filesys clean command describes all the available options for managing
how the storage space for redundant or expired data (such as expired NetWorker
save sets) can be reclaimed and recycled.
IMPORTANT
Contact Technical Support for assistance to remove a NetWorker Data Domain
device.
Note: If you need to remove a device from the NetWorker software, both the read-write and
read-only components of the device must be unmounted or an error results. To toggle the
display of these components, in the NetWorker Administration application Devices view, select
View > Display read-only devices and volumes.
Data types
Some types of user data such as text documents, slide presentations, spreadsheets,
most database types, source code, and e-mail are known to contain redundant data
and are good deduplication candidates.
Some other types of data such as audio, video, and scanned images already consist of
compressed data. Typically, the first full deduplication backup of these data types
yields low reductions, but subsequent backups will typically produce high
deduplication ratios if the data has a low change rate.
Change rate
Data that does not change much between backups (low change rate) produces high
deduplication ratios and is a good candidate for deduplication. Data that has already
been stored will not be stored again.
When a completely new save set is deduplicated, the number of unique blocks within
it can vary widely depending on the data type, and often there is little that can be
deduplicated. Yet because the Data Domain system compresses the data blocks, there
is typically a 2:1 to 3:1 (50–75 percent) data reduction.
The storage savings begin to increase significantly with each subsequent backup of
the save set because only those data blocks that are unique to each backup need to be
written to disk. In conventional business operations, the data change rate is typically
low and unique data may represent only 1–2 percent of the data present in each
additional backup set. The remainder of the backup is deduplicated against what is
already stored on the system.
Retention period
The deduplication ratio increases with longer data retention periods. The longer the
stored save sets are retained, the greater the chances that identical data already exists
in storage that can be used to deduplicate each subsequent backup, and the greater
the efficiency of the storage usage.
26 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Planning and Practices
◆ Use a single hostname associated with each NIC or IP interface within the same
NetWorker datazone. Each new hostname or IP address used by NetWorker will
require its own enabler.
◆ Include source and target abbreviations in names to quickly identify whether
network connections are correct. For example, add an abbreviation of the storage
node hostname in the Data Domain name and an abbreviation of the Data
Domain hostname in the storage node name. Ensure that these names are
included in the Data Domain /etc/hosts file.
◆ Specify all aliases, such as long and short names, IP address, and so on for the
NetWorker server and storage nodes in their respective Client resources (Globals
1 of 2 tab).
◆ Ensure that all hostnames in the network can be consistently resolved from
multiple locations in both directions. For example: shortname to IP, longname to
IP, IP to shortname, IP to longname.
◆ In general, use short, easy to identify, descriptive names instead of IP addresses or
fully qualified name strings for devices and storage nodes. Long names might not
fit into some views. Examples of a long and short name:
NWDD365-1.burloak.lab.mycorp.com:/NWDZ_Dr1
NWDD365-1:/NWDZ_Dr1
◆ Use standard alphanumeric characters, including “-” and “_” with no spaces or
special characters, except that pool names may not use underscores “_”.
◆ Use fixed formats (text field length and case) and include leading 0s in numbers,
with a maximum of 50 characters.
◆ Avoid the use of dates in names where the dates might change or become
meaningless in the future.
IP addresses
Avoid IP addresses because numbers are more difficult to identify and troubleshoot
than descriptive names.
However, there are exceptions:
◆ The Data Domain system requires IP addresses to be used to interface with an
ifgroup for Advanced Load Balancing and Link Failover features.
◆ For clone-controlled replications, the hosts file on the source Data Domain system
should list the IP address of the target Data Domain system. Otherwise, the
clone-controlled replication will use the same network that is used to access the
source and target systems for backup and restore operations.
The Data Domain documentation provides details.
Examples
The following are some example name formats.
Firewall requirements
Regardless of the network connections used, communication through a firewall
requires the use of specific ports and protocols for backup, monitoring, and
replication across sites. Ensure that the following firewall ports are open between the
Data Domain, NetWorker, and NMC servers:
28 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Planning and Practices
Note: Clone-controlled replication across datazones or to Data Domain devices that are not
managed by NetWorker are not supported.
3. The storage nodes communicate with the primary Data Domain system and store
deduplicated backup data on the system’s devices over dedicated 10 GbE
network connections.
An ifgroup software configuration of NICs on the Data Domain system enables
the two storage nodes to share the same Data Domain network identity and use a
single Data Domain Storage System Enabler.
“Data Domain Device Type Enabler” on page 18 provides details on the DD
system enabler and ifgroup configuration. “Network connectivity and firewall”
on page 28 provides suggestions on network connections.
4. Clone-controlled replication is used to store optimized clone copies of backups
from the primary Data Domain system over a network to a geographically distant
secondary Data Domain system for the purpose of disaster recovery.
5. As a further option, a NetWorker storage node, attached to the secondary Data
Domain system, is then able to create an additional NetWorker clone copy of the
data for one of the backup groups and stores it on traditional disk or tape. The
data in this copy is reverted to its native non-deduplicated format, which is
necessary for storage on traditional disk or tape.
Servers
IFGROUP
DD Device 09
DD Device 10
Storage Node Tape Library
Admin Network
Backup Network
Servers
DD Device 01 DD Device 05
WAN connection
GEN-001471
30 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Planning and Practices
Note: The performance required for the concurrent method depends on many factors
and would need to be validated by testing at the individual sites.
For example, the original backup on the primary Data Domain system is used
as the source to create simultaneous clones on two target Data Domain
systems.
Figure 2 on page 32 illustrates data paths 2a and 2b for this method.
Backup IP Network
Clients
Storage Node Storage Node Storage Node
10 GbE connection
WAN connection
GEN-001637
32 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Planning and Practices
“Data Domain storage folder structure and limits” on page 22 provides details on
save stream usage and memory requirements. “Data Domain Device Type
Enabler” on page 18 provides details on Data Domain system enablers:
3. The storage nodes store the deduplicated backup data on the NetWorker Data
Domain devices over a shared 10 GbE connection.
“Network connectivity and firewall” on page 28 describes network connections.
4. Additional backup to tape storage may be made either directly from a storage
node or by a NetWorker clone operation from the Data Domain system.
DZ-A datazone
Clients
NetWorker Server (DZ-A)
Clients
Backup Network
DZ-B datazone
Clients
DD Device 21 DD Device 01 DD Device 11
DD Device 22 DD Device 02 DD Device 12
Tape Library DD Device 03 DD Device 13
DD Device 04 DD Device 14
10 GbE connection
GEN-001472
34 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Planning and Practices
IFGROUP
IP Network
NetWorker Server
Clients Clients
DD Device 01 DD Device 21
Clients Clients
DD Device 02 DD Device 22
DD Device 03
DD Device 04
10 GbE connection
GEN-001473
Figure 4 Single datazone with dedicated storage nodes and one high bandwidth link
Network
Deduplication
process on
storage node
36 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
3
Data Redirection and
Migration
38 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Data Redirection and Migration
Note: This also stops the NetWorker Backup and Recovery Service and the EMC GST
service.
g. Right-click the NetWorker Backup and Recovery Service and click Start. This
also starts the NetWorker Remote Exec Service.
h. If the NetWorker server is also the NMC server, right-click the EMC GST
Service and click Start.
5. Run the NMC Device Configuration Wizard. The following tasks are specific to
NetWorker Data Domain devices:
a. Select or create NetWorker Data Domain devices on the Data Domain system
where the backup save sets will be stored.
b. Select or create a Pool resource that will target the save sets to the devices.
Note: The wizard enforces that a NetWorker Media Pool created for the Data Domain
system uses only NetWorker Data Domain devices. This can be verified in the NMC
property windows (Media Pool, Selection Criteria tab) for Target Device > Devices.
The wizard also sets Target Media > Media type required to Data Domain.
c. Select or create a NetWorker storage node on which to label and mount the
new devices.
d. Complete the wizard.
“Creating a NetWorker Data Domain device with the wizard” on page 51
provides details.
6. In the NMC NetWorker Administration window:
a. In the Media view, open the pool created or modified in step 5.
b. In the Data Source attribute, ensure that the groups selected for this pool are
intended for NetWorker Data Domain devices.
c. Ensure that these backup groups are removed from their legacy pools.
“Create pools to target NetWorker Data Domain devices” on page 57 provides
details.
Note: Existing individual NetWorker Client resources should not require any changes to
their settings, including the Data Domain backup attribute. The redirection is more
efficiently configured in the Pool resource.
7. Test the backup environment to ensure that the new configuration operates
correctly and existing backups that should be unaffected continue to run as
expected. For backups to new devices, the test should also include a restore from
those devices.
8. Start your redirection with a full backup to the new devices. This avoids a
dependency on the last full backup stored with the legacy storage environment
and the potential need to restore from two different environments.
9. Monitor backup performance and adjust the backup schedule to optimize the
configuration for maximum throughput or additional clients. “Monitoring
NetWorker Data Domain devices” on page 74 provides details.
Migration methods
Data migration is essentially a one-time NetWorker clone operation that is tailored to
the different device types and time scales. The migration can include all the data or be
limited to data selected from a specific time frame or backup type, such as weekly full
backups.
40 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Data Redirection and Migration
The details of the migration procedure depend on the method to be used and the
granularity of the data to be migrated:
◆ A NetWorker scheduled clone operation. “Migration scenarios” on page 42
provides details.
◆ A NetWorker nsrclone script to be run from a command line. The EMC NetWorker
Administration Guide provides details.
◆ A NetWorker staging (data moving) operation. The EMC NetWorker
Administration Guide provides details.
Note: When existing deduplicated VTL or CIFS/NFS AFTD save sets are migrated, the
save sets are first reverted to their native non-deduplicated format, which the storage node
reads and stores in deduplicated format on the new NetWorker Data Domain devices. This
reversion process occurs for both Data Domain and non-Data Domain storage.
3. Create a clone pool to target NetWorker Data Domain devices for the migration:
• In the Data Source attribute, select groups for the migration.
Typically, you will migrate the same groups that you selected for the
redirection of backups. “Redirect backups to NetWorker Data Domain
devices” on page 38 provides details.
• In the Target Devices attribute, select the NetWorker Data Domain devices
where the migrated data will be stored.
“Create pools to target NetWorker Data Domain devices” on page 57 provides
details.
4. Configure a NetWorker scheduled clone operation with the Write Clone Data to
Pool attribute selected for your clone pool.
“Scheduling a clone operation” on page 68 provides details on the scheduled
clone option.
5. Allow the clone operation to proceed, either according to its schedule or by
manual start.
“Starting a scheduled clone operation manually on demand” on page 71 provides
details.
6. After the clone operation completes, verify the cloned data on the target devices.
Migration scenarios
This section offers some typical scenarios to help you migrate existing backup data
from legacy devices or filesystems to NetWorker Data Domain devices. The best
scenario for you will depend on the configuration of your current storage
environment and the capacities and bandwidth available.
Some example migration scenarios are as follows:
◆ “Migrating to a new DD system from traditional tape or disk storage” on page 42:
• “Migrate to new devices on a different storage node” on page 42
• “Migrate to new devices on the same storage node” on page 43
◆ “Migrating to the existing DD system from non-integrated storage” on page 44:
• “Migrate to a new devices on a different storage node” on page 44
• “Migrate to new devices on the same storage node” on page 45
42 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Data Redirection and Migration
In this scenario, the IP network is used to transfer the data from the original storage
node CASSIE to the new storage node ERIC. The time required for the transfer will
depend on the capacity and bandwidth available on the IP network, irrespective of
the fact that the tape library is on a SAN. If this same IP network is used for backup
and restore operations, then additional effort and time will be required to ensure that
the data transfer does not impact the regular backup operations.
IP Network
2Gb
DDS1Device1
DD
R
4Gb
DDS1Device2
DD
Data Domain System
(DDS1) ERIC OLIVE BUSTER
Rd=Cassie:\tape0
SAN Rd=Cassie:\tape1
SSID 3034
SSID 3452
Data Read Path CASSIE
SSID 1261 Tape Library (SAN)
GEN-001566
Figure 6 Migration from traditional storage to NW-DD devices on a different storage node
Migration scenarios 43
Data Redirection and Migration
IP Network
NW Storage NW NW
DDS1Device1 Node Client Server
DD
R
2Gb
4Gb
DDS1Device2
Data Migration DD
Path
(Internal)
Rd=Cassie:\tape0
Data Domain System
(DDS1) Rd=Cassie:\tape1 CASSIE OLIVE BUSTER
SAN
SSID 3034
Figure 7 Migration from traditional storage to NW-DD devices on the same storage node
44 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Data Redirection and Migration
During this migration, the SAN and the IP network are used in two separate stages.
First, the original storage node CASSIE reads the non-deduplicated data from the
Data Domain system over the SAN connection. Then the new storage node DOVE
reads this data and concurrently stores only unique data, in this case the storage
metadata, across the IP network to the Data Domain system. The speed of the transfer
of non-deduplicated data across the IP network would be the limiting factor.
IP Network
2Gb
DDS1Device1
DD
R
4Gb
DDS1Device2
DD
Data Domain System
(DDS1) DOVE OLIVE BUSTER
Rd=Cassie:\tape0
VTL
Rd=Cassie:\tape1
Migration scenarios 45
Data Redirection and Migration
IP Network
2Gb
R
4Gb
Figure 9 Migration from VTL to NW-DD devices on the same storage node
46 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
4
Software
Configuration
Software Configuration 47
Software Configuration
Note: The DD Boost functionality requires NFS service to be enabled on the Data Domain
system, even if no users or shares are configured. The NetWorker server, storage nodes, or
clients are not required to be enabled for NFS.
4. If required, confirm the version number of the currently installed Data Domain
operating system:
# system show version
5. Create the backup user and set the OST or DD Boost user name:
• For DD OS 4.9:
# user add username
# ost set user-name username
• For DD OS 5.0:
# user add username
# ddboost set user-name username
Note: Use the same username in both of these commands. Only one OST or DD Boost user
may have access to Data Domain at a time.
48 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Software Configuration
• For DD OS 5.0:
# ddboost disable
# ddboost enable
9. Specify and enable users to monitor backup events captured by SNMP traps and
add the host to receive traps:
# snmp add ro-community community_name
# snmp enable
# snmp add trap-host hostname[:port]
The typical value for community_name is “public,” which allows all users to
monitor events.
10. Verify that distributed segment processing (DD Boost) is enabled. This is required
for deduplication operations on the NetWorker storage nodes:
• To verify DD Boost on a DD OS 4.9 system:
# ost option show boost
• To verify DD Boost on a DD OS 5.0 system:
# ddboost option show
If DD Boost is disabled:
• To enable DD Boost on a DD OS 4.9 system:
# ost option set boost enabled
• To enable DD Boost on a DD OS 5.0system:
# ddboost option set distributed-segment-processing enabled
Note: The boost (distributed segment processing) option must be enabled. The disabled
setting is not supported for production use.
11. (Optional) Use the Device Configuration Wizard instead of this step. The wizard
automatically creates a Data Domain storage unit (SU) named after the
NetWorker server hostname if it does not already exist. This SU folder will
contain the NetWorker Data Domain devices (device folders) for that host.
This step is included for power users and for special configurations that require
the manual creation of storage units, for example, to create additional storage
units for the same NetWorker server.
To create an SU folder:
• On a DD OS 4.9 system:
# ost lsu create hostfolder_name
• On a DD OS 5.0 system:
# ddboost storage-unit create hostfolder_name
where as a best practice, hostfolder_name should be named after the short
hostname of the NetWorker server. NetWorker Data Domain devices (device
folders) for this NetWorker server should be located under this folder.
Do not create any folders deeper in the folder path than these device folders.
To verify storage unit names, use the ost lsu show (DD OS 4.9) or ddboost
storage-unit show (DD OS 5.0) command.
“Creating a NetWorker Data Domain device with the wizard” on page 51 provides
details.
IMPORTANT
Although NFS service data is stored in the /backup directory, clients that are
configured for NFS access have the ability to view, change, and delete the
/backup/ost or /data/col1 directories that contain the NetWorker Data Domain
devices. If NFS devices are used, additional configurations may be needed to avoid
potential problems.
Methods
The Device Configuration Wizard, which is part of the NMC software, is designed to
reduce user errors and should be used to create and configure the NetWorker Data
Domain devices. It also creates and configures the required volume labels and storage
pools that are necessary to store client backups on NetWorker Data Domain devices.
After a device has been created with the wizard, you may modify its configuration, if
required, by using the individual NMC property windows:
◆ “Creating a NetWorker Data Domain device with the wizard” on page 51
◆ “Configuring a NetWorker Data Domain device with the property windows” on
page 53.
50 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Software Configuration
Note: There is no recommended limit on the number of NetWorker Data domain devices that
may be created. However, increasing the number of devices can impact Data Domain
performance and maintenance. Avoid the removal of devices. “Removing a NetWorker Data
Domain device” on page 25 provides details.
Note: To navigate to visited wizard pages to modify them, click the links in the steps panel.
The number of steps may vary according to the type of configuration chosen.
Note: Only one Data Domain OST username and password may be defined. All
NetWorker storage nodes and servers that access the Data Domain system must use the
same username and password. “Configure the Data Domain system for NetWorker” on
page 48 provides details on setting the OST username.
Note: The wizard does not show existing device folders that were previously created by
NetWorker 7.6 SP1.
a. Select a top-level folder (the Data Domain system) and click New Folder.
A new device folder appears in the folder tree.
Note: For NetWorker 7.6 SP2, the folder tree (not updated from NetWorker 7.6 SP1),
does not show the storage unit (SU) folder under the Data Domain system folder. The
wizard names this SU folder after the short hostname of the NetWorker server and
places devices within this SU folder. The SU folder is verifiable in the Review
Configurations Settings wizard page.
Note: For NetWorker 7.6 SP2, the Storage Unit label (not updated from NetWorker
7.6 SP1) actually refers to the device name.
Note: Implicit in the path to the SU folder on the Data Domain system are the hidden
mount point folders: /backup/ost (DD OS 4.9) or /data/col1 (DD OS 5.0 and later). This
folder structure is used by all NetWorker server hosts and should not be modified. The
Review Configurations Settings wizard page shows this location.
Note: For NetWorker 7.6 SP2, the Storage Unit name (not updated from NetWorker
7.6 SP1) is actually the path for the device folder in the format: SU/device_name, where SU
is named after the short hostname of the NetWorker server.
The NetWorker Data Domain device will be configured, mounted, and labeled for
the specified pool.
13. The final Device Configuration Results page is informational only. Click Finish
to exit the wizard.
52 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Software Configuration
14. In the NMC Devices view, verify that the device is labelled and mounted, ready
for use. This view should list the device with its appropriate Volume Name.
To show or hide the read-only components of the devices, click View > Display
read-only devices and volumes.
Note: If you manually create a device with this procedure, NMC will list the device but no
corresponding device folder will be created on the Data Domain system. If you try to label such
a device, an error appears. “Configure the Data Domain system for NetWorker” on page 48
provides an optional step to manually create a storage unit on the Data Domain system.
54 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Software Configuration
c. In the Media type attribute, select Data Domain from the list.
Note: Each Data Domain system NIC that is used as device with the Media type
attribute set to Data Domain will require a Data Domain Storage System Enabler.
“License requirements” on page 18 provides details.
6. Do not select Auto media management. It does not apply to NetWorker Data
Domain devices.
7. In the Remote user and Password attributes, type the Data Domain OST
username and password, respectively. This allows NetWorker interfaces to
connect to the Data Domain interface.
Only one OST user may be defined. All NetWorker storage nodes and servers that
access the Data Domain system must use the same username and password.
8. On the Configuration tab, set the number of concurrent save or clone streams
(sessions) the device may handle:
• Target sessions are the number of sessions that the NetWorker storage node
may send to this device before the storage node looks for another available
device. This should be set to a low value for best performance. The default
setting is 1 and it may not be set to a value greater than 60.
• Max sessions is set by default to a value of 10, which in most cases provides
best performance. It may not be set to a value greater than 60.
9. On the Advanced tab, ensure that common device interface (CDI) is set to Not
used with No reserve release. This setting pertains to tape volumes and does not
apply to NetWorker Data Domain devices.
Note: The Device Wizard automatically creates a label template for the volumes and this
procedure does not apply if the wizard is used.
Each NetWorker Data Domain device must be mounted with a storage volume,
which associates it with a specific Volume Name and membership within a pool. A
label template must be created to provide a Volume Name and numbering to all
storage volumes that belong to the same pool.
A label template defines the components of a volume label. All the volumes in the
same pool will have the same label name (for example, “dd_myvol”), but different
volume numbers (for example, .001–.003).
For example, a Data Domain system may have three devices, each of which is
mounted with a storage volume (Volume Name). If each device/volume is associated
with the same pool, the Volume Names would be as follows:
◆ dd_myvol.001
◆ dd_myvol.002
◆ dd_myvol.003
To create a volume label:
1. From the NetWorker Administration window, click Media.
2. In the browser tree, select Label Templates, and from the File menu, click New to
open the Create Label Template window.
3. In the Name and Comment attributes, type a name and description for the label
template. The label will associate a storage pool to a device.
4. In the Fields attribute, type components of the label. Place each label component
on a separate line. The template must include at least one volume number range
component. The label template will be applied to the volumes mounted on
NetWorker Data Domain devices in a Data Domain system.
For example:
dd_myvol
001-999
5. Select a Separator and click OK.
6. In the Next attribute, specify the next Volume Label in the sequence to be to be
applied during the next label and mount operation. For example, dd_myvol.001.
7. Click OK.
56 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Software Configuration
Note: Dynamic Drive sharing (DDS) is not supported for NetWorker Data Domain devices.
To manually create a pool for Data Domain backups by using the property windows:
1. Ensure that the devices that will target the pool have already been created. If you
do not use the Device Configuration Wizard to create the devices, “Configure the
Data Domain system for NetWorker” on page 48 provides details.
2. Ensure a label template has been created for the pool. “Create a volume label
template for NetWorker Data Domain devices” on page 56.
3. From the NetWorker Administration window, click Media.
4. In the browser tree, select Media Pools, and from the File menu, select New to
open the Create Media Pool window with the Basic tab selected.
5. In the Name attribute, type a name for each pool. Create names that clearly
indicate whether the pool is for Data Domain backup or clone operations.
For example:
DDsite1
DDCLsite2
For example, a pool name that starts with DD would be a Data Domain pool, and
a pool name that starts with DDCL would be a Data Domain clone pool. The pool
name can also include the physical location where the backup will be stored.
These conventions make the name easier to use for scripting and reporting.
6. In the optional Comment attribute, type a description of the pool.
7. Select Enabled.
8. Select the Pool type:
• To use the pool for backups, select Backup.
• To use the pool for clone copies, select Backup Clone.
An alternative way to configure a client to use a pool is to specify the Target Pool
attribute of the client resource. “Configure a backup client with the client wizard”
on page 59 provides details.
Note: A client may belong to multiple groups through multiple client definitions.
Note: As a best practice, do not include different media types in a single pool. “Backup fails
for older NetWorker application modules” on page 82 provides further details.
58 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Software Configuration
Note: For Microsoft Windows systems, the SYSTEM or Volume Shadow Copy Service
(VSS) SYSTEM save sets should be backed up on a regular basis for disaster recovery of
the client system.
The EMC NetWorker Administration Guide provides details for this step.
5. In the Group attribute, select a backup group. A backup group consists of clients
that are backed up according to the schedule for the group.
This attribute may show groups pre-selected based on the pool selected. It is
recommended to not separate groups from their associated pools at the client
level.
60 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Software Configuration
Note: Groups that are specifically devoted to deduplication backups help you monitor
deduplication operations and configure backup schedules for either deduplication or
traditional storage.
6. In the Pool attribute, select a pool that targets the NetWorker Data Domain
devices you want to use. The pool selected in this attribute overrides any other
pool that might be configured for the client or its save sets.
An alternative way to configure a client to use a pool is to specify the client or its
group in the Data Source attribute of the pool resource. “Create pools to target
NetWorker Data Domain devices” on page 57 provides details.
7. (Optional) A Data Domain backup applies only to the save sets (the data portion
of backups), and does not back up the NetWorker server’s bootstrap and index
files to a Data Domain device. Normally, the bootstrap and index files are backed
up to a traditional disk or tape device that is locally attached to the NetWorker
server.
If you want to back up the bootstrap and index files to a NetWorker Data Domain
device, create a Pool resource specifically for them. In the pool’s save sets
attribute, specify the following:
bootstrap*
index*
8. In the Schedule attribute, select a backup schedule.
The schedule selected in this attribute overrides any other schedules that might
be configured for the client or its save sets.
9. Select the Apps & Modules tab.
10. In the Deduplication attributes group, select Data Domain backup. This ensures
that the client data will be backed up only to NetWorker Data Domain devices,
even if the pool selected for the backups contains a mix of other device types,
although pools with mixed devices is not recommended.
11. For NDMP deduplication clients, ensure that the -M option is used with the
nsrndmp_save command, which is specified in the Backup Command attribute.
The -M parameter will look for NetWorker Data Domain devices. The EMC
NetWorker Administration Guide provides details.
12. Click the Globals (2 of 2) tab.
13. In the Storage nodes attribute, specify storage nodes that the client will back up
to, especially if this client backs up to a remote storage node.
14. In the Clone storage nodes attribute, if this client is a source storage node for
clone operations, specify the target storage node where the clones will be stored.
15. When you have completed the client configuration, click OK.
In the NetWorker Administration window, the configured client shows a
checkmark in the Scheduled backup column to indicate that scheduled backup is
enabled.
Deactivating a device
To deactivate a device so it does not interfere with normal backup operations, use any
of the following options:
◆ “Convert a read-write device to be read-only” on page 62
◆ “Disable a device” on page 62
◆ “Remove a device” on page 62
Disable a device
Disabling a device prevents further operation of the device. The device may be
re-enabled to restore old data, which is retained but is not active. To disable a device:
1. In the NetWorker Administration window, click the Devices view and select the
Devices folder in the navigation tree.
2. In the View menu, ensure Display Read Only Devices and Volumes is selected.
Both the read-write and the corresponding read-only components should appear
in the Devices table.
3. Perform the following steps for both the read-write and read-only components:
a. In the Devices table, right-click the device and select Unmount.
b. Right-click this device and select Enable/Disable to disable.
4. Inspect the Enabled column of the table to verify the device is disabled.
Remove a device
Special procedures are required to remove NetWorker Data Domain devices.
“Removing a NetWorker Data Domain device” on page 25 provides details.
62 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
5
Cloning and
Replicating Data
Datazone requirement
A clone of stored data may not be created in a different NetWorker datazone. For the
NetWorker server to manage and monitor clone operations, the storage nodes at both
the source and target locations must be clients of the same NetWorker server. The
NetWorker server maintains browse and retention policies for all cloned copies and
can monitor and report on their storage operations.
Clone pool
In order for the NetWorker software to copy save sets from one device to another, a
special pool must be created. This pool, known as a “clone pool,” is assigned to
devices on the target system. It must be properly configured and available for use.
“Create pools to target NetWorker Data Domain devices” on page 57 provides
details. The EMC NetWorker Administration Guide provides additional details.
64 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Cloning and Replicating Data
Clone reports
NetWorker Data Domain clone operation reports can be viewed in the NMC Reports
view.
“Backup, recovery, and clone reports” on page 77 provides details.
Clone formats
Copies of data stored on a NetWorker Data Domain device may be created by the
NetWorker clone process. The type of clone produced depends on the type of media
on which the clone copy will be stored, whether a clone-controlled replication or a
regular clone.
Note: A NetWorker Data Domain device that is used to store clone-controlled replications
should not also be used to store normal backups.
Clone formats 65
Cloning and Replicating Data
Clone methods
NetWorker clone operations may be configured by several methods, which are
suitable to different environment and storage needs. In some cases, it may be
necessary to use multiple or mixed approaches to achieve the desired control and
flexibility.
As choices, you can configure clone operations to be run by automatic start
(auto-clone), by a schedule, or by a customized script.
Immediate clone
Also known as “auto-clone.” Save sets can be automatically cloned when the backup
group that contains them is completed. This clone method is suitable for smaller
environments, or a small number of clients, where the clone operations need to be
completed quickly and immediately within the backup window:
1. In the NetWorker Administration window, select Configuration.
2. Create a Group resource and specify the Clones option and Clone pool.
“Create pools to target NetWorker Data Domain devices” on page 57 provides details
on creating a clone pool.
Scheduled clone
NetWorker scheduled clone operations can be configured and run in NMC according
to a schedule for predetermined clients, pools, save sets, and devices. This method is
suitable for environments where copies of save sets need to be regularly provided,
typically as part of a well-defined maintenance cloning window, which runs
independently of the main backup operation.
“Scheduling a clone operation” on page 68 provides details.
nsrclone command
A NetWorker nsrclone script can be created and used to run clone operations and be
launched either manually or as a scheduled task run from the operating system or an
external scheduler. This method is typically used in larger environments where
flexibility and conditional controls are required.
One example would be to start Clone Job B, which clones to tape storage, if and only
if Clone Job A successfully completes. Another example would be to select multiple
specific save sets to be cloned to specific devices or storage nodes.
Scripted solutions require additional knowledge and have external dependencies,
such as operating system, scripting tool, and so forth.
The EMC NetWorker Administration Guide and the EMC NetWorker Command Reference
Guide provides details. EMC Professional Services are also available.
66 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Cloning and Replicating Data
Clone requirements
To clone data from one Data Domain device to another by NetWorker
clone-controlled replication (optimized cloning), ensure that the following
requirements are met. These requirements assume the previous creation of a clone
target pool, for example, named newclonepool:
1. Ensure that both the source and target storage nodes are clients of the same
NetWorker server.
2. Ensure that the Data Domain systems are properly licensed, including a
Replication license, which is required to create optimized clones.
“Configure the Data Domain system for NetWorker” on page 48 provides details.
3. Ensure that the Client resource for the NetWorker server and both storage nodes
specify, in their Aliases attribute (Globals 1 of 2 tab), all of their names in use
(fully qualified name, short name, aliases, and IP address).
Note: If a nsrclone command or script is used to perform an optimized clone from a host
that is not the NetWorker server, then this command must specify the NetWorker server by
its primary hostname (as listed in the NMC Enterprise view). Otherwise, a regular clone
might be produced instead of an optimized clone.
4. Ensure that a target pool (for example, newclonepool) has been created for Backup
Clone type with the Media type required attribute set to Data Domain.
With this setting, if a Data Domain device is not available for a clone operation in
the specified target pool, then NMC displays a “Media waiting” message.
Note: The Default Clone pool does not allow any modification and Media type required
cannot be set in that pool. The use of the default clone pool is not recommended.
Clone requirements 67
Cloning and Replicating Data
8. Verify that the target clone pool (for example, newclonepool) is properly specified
or selected:
• For CLI clone operations, use the nsrclone -b newclonepool command.
• For scheduled clone operations, in the Write clone data to pool attribute of the
Clone resource, select newclonepool.
• For immediate clone operations for a group, in the Clone pool attribute of the
Group resource, select newclonepool.
• For clones of entire volumes, “Cloning by pools” on page 70 provides details.
Note: The NetWorker software allows only one unique save set clone to be stored on a
single volume. If multiple clones of the same save set are created, each will be stored on a
different volume in the pool.
68 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Cloning and Replicating Data
10. Configure the schedule for the clone operation. Figure 11 on page 69 shows the
settings for a schedule.
Typically, to reduce the use of resources, you would schedule clone operations to
be performed after the backup window has completed:
a. Select Enable to enable the clone operation to run at its scheduled times.
b. In the Start Time attribute, click the spin arrows, or type values, to set the start
time.
c. Select either Weekly by day or Monthly by day to display either a calendar
week or a calendar month. In the calendar, select the days on which you want
the clone operation to run.
d. To repeat the clone operation within a day, specify an Interval time in hours.
For example, if the start time is 6 a.m., and the interval is 6 hours, then the
clone operation will run at 6 a.m., 12 p.m., and 6 p.m.
If the Limit the number of save set clones value is set, then the repeat clone
operation will fail after the limit is reached.
11. Select the Save Set Filters tab and specify the source save sets to include in your
scheduled clone operation.
There are two main options as follows:
• Select Clone save sets that match selections and limit the save sets to be
cloned by specifying the following details:
– save groups
– NetWorker clients
– pools
– save set levels
Cloning by pools
To copy save sets from Data Domain storage to a device, a special pool may be
specified. This pool is know as a “clone pool.” A clone pool must be assigned to a
device on the target Data Domain system, where it will be available for use. There are
two main purposes for a clone pool:
◆ To copy existing deduplicated VTL or CIFS/NFS AFTD save sets to a NetWorker
Data Domain device.
◆ To copy the existing save sets from one NetWorker Data Domain device to
another NetWorker Data Domain device, typically at a remote location for
disaster recovery purposes.
Note: Selection of the volumes, pools, or save sets for a clone operation must ensure that all
appropriate save sets are identified and successfully cloned. Failure to clone the entire backup
set may result in a failure to perform a recovery from that clone copy. Expiration or deletion of
the primary save sets should be performed only when you are confident that all save sets have
been successfully cloned.
70 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Cloning and Replicating Data
7. Once the pool is defined, the clone copies may be configured and run as a
scheduled clone operation or manually started. “Scheduling a clone operation”
on page 68 provides details.
72 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
6
Monitoring,
Reporting, and
Troubleshooting
Note: On NMC servers with HP-UX operating systems, status monitoring (SNMP) in the
Enterprise view is supported, but event monitoring (SNMPTRAP) in the Events view is not
supported.
To configure NMC to monitor Data Domain backup status and events (SNMP traps):
1. Provided you have viewing privileges, the NMC Enterprise view should list the
Data Domain systems as network hosts. “Add a host Data Domain system to
NMC Enterprise view” on page 53 provides details.
2. Ensure that SNMP is enabled on the Data Domain system and Data Domain
system is configured to send traps to NMC server. “Configure the Data Domain
system for NetWorker” on page 48 provides details.
3. In the NMC Enterprise view left panel, right-click the Data Domain system to be
monitored and select Properties. The Properties window opens.
4. On the Manage Data Domain tab, select Capture Events.
If the box is not checked, NMC will monitor the status of the NetWorker Data
Domain devices, but will not monitor Data Domain SNMP traps, required to
monitor event.
5. On the Configure SNMP monitoring tab, type a value for SNMP Community
String. The typical setting is “public,” which allows all users to monitor events.
Figure 12 on page 75 shows an example.
6. Type a value for the SNMP Process Port. The default value is 162. This setting
should agree with the firewall setting on the Data Domain system.
7. Select the appropriate SNMP Traps that you want to monitor. Some traps are
pre-selected. Figure 12 on page 75 shows an example for Data Domain 4.8 alerts.
Other versions may differ.
8. Click OK.
74 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Monitoring, Reporting, and Troubleshooting
• /ddvar indicates log file space used on the Data Domain filesystem.
• Compression (Reduction), which is represented by two values:
– Pre-Compression ÷ Post-comp Used
– (1 - Post-comp Used) ÷ Pre-Compression) x 100%
76 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Monitoring, Reporting, and Troubleshooting
Configure a report
To configure and display a Data Domain backup or clone report:
1. From the NetWorker Management Console window, click Reports.
2. Expand the Reports folder, expand the Data Domain Statistics folder, and then
select a summary or statement report type to view.
The Configure tab for the selected report type appears in the right panel.
3. In the Configure tab, customize the items that you want to include in the report
‹
by selecting the item parameters and clicking the Remove ( ), Add ( ), Remove ›
« »
All ( ), or Add All ( ) buttons as required.
If Save Time values are not specified, the report will display all the available data.
Table 2 on page 77 lists details of report configuration parameters. The specific
parameters available depend on the type of report selected.
Figure 14 on page 78 shows an example report configuration.
4. To display the report, select the View Report tab.
Server Name Selects managed hosts within the enterprise. Selected server names
Save Set Name Selects one or more save sets. Values are Selected save set names
case-sensitive and wild cards cannot be used.
Save Time Limits the report to a specified time range. Save time (range)
View a report
Backup reports are available in various formats. Most are basic reports. The Backup
Summary and Monthly Client Statement are drill-down reports:
◆ “Basic reports” on page 78 describes details of basic reports.
◆ “Table 3 on page 79 describes the basic reports available for Data Domain.” on
page 79 describes details of drill-down reports.
◆ “Advanced Reporting” on page 80 describes advanced reporting functionality
with the optional EMC Data Protection Advisor (DPA).
◆ For clone operations, there is no specific report. You can query and list the copies
of save sets in the NetWorker Administration, Media view, under Save Sets.
Basic reports
A basic report collects statistics for a specific datazone component, time span, or
attribute. You can modify the scope of a report by adjusting the parameters on the
Configure tab.
78 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Monitoring, Reporting, and Troubleshooting
Table 3 on page 79 describes the basic reports available for Data Domain.
Client Summary For all or specified clients, provides the following statistics:
• Amount of data: The amount of the data that would have been moved by using
traditional NetWorker backup (protected data).
• Target size: Size of the data after deduplication has taken place on the Data
Domain System (stored data).
• Deduplication ratio: Percentage of savings by using Data Domain
deduplication.
• Number of Save Sets: The number of save sets included in the backup.
• Number of Files: The number of files included in the backup.
Save Set Summary For all or specified save sets, provides deduplication statistics on the following:
• Amount of data: The amount of the data that would have been moved by using
traditional NetWorker backup.
• Target size: Size of the data after deduplication has taken place on the Data
Domain System.
• Deduplication ratio: Percentage of savings by using deduplication.
• Number of Save Sets: The number of save sets included in the backup.
• Number of Files: The number of files included in the save set.
Save Set Details Displays details about each save set, including backup duration.
The Save Set Details report provides statistics on the following:
• Save Set ID
• Save time
• Backup level
• Save Set size (protected data size)
• Target size: Size of the data after deduplication has taken place on the Data
Domain System (stored data size).
• Deduplication ratio: Percentage of savings by using deduplication.
• Number of Files: The number of files included in the save set.
Drill-down reports
A drill-down report consists of multiple basic reports, connected as layers and all
configured with the same parameters used in the top layer.
Reports can be run for groups, clients, or save sets. You can modify the scope of a
report by adjusting the parameters on the Configure tab.
Table 4 on page 80 lists the drill-down reports available for Data Domain Statistics.
Backup Summary Reports backup statistics over a period of time, 1. Client Summary
starting from client summary down to individual 2. Save Set Summary
save sets level. 3. Save Set Details
Advanced Reporting
NMC provides reports for only the recent backup history in a specific datazone. The
optional EMC DPA software can provide extended reports of backups, trends, and
analysis for one or multiple datazones, including reports of Data Domain systems.
DPA is recommended for larger environments where additional analysis with
forecasts and trends are required.
Troubleshooting
The following sections will help you identify and resolve common configuration and
operation issues.
80 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Monitoring, Reporting, and Troubleshooting
Troubleshooting 81
Monitoring, Reporting, and Troubleshooting
82 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Monitoring, Reporting, and Troubleshooting
IMPORTANT
Ensure the Label and Mount checkbox is not selected. If the volume is
re-labeled all the data will be lost. This action cannot be undone.
e. On the Select the Storage Nodes page, select a storage node to handle the new
devices by doing one of the following:
– Select an existing storage node
– Create a replacement storage node
– Use the NetWorker server itself, which will handle the devices
The storage node must be running on the appropriate network and its
hostname must be resolvable by DNS.
f. Complete the wizard.
6. Manually mount each new device:
a. From the NetWorker Administration window, click Devices.
b. Click View > Display read-only devices and volumes to unhide the
associated read-only devices.
c. In the navigation tree, select the Data Domain system.
d. In the right panel, right-click each device and each corresponding read-only
(RO) device to be mounted, and select Mount.
The device mounts on the storage node and will use the label associated with the
pool you have specified.
7. Review the NMC log for any error messages.
If no errors are reported from this procedure, then the device and its volume should
be available for use. Backup and recovery operations may require further
configuration depending on the original settings and the purpose of the device
recovery.
84 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
A
Upgrading to
NetWorker 7.6 SP2
IMPORTANT
Data Domain 5.0 or later systems with more than 14 active storage unit folders for
NetWorker Data Domain devices can suffer impaired performance and hang.
An exception where more storage units might be configured could be for service
providers to allow quota enforcements, or for reporting on actual capacity used and
deduplication ratios for each application group or end-customer. However, this
practice should not be configured where it is not explicitly required.
Note: Data Domain system with more than 99 NetWorker Data Domain SU folders will fail the
upgrade process from DD OS 4.9 to 5.0
86 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Upgrading to NetWorker 7.6 SP2
DD OS 4.9 Upgrade to NetWorker 7.6 SP2 (except if DFA will be used) Upgrade to DD OS 5.0
as required.
Upgrade to DD OS 5.0 then upgrade to NetWorker 7.6 SP2
Note: On Data Domain OS 4.9 systems, all DD Boost devices use a common root under the
/backup/ost directory. On Data Domain OS 5.0 systems, each DD Boost device has a
separate root under the /data/col1 directory.
87
Upgrading to NetWorker 7.6 SP2
88 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Upgrading to NetWorker 7.6 SP2
c. Create a new directory on the mount point to represent the new Data Domain
device that will be used with NetWorker 7.6 SP2:
mkdir /ddmount/shortname/nw_device_name
Where shortname is the short hostname of the NetWorker server and
nw_device_name is the new DD Boost device that the NetWorker server will
use.
For example:
mkdir /ddmount/gidrathy/DZBurl_BACK_DD01
d. Change user:group ownership of this folder to be the same as the actual SU.
ls -l /ddmount/shortname
chown user:group /ddmount/shortname/nw_device_name
For example:
ls -l /ddmount/gidrathy
chown 501:100 /dmount/gidrathy/DZBurl_BACK_DD01
5. If the NetWorker storage node uses a Microsoft Windows operating system, map
an available drive and create a NetWorker Data Domain device folder on the Data
Domain system as follows:
a. Using Windows Explorer or a command line, map an available drive to the
CIFS share folder on the Data Domain system (\\dd_hostname\sharename). For
example:
net use Z: \\dd_hostname\sharename password /user:username
Where Z: is the available drive letter, dd_hostname is the hostname of the Data
Domain system, and sharename is the CIFS sharename created on the Data
Domain system such as backup_ost or data_col1.
Note: A username and password may be required to map this drive. Specify an account
that has write access to the share folder, such as the sysadmin or the OST user account.
b. In Windows Explorer, expand the mapped drive and expand the SU (which is
named after the short hostname of the NetWorker server).
c. Create a new folder within the SU that will represent the new NetWorker Data
Domain device to be used with NetWorker 7.6 SP2. For example:
DZBurl_BACK_DD01.
6. Move the data from the old SU path to the new SU path by using one of the
following methods:
• On a UNIX storage node (or a Microsoft Windows system that is mounted to
the Data Domain system):
mv /ddmount/pre762_nw_device_name/* /ddmount/shortname/
nw_device_name/
mv /ddmount/pre762_nw_device_name/.nsr /ddmount/shortname/
nw_device_name/
• On a Microsoft Windows storage node:
Use Windows Explorer to move files from:
Z:\ddmount\pre762_nw_device_name
to
Z:\ddmount\shortname\nw_device_name
89
Upgrading to NetWorker 7.6 SP2
Where:
– pre762_nw_device_name is the name of the existing 7.6 SP1 Data Domain SU.
– shortname is the short hostname of the NetWorker server.
– nw_device_name is the name of the newly created NetWorker Data Domain
device.
For example, move files from:
Z:\ddmount\My_Old_BACK_DD01
to
z:\ddmount\gidrathy\DZBurl_BACK_DD01
7. Unmount the export from the NetWorker storage node:
• On a UNIX storage node:
umount /ddmount
• On a Microsoft Windows storage node:
In Windows Explorer, right-click the mapped drive and select Disconnect, or
from a command prompt type:
net use Z: /delete
8. On the Data Domain system, remove the NFS or CIFS export that was created.
• For NFS and Data Domain OS 4.9:
nfs del /backup/ost *
• For NFS and Data Domain OS 5.0:
nfs del /data/col1 *
• For CIFS and Data Domain OS 4.9:
cifs destroy sharename
• For CIFS and Data Domain OS 5.0:
cifs destroy sharename
9. Delete the pre 7.6 SP2 SU on the Data Domain system, which is now empty:
• For Data Domain OS 4.9:
ost lsu delete pre762_nw_device_name
• For Data Domain OS 5.0:
ddboost storage-unit delete pre762_nw_device_name
10. Configure the device access information:
a. Connect to the NetWorker server using NMC and click the Devices view.
b. In the View menu, select Display Read Only Devices.
The Devices table displays both read-write and related read-only devices.
c. Click Devices in the navigation tree.
d. In the Devices table, right-click on the DD Boost device and select Properties.
e. Update the Device Access Information field from the pre 7.6 SP2 device
configuration to the new SU and Data Domain device configuration in the
following format:
dd_hostname:shortname/nw_device_name
Where:
– dd_hostname is the name of the Data Domain system.
90 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Upgrading to NetWorker 7.6 SP2
91
Upgrading to NetWorker 7.6 SP2
92 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Glossary
A
administrator Person who normally installs, configures, and maintains software on network
computers, and who adds users and defines user privileges.
AFTD (advanced file Disk storage device that uses a volume manager to enable multiple concurrent
type device) backup and recovery operations and dynamically extend available disk space.
B
backup 1. Duplicate of database or application data, or entire computer system, stored
separately from the original, which can be used to recover the original if it is
destroyed or damaged.
2. Operation that saves data to a volume for use as a backup.
bootstrap Save set that is essential for NetWorker disaster recovery procedures with the
NetWorker server. The bootstrap consists of three components that reside on the
NetWorker server: the media database, the resource database, and a server index.
browse policy NetWorker policy that specifies the period of time during which backup entries are
retained in the client file index. The index makes the associated backed-up data
readily accessible for recovery by desktop users. See “retention policy.”
C
client Computer, workstation, or fileserver whose data can be backed up and restored with
NetWorker software.
client file index Database maintained by the NetWorker server that tracks every database object, file,
or filesystem backed up. The NetWorker server maintains a single index file for each
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 93
Glossary
client computer. The tracking information is purged from the index after the browse
time of each backup expires.
Client resource NetWorker server resource that identifies the save sets to be backed up on a client.
The Client resource also specifies information about the backup, such as the schedule,
browse policy, and retention policy for the save sets.
clone Duplicate copy of backed-up data, which is indexed and tracked by the NetWorker
server. Single save sets or entire volumes can be cloned.
clone-controlled Creation of a replica of deduplicated data copied from one NetWorker Data Domain
replication device to another, which can be scheduled by the NMC clone feature and is indexed
and tracked by the NetWorker server.
clone volume Exact duplicate of a backup or archive volume. NetWorker software can track four
types of volumes (backup, archive, backup clone, and archive clone). Save sets of
these different types may not be intermixed on one volume. Clone volumes may be
used in exactly the same way as the original backup or archive volume.
D
database 1. A collection of data arranged for ease and speed of update, search, and retrieval
by computer software.
2. An instance of a database management system (DBMS), which in a simple case
might be a single file containing many records, each of which contains the same
set of fields.
datazone Group of clients, storage devices, and storage nodes that are administered by a
NetWorker server.
DD Boost An optimized library and communication framework with a special Data Domain
API that allows the NetWorker software to define and interact with storage devices
on the Data Domain system.
deduplication Process of detecting and identifying the redundant variable-length blocks (data
segments) within a given set of data to eliminate redundancy.
deduplication Type of backup in which redundant data blocks are replaced by metadata pointers
backup and only unique blocks of data are stored. When the deduplicated data is restored,
the data is returned to its original native format.
deduplication ratio Reduction in storage space required to store data as a result of deduplication
technology, usually combined with data compression, for example, a 20:1 space
reduction.
device 1. Storage folder or storage unit that can contain a backup volume. A device can be a
tape device, optical drive, autochanger, or disk connected to the server or storage
node.
94 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Glossary
2. Access path to the physical drive, when dynamic drive sharing (DDS) is enabled.
3. General term that refers to storage hardware.
disaster recovery Recovery from any disruptive situation, such as hardware failure or software
corruption, in which ordinary data recovery procedures are not sufficient to restore a
system and its data to normal day-to-day operations.
distributed segment Part of the DD Boost interface, which enables data deduplication to be performed on
processing (DSP) a host before the data is sent to the Data Domain system for storage.
E
enabler code Special code that activates the software. The enabler code that unlocks the base
features for software is called a base enabler. Enabler codes for additional features or
products (for example, library support) are called add-on enablers.
G
group Client computer or group of clients that are configured to back up files during a
NetWorker scheduled backup, according to a single designated schedule or set of
conditions.
H
host Computer on a network.
L
label Electronic header on a volume used for identification by NetWorker or other data
mover application.
M
managed application Program that can be monitored or administered, or both from the Console server.
media Physical storage, such as magnetic tape, optical disk, or filesystem, to which backup
data is written. See also “volume.”
media index Database that contains indexed entries of storage volume location and the lifecycle
status of all data and volumes managed by the NetWorker server. Also known as
media database.
N
NetWorker Data A logical storage device created on a Data Domain server, which is used to store
Domain device deduplicated NetWorker backups. Each device appears as a folder on the Data
Domain system and is listed with a storage volume name in NMC.
NetWorker Software product that is used to manage NetWorker servers and clients. The NMC
Management server also provides reporting and monitoring capabilities for all NetWorker
Console (NMC) processes.
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 95
Glossary
NetWorker server Computer on a network that runs the NetWorker server software, contains the online
indexes, and provides backup and restore services to the clients and storage nodes on
the same network.
notification Message sent to the NetWorker administrator about important NetWorker events.
O
online indexes Databases located on the NetWorker server that contain all the information
pertaining to the client backups (client file index) and backup volumes (media
database).
P
pathname Set of instructions to the operating system for accessing a file. An absolute pathname
indicates how to find a file starting from the root directory and working down the
directory tree. A relative pathname indicates how to find a file starting from the
current location.
policy Set of constraints that specify how long the save sets for a client are available for
recovery. Each NetWorker client has a browse policy and a retention policy. When the
browse policy expires, the save sets associated with that policy are no longer readily
available for browsing in an index. When the retention policy expires, the save sets
associated with that policy are marked recyclable.
pool 1. NetWorker feature that assigns specific backup data to be stored on selected
media volumes.
2. Collection of NetWorker backup volumes to which specific data has been backed
up.
R
recover To restore data files from a backup volume to a client disk and apply transactional
(redo) logs to the data to make it consistent with a given point in time.
remote device 1. Storage device that is attached to a storage node that is separate from the
NetWorker server.
2. Storage device located at an offsite location that stores a copy of data from a
primary storage device for disaster recovery.
replication Process of creating an exact copy of an object or data. Examples in this guide include
NetWorker clone-controlled replication and Data Domain collection replication.
resource Software component that describes details of the NetWorker server or its clients.
Clients, devices, schedules, groups, and policies are all NetWorker resources. Each
resource has configurable attributes that define its properties.
restore To retrieve individual data files from backup media and copy the files to disk,
without applying transaction logs. See also “recover.”
96 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Glossary
retention policy NetWorker setting that determines the minimum period of time that backup data is
retained on a volume available for recovery. Once this time is exceeded, the data is
eligible to be overwritten. See also “browse policy.”
S
save NetWorker command that backs up client files to backup media volumes and makes
data entries in the online index.
save set Group of files or a filesystem that has been backed up on storage media by using the
NetWorker software.
save stream Data and save set information that is written to a storage volume during a backup. A
save stream originates from a single save set.
scheduled backup Type of backup that is configured to start automatically at a specified time for a group
of one or more NetWorker clients. A scheduled backup generates a bootstrap save set.
storage node Computer with NetWorker software installed whose primary purpose is to receive a
backup stream from one or more backup clients and then write to save sets on storage
media.
storage unit (SU) Logical unit of disk storage on a Data Domain system.
T
trap Unsolicited notification sent from the SNMP agent to the network manager’s SNMP
event manager.
V
VTL Virtual tape library.
volume 1. Unit of physical storage medium, such as a magnetic tape, optical disk, or
filesystem used to store data. Backup data must be stored on a backup volume
and cannot be stored on an archive volume or a clone volume.
2. Identifiable unit of data storage that may reside on one or more computer disks.
volume ID (volid) Internal identification that NetWorker software assigns to a backup volume.
volume name Name that you assign to a backup volume when it is labeled. See also “label.”
VTL Virtual tape library. Software emulation of a physical tape library storage system.
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 97
Glossary
98 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Index
A D
Add New Host Wizard 53 daily summary report 79
AFTD disk configurations 24, 41 Data Domain
authorization code 18 backup attribute 39, 59, 61
Device Type Capacity Entitlement Enabler 18
replication 40, 64
B Retention Lock not supported 17
backup failure
server 19
older NetWorker application modules 59, 61, 82
Data Domain Archiver 22
backup, redirect 38
DD Boost
bootstrap 61
enabling 49
feature 14
C license 19
cascaded replication configuraiton 31 DD format 65
CIFS formats 24, 41 DDBoost username 48
cleaning storage space 24 deactivate a device 62
client dedicated storage nodes 34
configuration wizard 59 deduplication
non-wizard configuration 60 feature 14
clone ratio 25
auto 66 device
by pool 70 deactivate 62
feature 64 Device access information 55
immediate 66 Device Configuration Wizard 51
manually started 71 device identity 54
native non-deduplicated 65 devices
nsrclone 66 creating 51
pool 64 mounting 58
regular 65 non-wizard configuration 54
requirements 64, 67 removal 25, 62
scheduled 66, 68 renaming 25
clone-controlled replication disable a device 62
cascade configuration 31 disaster recovery configuation 29
feature 16 distributed segment processing 14, 49
for disaster recovery 29 DPA 80
process 65 dynamic drive sharing 57
scheduling 68
shared datazones 32 E
compression not supported 17
encryption not supported 17
configuring
evaluation period 18
Data Domain for NetWorker 48
NetWorker for Data Domain 50
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 99
Index
M R
max sessions 23, 55 redirect backups 38
memory requirement, storage node 23 regular clone 65
migration remote device 55
legacy save sets 40, 41 REPLICATION license 19, 48
scenarios 42 replication, Data Domain 64
tape to local devices 43 reports
tape to remote devices 42 advanced 80
VTL to local devices 45 basic 78
VTL to remote devices 44 configuring 77
monitoring drill-down 79
backup operations 75 reviewing 78
backup statistics 76
devices 74 S
monthly summary report 79 SAN support 24
mounting a device 58 save streams 23
mtree 22 seed
multiple datazones 32 as a data migration step 44
before redirected backups begin 38, 41
N by native Data Domain replication 40
naming guidelines 26 diminishing value of 42
native non-deduplicated format 41 new device with legacy data 40
NDMP deduplication backup 61 not required on existing system 44
net hosts add command 26 sessions 23
network connectivity 28 shared datazones 32
NetWorker shared storage units 86
client 20 short names, preferred over long names 27
server 20 snmp add ro-community command 53
storage node 20 snmp add trap-host command 53
NetWorker Data Domain SNMP traps 49, 53
100 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide
Index
storage node 23
storage space, cleaning 24
storage unit 22
storage units
number of 22
storage units, creating 49
T
target devices 58
target sessions 23, 55
throughput, maximize 28
troubleshooting
connectivity issues 80
IP addresses 27
network connections 81
network issues 81
V
version requirements 17
virtual tape libraries 41
volume name 56
Volume Shadow Copy Service 60
VTL 24
W
wizard
Add New Host 53
Client Configuration 59
device configuration 51
EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide 101
Index
102 EMC NetWorker Data Domain Deduplication Devices Release 7.6 Service Pack 2 Integration Guide