IBM Total Storage SAN File System As An Infrastructure For Multimedia Servers Redp4098
IBM Total Storage SAN File System As An Infrastructure For Multimedia Servers Redp4098
IBM Total Storage SAN File System As An Infrastructure For Multimedia Servers Redp4098
ibm.com/redbooks
Why SAN?
To answer the question of why SAN, consider the scenario in Figure 1.
Disk1
Server1 Clients Server2
Disk2
Here, Server1 is a multimedia server or streaming server serving some clients. Server1 can serve a maximum of 10 clients, assuring the required QoS to each one. If more client requests come in, we need to add a new server named Server2. In addition, one needs to copy the data that Disk1 has to Disk2. The job of copying the data and maintaining two consistent copies is very tedious, because the media data is so large in size. A storage area network (SAN) can give us relief from this problem. As shown in Figure 2, Server1 and Server2 can share the data using a SAN. Therefore, using a SAN enables an easy capacity upgrade of your environment.
Disk
S A N
In a traditional environment, each server has its own file system that must be individually configured and managed. This means that as you scale the number of servers and file systems, you often have to scale the administrative staff. This silo approach to file systems also makes it difficult to set and audit enterprise-wide policies for how corporate information is to be stored and protected. Data sharing between applications that run on different servers is difficult. Traditional SAN infrastructures are just not very flexible, in and of themselves. Some major file and data management issues that exist in todays SANs include: File tasks must be done on each server. Difficult to migrate applications to other servers. Application downtime is required for file system changes. No single view or single access to files or data. No ability to pool files based on quality of service. Therefore, we need a file management system that exploits all the functionality provided by a SAN and also provides an easy interface to manage the data stored in a SAN. This system needs to resolve all concurrency-related issues for a SAN environment and provide a global namespace that will work on multiple platforms. IBM TotalStorage SAN File System is designed to simplify file and data management in your storage area network by consolidating file systems across UNIX, Microsoft Windows, and Linux servers. With SAN File System, files and file systems are viewed and managed as a centralized IT resource with a single point of administrative control. By providing a common SAN-wide file system, SAN File System can support secure, heterogeneous data sharing and centralized policy-based storage management in an open environment. SAN File System is designed to help you to achieve data lifecycle management efficiencies through policy-driven automation and tiered storage management. User-defined policies provide the ability to better match the cost of your storage resources to the value of your data. SAN File System is also designed as a highly scalable solution, supporting both very large files and very large numbers of files without the limitations normally associated with Network File System (NFS) or Common Internet File System (CIFS) implementations. The result is reduced storage management costs, enhanced productivity, and optimized storage resource utilization. SAN File System has been designed to help: Simplify file and database management and improve administrator productivity through policy-based data management Enhance data sharing and collaboration through a single, global namespace across heterogeneous server platforms, with high performance and full locking support Reduce storage needs by pooling available and temporary file space across heterogeneous servers and storage platforms and reducing the need to maintain duplicate data for sharing Improve application and data availability, and lower failover costs by reducing or eliminating application downtime for storage and data management tasks Simplify and lower the cost of data backups through application server free backup and leveraging built-in, file-based FlashCopy functions Provide an on demand storage environment for fast and simple deployment through file level virtualization, policy-based automation, and support of open standards
Out-of-band virtualization
SAN File System is an out-of-band virtualization implementation, where the data flow is separated from the control flow. This is achieved by separating the data and metadata (data about the data) into different places. Out-of-band virtualization involves moving all mapping and locking tables to a separate server (the metadata server) that contains the metadata of the files. In an out-of-band solution, servers (such as application servers, file servers, and so on) request authorization to data from the metadata server, which grants it, handles locking, and so on. After they are authorized, the servers access the data directly without any metadata server intervention. After a server has obtained access to a file, all I/O will go directly over the SAN to the storage devices. For many operations, the metadata server does not even intervene. Separating the flow of control and data in this manner enables the I/O to use the full bandwidth that a SAN provides, while control can go over a separate network or routes in the SAN that are isolated for this purpose.
How it works
SAN File System is intended to provide a common file system specifically designed for storage networks. A key element of its design is the way that it manages the metadataoverall information about files, such as file location, security permissions, and locks. In many traditional file systems, the metadata resides within individual servers, which can create limitations in sharing and accessing data across servers or across file systems. By managing the metadata on the storage network using a metadata server, instead of individually within all the servers, the design of SAN File System helps move intelligence out from individual servers to the storage network so that it can be available to any server in the network. Figure 3 illustrates this concept.
NFS CIFS
External Clients
AIX
VFS w/Cache
Solaris
VFS w/Cache
Windows 2000
VFS w/Cache
Linux
VFS w/Cache
Metadata Server
Metadata Store
Storage Network
Metadata Server
Metadata Server
Computers that want to share data and have their storage centrally managed are all connected to the SAN. In SAN File System terms, these are known as clients, because they access SAN File System services; although in the enterprise context, they would most likely
be, for example, database servers, application servers, or file servers. The SAN File System client code installed on each attached server captures file requests (such as open, close, create, or lock) and passes this information to the metadata server, which maintains the master catalog of file metadata, notifies the client if a file address changes in flight, handles the file placement policies and information, and handles the locking for file sharing. After SAN File System client software has the metadata, it accesses the file data directly over the SAN. The SAN File System client software enables users and applications to access the global namespace through a virtual file system (VFS) on UNIX and Linux systems and an installable file system (IFS) on Microsoft Windows systems. This layer (VFS/IFS) is built by the operating system (OS) vendors specifically for special-purpose or newer file systems. Metadata servers are clustered for scalability and availability of metadata operations and are often referred to as the metadata server cluster. In a SAN File System server cluster, there is one master metadata server and one or more subordinate metadata servers. Each MDS runs on a separate physical engine in the cluster. Additional metadata servers can be added as required if the workload grows, providing solution scalability. In this manner, SAN File System is designed to provide high-performance access to data and enable sharing across heterogeneous application servers with full-locking and security. SAN File System is designed to allow applications on any server within the SAN to access any file in the network. No changes to the applications typically need to be made. Storage volumes that store SAN File System clients user data (user pools) are separated from storage volumes that store metadata (system pool), as shown at the bottom of Figure 3.
Extensive caching
SAN File System does extensive caching of data, as well as metadata at the client side. It separately maintains the metadata cache. The metadata cache is about 32 MB for 32-bit 6
IBM TotalStorage SAN File System as an Infrastructure for Multimedia Servers
machines and 256 MB for 64-bit machines. The data cache is maintained by the normal file system-related operations through the virtual file system (VFS) layer. As the metadata is cached, the client does not need to contact the metadata server for the metadata frequently, so even if the data is not available in the data cache, it can be retrieved very quickly because of the faster Fibre Channel infrastructure support. Therefore, extensive caching of metadata plays an instrumental role in multimedia servers that serve popular files where there are multiple requests for the same file, because the file can be served from the cache itself, which cuts the delay of retrieving the data from storage multiple times.
IP Network
SAN
1 2 3
MDS Cluster
Volume1
Volume2
Volume3
Figure 4 shows a part of a very large file represented by the blocks in pink. Each file part is then divided into sections (three in this case), which are striped across the available storage volumes. Therefore, while retrieving data (for example, when playing a media file), these
sections can be read concurrently to improve throughput. SAN File System does all this transparently. This placement strategy is called the round-robin method. This method is best suited for when data is going to be retrieved in a parallel fashion.
movie1.avi menu.txt
project.log
IP Network
Rules for File Provisioning *.avi, *.mp3 go into Storage Pool1 *.txt, *.doc go into Storage Pool2 *.log goes into storage Pool3
SFS Client
MDS Cluster
Here, we have a tiered storage system with storage pools of different performance characteristics. Storage Pool 1 provides the best throughput. Storage Pool 2 provides medium performance, and Storage Pool 3 is the slowest. We have a policy file that shows the rules we have set. The rules say that all AVI and MP3 files should go in Storage Pool 1, because these are all multimedia files and have deadlines associated with them. Therefore, we need to give the best storage pool to these types of files. Then, all TXT and DOC files go to Storage Pool 2 and all LOG files go to Storage Pool 3. Figure 5 shows the execution of the policy with three example files named movie1.avi, menu.txt, and project.log. SAN File System also provides information lifecycle management capabilities, enabling administrators to specify how files should be automatically moved among storage pools during their lifetime. This feature can potentially lower the overall costs of storage and improve storage space utilization, enabling a balanced use of premium and inexpensive storage matching the objectives of the enterprise. This feature also has the potential to reduce
manual storage administration for managing space utilization and reduce the cost of storage management. Therefore, this policy-based storage management can prove helpful in the task of managing media files of large sizes according to their popularity or hit rate. You can write various rules in the policy according to your requirements.
High availability
SAN File System has some self-healing features that make it highly available for multimedia servers: If one metadata server from the metadata cluster fails, SAN File System has a fail-over mechanism with which all the responsibilities of the failed metadata server can be transferred to another, running metadata server. SAN File System has a volume drain feature that provides nondisruptive volume data transfers. This technique is useful if one of the disks fails in the underlying storage and needs to be replaced with a new one. SAN File System also has an automatic call home reporting feature, thus if there is any problem in the system, it automatically reports the problem to an IBM Support Engineer.
Conclusion
In this study of the requirements of multimedia data, we discussed unique characteristics of media data and its special QoS requirements from a file system. We proposed SAN as a hardware solution and SAN File System as a software solution on top of SAN for a typical multimedia data center. In this proposition, we touched on SAN File System architecture, and at the end, we described the features of SAN File System that make it a perfect fit in this media environment. In summary, we emphasized the following features: Better performance with large files Extensive caching Prefetching (read-ahead) to meet QoS Data placement strategy for better throughput Policy-based, efficient storage management High availability Another point that this paper stresses is that, along with meeting the QoS demands of media data, SAN File System provides a complete storage management solution, taking care of the data during its entire life cycle (generally termed information lifecycle management).
10
Notices
This information was developed for products and services offered in the U.S.A. IBM may not offer the products, services, or features discussed in this document in other countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any non-IBM product, program, or service. IBM may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to: IBM Director of Licensing, IBM Corporation, North Castle Drive Armonk, NY 10504-1785 U.S.A. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to non-IBM Web sites are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM product and use of those Web sites is at your own risk. IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. COPYRIGHT LICENSE: This information contains sample application programs in source language, which illustrates programming techniques on various operating platforms. You may copy, modify, and distribute these sample programs in any form without payment to IBM, for the purposes of developing, using, marketing or distributing application programs conforming to the application programming interface for the operating platform for which the sample programs are written. These examples have not been thoroughly tested under all conditions. IBM, therefore, cannot guarantee or imply reliability, serviceability, or function of these programs. You may copy, modify, and distribute these sample programs in any form without payment to IBM for the purposes of developing, using, marketing, or distributing application programs conforming to IBM's application programming interfaces.
11
Send us your comments in one of the following ways: Use the online Contact us review redbook form found at: ibm.com/redbooks Send your comments in an email to: [email protected] Mail your comments to: IBM Corporation, International Technical Support Organization Dept. QXXE Building 80-E2 650 Harry Road San Jose, California 95120-6099 U.S.A.
Trademarks
The following terms are trademarks of the International Business Machines Corporation in the United States, other countries, or both:
AIX 5L AIX developerWorks FlashCopy IBM Lotus Notes Lotus Notes Redbooks (logo) Redbooks TotalStorage
The following terms are trademarks of other companies: Solaris, Sun, and all Java-based trademarks are trademarks of Sun Microsystems, Inc. in the United States, other countries, or both. Microsoft, Windows, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both. UNIX is a registered trademark of The Open Group in the United States and other countries. Linux is a trademark of Linus Torvalds in the United States, other countries, or both. Other company, product, or service names may be trademarks or service marks of others.
12