By Joab Jackson, GCN Staff

The World of Global File Systems
CXFS (Clustered XFS): CXFS is a an extension of SGIs XFS file system, which was developed for the SGI IRIX operating system. CXFS is optimized for large computer clusters that work together on a single-system image, such as the NASA Ames Rsearch Centers Columbia supercomputer, where it is deployed. (http://www.sgi.com/products/storage/tech/file_systems.html )
GPFS (General Parallel File System): GPFS is a file system for clustering developed by IBM Corp. First developed for IBMs AIX Unix operating system, GPFS now works for Linux implementations as well. It can support over 1,000 disks within a single file system. (http://www.ibm.com/systems/clusters/software/gpfs.html )
GFS (GLOBAL FILE SYSTEM): GFS is an open-source file system developed for Linux clusters. Red Hat Inc. of Raleigh, N.C., incorporates GFS into its Red Hat Enterprise Linux operating system. (http://sources.redhat.com/cluster/gfs /)
NFS RDMA (NFS Remote Direct Memory Access): NFS RDMA fuses the widely used Network File System with the RDMA protocol, which can be used to offload work from the server CPU to the network card, thereby increasing the potential amount of data that can be downloaded. It is being incorporated into Version 4 of NFS. (http://sourceforge.net/projects/nfs-rdma )
PanFS (The Panasas ActiveScale File System): An object-based file system for Linux clusters that is used with storage arrays from Panasas Inc. of Fremont, Calif. (http://www.panasas.com/panfs.html )
pNFS (Parallel NFS): pNFS is a version of NFS that allows data to be spaced across multiple storage arrays, which could speed downloads and writes of large data sets. It is being incorporated into Version 4 of NFS. (www.pdl.cmu.edu/pNFS/) pVFS (The Parallel Virtual File System): pVFS is a dedicated open-source file system for parallel environments. Development of pVFS is being funded by the Energy Department, as well as by NASA and the National Science Foundation. (http://www.parl.clemson.edu/pvfs /)
ZFS (Zettabyte File System): The next-generation file system developed by Sun Microsystems Inc. as a successor for NFS. It is packaged in the Sun Solaris 10 operating system. ZFS is the first file system built to support 128-bit addressing, allowing systems to manage a virtually unlimited amount of storage. (http://www.opensolaris.org/os/community/zfs
For Gary Grider, group leader of Los Alamos National Laboratorys High Performance Computing Systems Integration Group, saving data is of the highest importance. He is part of a team that is developing what may well be the worlds fastest supercomputer, a petascale machine called Roadrunner with more than 32,000 processors. IBM Corp. is leading the effort.
Jobs simulating nuclear-weapon degradation could take months to run. If a single processor failed a statistical probability given the sheer number of CPUs used the work would be corrupted. So, naturally, the lab wants to save often, just as you might do with your PC. But in this case, the procedure involves frequently saving terabytes of data as quickly as possible no small feat.
Thats why Los Alamos specified that data must be able to flow back from the processors to the storage arrays at an unprecedented 50 Gbps, far beyond the capability of any single storage cluster. Running multiple storage arrays in parallel would do the trick, but that approach requires advanced techniques for coordinating the storage and management of data.
Roadrunner isnt alone in facing this challenge. You can easily put a lot of CPU power in the room, but to do useful work, you also need very good I/O, said Mike Gigante, SGIs engineering director of file-serving technologies. Unfortunately, many people dont think about the I/O until the CPU is set up, and they realize that the overall utilization efficiency of their computer is very low.
File here
Managing a computers data is the job of the file system, and agencies, volunteer bodies and industry are working on a new generation of file systems, often called global or parallel file systems, that can support machines such as Roadrunner. The challenge is picking the right one for the job.
In many ways, Energy Department laboratories have been a driving force behind the development of global file systems. In 1994, Energy labs banded together to develop Lustre, a file system designed specifically for the upcoming supercomputer deployments. We didnt see anyone out there who had what we wanted, Grider said.
More news on related topics: Communications / Networks
| Latest News | |||
| Latest News | |||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
© 1996-2008 1105 Media, Inc. All Rights Reserved.



