Linux Storage Guide

© Dennis Leeuw dleeuw at made-it dot com
Last updated: 29 Feb 2012
License: GPL

1. Introduction

System usage / /home iSCSI

Logical Volume(s) /dev/vg-1/lv-1 /dev/vg-1/lv-2 /dev/vg-1/lv-3 unused

Volume Group(s) /dev/vg-1

Physical Volume(s) /dev/md0 /dev/sdc1 /dev/sdd1

/dev/sda1 /dev/sdb1

iSCSI iqn...-1 iqn...-2 iqn...-3

Hardware SCSI disk NAS

System usage	/	/home	iSCSI
Logical Volume(s)	/dev/vg-1/lv-1	/dev/vg-1/lv-2	/dev/vg-1/lv-3	unused
Volume Group(s)	/dev/vg-1
Physical Volume(s)	/dev/md0	/dev/sdc1	/dev/sdd1
/dev/sda1	/dev/sdb1
iSCSI		iqn...-1	iqn...-2	iqn...-3
Hardware	SCSI disk	NAS

1.1 SAN vs. NAS

Let us first look at the terms SAN and NAS. SAN stands for Storage Attached Network and NAS for Network Attached Storage. A NAS is often refered to when storage is provided to a network like the Microsoft™ shares that are offered through SMB or CIFS, but also NFS shares.

SANs are disks that are offered from a network to a server. This is most often an iSCSI network or a Fibre Channel network.

The line between a SAN and a NAS was when the terms where invented more clear then they are now. An iSCSI share can be offered to the network and a CIFS share can be provided to a server. The most clear destinction between the two is still that a share offered by a NAS can be mounted by more then one host at the same time. Some kind of locking prevents access problems to files, while a share offered by a SAN can only be used by a single host in read/write mode.

2. The Filesystem Hierarchy

See http://www.pathname.com/fhs/

5. Some real file systems

5.1. A non-journaling filesystem example: Ext2

http://www.linuxhq.com/guides/TLK/fs/filesystem.html

5.2. Commands

When formatting a device for an ext2 filesystem you can use the -b option to tell the system the blocksize:

mke2fs -b 4096 <device>

To see the blocksize of a certain filesystem use:

stat -f [<device>|<mount-point>]

dumpe2fs -h <device>

5.2. A journaling filesystem example: Ext3

http://en.wikipedia.org/wiki/Ext3

10. Backup

It's worth nothing that you can store all your data on your server, when a simple crash of a disk destroys it all. So backups are needed. But there is more to back up then just backing up data so you can restore it in case of an emergency. Another scenario is that you might want to move data from your primary storage disks to tape, because you do not need them online, but can not throw the data away. So you store the data on cheaper storage, like tapes. The last backup scenario is to back up data where you want to be able to retrieve older versions of a single file. Meaning you do some version control on a document by making for example daily copies, so you can later on always go back to an earlier version.

All these different scenarios have an impact on how you do your back ups. Another important aspect is the safety of the backed up data. If for example your back up machine is next to the machine holding the data you have a copy, but when a fire breaks out it is of no use, because most likely both machines will be lost.

There are different solutions to this problem, like in the case that you do back ups to tape, you could move the tape with the back up off site, meaning to restore the data after a fire you might need to buy a new tapestreamer, but your data is safe. You could also decide to place the entire tapestreamer off site and do your back up across the network, or even across the Internet. It all depends on the amount of data, and how much security you need, and how much money you are will to spent.

Bacula

http://www.bacula.org/2.4.x-manuals/en/developers/developers.pdf

10.1 Before you make backups

The design of your backup system is dependant on your local situation. The simplest form of backup is to copy your data to another harddisk. Before you you act it is always good to think first. This section discusses some points to think about before diving into a backup design.

What might go wrong

The first thing to realize is the possible failure scenarios, and probably more importantly how to avoid them. A couple of examples:

A cable might break: make your system redundant
A fire might occur: make sure your back ups are off site, and make sure a fire is stopped as soon as possible
Data can be stolen: prevent unauthorized access to data

Of course there are always cases where you can not prevent disaster. There is, for example, no scenario to prevent user stupidity, where a user just accidentally removes some very important data that he needs now!

Recovering from disaster

If disaster strikes it is good to have a plan according to which you can recover. Important to note is that the plan should be accessable after the disaster, so an online version might not be the best solution. In this plan the following points should be addressed:

What are the most important systems to bring up and in which order should they be brought up
What and who is needed to accomplish the tasks, who should be contacted and how
Who is responsible for the plan, and who maintains it
Where are the backups?
Is the plan tested, so it will work when disaster strikes

10.2 Backup Design

Notes before you start your design:

Do regular checks to see if your backup medium (tapes) are still in shape
Do regular restore to see if your backups are good
Check regularly if you are still backing up the right pieces, or that you need to do more or less

Something about filesystems

Windows users are used to the archive bit, which can be set if a backup is needed, or unset if the backup is done. GNU/Linux and Unix in general have another scheme for this. The ext2 and ext3 file systems provide you with a atime, ctime or mtime. The atime is the time the file was last accessed (read), the ctime is the time the inode was last changed and the mtime is the time the file was last modified. The last two probably need a little bit more explanation. When the mtime changes, the ctime changes. Meaning when a file is written to both change, but when you change the rights on a file, or change the ownership, which means you only adjust the information that is stored in the inode only the ctime changes. Which means for backing up files back up software will use the ctime. (see stat and touch)

The Emergency Backup

The emergency-backup consists of a full backup followed by a couple of incrementals, after which this cycle repeats itself.

Characterization:

Original is direct online
The copy is on tape
Tapes can be reused after the cycle is complete
Limited amount of tapes

Requirements are:

data can be easily restored
data is off-site

Questions that need to be answered:

How often do you want to make a full backup?
How far off-site does the data need to be?

Archive Backup

The archive-backup consists only of a full backup of the data after which the data can be removed from the primary disks. To make sure this data is redundant it needs to be written to two tapes, which should be stored separate from one another.

Characterization:

Original is on tape
Copy of the original tape is needed
Tape is only used once
Amount of tapes grows (slow or no reuse of tapes)

Requirements:

The copy should be done two fold
A management system is needed to keep track of the data
Since this is long term storage, checks should be done once in a while
Data should be tagged for how long it needs to be stored
Provisions should be taken to make sure the tape can be read after for example 30 years. Is there still hardware to read it, is the software still available and is there an OS to run it on

Question that need to be answered:

How many time periods do you have and how long are they? (for example a place to store 5 years, 30 years, 150 years)
How often do we run the archive-backup? and what do you do in between?
How do you remove backed up data?
What kind of management system do are you going to use?
Who is maintaining the management system?
Where do you store the tapes? And under what conditions

10.4 About tapes

Tape

If you have dropped a tape cardridge, restore its content and rebackup. Since dropped tapes have a shorter lifespan.

To maximize tape life, tape cartridges should be kept in an atmosphere free of contaminating dust particles and corrosive gases or chemicals. Cartridges should always be acclimated to the operating environment prior to mounting the cartridge on the drive. A minimum of 24 hours of acclimation time is generally recommended to make sure the cartridge is at the same humidity and temperature as the drive for newly received tapes.

The National Bureau of Standards publication, Care and Handling of Computer Magnetic Storage Media, recommends that magnetic tape be stored at 65 +/- 3 degrees Fahrenheit and 40% +/- 5% Relative Humidity.

National Media Laboratory = NML

Studies by the NML indicate that magnetic media, properly cared for, should have a lifetime which equals or exceeds that of the recording technology (10 to 20 years).

Commands

tar mt st

The shoeshine effect

Take notes from: http://www.backupcentral.com/phpBB2/two-way-mirrors-of-external-mailing-lists-3/emc-networker-19/recommendations-for-new-tape-library-62149/index-15.html
http://searchstorage.techtarget.co.uk/news/column/0,294698,sid181_gci1295968,00.html
http://mailman.eng.auburn.edu/pipermail/veritas-bu/2009-April/103851.html
http://forums11.itrc.hp.com/service/forums/questionanswer.do?admit=109447626+1267114354407+28353475&threadId=1212744
http://storage.ittoolbox.com/groups/vendor-selection/storage-select/tape-autoloader-selection-1832275

LTO

See: http://en.wikipedia.org/wiki/Linear_Tape-Open

10.5 About optical disks

common optical media formats have a storage life of 30 years or more

Store discs in a cool, dry environment away from direct light. Discs stored between 23 degrees F (-5 degrees C) and 86 degrees F (30 degrees C) can last up to 100 years

Do not leave the disc in direct sunlight or in a hot, humid environment--like your car on a summer day--as these conditions could warp and damage the disc

Do not allow moisture to condense on the disc.

Appendix A harddisks

A.1. IDE

hdparm -i gives you more information about the hardware, like disk manufacturer, serial number and disk geometry.

A.1.1 SATA

Serial ATA is ATA over serial lines. SATA uses smaller cables, then parallel ATA, which leaves more room and thus better cooling in the computer housing. SATA also does not use the master/slave setup anymore and is hotplugable.

A.2. SCSI

A.2.1. SAS

Serial-attached SCSI has thinner cables, less bulky connectors and allows for longer cables. The hardware is cheaper and less prone to crosstalk.

Appendix D Fibre Channel

FCoE

http://www.open-fcoe.org/ http://en.wikipedia.org/wiki/Fibre_Channel_over_Ethernet http://www.phfactor.net/fc/ http://www.linuxjournal.com/article/4499

Appendix X References

http://linuxvfs.googlepages.com/linuxvfs.html
http://www.coda.cs.cmu.edu/doc/talks/linuxvfs/sld001.htm
http://www.linuxhq.com/guides/TLK/fs/filesystem.html
http://www.howtoforge.com/linux_lvm
http://www.drbd.org/
http://www.openfiler.com/
http://tldp.org/LDP/sag/html/disk-usage.html
http://en.wikipedia.org/wiki/Serial_ATA
http://cool.conservation-us.org/bytopic/electronic-records/electronic-storage-media/bogart.html
http://findarticles.com/p/articles/mi_m0BRZ/is_8_23/ai_109665179/?tag=content;col1