Network file system: Safe and efficient data storage

With the evolution of the Internet and services it provides, the need of sharing and distributing of data appeared. Obviously, a local filesystem doesn’t fit this need, that’s why a number of different network file systems appeared.

A network file system is a kind of a network abstraction over a file system. It provides remote access to the client over a network in a way a local file system does on a computer. In other words, it acts as a client for a remote file access protocol, providing access to files on a server. Programs using local interfaces can transparently create, manage and access hierarchical directories and files located on remote network-connected computers.

Here are some types of network file systems in terms of data sharing.

Distributed file system (DFS)

Distributed file system (DFS) means that system components run on multiple servers and are organized in a global directory in such a way that remote data access is not location-specific but is identical for any client. Technically, DFS is a file system that can be accessed anywhere on a network. All users have the access to the global file system where data is organized in hierarchical order and is directory-based. It provides scalability and protection against failure of one of the servers.

DFS was originated to ensure the fact that the processes of locating files, transporting data, modifying files remain clearly organized to client programs.

Today it is the most common way of storing data. Among distributed file systems are GlusterFS, Lustre, and many others.

DFS Design Goals:

Network transparency
Location transparency
Location independence
User mobility
Fault tolerance
Scalability
File mobility

Clustered file system (CFS)

Clustered file system (CFS) allows concurrent access to the data stored on a shared block device mounted on multiple computers. In other words, CFS was designed so that a certain number of computers could work with the same files at the same time. The most common usage scenario for them would be block device located on SAN and mounted via ISCSI on several cluster nodes. Clustered file system takes care of metadata distribution and concurrency locks. Well-known examples of clustered file systems are GFS/GFS2/OCFS2.

CFS Design Goals:

Getting rid of 3rd party file sync software like lsync and delays caused by it
Concurrency control
Usage simplicity

Network File System (NFS)

In 1985 Sun Microsystems created Network File System (NFS) which became the first widely-used Internet Protocol-based network file system. NFS allows to access files on a client device over a computer network and at the same time to boost scalability by adding support for parallel access across distributed clients. It works by a combination of kernel functionality on the client (uses the remote file system) and an NFS service on the server (provides the file data). Such access to files is transparent on the client side and works across a variety of servers and host architectures.

Сentral management resides among the major NFS advantages. Working with a centrally managed server helps to decrease the workload for the system administrator in terms of back-ups, adding software that will be shared, and computer repair. NFS is available for every Linux distribution worldwide and it can be installed from either the command line or the distribution’s package manager.

NFS Design Goals:

Not restricted to Unix
Hardware-independent protocol
Simple recovery
Application transparency
UNIX semantics for UNIX clients
Good performance
Transport-independent

File Systems Overview

01.

GlusterFS

GlusterFS is a scalable distributed file system originally released by Gluster, but in 2012 acquired by Red Hat. It is an open source product which helps you to create large, distributed file storage solutions for media streaming, data analysis, etc., simply using standard hardware. GlusterFS allows systems operating on the file system level to ensure that the data is copied to another location no matter when it is written to the disk, unlike other software and databases that only give the possibility to spread data out in the context of a single application.

Different kinds of storage configurations are possible with GlusterFS. Many of those configurations are functionally similar to RAID levels, such as striping data across different nodes in the cluster, or implementation of redundancy for better data availability.

02.

GFS

GFS (Google File System) is a proprietary distributed file system which purpose is to provide efficient, reliable access to data using large clusters of commodity hardware. GFS is designed to run on clusters of computers. GFS allows to organize and manipulate huge files giving the possibility to developers to research and develop resources they require.

Scalability is a top priority for this FS so its performance won’t suffer as it grows.

While designing GFS, an easy control of the system was also considered. That is why basic file commands (as open, create, read, write and close files) were implemented along with specialized commands such as append and snapshot.

03.

GFS2

GFS2 (Global File System 2) is a clustered file system powered by Red Hat. GFS2 differs from distributed file systems such as GlusterFS because it allows all nodes to have direct parallel access to the same shared block storage. In addition, GFS2 can also be used as a local file system.

GFS2 is a journaled file system. Journaling is designed to prevent data loss and data corruption from crashes and unexpected power cut. When your system is halfway from writing a file to the disk and it unexpectedly loses power, a journal identifies whether the file was completely written to disk or not. Linux can check the file system’s journal when it re-starts and resumes any partially completed job.

04.

Ceph

Also it worth to mention Ceph, a scale-out storage platform that is “designed for excellent performance, reliability, and scalability.” It is more complex technology than just a simple FS, since it provides distributed low-level storage, on top of which different ways to access data are possible:

Object storage with S3-compatible API
Block based storage
Ceph file system

In terms of object storage, Ceph’s libraries provide client applications with direct access to the RADOS object-based storage system. It is prefered for large scale storage systems, since it guarantees storing data more efficiently. Object-based storage systems separate the object namespace from the underlying storage hardware which simplifies data migration.

In terms of block based storage, when you use a block device while writing data to Ceph, it automatically stripes and replicates the data across the cluster.

In terms of file system, Ceph provides a traditional file system interface with POSIX semantics. Object storage systems are a significant innovation, but they complement rather than replace traditional file systems.

The most remarkable Ceph’s feature is that it does not rely on a central metadata service to locate data, but uses CRUSH algorithm to calculate the location of data. It replicates and re-balance data within the cluster dynamically – eliminating this tedious task for administrators, while delivering high-performance and infinite scalability.

Benefits of Ceph:

Stronger data safety for mission-critical applications.
Virtually unlimited storage to file systems.
No integration or customization required as applications that use file systems can use Ceph FS with POSIX semantics.
Automatically balancing of the file system to deliver maximum performance.
Integration with OpenStack. Ceph provides a reliable storage backend for OpenStack. The Ceph block storage has capabilities like thin provisioning, snapshot, cloning, which helps to spin up VM’s quickly and makes backing up and cloning of VM’s easy.
Disaster recovery.

Do not mix up!

DRBD (Distributed Replicated Block Device) a distributed replicated storage system for the Linux platform. It is not a file system, but rather RAID1-over-the-network block device. The DRBD’s purpose is the formation of a fault-tolerant cluster environment on Linux.

The software was designed with Linux security standard in mind that at the same time offers excellent reliability with little expenses. DRBD usually goes with all common flavors of Linux for synchronous replication of stored data between a passive system and an active system (there is also a possibility to read/write data on both systems simultaneously using one of clustered FS we described above). DRBD supports resource-level fencing as well. On 8 December 2009 DRBD became a part of the official Linux kernel.

Highly Available NFS with DRBD

Having single NFS server is not good for high availability since it becomes a single point of failure. But DRBD could be a solution here, it allows to replicate the actual block device on which NFS is hosted. With some significant configuration implied, it gives the possibility to successfully failover an NFS mount without stale file handles on the client side. Such configuration is difficult to get correct but it is one of the few means to achieve HA for NFS.

Setting up a Highly Available NFS using DRBD usually goes in conjunction with such software as Pacemaker or Heartbeat.

Heartbeat/Pacemaker and DRBD can be used effectively to maintain high availability. They are network-oriented tools for maintaining high availability and managing failover. Scenario example: We have 2 nodes with DRBD in active/passive mode, and Heartbeat/Pacemaker setup to mount DRBD on active node, start NFS server and add a failover IP. In this case the first node goes down, the second one notices this and starts migration process, which includes switching DRBD resource to active mode, mounting it and bringing up all needed services.

File systems differ in terms of their own ways of organizing their data. They also can differ in terms of features such as speed, security, drives support with large/small storage capacities. Some file systems are more robust and resistant to file corruption, while others sacrifice that robustness for additional speed. There isn’t only one correct choice.

Network File Systems