0% found this document useful (0 votes)

57 views

Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon

NFS was the first commercially successful network file system developed by Sun Microsystems in the 1980s. It was designed to be robust, maintain UNIX semantics transparently across a network, and provide adequate performance. The key aspects of NFS included its stateless protocol design for easy crash recovery, use of file handles and lookups to access remote files, and caching at the client for performance with consistency protocols to maintain semantics. While NFS had limitations, it succeeded due to its robustness, reasonable efficiency through tuning, and ability to evolve over time.

Uploaded by

michael.ferraris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon

Uploaded by

michael.ferraris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 34

DESIGN AND IMPLEMENTATION

OF THE
SUN NETWORK FILESYSTEM

R. Sandberg, D. Goldberg
S. Kleinman, D. Walsh, R. Lyon
Sun Microsystems
What is NFS?
• First commercially successful network file system:
– Developed by Sun Microsystems for their
diskless workstations
– Designed for robustness and “adequate
performance”
– Sun published all protocol specifications
– Many many implementations
Paper highlights
• NFS is stateless
– All client requests must be self-contained
• The virtual filesystem interface
– VFS operations
– VNODE operations
• Performance issues
– Impact of tuning on NFS performance
Objectives (I)
• Machine and Operating System Independence
– Could be implemented on low-end machines of the mid-
80’s
• Fast Crash Recovery
– Major reason behind stateless design
• Transparent Access
– Remote files should be accessed in exactly the same
way as local files
Objectives (II)
• UNIX semantics should be maintained on client
– Best way to achieve transparent access
• “Reasonable” performance
– Robustness and preservation of UNIX
semantics were much more important
• Contrast with Sprite and Coda
Basic design
• Three important parts
– The protocol
– The server side
– The client side
The protocol (I)
• Uses the Sun RPC mechanism and Sun eXternal
Data Representation (XDR) standard
• Defined as a set of remote procedures
• Protocol is stateless
– Each procedure call contains all the
information necessary to complete the call
– Server maintains no “between call” information
Advantages of statelessness
• Crash recovery is very easy:
– When a server crashes, client just resends
request until it gets an answer from the
rebooted server
– Client cannot tell difference between a server
that has crashed and recovered and a slow
server
• Client can always repeat any request
Consequences of statelessness
• Read and writes must specify their start offset
– Server does not keep track of current position in
the file
– User still use conventional UNIX reads and writes
• Open system call translates into several
lookup calls to server
• No NFS equivalent to UNIX close system call
The lookup call (I)
• Returns a file handle instead of a file descriptor
– File handle specifies unique location of file
• lookup(dirfh, name) returns (fh, attr)
– Returns file handle fh and attributes of named
file in directory dirfh
– Fails if client has no right to access directory
dirfh
The lookup call (II)
– One single open call such as
fd = open(“/usr/joe/6360/list.txt”)
will be result in several calls to lookup
lookup(rootfh, “usr”) returns (fh0, attr)
lookup(fh0, “joe”) returns (fh1, attr)
lookup(fh1, “6360”) returns (fh2, attr)
lookup(fh2, “list.txt”) returns (fh, attr)
The lookup call (III)
• Why all these steps?
– Any of components of /usr/joe/6360/list.txt
could be a mount point
– Mount points are client dependent and mount
information is kept above the lookup() level
Server side (I)
• Server implements a write-through policy
– Required by statelessness
– Any blocks modified by a write request
(including i-nodes and indirect blocks) must
be written back to disk before the call
completes
Server side (II)
• File handle consists of
– Filesystem id identifying disk partition
– I-node number identifying file within partition
– Generation number changed every time
i-node is reused to store a new file
• Server will store
– Filesystem id in filesystem superblock
– I-node generation number in i-node
Client side (I)
• Provides transparent interface to NFS
• Mapping between remote file names and remote
file addresses is done a server boot time through
remote mount
– Extension of UNIX mounts
– Specified in a mount table
– Makes a remote subtree appear part of a local
subtree
Remote mount
Client tree
/
Server subtree

usr
rmount
bin

After rmount, root of server subtree

can be accessed as /usr
Client side (II)
• Provides transparent access to
– NFS
– Other file systems (including UNIX FFS)
• New virtual filesystem interface supports
– VFS calls, which operate on whole file system
– VNODE calls, which operate on individual files
• Treats all files in the same fashion
Client side (III)
User interface
UNIX system calls is unchanged

VNODE/VFS Common
interface

Other FS NFS UNIX FS

RPC/XDR disk

LAN
File consistency issues
• Cannot build an efficient network file system
without client caching
– Cannot send each and every read or write to
the server
• Client caching introduces consistency issues
Example
• Consider a one-block file X that is concurrently
modified by two workstations
• If file is cached at both workstations
– A will not see changes made by B
– B will not see changes made by A
• We will have
– Inconsistent updates
– Non respect of UNIX semantics
Example

A B
Server

x’ x’’ x

Inconsistent updates
UNIX file access semantics (I)
• Conventional timeshared UNIX semantics
guarantee that
– All writes are executed in strict sequential
fashion
– Their effect is immediately visible to all other
processes accessing the file
• Interleaving of writes coming from different
processes is left to the kernel discretion
UNIX file access semantics (II)
• UNIX file access semantics result from the use
of a single I/O buffer containing all cached
blocks and i-nodes
• Server caching is not a problem
• Disabling client caching is not an option:
– Would be too slow
– Would overload the file server
NFS solution (I)
• Stateless server does not know how many users
are accessing a given file
– Clients do not know either
• Clients must
– Frequently send their modified blocks to the
server
– Frequently ask the server to revalidate the
blocks they have in their cache
NFS solution (II)

A B
? ?
Server

x’ x

Better to propagate my updates

and refresh my cache
Implementation
• VNODE interface only made the kernel 2%
slower
• Few of the UNIX FS were modified
• MOUNT was first included into the NFS protocol
– Later broken into a separate user-level RPC
process
Hard issues (I)
• NFS root file systems cannot be shared:
– Too many problems
• Clients can mount any remote subtree any way they
want:
– Could have different names for same subtree by
mounting it in different places
– NFS uses a set of basic mounted filesystems on
each machine and let users do the rest
Hard issues (II)
• NFS passes user id, group id and groups on
each call
– Requires same mapping from user id and
group id to user on all machines
– Achieved by Yellow Pages (YP) service
• NFS has no file locking
Hard issues (III)
• UNIX allows removal of opened files
– File becomes nameless
– Processes that have the file opened can continue
to access the file
– Other processes cannot
• NFS cannot do that and remain stateless
– NFS client detecting removal of an opened file
renames it and deletes renamed file at close time
Hard issues (IV)
• In general, NFS tries to preserve UNIX open
file semantics but does not always succeed
– If an opened file is removed by a process on
another client, file is immediately deleted
Tuning (I)
• First version of NFS was much slower than Sun
Network Disk (ND)
• First improvement
– Added client buffer cache
– Increased the size of UDP packets from 2048 to
9000 bytes
• Next improvement reduced the amount of buffer to
buffer copying in NFS and RPC (bcopy)
Tuning (II)
• Third improvement introduced a client-side
attribute cache
– Cache is updated every time new attributes
arrive from the server
– Cached attributes are discarded after
• 3 seconds for file attributes
• 30 seconds for directory attributes
• These three improvements cut benchmark run
time by 50%
Tuning (III)

These three improvements

had the biggest impact on
NFS performance
My conclusion
• NFS succeeded because it was
– Robust
– Reasonably efficient
– Tuned to the needs of diskless workstations

In addition, NFS was able to evolve and

incorporate concepts such as close-to-open
consistency (see next paper)

NFS
No ratings yet
NFS
27 pages
Sun NFS Overview: Network File System (NFS) Is A Protocol Originally Developed by
No ratings yet
Sun NFS Overview: Network File System (NFS) Is A Protocol Originally Developed by
4 pages
Networked File System: CS 537 - Introduction To Operating Systems
No ratings yet
Networked File System: CS 537 - Introduction To Operating Systems
23 pages
Network File System
No ratings yet
Network File System
30 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
File Systems 2
No ratings yet
File Systems 2
43 pages
Network File System (NFS)
No ratings yet
Network File System (NFS)
31 pages
03 Nfs PDF
No ratings yet
03 Nfs PDF
48 pages
Distributed File Systems
No ratings yet
Distributed File Systems
11 pages
Sun Network File System
No ratings yet
Sun Network File System
6 pages
Distributed File Systems
No ratings yet
Distributed File Systems
56 pages
Case Study On Network File System
No ratings yet
Case Study On Network File System
8 pages
Case Study On Network File System
No ratings yet
Case Study On Network File System
8 pages
Distributed File Systems: Arvind Krishnamurthy Spring 2001
No ratings yet
Distributed File Systems: Arvind Krishnamurthy Spring 2001
3 pages
Operating Systems: Vfs/Nfs
No ratings yet
Operating Systems: Vfs/Nfs
17 pages
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
No ratings yet
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
27 pages
Why NFS Sucks
No ratings yet
Why NFS Sucks
16 pages
PHPR OZ12 B
No ratings yet
PHPR OZ12 B
31 pages
Distributed File Systems
No ratings yet
Distributed File Systems
31 pages
3Distributed File System
No ratings yet
3Distributed File System
42 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
Linux UNIT III
No ratings yet
Linux UNIT III
29 pages
Distributed File Systems in Unix
No ratings yet
Distributed File Systems in Unix
6 pages
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
No ratings yet
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
7 pages
The-Linux-File-System
No ratings yet
The-Linux-File-System
4 pages
Linux NFS
100% (1)
Linux NFS
11 pages
04 en Network File Systems
No ratings yet
04 en Network File Systems
57 pages
CS2510_00_Distributed_Storage_Overview
No ratings yet
CS2510_00_Distributed_Storage_Overview
53 pages
Kleiman 86 V Nodes
No ratings yet
Kleiman 86 V Nodes
10 pages
Distributed File Systems
No ratings yet
Distributed File Systems
38 pages
02 NFS
No ratings yet
02 NFS
9 pages
Distributed File Systems
No ratings yet
Distributed File Systems
28 pages
nfs
No ratings yet
nfs
39 pages
NFS Server Conf in Security Patch
No ratings yet
NFS Server Conf in Security Patch
18 pages
Issues in Distributed File Systems
No ratings yet
Issues in Distributed File Systems
10 pages
Distributed File System
100% (1)
Distributed File System
17 pages
Itro To NFS
No ratings yet
Itro To NFS
39 pages
06 dfs2
No ratings yet
06 dfs2
50 pages
10 Distributed File Systems
No ratings yet
10 Distributed File Systems
27 pages
Protocol Components: Remote Procedure Call (RPC) Protocol: Rpcbind
No ratings yet
Protocol Components: Remote Procedure Call (RPC) Protocol: Rpcbind
25 pages
chap6
No ratings yet
chap6
54 pages
A4 Presentation3
No ratings yet
A4 Presentation3
17 pages
Usc Csci555 f12 Part2
No ratings yet
Usc Csci555 f12 Part2
222 pages
L6 DFS
No ratings yet
L6 DFS
27 pages
Unit 2. Sun Network File System.
No ratings yet
Unit 2. Sun Network File System.
1 page
Sun's Network File System (NFS) : Client0 - Client1 - / - / - Network - Server+disks / - Client2 - / - Client3
No ratings yet
Sun's Network File System (NFS) : Client0 - Client1 - / - / - Network - Server+disks / - Client2 - / - Client3
22 pages
Network File System (NFS) : Chandan Padalkar
No ratings yet
Network File System (NFS) : Chandan Padalkar
16 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
Distributed Systems U4
No ratings yet
Distributed Systems U4
8 pages
Distributed-File Systems Background
No ratings yet
Distributed-File Systems Background
9 pages
Lec 14 File Systems 4
No ratings yet
Lec 14 File Systems 4
27 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
Week5 Dfs
No ratings yet
Week5 Dfs
13 pages
DFS Design and Implementation
No ratings yet
DFS Design and Implementation
40 pages
DFS Design and Implementation: Brent R. Hafner
No ratings yet
DFS Design and Implementation: Brent R. Hafner
40 pages
Distributed File System Implementation
100% (1)
Distributed File System Implementation
30 pages
ds_2016_17_lec17
No ratings yet
ds_2016_17_lec17
32 pages
The Google File System: Kenneth Chiu
No ratings yet
The Google File System: Kenneth Chiu
40 pages
Lec24 Distfiles
No ratings yet
Lec24 Distfiles
26 pages
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
From Everand
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
Steve Will
4.5/5 (3)
114 Handy Formulae For Quantitative Aptitude Problems
91% (69)
114 Handy Formulae For Quantitative Aptitude Problems
12 pages
Technical Aptitude Test
No ratings yet
Technical Aptitude Test
175 pages
1) What Is The Most Common Issue? 2) Who Is The President of The US?
No ratings yet
1) What Is The Most Common Issue? 2) Who Is The President of The US?
1 page
Mobile Computing
100% (1)
Mobile Computing
5 pages
Final Pub110
No ratings yet
Final Pub110
30 pages
System Software Lab
100% (2)
System Software Lab
49 pages
Three Dimensional Concepts
No ratings yet
Three Dimensional Concepts
52 pages
Java Database Connectivity
No ratings yet
Java Database Connectivity
4 pages
T8-Basic Event Handling, Mapping Modes
No ratings yet
T8-Basic Event Handling, Mapping Modes
35 pages
T7-The Visual C++ Components
100% (7)
T7-The Visual C++ Components
13 pages
Object File Network Binary
No ratings yet
Object File Network Binary
6 pages
Introduction To Visal Programming
No ratings yet
Introduction To Visal Programming
24 pages
Graphics & Multimedia
No ratings yet
Graphics & Multimedia
42 pages
Mapping Modes
No ratings yet
Mapping Modes
50 pages
Diaolog Boxes and Controls: M.Sushmera Lecturer in IT Department
No ratings yet
Diaolog Boxes and Controls: M.Sushmera Lecturer in IT Department
14 pages
Diaolog Boxes and Controls: M.Sushmera Lecturer in IT Department
No ratings yet
Diaolog Boxes and Controls: M.Sushmera Lecturer in IT Department
14 pages
Windows and Message Processing
No ratings yet
Windows and Message Processing
33 pages
A Rectangle With Zooming Facility: Project Name: Zoom
No ratings yet
A Rectangle With Zooming Facility: Project Name: Zoom
5 pages
Project Name: Mdi in Mdiview.H
No ratings yet
Project Name: Mdi in Mdiview.H
6 pages
Project Name: Multithread in Multithreadview - CPP
No ratings yet
Project Name: Multithread in Multithreadview - CPP
7 pages
GDI and Device Context
No ratings yet
GDI and Device Context
17 pages
Activex - Own
No ratings yet
Activex - Own
6 pages
Isapi Server Extensions
No ratings yet
Isapi Server Extensions
13 pages
Database ODBC Without File Support
No ratings yet
Database ODBC Without File Support
3 pages
PROJECT NAME: Dllformula in Dllformula - CPP
No ratings yet
PROJECT NAME: Dllformula in Dllformula - CPP
5 pages
Database ODBC With File Support
No ratings yet
Database ODBC With File Support
7 pages
Program: Class Name: CDLG - CPP Project Name: Toolbar in View.H
No ratings yet
Program: Class Name: CDLG - CPP Project Name: Toolbar in View.H
6 pages
Program: Class Name: CDLG - CPP Project Name: Statusbar in Mainfrm.H
No ratings yet
Program: Class Name: CDLG - CPP Project Name: Statusbar in Mainfrm.H
7 pages
F C S M: in Doc.H: Project Name: Menubar
No ratings yet
F C S M: in Doc.H: Project Name: Menubar
9 pages
Challenges of Computerization
No ratings yet
Challenges of Computerization
2 pages
Distributed DBMS Architectures
No ratings yet
Distributed DBMS Architectures
9 pages
Quiz
No ratings yet
Quiz
10 pages
An IoT-Based School Bus and Vehicle Tracking System Using RFID Technology and Mobile Data Networks
No ratings yet
An IoT-Based School Bus and Vehicle Tracking System Using RFID Technology and Mobile Data Networks
11 pages
Chapter 26: Network Security
No ratings yet
Chapter 26: Network Security
70 pages
Data Communication
100% (2)
Data Communication
114 pages
Medium Enterprise Design Profile (MEDP) - LAN Design
No ratings yet
Medium Enterprise Design Profile (MEDP) - LAN Design
87 pages
UserGuideGP2v2 (Spanish)
No ratings yet
UserGuideGP2v2 (Spanish)
63 pages
Internet Users in France
No ratings yet
Internet Users in France
16 pages
De La Philosophie Gourinat Tome 1
0% (1)
De La Philosophie Gourinat Tome 1
3 pages
Presentation - Html5 and CSS3
No ratings yet
Presentation - Html5 and CSS3
33 pages
UGRD IT6223B DATA COMMUNICATIONS and NETWORK2
No ratings yet
UGRD IT6223B DATA COMMUNICATIONS and NETWORK2
8 pages
Linux Cheat Sheet EN
No ratings yet
Linux Cheat Sheet EN
3 pages
Aedp 06c1
No ratings yet
Aedp 06c1
141 pages
Sat Lite Tech 103 Accessories rc3000 Antenna Controller
No ratings yet
Sat Lite Tech 103 Accessories rc3000 Antenna Controller
1 page
Annex A Technical Standards For Outside Plant (OSP) Installations
No ratings yet
Annex A Technical Standards For Outside Plant (OSP) Installations
11 pages
232-001765-00 Rev A SonicOS 5.5 Data Forensics With Solera
No ratings yet
232-001765-00 Rev A SonicOS 5.5 Data Forensics With Solera
4 pages
RER620 Product Guide 1MAC301920-PG Rev D
No ratings yet
RER620 Product Guide 1MAC301920-PG Rev D
44 pages
Skripta - Mehanika 2
No ratings yet
Skripta - Mehanika 2
18 pages
The Latest Open Source Software Available and The Latest Development in ICT
No ratings yet
The Latest Open Source Software Available and The Latest Development in ICT
7 pages
Niraja AWSDevOps
No ratings yet
Niraja AWSDevOps
6 pages
FAQ For Huawei 3G USB Modem and Asterisk
No ratings yet
FAQ For Huawei 3G USB Modem and Asterisk
23 pages
Security Hardening Checklist Guide For Cisco Routers - Switches in 10 Steps
No ratings yet
Security Hardening Checklist Guide For Cisco Routers - Switches in 10 Steps
14 pages
Networking With Cisco CCNA 200 301 Course Content
No ratings yet
Networking With Cisco CCNA 200 301 Course Content
8 pages
Claroty Sample Risk Assessment Report
100% (1)
Claroty Sample Risk Assessment Report
29 pages
LT-1023 Device Compatiblity Guide - Mircom Addressable Panels Rev 15 Oct 2021
No ratings yet
LT-1023 Device Compatiblity Guide - Mircom Addressable Panels Rev 15 Oct 2021
69 pages
Deep Security As A Service Best Practice Guide
No ratings yet
Deep Security As A Service Best Practice Guide
52 pages
MPC LED Indication PDF
No ratings yet
MPC LED Indication PDF
2 pages
How To Download Articles From Scribd
100% (1)
How To Download Articles From Scribd
7 pages
ABRITES Commander For Nissan Manual
No ratings yet
ABRITES Commander For Nissan Manual
19 pages

Uploaded by

Uploaded by

DESIGN AND IMPLEMENTATION

After rmount, root of server subtree

Other FS NFS UNIX FS

Better to propagate my updates

These three improvements

In addition, NFS was able to evolve and

You might also like