Scale and Performance in a Distributed File System

Howard, Kazar, Menees, Nichols, Satyanarayanan, Sidebotham, West (1988)

Goal: compare local and remote execution times to understand the impact of scale and distribution: "To quantify the performance penalty due to remote access."
Dataset size: 70 files; 200 KB.
Five Phases:
1. MakeDir: Construct a target subtree.
2. Copy: Copy each file into target subtree.
3. ScanDir: Traverse hierarchy, obtaining stat information.
4. ReadAll: Read every byte.
5. Make: Compile and link the application.
Results of Benchmark
- Shared tree 70% slower than local tree.
- TestAuth saturated at about 5 load units.
- CPU utilization was peaking above 75% on servers.
Conclusion: overall architecture is OK, but implementation could use some work.
Use Benchmark results to motivate VICE-I to VICE-II redesign.

NFS is a remote-open system (i.e. not whole-file caching).
Run the Andrew benchmark on both systems.
NFS time-outs improperly handled by applications, result in errors.
The results they show demonstrate AFS is superior to NFS except at very low load.
Andrew claims superior scalability.

Volumes: small groupings of files.
Map volumes to users
Multiple volumes to a disk partition.
Can move volumes just by updating volume database.
Move volumes by creating clones, moving clone, repeating until there are no more updates.
Quotas enforced per volume.
Backups handled via clones.