Get rid of DEBUG macro, supporting code, and documentation,

[pintos-anon] / doc / filesys.texi
diff --git a/doc/filesys.texi b/doc/filesys.texi

index 36c21a39748c8e61be73094318b5991a7e74f293..6ebc4d9707e1cb4da43a44058beb2d9fdd353974 100644 (file)
--- a/doc/filesys.texi
+++ b/doc/filesys.texi
@@ -21,17 +21,15 @@ parts work together so that you can run VM and your filesystem at the
  same time.  Plus, keeping VM is a great way to stress-test your
  filesystem implementation.
  
-Your submission should define @code{THREAD_JOIN_IMPLEMENTED} in
-@file{constants.h} (@pxref{Conditional Compilation}).
-
  @menu
  * File System New Code::        
-* Problem 4-1 Large Files::     
+* File System Synchronization::  
+* Problem 4-1 Indexed Files::     
  * Problem 4-2 File Growth::     
  * Problem 4-3 Subdirectories::  
  * Problem 4-4 Buffer Cache::    
  * File System Design Document Requirements::  
-* File System FAQ::            
+* File System FAQ::             
  @end menu
  
  @node File System New Code
@@ -91,7 +89,7 @@ prepared for interactions with all previous parts (as usual).
  
  The provided file system requires external synchronization, that is,
  callers must ensure that only one thread can be running in the file
-system code at once.  Your submission should use a more finely granular
+system code at once.  Your submission should use a finer-grained
  synchronization strategy.  You will need to consider synchronization
  issues for each type of file system object.  The provided code uses the
  following strategies:
@@ -123,19 +121,22 @@ file system doesn't have to maintain full serialization as long as it
  follows the rules above.
  @end itemize
  
-@node Problem 4-1 Large Files
-@section Problem 4-1: Large Files
+@node Problem 4-1 Indexed Files
+@section Problem 4-1: Indexed Files
+
+The basic file system allocates files as a single extent, making it
+vulnerable to external fragmentation.  Eliminate this problem by
+modifying the inode structure.  In practice, this probably means using
+an index structure with direct, indirect, and doubly indirect blocks.
+(You are welcome to choose a different scheme as long as you explain the
+rationale for it in your design documentation, and as long as it does
+not suffer from external fragmentation.)
  
-Modify the file system to allow the maximum size of a file to be as
-large as the disk.  You can assume that the disk will not be larger
-than 8 MB.  In the basic file system, each file is limited to a file
-size of just under 64 kB.  Each file has a header called an index node
-or @dfn{inode} (represented by @struct{inode}) that is a table of
-direct pointers to the disk blocks for that file.  Since the inode is
-stored in one disk sector, the maximum size of a file is limited by
-the number of pointers that will fit in one disk sector.  Increasing
-the limit to 8 MB will require you to implement doubly-indirect
-blocks.
+You can assume that the disk will not be larger than 8 MB.  You must
+support files as large as the disk (minus metadata).  Each inode is
+stored in one disk sector, limiting the number of block pointers that it
+can contain.  Supporting 8 MB files will require you to implement
+doubly-indirect blocks.
  
  @node Problem 4-2 File Growth
  @section Problem 4-2: File Growth
@@ -145,16 +146,20 @@ is specified when the file is created.  One advantage of this is that
  the inode data structure, once created, never changes.  In UNIX and
  most other file systems, a file is initially created with size 0 and
  is then expanded every time a write is made off the end of the file.
-Modify the file system to allow this.  As one test case, allow the
-root directory file to expand beyond its current limit of ten files.
+Modify the file system to allow this.  
  Make sure that concurrent accesses to the inode remain properly
  synchronized.
  
+There should be no predetermined limit on the size of a file, except
+that a disk cannot exceed the size of the disk (minus metadata).  This
+also applies to the root directory file, which should now be allowed
+to expand beyond its initial limit of ten files.
+
  The user is allowed to seek beyond the current end-of-file (EOF).  The
  seek itself does not extend the file.  Writing at a position past EOF
  extends the file to the position being written, and any gap between the
  previous EOF and the start of the write must be filled with zeros.  A
-read past EOF returns zero bytes.
+read starting from a position past EOF returns no bytes.
  
  Writing far beyond EOF can cause many blocks to be entirely zero.  Some
  file systems allocate and write real data blocks for these implicitly
@@ -167,28 +172,39 @@ your file system.
  @section Problem 4-3: Subdirectories
  
  Implement a hierarchical name space.  In the basic file system, all
-files live in a single directory.  Modify this to allow directories to
-point to either files or other directories.  To do this, you will need
-to implement routines that parse path names into a sequence of
-directories, as well as routines that change the current working
-directory and that list the contents of the current directory.  For
-performance, allow concurrent updates to different directories, but
-use mutual exclusion to ensure that updates to the same directory are
-performed atomically (for example, to ensure that a file is deleted
-only once).
+files live in a single directory.  Modify this to allow directory
+entries to point to files or to other directories.  You will need
+routines to parse a path name into a sequence of directories, to
+change the current working directory, and to list the contents of the
+current directory.  For performance, allow concurrent updates to
+different directories, but use mutual exclusion to ensure that updates
+to the same directory are performed atomically (for example, to ensure
+that a file is deleted only once).
  
  Make sure that directories can expand beyond their original size just
-as any other file can.
+as any other file can.  
+
+The basic file system has a 14-character limit on file names.  You may
+retain this limit for individual file name components, or may extend
+it, at your option.  In any case you must allow full path names to be
+much longer than 14 characters.
  
-Each process has its own current directory.  When one process starts
-another with the @code{exec} system call, the child process inherits its
-parent's current directory.  After that, the two processes' current
-directories are independent, so that either changing its own current
-directory has no effect on the other.
+The current directory is maintained separately for each process.  At
+startup, the initial process has the root directory as its current
+directory.  When one process starts another with the @code{exec}
+system call, the child process inherits its parent's current
+directory.  After that, the two processes' current directories are
+independent, so that either changing its own current directory has no
+effect on the other.
  
  Update the existing system calls so that, anywhere a file name is
  provided by the caller, an absolute or relative path name may used.
-Also, implement the following new system calls:
+
+Update the @code{remove} system call so that it can delete empty
+directories in addition to regular files.  Directories can only be
+deleted if they do not contain files or subdirectories.
+
+Implement the following new system calls:
  
  @table @code
  @item SYS_chdir
@@ -201,18 +217,27 @@ successful, false on failure.
  @itemx bool mkdir (const char *dir)
  Attempts to create the directory named @var{dir}, which may be either
  relative or absolute.  Returns true if successful, false on failure.
+Fails if @var{dir} already exists or if any directory name in
+@var{dir}, besides the last, does not already exist.  That is,
+@code{mkdir("/a/b/c")} succeeds only if @file{/a/b} already exists and
+@file{/a/b/c} does not.
  
  @item SYS_lsdir
  @itemx void lsdir (void)
  Prints a list of files in the current directory to @code{stdout}, one
-per line.
+per line, in no particular order.
  @end table
  
-Also write the @command{ls} and @command{mkdir} user programs.  This
-is straightforward once the above syscalls are implemented.  In Unix,
+We have provided @command{ls} and @command{mkdir} user programs, which
+are straightforward once the above syscalls are implemented.  In Unix,
  these are programs rather than built-in shell commands, but
  @command{cd} is a shell command.  (Why?)
  
+The @code{pintos} @option{put} and @option{get} commands should now
+accept full path names, assuming that the directories used in the
+paths have already been created.  This should not require any extra
+effort on your part.
+
  @node Problem 4-4 Buffer Cache
  @section Problem 4-4: Buffer Cache
  
@@ -265,7 +290,11 @@ Note that write-behind makes your filesystem more fragile in the face
  of crashes.  Therefore, you should
  periodically write all cached blocks to disk.  If you have
  @func{timer_sleep} from the first project working, this is an
-excellent application for it.
+excellent application for it.  (If you're still using the base
+implementation of @func{timer_sleep}, be aware that it busy-waits, which
+is not an acceptable solution.) If @func{timer_sleep}'s delays seem too
+short or too long, reread the explanation of the @option{-r} option to
+@command{pintos} (@pxref{Debugging versus Testing}).
  
  Likewise, read-ahead is only really useful when done asynchronously.
  That is, if a process wants disk block 1 from the file, it needs to
@@ -355,9 +384,9 @@ pintos -ci shell 12345
  pintos -ex "shell"
  @end example
  
-If you don't change the filesystem interface, then this should already
-be implemented properly in @file{threads/init.c} and
-@file{filesys/fsutil.c}.
+If you don't change the filesystem interface, none of this should
+require any special effort on your part.  They are already implemented
+properly in @file{threads/init.c} and @file{filesys/fsutil.c}.
  
  You must also implement the @option{-q} option and make sure that data
  gets flushed out to disk properly when it is used.
@@ -415,17 +444,6 @@ easier to change the current working directory of the shell process if
  user process.  In fact, Unix-like systems don't provide any way for
  one process to change another process's current working directory.
  
-@item
-@b{When the spec states that directories should be able to grow beyond
-ten files, does this mean that there can still be a set maximum number
-of files per directory that is greater than ten, or should directories
-now support unlimited growth (bounded by the maximum supported file
-size)?}
-
-We're looking for directories that can support arbitrarily large
-numbers of files.  Now that directories can grow, we want you to
-remove the concept of a preset maximum file limit.
-
  @item
  @b{When should the @code{lsdir} system call return?}
  
@@ -443,7 +461,7 @@ This code should create the following output:
  
  @example
  Start of directory
-...  directory contents ...
+@dots{}directory contents@dots{}
  End of directory
  @end example
  
@@ -451,13 +469,6 @@ End of directory
  @b{Do we have to implement both absolute and relative pathnames?}
  
  Yes.  Implementing @file{.} and @file{..} is extra credit, though.
-
-@item
-@b{Should @func{remove} also be able to remove directories?}
-
-Yes.  The @code{remove} system call should handle removal of both
-regular files and directories.  You may assume that directories can
-only be deleted if they are empty, as in Unix.
  @end enumerate
  
  @node Problem 4-4 Buffer Cache FAQ
@@ -476,6 +487,9 @@ the 64-block limit.  The same rule applies to anything that's
  ``similar'' to a block of disk data, such as a @struct{inode_disk}
  without the @code{length} or @code{sector_cnt} members.
  
+You can keep a cached copy of the free map in memory permanently if you
+like.  It doesn't have to count against the cache size.
+
  That means you'll have to change the way the inode implementation
  accesses its corresponding on-disk inode right now, since it currently
  just embeds a @struct{inode_disk} in @struct{inode} and reads the