Assignment: The Voyeur Shell

CS230/330 - Operating Systems (Winter 2003).

MWF 1:30pm Ry251

Due : Friday, January 17, 11:59 p.m.

Overview

The Department of Homeland Security has given you a million dollars to write a new Unix shell that allows John Ashcroft to keep a lustful eye on everyone's Unix commands. Your task is to write a new Unix shell known as "vsh" (Voyeur Shell) that allows a third party to remotely monitor shell commands. Your shell will have to support the usual shell features of I/O redirection, pipes, signals, and background processes. However, it also has to provide this clandestine monitoring feature. The primary goal of this assignment is to familiarize yourself with the basic functionality of the Unix operating system including processes, files, threads, networking, and interprocess communication. In addition, you will be learning how to use some of the development tools needed for the class project.

Get an account

Your first step is to get a CS account. All of your work in this class must take place on a dedicated Sun Solaris machine (stonecrusher.cs.uchicago.edu) as it has been configured for the OS project and provide some additional stability when compared to other machines in the department. Access the machine using 'ssh' from any of the department machines.

Description

The input to your shell is a sequence of commands, each provided on a separate line of input text typed interactively at the keyboard. The following commands must be supported:

progname [args]

Runs the program progname with the given, possibly optional, arguments. For example:
vsh % ls
foo.c
bar.c
vsh % cp foo.c foo1.c
vsh % rm -f foo.c

exit

Forces the shell to exit.

I/O Redirection

In addition to the above commands, your shell must support I/O redirection. I/O redirection is specified using the < and > operators at the end of a command line. For example:

progname [args] >file.out

Directs the standard output of progname to the file file.out.

progname [args] <file.in

Uses the contents of the file file.in as the standard input to program progname.

Both input and output redirection may be specified for a single command so your shell will have to check for both.

Pipes

Your shell also needs to support pipes. A pipe is nothing more than a way of hooking up the standard output of one program to the standard input to another. A pipe is indicated using the | operator as follows:

progname1 [args] | progname2 [args]

Pipes the output of program progname1 to the input of program progname2. For example:
vsh % ls -l | wc
       7      56     366
vsh % foo <infile | bar >outfile

Your shell only needs to support a single pipe. You do NOT need to support commands with multiple pipes, such as

vsh % foo | bar | spam

Background Jobs

When a program runs, it normally blocks you from performing any other operations until it has completed. However, you can put a program into the background using the & operator. For example:

progname [args] &

Detaches the program progname and runs it in the background. Control is immediately returned to the command shell where additional commands can be executed. Background jobs should continue to run even if you quit the shell before they have finished.

Signals

Finally, your shell needs to ignore a single signal, SIGINT. This signal is generated when a user presses Control-C on the keyboard. When received, it should be passed to the currently running program, but it should not cause your shell to terminate. Control-C has no effect on background jobs.

Exit codes

When programs terminate, they return an integer exit code to your shell. If this code is non-zero, your shell should print the returned value. For example:

vsh % cp foo bar
cp: cannot access foo
[ Program returned exit code 1 ]
vsh %

The Voyeur Interface

To allow secret monitoring, the shell should listen for connections on a network port that is equal to the process id (pid) + 10000. Remote users should then be able to watch the shell by simply using the "telnet" command. For example, suppose that a user launches "vsh" and it has a process ID of 11538 (use 'ps' to find the process id):

% ps
   PID TTY      TIME CMD
 23808 pts/2    0:04 netscape
 11538 pts/2    0:00 vsh           <-- Running vsh shell
 23804 pts/2    0:04 emacs

To watch the shell, use telnet like this:

% telnet localhost 21538           # Note: 21538 = 11538 + 10000
Trying 127.0.0.1...
Connected to localhost.
Escape character is '^]'.
[ Welcome to the vsh shell ]
ls -l
cp foo bar spam
... commands typed by user follow ...

When a user first connects, a list of all commands typed so far should be displayed. After that, new commands typed by the user are displayed as long as the telnet connection stays open. The actual screen output of commands is not displayed---only the command itself.

Extra credit: Extend the Voyeur interface so that it allows a remote user to inject a command into the shell without the user's knowledge.

How to get started

Make sure you understand the assignment before beginning any work. Now, consider the following steps as a rough guide.

Step 1 : Create a CVS project

All projects in this class must be submitted using CVS. You should create a CVS project for your shell project BEFORE WRITING ANY CODE!

To create a project, follow these steps:

Create a directory called vsh. For example:

unix % mkdir vsh

In this directory, create two files, Makefile and vsh.c. Makefile should look like this:

# Makefile for vsh project
all:
       gcc vsh.c -o vsh

must

Your vsh.c file should look like this:

/* vsh.c */
/* Your Name, CS230 (or330), Winter 2003 */

Now, in the SAME directory as your Makefile and vsh.c files, type the following to create your CVS project:

unix % cvs import -m "Vsh shell" vsh vsh start

Leave the vsh directory you created and remove it. For example:

unix % cd ..
unix % rm -rf vsh

Check out your vsh project from CVS as follows:

unix % cvs checkout vsh

Do not proceed to step 2 until you successfully create a CVS project for your shell. Contact the TAs if you are unable to create a project for some reason.

Step 2 : Command line parsing

Write a function that takes a line of input text and parses it into some sort command structure containing information about the program name, arguments, and options for I/O redirection, pipes, and background jobs. If it helps, the syntax for the the shell is roughly as follows (optional fields are in brackets) :

command    :   program
           |   program | program
           |   "exit"
           ;
            
program    :   identifier [ arglist ] [ <infile ] [ >outfile ] [ & ]

Tokens and arguments are separated by white space. To simplify parsing, Your shell does NOT need to support quoted strings such as the following:

vsh % foobar "This is a quoted argument"

Furthermore, you can assume that no whitespace separates the < and > operators from the filename that follows.

After you've got your command line parser working, write an infinite loop that does nothing but print the shell prompt ("vsh % "), read a line of input, and pass it to your command line parser. Check the data returned from the parsing function to make sure it looks reasonable.

Note: writing the command parser should be easy. Just use the strtok() function from the C library. There is no need to write a lex/yacc based parser or anything of comparable complexity (your parser should only be around 50 lines of code).

Commit your changes to CVS. For example:

unix % commit -m "parsing" vsh.c

Step 3 : Make your shell run programs

Once you're satisfied with the parser, modify the command loop to execute programs. You will need to use the fork() and exec() system calls to do this. While running, the shell process should wait for the program to complete by calling wait(). The shell should also check the exit code returned by the program and print a message if it is nonzero. Note : the exit code is placed into the lower 8-bits of the status code set by wait(). Your code will look roughly like this:

while (1) {
    read a line of input
    cmd = parse command line
    pid = fork();
    if (pid == 0) {
        extract the program name from cmd
        ...
        exec( ... args ...);   /* Execute the command */
    } else {
        wait(&status);         /* Wait for command termination */
        check return code placed in status;
    }
}

At this point you should have a working shell. Try it out by running some of your favorite Unix commands such as "ls", "cp", "cat" and so forth. If it doesn't work, you have done something wrong.

Commit your changes to CVS.

Step 4 : Add I/O direction

To add I/O redirection, modify the child process created by fork() by adding some code to open the input and output files specified on the command line. This should be done using the open() system call. Next, use the dup2() system call to replace the standard input or standard output streams with the appropriate file that was just opened. Finally, call exec() to run the program.

Commit your changes to CVS.

Step 5 : Add Background Jobs

This is a little more tricky. When a job is put into the background, the shell just starts it and forgets about it (the shell should return to the command prompt and allow more commands to be typed). However, this presents two problems. First, the background job should keep running even if the shell terminates. Thus, this means that the background job can't be a child of the shell process. Second, when the background job finishes, it needs to have its exit code collected--otherwise it turns into a zombie.

Modify your shell to run background jobs in a way that solves both of these problems. Hint : the solution involves the fork() function.

Commit your changes to CVS.

Step 6 : Add pipes

To support a pipe, you need to execute two separate programs and play some funny games with I/O to make the output of one program go to the input of the other program. To do this, you'll need to use the pipe() and the dup2() system calls.

Commit your changes to CVS.

Step 7 : Make control-C work

Modify your shell so that the SIGINT signal causes the currently running program to terminate while the shell continues to run (i.e., your shell should ignore SIGINT).

Commit your changes to CVS.

Step 8 : Implement the Voyeur feature

Modify your shell so it keeps a history of all commands entered. Open a TCP/IP network socket using the socket() system call. Use a port number that is equal to the value of getpid()+10000. Make the shell listen for incoming connections on this port. When connections are received, send the complete history buffer and then make the shell display new commands as they are entered. The user of the shell should have no awareness of the remote connection.

Implementation suggestion: Implement the Voyeur feature as a separate thread using thr_create() or pthread_create().

Step 9: Create a README file

Within the 'vsh' directory created in Step 4, create a README file that contains your name and any other pertinent information about your solution that we should know about.

Add the README file to the CVS project as follows:

% cvs add README
% cvs commit -m "" README

Step 10 : Sit back and relax.

By now, you should be ready for the kernel project. "Ha, bring it on!", you say.

Did you remember to commit your changes to CVS?

Other Odds and Ends

Header files

You will probably need to use the following header files in your solution.

#include <stdio.h>
#include <string.h>
#include <stdlib.h>
#include <unistd.h>
#include <fcntl.h>
#include <sys/types.h>
#include <sys/wait.h>
#include <errno.h>

You will need to add more header files to support the Voyeur feature.

Error handling

This is not a compilers course so we are not going to test your shell against every possible bad command. Your shell should not crash, but it does not need to do anything other than print "Bad command" if the user supplies bad input.

Getting Help

Since this is an upper division/graduate computer science course, you are expected to do your own research regarding the usage of various system calls, header files, and libraries. Information is readily available in the man pages, Unix reference books, or on the web.

Otherwise, do not hesistate to ask a question if you are unclear about how some part of the assignment is supposed to work.

Handin Procedure

The shell project will be automatically collected from CVS at 11:59 p.m. on the due date. Your project must be checked into a CVS project named 'vsh'. If you made it this far and skipped step 1, shame on you! Go back and read the instructions.

Your final solution to the shell project should be a directory of files that look like this:

vsh/
     Makefile
     README
     vsh.c

To grade your shell, we will perform the following steps:

unix % cvs checkout vsh
unix % cd vsh
unix % make
unix % vsh

If your shell fails to check out of CVS, does not build using make, or fails to run, you will receive no credit!

Make sure you test your shell by typing the above commands in some kind of junk directory---do not assume that your shell will work until you have tested it yourself!

Grading

Your shell will primarily be graded for correctness. We will run your shell on a series of simple commands that exercise all of its features. Your solution should not deviate from the specifications described in this handout (i.e., don't change the name of the commands or the shell syntax).

Your grade will consist of the following:

Correctness. 80%
Programming style and efficiency. 20%.

No late handins are accepted!