1
SOFTWARE ARCHITECTURE
3. SHELL
Tatsuya Hagino
hagino@sfc.keio.ac.jp
slides URL
https://vu5.sfc.keio.ac.jp/sa/login.php
2
Software Layer
Application Shell
Library
MIddleware
Shell
Operating System
Hardware
3
Functions of Shell
• Start programs
• Control running programs
• background Starting and Controlling
• foreground Programs
• Input/output redirection for programs
• pipe
• Set environment variables and shell Setting up execution
variables environment
• Expanding wild cards for command line
• History of commands
• Alias for commands Useful functions
• File name completion
• Repeat and conditional execution
• Shell script Running scripts
4
Shells
• Windows
• Command Prompt User
• PowerShell
App
• Explorer
Shell App
• Mac
• Finder
• Launcher OS
Kernel
• UNIX
• sh
• Bourne Shell App
• Korn Shell
• Z Shell
• csh
• C Shell
• TENEX C Shell
Shell between OS and user
5
UNIX Shell Commands
• Shell communicates your instruction to OS
Shell OS
• Very few built-in commands
• set, alias, cd, setenv, etc.
• Most of commands are ordinary programs.
• ls is a program to list files in a directory.
• cat is a program to output contents of files.
6
Shell Command Processing
shell() {
char buf[512], char *argv[512];
for (;;) {
1 printf("% ");
2 if (!fgets(buf, sizeof(buf), stdin)) break;
3 parse(buf, argv);
4 execute(argv[0], argv);
}
}
1. Write a prompt.
2. Read one line from a terminal.
3. Separate the line into a command and its arguments
separated by spaces.
4. Execute the command with the arguments.
5. Back to the next input.
7
Execute a Program
• Create a child process and execute a program.
execute(char *cmd, char *argv[]) {
int pid, status;
pid = fork();
if (pid == 0) {
execve(cmd, argv, NULL);
fprintf(stderr, "command %s not found¥n", cmd);
exit(1);
}
while (wait(&status) != pid);
}
• Shell is the parent process and waits for the child process to finish.
8
Process
• Running state of a program
• Multiple processes may run the same program.
CPU state
• Each process consists of: file descriptors
etc.
• program
• CPU state (registers, PC, SP) program code
• data
• memory space data
• file descriptors
• other process related states Heap
stack
memory space
9
Process related System calls
• fork parent
• Create a child process. process
• Exact copy of the parent running abc wait
• exec
• Specify a program to execute.
fork
• Current program is destroyed and child child
replaced with the new one. process process
• wait running abc running xyz
exec
• Wait until a child process terminates.
exit
• exit
• Terminate the current program. destroyed
• other system calls
• signal
• Specify a handler for each interrupt.
parent-child relation in processes
• kill
• Send an interrupt.
10
fork and exec
• fork • exec
• Inherit all from the parent. • Independence from the parent
• memory space • Destroy the current content of
• program the memory.
• data • Replace it with the new program
• stack image.
• file descriptors • Start the new program from
• environment variables its entry point.
• main
• Memory spaces are not • What inherits.
shared, but copied. • file descriptors
• copy-on-write • arguments to exec
• If either one changes, it is • including environment variables
actually copied.
• The program image is not
loaded into the memory
immediately.
• demand paging
• loaded when necessary
11
How fork works
1. Create a new process and make it a child.
2. Copy CPU state.
• registers, PC, SP, etc.
3. Copy memory contents.
• use copy-on-write sharing
4. Copy file descriptors.
CPU
5. Make the new process ready-to-run (or runnable).
6. Return to where fork is called.
parent
process
CPU CPU CPU CPU
file descriptors
CPU
parent parent child parent child
process process process process process
child
process
file descriptors file descriptors file descriptors file descriptors file descriptors
12
How exec works
1. Allocate a temporary memory space.
2. Copy the command arguments and the environment variables to the temporary
memory space.
3. Release all the memory of the current process and create an empty memory
space.
4. Set up demand paging of the given program on the new memory space.
5. Copy the command arguments and the environment variables from the
temporary memory space to the stack in the new memory space.
6. Release the temporary memory space.
7. Set CPU PC to the entry point.
HDD・SSD
entry
CPU point CPU
program
PC
program program
process
command command command
arguments arguments arguments
environment
environment environment
variables
variables variables
file descriptors file descriptors file descriptors file descriptors
13
Command PATH
• The first argument of execve is the path name of the command.
• execve("ls", ...) does not work.
• execve("/bin/ls", ...) does work.
• Writing full path name is tedious.
• Use an environment variable ‘PATH’ which contains a list of directories where commands are.
/bin:/usr/bin:/sbin:/usr/sbin:/usr/local/bin
• Try each directory to find the command.
execute(char *cmd, char *argv[]) {
int pid, status;
pid = fork();
if (pid == 0) {
execve("/bin/" + cmd, args, NULL);
execve("/usr/bin/" + cmd, args, NULL);
execve("/sbin/" + cmd, args, NULL);
execve("/usr/sbin/" + cmd, args, NULL); After a successful execve
execve("/usr/local/bin/" + cmd, args, NULL); the rest will not be executed.
fprintf(stderr, "command %s not found¥n", cmd);
exit(1);
}
while (wait(&status) != pid);
}
14
Background and Foreground
• Foreground process
• Usual execution of a program
• Execute one by one.
• Background process
• Put ‘& ’ at the end of command.
• Execute the next command without waiting termination of the current
one.
• Multiple commands can be executed as background.
Shell
Foreground Background Background Background
process process process process
Editor Compiler Server
15
Implementation of Background
• Shell waits termination of child process with ‘wait’ system call.
• For background processes, shell does not wait for the
termination.
execute(char *cmd, char *argv[], int foreground) {
int pid, status;
pid = fork();
if (pid == 0) {
execve(cmd, argv, NULL);
fprintf(stderr, "command %s not found¥n", cmd);
exit(1);
}
if (foreground) {
while (wait(&status) != pid);
}
}
16
Job Control
• A job may consists of more than one process.
• Connect commands with pipe.
• A command may fork to create children.
• A job is a group of processes.
• Foreground and background are controlled for jobs (not for each process).
• UNIX uses process group.
• Each process belongs to a process group.
• Child processes belong to the same process group.
• Shell creates a new process group for each command.
process group process group
process group
process process
process process
child
child
process process
17
Job Control for a Terminal
• Each terminal holds one process group.
• can be set by ioctl with TIOCSPGRP parameter.
• Foreground
• terminal process group = process process group
• Terminal switches process group.
• Input from terminal
• Sent to foreground processes.
• Background processes stop when they try input from terminal.
• Output to terminal
• Foreground and background processes can output. shell
• Possible to prohibit background processes to output.
group 123 group 234 group 345
foreground background background
process process process
input/output
output
output
group 123 terminal
18
Redirection
• Redirect standard input/output/error to files.
% cat /etc/passwd > /tmp/aaa
% wc < /etc/passwd
• Pipe can combine two commands.
• Output of one command is connected to input of the other.
% grep abc /etc/passwd | wc
terminal file
terminal terminal
cat grep pipe
wc
output input
redirect standard output connect commands using pipe
19
Implementation of Redirection
• File descriptors are inherited by execve.
• Set file descriptors before execve.
execute(char *cmd, char *argv[]) {
int pid, status;
pid = fork();
if (pid == 0) {
fd = open("/etc/passwd", O_RDONLY);
dup2(fd, 0);
close(fd);
execve(cmd, argv, NULL);
...
}
while (wait(&status) != pid);
}
• dup2 copies file descriptors.
• If changed before fork, shell's file descriptors are also changed.
• fork and exec need to be separated.
20
Shell variables and Environment varaibles
• Shell variables
• Used by shell
• Change shell behavior
shell
shell
• Often used in shell scripts. variables
environment
variables
• Environment variables
• Inherited to commands
fork copies
• User name
• Home directory command
• PATH (process)
environment
variables
21
Wild Card
• Wide card‘*.c’ can be used to specify multiple files.
• ‘*.c’ is expanded by shell
int match(char *pattern, char *p) {
% ls *.c char ch;
% cat a???.b? while (ch = *pattern++) {
if (ch == ’*’) {
while (*p) {
DIR *dp; if (match(pattern, p)) return 1;
struct dirent *de; p++;
dp = opendir(dir); }
while (de = readdir(dp)) { return (*pattern == 0);
if (match(name, de->d_name)) { }
strcpy(argv[argc++], de->d_name); else if (ch == ’?’) {
} if (*p++ == 0) return 0;
} }
closedir(dp); else if (*p++ != ch) return 0;
}
return (*p == 0);
}
• matching of ‘*’ and ‘?’
22
Shell Script
test
echo Hello chmod a+x test
date
ls Mark it as executable
a new command ‘ test’
• Create a new command combining some commands:
• Create a text file with commands.
• Mark it as executable using chmod.
• OS execute a shell if the given file is not binary.
• Conditional braches and repetitions are allowed.
• Shell can be specified in the first line.
test
#!/bin/csh -f
echo Hello
date
ls
23
Summary
• Functions of shell
• Execute commands
• Job control
• Redirection
• Environment variables
• Wide card
• Shell script
• Other functions
• Command alias
• File name completion
• Command history