Bash Scripting for Unix Users
Bash Scripting for Unix Users
Scott T. Milner
September 9, 2020
1 Introduction
The shell is the program we interact with when we type at a Unix command line prompt. There
are actually several different Unix shell programs; the most commonly used is bash. bash and
other shells include facilities for writing programs, called “shell scripts”. (Different shells have
overlapping but distinct capabilities and syntax; in this tutorial, we assume the shell is bash.)
Shell scripts can be used to automate tasks involving Unix, as well as command-line programs that
run under Unix. The shell scripting language reflects an accretion of good ideas and convenient
tricks, in contrast to coherently designed, full-featured programming languages such as Python,
which can also be used to write scripts for automating tasks. However, shell scripting is convenient
because of its direct relationship to the well-conceived set of command-line tools that Unix com-
prises. All Unix commands can be used directly in shell scripts; indeed, many useful shell scripts
consist of little more than a sequence of commands that could be typed at the command line.
Shell scripting benefits immensely from the Unix concept of small, well-defined command line
utilities that can be combined to perform larger tasks by use of pipes and redirection of standard
input and output. Familiarity with this approach is a prerequisite for writing good scripts. Likewise,
powerful and versatile command line tools such as grep, sed, and awk greatly extend the range
of what simple scripts can do.
The disadvantages of shell scripting for tasks beyond a certain complexity arise as well from its
simple design. First among these limitations is the lack of convenient facilities for mathematical
operations on real numbers, or data structures beyond integers, strings, and simple arrays. As a
result, shell scripts are preferred for automation tasks, but not for “number crunching”.
Of course, other shell scripting tutorials exist; many are rather brief and thus of limited use. One
extensive tutorial is http://tldp.org/LDP/abs/html/, which is a useful reference but formidable
to digest. This tutorial presents those features of shell scripting that commonly appear in scripts
written for my research group.
The elements of the language are introduced here with brief but functional sample code. By them-
selves, these examples are too limited to give a clear indication of how real scripts are constructed.
1
However, similar tasks and consequently similar techniques occur again and again in our scripts.
Therefore, in the final part of this tutorial we present several complete scripts from real applications,
annotated to highlight techniques we have found useful.
2 Language elements
2.1 Variables
The traditional first program in any language prints out “Hello world”. Here is a shell script that
illustrates the use of variables:
#!/bin/bash
In bash scripts, # denotes a comment, and blank lines are allowed. The first line above is a comment
to Unix, that the script should be executed with bash.
In the script above, sayIt is first assigned the value “Hello”. Then, sayIt is assigned its old value,
concatenated with “ world”. The value of a variable is accessed with $.
Shell variables need not be “declared” or “defined” first; just start using them. Shell variables do
not have “types” (integer, real, string, etc.). More precisely, all shell variables are strings, but can
be interpreted and used as integers (discussed below).
2.2 Arguments
Values can be passed to shell scripts by command-line arguments. Here is an example (script1.sh):
#!/bin/bash
echo Hello $1
script1.sh Scott prints out “Hello Scott”. Up to 9 command-line arguments are accessed with
$1 and so forth. $# is the number of arguments passed.
2
2.3 Strings
Shell variables hold strings, which are assigned with = (no space on either side):
myString=Hello
myString="Hello world"
To assign a string containing special characters that should not be “processed”, enclose it in single
quotes:
myString=’echo $1’
string1="Hello "
string2="world!"
bothStrings=$string1$string2
Shell scripts can contain any Unix command. Often, we want to save the output of the command
in a variable. This is done with backquotes (‘...‘), as follows:
myFiles=‘ls‘
myFiles now contains a string with all the filenames separated by spaces.
myFiles=$(ls)
As usual in Unix, the command inside ‘...‘ or $(...) can be a sequence of commands separated
by semicolons, or a compound command involving pipes.
3
2.5 Integers
Shell variables are fundamentally strings; however, shell scripting includes syntax that allows us to
treat variables as integers.
One way is to use let, which allows us to do math on variables that contain integers, using
operations +, -, *, /, ** (exponentiation), and parentheses:
z=4
let z=($z/2 + 4**2)*$z
which replaces z with 72. Note that division (/) here is integer, i.e., 5/2=2. Spaces are not
permitted around the =, but can appear in the expression.
A more convenient way to do math on integer-valued variables is with ((...)). Expressions inside
double parentheses act as they would in C/C++, with no need to use $ to obtain the value of
variables. For example:
((e=4*e+3**2))
The right side evaluates without typing $e, and the assignment is made. Operators like ++ (incre-
ment) and % (mod) also work inside ((...)).
2.6 Loops
Automation scripts typically involve looping. We may loop over values of some parameter, or a list
of files to be processed. Or, we may continue a process while or until some condition is met.
Shell scripting includes three constructs for looping: for, while, and until. All have a similar
syntax, with the body of the loop enclosed by do and done. while and until loops are controlled
by a logical test (described below):
for loops in shell scripts are different from those in C/C++; they iterate over a list of choices:
4
for fruit in apple pear orange
do
echo $fruit
done
To get a C-like for loop, use the shell command seq to generate a sequence to iterate over:
The general form of seq is seq <start> <step> <end>, where <start> and so forth are integers
(or expressions that evaluate to integers), and <step> is optional (with default 1).
To compare two variables, we use a logical test (note the spaces between the brackets, variables,
and ==):
[ $string1 == $string2 ]
If the variables contain integers (see below), comparison operators -eq, -ne, -le, -ge, -lt, -gt
can be used.
A more convenient form of logical test for integer-valued variables is to use ((...)). In the same
way as arithmetic operations on integers (see above), expressions inside ((...)) using the usual
comparison operators <, >, <=, >=, != ==, && (and), and || (or) will evaluate to “true” or
“false”.
5
while ((x**2+y**2 < 100))
...
In addition to looping, shell scripting provides control statements to direct the execution. The
simplest is the if-statement:
if [ $string1 == $string2 ];
then
echo "equal"
else
echo "not equal"
fi
The logical test [...] takes the form described in the previous section, and must be followed by a
semicolon. The then and else sections can contain multiple commands. The else clause may be
omitted.
if [ $string1 == $string2 ];
then
echo "equal"
elif [ $string1 == "default" ];
then
echo "default"
fi
In the above example, for simplicity the else cases have been omitted from the primary and
secondary tests.
6
2.9 Random choices
The shell provides a facility for generating 16-bit random integers (ranging from 0 to 215 -1 = 32767).
Each time the shell variable $RANDOM is evaluated, a different pseudorandom number is obtained.
These can be used to make random choices among a relatively small number of alternatives, in
conjunction with the mod (%) operator and integer arithmetic.
The mod operator computes the remainder of an integer divided by the modulus: echo $((5 % 2))
gives 1, while echo $((-5 % 2)) gives -1.
if (($RANDOM % 2 == 0));
...
Multiway branching can be performed with a case statement. The comparison is different from
if; here, the argument is matched to a succession of “patterns”, each of which is terminated with
a closing parenthesis ):
case $percent in
[1-4]*)
echo "less than 50 percent" ;;
[5-8]*)
echo "less than 90 percent" ;;
*)
echo "90 percent or greater" ;;
esac
Note the double semicolon terminating each command (which as usual can be multiple commands
on multiple lines).
The patterns to be matched use the same syntax as in other Unix operations (commonly used in
vi, grep, ls, rm, and the like). For completeness, patterns are described briefly in the next
section.
2.11 Patterns
Patterns (or “regular expressions”) are used by several Unix command-line programs and utilities.
File management programs like ls, cp, mv, rm, and tar use patterns to select files to operate on.
7
The Unix utility grep selects lines from its input (stdin) that match a pattern, and prints those
lines to stdout. The Unix visual editor vi, and the stream editor sed use patterns in its searching
and substitution.
Patterns are strings, possibly containing special characters, which match part or all of other strings.
A simple example is ls *.sh, which lists all files with names ending in .sh (i.e., all shell scripts,
if we adhere to that naming convention). Because effective use of patterns extends the capabilities
of Unix tools like grep and sed that frequently appear in scripts, we include here a table of the
special characters used in patterns and their meanings.
pattern matches
^ beginning of string
$ end of string
. any single character
* zero or more repeats of previous character
[...] any one of the enclosed characters
[a-k] any one of the range of characters
[2-5] any one of the range of digits
[^...] any character not in . . .
\<...\> pattern . . . only matches a word
\(...\) saves pattern matched by . . .
\1, \2, ... the first, second, . . . pattern saved
Pattern matching is useful in many contexts, but can be tricky — it is often a good idea to test
patterns in a safe situation before relying on them where errors can do damage (as with rm).
2.12 Functions
In longer scripts it is often convenient to define functions, to perform a well-define subtask that is
called upon multiple times.
Command-line arguments can be passed to functions in the same way as for scripts generally, with
variables $1, $2 and so forth. Variables declared with local are protected from meddling by the
main program. Shell script functions do not return values, so functions must communicate with
the calling program through global variables.
function newFcn {
...
}
8
in which ... are one or more commands.
Functions can return a value that indicates the status of the function call; by convention, 0 indicates
success and nonzero values indicate some error. For a user-defined function, the return value is set
with return <value>.
All built-in shell functions return such values. The return value of the most recently called function
can be accessed with $?.
reads successive lines and echoes them until the file is empty, at which point read returns a nonzero
value, and the while loop ends.
Another way to use return status of functions and commands, is together with the operators &&
(logical and) and || (logical or). The syntax
executes the second command only if the first one succeeds (i.e., returns status 0). This construct
can be used to run a “check” (for the existence of a file) before executing a command.
Similarly,
<cmd1> || <cmd2>
executes the second command only if the first one fails (i.e., returns status 1). This construct can
be used to run a “fallback” command if the first one fails.
Output to files can be written using Unix command line tools, including echo, cat, and printf.
echo writes strings; cat concatenates one or more files; and printf prints one or more variables in
a specified format, similar to print statements in C/C++.
9
For example,
writes the concatenated string consisting of the text strings and variable.
printf writes one or more variables using a C-style format string, which provides a convenient way
of controlling the output format:
writes string $s1 in 5 spaces, followed by floating-point variable $f1 in 10 spaces with 5 decimal
places, $f2 in scientific notation in 12 spaces with 4 decimal places, and integer $i1 in five spaces.
Finally, cat concatenates a list of files into a single file directed to stdout, as in:
All three utilities echo, cat, and print write to stdout (standard output). However, this output
can be redirected to write or append to a file with > or >>. If a script is invoked with redirection,
as in
all commands in the script that write to stdout are redirected to myFile.out. This is convenient
if the script only produces one output file, since redirection can be used to save the file with any
desired name.
Sometimes, a script writes more than one output file. Then, it is convenient to redirect output of
commands within the script. This is done with the same > and >> syntax. For example,
10
2.16 “here” documents
Sometimes, it is convenient to store a small file within a script, for use with a utility like cat that
requires a file as input. This can be accomplished with a “here” document:
In this example, cat will write the file (consisting of three lines) to stdout. Here documents can be
combined with output redirection, by replacing the first line with
Here documents can be used with any command that requires a file as input, as an alternative to
maintaining a separate “database file” that is redirected to input, like this:
The Unix utility read is used to get input from files. By default, read takes input from stdin. To
read a single line of text:
read newLine
Often, we want to read all the lines in a file, processing each line in some way until the file is empty.
This can be done with a while loop:
11
The read command returns status 0 (“true”) if a line is read, and nonzero (”false”) if the end of
file is reached. This serves as a logical test to control the while loop. If input is read from stdin
(the keyboard), ctrl-D acts as an end-of-file to terminate input.
Sometimes it is convenient to read from a string; for example, we can use read to parse a string
into arguments. This may be regarded as a form of input redirection, or a variation on a “here”
document.
all read commands in the script that have not been redirected then take input from myFile.in.
This is convenient if the script only reads from one file, which we specify when we invoke the script.
Sometimes, a script reads from more than one file. Then, we can also invoke input redirection
within the script. For example:
Note how the redirection follows the entire while loop construct; all reads inside the while loop
come from myFile.txt.
However, if we want to read two successive lines from a file, the obvious code
12
does not work as intended; both reads take the first line from myFile.txt. To achieve the desired
result requires a different approach, described in the next section.
In languages like C/C++ or Python, facilities exist for “opening” a file, which provides a file
“pointer” that advances with each successive read. When reading is done, the file must be “closed”.
Here 3 is a file “channel”, where 0 is standard input (stdin), 1 is standard output (stdout), and 2 is
standard error (stderr). If more than one file must be opened at once, channels 4, 5, and so forth
can be used.
2.21 Arrays
Full-featured programming languages offer data structures more complex than a single variable.
The simplest of these are arrays. The shell provides array variables, which are lists of simple
variables that can be accessed by index and iterated over.
An array can be created by reading a list of arguments separated by spaces from a single line (the
flag -a directs read to create an array):
read -a myArray
13
firstElement=${myArray[0]}
(The array index starts at 0.) The length of the array is accessible with the same syntax as the
length of a string:
${#myArray[@]}
The entire array can be accessed as a single string (with elements separated by spaces):
${myArray[@]}
for i in ${myArray[@]}
do
...
done
2.22 Substrings
myString="abcdefg"
newString=${myString:2:3}
If the 2 above is omitted, the substring starts at 1; thus ${myString::3} equals abc.
Likewise, if the :3 is omitted, the substring goes to the end; thus ${myString:2} equals bcdefg.
${myString%<pattern>}
where <pattern> is a string possibly including “wildcard” characters. For example, ? matches any
single character, so that
14
${myString%???}
${#myString}
Floating point math operations in shell scripting are limited and awkward, which constrains its use
in programs that crunch numbers. However, many shell scripts require only a few floating-point
calculations. For this purpose, the Unix utility bc (basic calculator) can be used.
bc is normally an interactive program; launched from the command line, it responds to typed input.
To use bc in a script, input can be supplied from echo via a pipe:
echo "c(3.14159/2)" | bc -l
echo "$var1*$var2" | bc -l
The number of output digits can be controlled with the command scale=<n>. This only works if
the expression includes division, which can be worked around by dividing by 1 somewhere.
echo "scale=3;3.14*7/2+1" | bc -l
15
2.24 Automating interactive programs
Using echo to supply what would have been typed input to bc is an example of a general approach
to make interactive programs “scriptable”.
Another way to do this is with input redirection from a string, which we encountered with read in
constructions like read arg1 arg2 <<< $inputLine.
The same syntax can be used with bc, and any other program or utility that expects typed input
from stdin:
Unix provides a number of very powerful utility programs, chief among them grep, sed, and awk,
which can be used to great advantage in shell scripts.
grep (Globally search for Regular Expression and Print) searches for patterns (“regular expres-
sions”) on each line of its input, and writes the lines matching the pattern to stdout. The basic
format is
in which <pattern> is any string, possibly containing one or more regular expressions (see “Pat-
terns” above). If <file> is not specified, grep takes its input from stdin.
awk (named for its co-creators Aho, Weinberger, Kernihan) acts on each line of its input, splitting
it into fields, doing simple calculations, and printing output.
Finally, sed (Stream EDitor) acts with editing commands on each line of its input, producing a
stream of output lines. An important use of sed is to produce a sequence of files from a common
“template file”, in which only a few parameters are replaced by some iterated value. A typical use
of sed looks like this:
Here all occurrences in infile of the “template parameter” ANG1 are replaced by the value of the
variable $myAng1, and likewise for ANG2 and $myAng2. The commands within quotes are standard
vi substitution commands.
16
3 Example scripts
In this section, we present several scripts that automate various tasks arising in my research.
Although the particular applications pertain to my group, the categories of tasks they represent
and the scripting techniques they employ are relevant to any computational researcher.
17
3.1 Extracting results from files
1 ...¬
2 SCF Done: E(RB3LYP) = -552.938806292 A.U. after 10 cycles¬
3 ...¬
4 Alpha occ. eigenvalues -- -88.90122 -10.22991 -10.22991 -10.21007 -10.20983¬
5 Alpha occ. eigenvalues -- -7.97827 -5.94170 -5.93770 -5.93284 -0.87872¬
6 Alpha occ. eigenvalues -- -0.73873 -0.73631 -0.57462 -0.56304 -0.52033¬
7 Alpha occ. eigenvalues -- -0.43087 -0.40782 -0.38726 -0.37642 -0.34632¬
8 Alpha occ. eigenvalues -- -0.25864 -0.24890¬
9 Alpha virt. eigenvalues -- -0.02966 0.00606 0.01137 0.03129 0.03259¬
10 Alpha virt. eigenvalues -- 0.03474 0.04011 0.06137 0.06196 0.06943¬
11 Alpha virt. eigenvalues -- 0.09110 0.09179 0.10201 0.10545 0.11620¬
12 Alpha virt. eigenvalues -- 0.14021 0.14641 0.15230 0.15333 0.16807¬
13 Alpha virt. eigenvalues -- 0.16835 0.20141 0.20394 0.21203 0.21387¬
14 Alpha virt. eigenvalues -- 0.25377 0.26555 0.27470 0.31453 0.36792¬
15 Alpha virt. eigenvalues -- 0.45975 0.49534 0.51152 0.53956 0.58259¬
16 Alpha virt. eigenvalues -- 0.61709 0.67887 0.69572 0.69799 0.70991¬
17 Alpha virt. eigenvalues -- 0.73149 0.74013 0.77902 0.87953 0.92346¬
18 Alpha virt. eigenvalues -- 0.97386 0.98760 1.00561 1.07732 1.08930¬
19 Alpha virt. eigenvalues -- 1.17629 1.19135 1.22912 1.32897 1.42423¬
20 ...
A common task for scripts is to extract values of interest from the log file of some computational
program. Such log files are typically long text files, full of comments and information. Shell text
processing commands are helpful in finding the relevant lines and extracting the desired fields from
those lines.
Above is an example of log file output from the quantum chemistry program Gaussian, where
the energy levels are printed sequentially from most to least strongly bound. The occupied and
unoccupied orbitals are denoted Alpha occ. and Alpha virt. respectively. The HOMO (highest
occupied molecular orbital) is thus the last Alpha occ. entry, and the LUMO (lowest unoccupied
molecular orbital) is the first Alpha virt. entry. Separately, the ground state energy is reported
on the line beginning SCF Done:.
The script below (analyz.sh) extracts the HOMO, LUMO, and ground state energy from a list of
files read from stdin ((CHANGE THIS IN SCRIPT)), and computes the difference of these values
from those in a reference calculation. (The purpose here is to analyze how the HOMO, LUMO,
and ground state energy change with molecular distortion corresponding to different amplitudes of
a normal mode. The set of Gaussian jobs with different imposed normal mode amplitudes were
generated manually from a reference job in which normal modes were computed, by examining the
normal mode, setting the amplitude manually, and saving a new compute job.)
The script uses command-line tools grep, head, and tail to extract the lines containing the ground
state energy, HOMO, and LUMO ((FIX THIS — no need to use head and tail both on each line.
And put SCF Done first.)). Briefly, grep searches stdin for all lines containing a given pattern.
Usually, the desired line in a file can be uniquely specified by a well-chosen grep (as here for the
ground state). But for the HOMO and LUMO, the obvious grep (of “Alpha occ” and “Alpha virt”)
finds multiple lines. So we use head -n 1 and Alpha occ. to print the first and last lines of stdin,
to obtain the relevant line in each case. The commands are strung together with pipes |), which
directs the output (stdout) of one command into the input (stdin) of the next.
18
The relevant field is extracted and printed with awk, which is a command-line tool useful for this
purpose. The same thing could be accomplished using read -a to fill an array args with the
successive fields in the extracted line, and extracting the desired field by indexing the array, as
indicated in the comments.
The difference between the extracted and reference values are computed using bc, and printed to
stdout in a desired format with printf.
1 #!/bin/bash¬
2 ¬
3 # awk is convenient here to select fields; this could also be done as¬
4 # read -a args <<< `grep "Alpha occ" thiophene.log | tail -1`¬
5 # eh0=${args[5]}¬
6 ¬
7 eg0=`grep "SCF Done" thiophene.log | awk '{ print $5 }'`¬
8 eh0=`grep "Alpha occ" thiophene.log | tail -n 1 | awk '{ print $6 }'`¬
9 el0=`grep "Alpha virt" thiophene.log | head -n 1 | awk '{ print $5 }'`¬
10 printf "%12s%12s%12s\n" "Delta E_H" "Delta E_L" "Delta E_g"¬
11 while read file¬
12 do¬
13 eg=`grep "SCF Done" $file | awk '{ print $5 }'`¬
14 eh=`grep "Alpha occ" $file | tail -n 1 | awk '{ print $6 }'`¬
15 el=`grep "Alpha virt" $file | head -n 1 | awk '{ print $5 }'`¬
16 deh=`echo "$eh - $eh0" | bc`¬
17 del=`echo "$el - $el0" | bc`¬
18 deg=`echo "$eg - $eg0" | bc`¬
19 printf "%12.5f%12.5f%12.5f\n" $deh $del $deg¬
20 done¬
21 ¬
22
Figure 2: analyz.sh, a script to extract results from a Gaussian log file.
The script is used with input redirection, with a list of files like mode11.txt below. (The names
indicate the mode amplitude: “m4” equals minus 0.4Å, and so forth.)
1 mode11m4.log¬
2 mode11m3.log¬
3 mode11m2.log¬
4 mode11m1.log¬
5 thiophene.log¬
6 mode11p1.log¬
7 mode11p2.log¬
8 mode11p3.log¬
9 mode11p4.log¬
10
19
3.2 Batch job
1 #PBS -A open¬
2 #PBS -l nodes=1:ppn=8¬
3 #PBS -l walltime=8:00:00¬
4 #PBS -l pmem=1gb¬
5 #PBS -m aeb¬
6 #PBS -M stm9@psu.edu¬
7 #PBS -j oe¬
8 ¬
9 # load group local copy of Gromacs¬
10 source /gpfs/group/stm9/default/SOFTWARES/gromacs-5.1.4STM/single/bin/GMXRC¬
11 ¬
12 # cd to working directory, define directories¬
13 cd $PBS_O_WORKDIR¬
14 MDP=$PBS_O_WORKDIR/MDP¬
15 SYSTEM=$PBS_O_WORKDIR/System¬
16 ¬
17 # energy minimization¬
18 mkdir EM ¬
19 cd EM¬
20 gmx grompp -f $MDP/em.mdp -c $SYSTEM/system.gro -p $SYSTEM/topol.top -o em.tpr -maxwarn 20¬
21 gmx mdrun -nt 8 -deffnm em¬
22 ¬
23 # NVT equilibration¬
24 cd ../¬
25 mkdir NVT¬
26 cd NVT¬
27 gmx grompp -f $MDP/nvt.mdp -c ../EM/em.gro -p $SYSTEM/topol.top -o nvt.tpr -maxwarn 20¬
28 gmx mdrun -nt 8 -deffnm nvt¬
29 ¬
30 # NPT equilibration¬
31 cd ../¬
32 mkdir NPT¬
33 cd NPT¬
34 gmx grompp -f $MDP/npt.mdp -c ../NVT/nvt.gro -p $SYSTEM/topol.top -o npt.tpr -maxwarn 20¬
35 gmx mdrun -nt 8 -deffnm npt¬
36 ¬
37 # MD production run¬
38 cd ../¬
39 mkdir MD¬
40 cd MD¬
41 gmx grompp -f $MDP/md.mdp -c ../NPT/npt.gro -p $SYSTEM/topol.top -o md.tpr -maxwarn 20¬
42 gmx mdrun -nt 8 -deffnm md
A very common use of shell scripts is for submitting batch jobs to PBS (Portable Batch System).
Batch scripts are ordinary shell scripts, plus a set of comments at the beginning that contain direc-
tives to PBS. These directives request resources (nodes, cores, memory, and time), direct the job to
a particular queue or allocation, request that mail be sent when the job starts or ends, and so forth.
For details on PBS directives, see https://albertsk.files.wordpress.com/2011/12/pbs.pdf.
The script below (job.sh) is a typical Gromacs batch job, with no control statements or other
automation features. It consists essentially of a set of commands that could have been entered
sequentially at the command line.
The only difference is the line cd $PBS_O_WORKDIR, which navigates to the folder from which the
job was launched. The shell variable $PBS_O_WORKDIR (Original WORKing DIRectory) is the full
20
path to the directory from which the command qsub job.sh was executed. For convenience,
variables $MDP and $SYSTEM are defined as paths to subfolders of $PBS_O_WORKDIR.
21
3.3 Morphing simulation scripts
1 #!/bin/bash¬
2 #PBS -A open¬
3 #PBS -l nodes=1:ppn=8¬
4 #PBS -l walltime=24:00:00¬
5 #PBS -l pmem=1gb¬
6 #PBS -j oe¬
7 ¬
8 source /gpfs/group/stm9/default/SOFTWARES/gromacs-5.1.4STM/single/bin/GMXRC¬
9 ¬
10 echo $PBS_O_WORKDIR¬
11 cd $PBS_O_WORKDIR¬
12 MDP=$PBS_O_WORKDIR/MDP¬
13 SYSTEM=$PBS_O_WORKDIR/System¬
14 ¬
15 TEMP=/gpfs/scratch/stm9/vanish¬
16 mkdir $TEMP¬
17 ¬
18 INIT=$SYSTEM/equil.gro¬
19 ¬
20 for LAMBDA in `seq 0 20`¬
21 do¬
22 mkdir Lambda_$LAMBDA¬
23 cd Lambda_$LAMBDA¬
24 ¬
25 cp ../Tables/table_${LAMBDA}.xvg md_${LAMBDA}_A_B.xvg¬
26 cp ../Tables/table_0.xvg md_${LAMBDA}_A_A.xvg¬
27 cp ../Tables/table_0.xvg md_${LAMBDA}_B_B.xvg¬
28 cp ../Tables/table_0.xvg md_${LAMBDA}.xvg¬
29 gmx grompp -f $MDP/md.mdp -c $INIT -p $SYSTEM/system.top -n $SYSTEM/system.ndx -o
… md_$LAMBDA.tpr -maxwarn 20¬
30 gmx mdrun -nt 8 -deffnm md_$LAMBDA¬
31 mv md_$LAMBDA.trr $TEMP¬
32 ¬
33 cd ..¬
34 INIT=../Lambda_$LAMBDA/md_$LAMBDA.gro¬
35 done¬
36 ¬
37
Figure 5: morphAll.sh, a typical “morphing” simulation batch script.
The above script (morphAll.sh) is a typical batch script with automation. Here a sequence
of gromacs simulation jobs are prepared and executed, each placed in its own folder Lambda_1,
Lambda_2 and so forth.
(In this “morphing” simulation, the interactions between groups A and B are progressively weak-
ened. This is achieved by using a series of tabulated potentials table_1.xvg (original), table_1.xvg,
table_2.xvg and so forth, which are copied into each job as needed.)
The initial configuration for the ith job is obtained from the final configuration of the previous job,
the path to which is stored in the variable $INIT.
22
The second part of the “morphing” simulation uses a similar automated workflow, in which previ-
ously saved trajectory files md_<k>.trr are “rerun” with slightly different interactions, to produce
energy data files rerun_<k-1>.edr and rerun_<k+1>.edr.
1 #PBS -A open¬
2 #PBS -l nodes=1:ppn=8¬
3 #PBS -l walltime=48:00:00¬
4 #PBS -l pmem=1gb¬
5 #PBS -j oe¬
6 ¬
7 source /gpfs/group/stm9/default/SOFTWARES/gromacs-5.1.4STM/single/bin/GMXRC¬
8 ¬
9 cd $PBS_O_WORKDIR¬
10 SYSTEM=$PBS_O_WORKDIR/System¬
11 MDP=$PBS_O_WORKDIR/MDP¬
12 TEMP=/gpfs/scratch/stm9/vanish¬
13 ¬
14 for LAMBDA in `seq 1 20`¬
15 do¬
16 let LP=$LAMBDA+1¬
17 let LM=$LAMBDA-1¬
18 ¬
19 cd Lambda_$LAMBDA¬
20 ¬
21 cp ../Tables/table_${LP}.xvg rerun_${LP}_A_B.xvg¬
22 cp ../Tables/table_0.xvg rerun_${LP}_A_A.xvg¬
23 cp ../Tables/table_0.xvg rerun_${LP}_B_B.xvg¬
24 cp ../Tables/table_0.xvg rerun_${LP}.xvg¬
25 gmx grompp -f $MDP/md.mdp -c md_$LAMBDA.gro -p $SYSTEM/system.top -n $SYSTEM/system.ndx
… -o rerun_$LP.tpr -maxwarn 20¬
26 gmx mdrun -nt 8 -deffnm rerun_$LP -rerun $TEMP/md_$LAMBDA.trr¬
27 rm rerun_$LP.trr¬
28 ¬
29 cp ../Tables/table_${LM}.xvg rerun_${LM}_A_B.xvg¬
30 cp ../Tables/table_0.xvg rerun_${LM}_A_A.xvg¬
31 cp ../Tables/table_0.xvg rerun_${LM}_B_B.xvg¬
32 cp ../Tables/table_0.xvg rerun_${LM}.xvg¬
33 gmx grompp -f $MDP/md.mdp -c md_$LAMBDA.gro -p $SYSTEM/system.top -n $SYSTEM/system.ndx
… -o rerun_$LM.tpr -maxwarn 20¬
34 gmx mdrun -nt 8 -deffnm rerun_$LM -rerun $TEMP/md_$LAMBDA.trr¬
35 rm rerun_$LM.trr¬
36 ¬
37 cd ..¬
38 done¬
39
Figure 6: remorphAll.sh, a “morphing” simulation batch script for rerunning the trajectory files.
The .edr files produced by the script remorphAll.sh are analyzed in turn by a third script
getDhDl.sh, which uses gmx energy to extract time series for the potential energy, computes
the difference between the two time series, and uses these data to evaluate the average h∆H/∆λi
and its uncertainty.
getDhDl.sh takes a single argument, the value of $LAMBDA to be processed. A small driver script
(not shown) loops over all the desired $LAMBDA values. Separating this looping leads to a simpler
and more versatile main script.
In the script, note the use of echo to automate the interactive utility gmx energy. echo ’8’
supplies the input that would otherwise be typed in.
23
1 #!/bin/bash¬
2 # argument is LAMBDA¬
3 ¬
4 # adjacent lambda values¬
5 LAMBDA=$1¬
6 LP=$((LAMBDA+1))¬
7 LM=$((LAMBDA-1))¬
8 ¬
9 # go to directory¬
10 cd Lambda_$LAMBDA¬
11 ¬
12 # run gmx energy on LP and LM output¬
13 # field 8 is potential energy (rerun has no KE)¬
14 echo '8' | gmx energy -f rerun_${LP}.edr -o energy_${LP}.xvg &> energy_${LP}.out¬
15 echo '8' | gmx energy -f rerun_${LM}.edr -o energy_${LM}.xvg &> energy_${LM}.out¬
16 ¬
17 # read comments atop .xvg files¬
18 exec 3< energy_${LP}.xvg¬
19 exec 4< energy_${LM}.xvg¬
20 for n in `seq 1 12`¬
21 do¬
22 read 0<&3 line1¬
23 read 0<&4 line2¬
24 done¬
25 ¬
26 # read the xvg header, echo to diff.xvg¬
27 for n in `seq 1 11`¬
28 do¬
29 read 0<&3 line1¬
30 read 0<&4 line2¬
31 echo $line1 > diff.xvg¬
32 done¬
33 ¬
34 # read 100 data lines, compute avg and variance¬
35 # write difference to diff.xvg¬
36 avg=0.¬
37 var=0.¬
38 for n in `seq 1 100`¬
39 do¬
40 read 0<&3 t e1¬
41 read 0<&4 t e2¬
42 diff=`echo "$e1 - $e2" | bc -l`¬
43 printf "%15.5f%15.5f\n" $t $diff > diff.xvg¬
44 avg=`echo "$avg + $diff" | bc -l`¬
45 var=`echo "$var + $diff*$diff" | bc -l`¬
46 done¬
47 # normalize¬
48 avg=`echo "$avg/100." | bc -l`¬
49 var=`echo "$var/100." | bc -l`¬
50 ¬
51 # write avg and error estimate to stdout¬
52 # /0.1 because dH/d lambda¬
53 dhdlam=`echo "$avg/0.1" | bc -l`¬
54 err=`echo "sqrt($var/100.)/0.1" | bc -l`¬
55 printf "%15.5f%15.5f" $dhdlam $err¬
56 ¬
57 # close files¬
58 exec 3<&-¬
59 exec 4<&-¬
60
Figure 7: getDhDl.sh, a script for extracting and analyzing the morphing results.
24
The other noteworthy feature of this script is the use of exec 3< and so forth to open multiple
files for reading, The file is then read with read 0<&3; successive reads from the same “file handle”
read successive lines in the file. The file is finally closed with exec 3<&-.
25
3.4 Index file construction
1 #!/bin/bash --login¬
2 ¬
3 chains=32 #Total chains in the box¬
4 monomers=20 #Residues in a given chains¬
5 backbone="aC1|aC2" #List all atoms that are part of the monomer backbone separated by |
… and prefix with a¬
6 ¬
7 nGroup=5 # index on group created (groups 0-4 exist a priori)¬
8 chain=1 # index on chain ¬
9 for chain in `seq 1 $chains`¬
10 do¬
11 r1=$((monomers*(chain-1)+1))¬
12 r2=$((monomers*chain))¬
13 ¬
14 echo $backbone"&r$r1-$r2" >> make_ndx_cmds¬
15 echo "name ${nGroup} chain${chain}" >> make_ndx_cmds¬
16 nGroup=$((nGroup+1))¬
17 ¬
18 echo $backbone"&!r$r1-$r2" >> make_ndx_cmds¬
19 echo "name ${nGroup} notchain${chain}" >> make_ndx_cmds¬
20 nGroup=$((nGroup+1))¬
21 done¬
22 echo "q" >> make_ndx_cmds¬
23 ¬
24 cat make_ndx_cmds | gmx make_ndx -f newBox.pdb -o PS.ndx¬
25 rm "#"*"#"¬
26
Figure 8: packing ndx.sh, a script to drive the interactive utility gmx make ndx to produce an
index file of backbone carbons for a polymer melt.
gmx make_ndx ordinarily responds to a sequence of typed queries and commands to create groups
of atoms satisfying certain criteria, rename those groups as desired, and ultimately write an index
file. Here we create a group for each chain, consisting of those atoms of type C1 or C2 (i.e., the
main-chain carbons) with atom numbers within range for that chain.
The automation approach here is to assemble a file make_ndx_cmds that contains all the input we
would ordinarily type. When the file is complete, we cat the file piped to gmx make_ndx.
26
3.5 Building a polymer .pdb file
1 #!/bin/bash¬
2 # reads monomer list from stdin¬
3 # arguments $1 $2 $3 are box dimensions¬
4 read monomer¬
5 gmx editconf -f $monomer.pdb -box $1 $2 $3 -noc -o newChain.pdb &> insert.log¬
6 read a b dx dy dz < $monomer.pdb¬
7 zOff=`echo "scale=4;$dz/10" | bc -l`¬
8 echo "0. 0. $zOff" > pos.dat¬
9 ¬
10 # loop over monomers¬
11 while read monomer¬
12 do¬
13 mv newChain.pdb oldChain.pdb¬
14 gmx insert-molecules -f oldChain.pdb -ci $monomer.pdb -ip pos.dat \¬
15 -rot none -scale 0. -o newChain.pdb &> insert.log¬
16 read a b dx dy dz < $monomer.pdb¬
17 zOff=`echo "scale=4;$zOff + $dz/10" | bc -l`¬
18 echo "0. 0. $zOff" > pos.dat¬
19 done¬
20 rm oldChain.pdb
The above script (buildChain.sh) uses the utility gmx insert-molecules to construct a .pdb file
for a polymer chain from .pdb files for the constituent monomers. The list of monomers is read
from stdin.
The script leverages the ability of gmx insert-molecules to renumber the atoms and residues of
each monomer it adds, so that the entire molecule is appropriately labeled.
The script needs to know where to place each successive monomer. The monomer .pdb files for
this purpose are specially constructed in “standard orientation” along the chain (z) axis, with the
first atom to be bonded at the origin.
Furthermore, the displacement vector from the bonded atom to the next bonded atom is included
as a comment at the top of the file. This comment is read (lines 6 and 16) and converted to a z
offset, which is added to the variable zOff. The current value of zOff is written to a short file
pos.dat, which tells gmx insert-molecules where to insert the next monomer.
27
1 #!/bin/bash¬
2 ¬
3 nMon=$1¬
4 echo "PSB"¬
5 for i in `seq 1 $((nMon-2))`¬
6 do¬
7 if (($RANDOM % 2));¬
8 then¬
9 echo "PSML"¬
10 else¬
11 echo "PSMR"¬
12 fi¬
13 done¬
14 echo "PST"¬
15
The above script (buildChainPS.sh) can be used with buildChain.sh to build an atactic polystyrene
chain. The script simply constructs a list of monomer names to be piped to buildChain.sh:
buildChainPS.sh 25 | buildChain.sh
The number of monomers nMon is passed as a command-line argument. The random tacticity is
accomplished by making a random choice in the loop over the number of monomers, with the use
of $RANDOM % 2, which randomly evaluates to 0 or 1 each time it is accessed. The structure of the
end monomers is different from the internal monomers, so they are written separately outside the
loop.
28
3.6 Building a polymer .itp file
The script below (itpMake.sh) assembles an .itp file for a polymer from a set of specially con-
structed “monomer .itp files”. itpMake.sh reads a list of monomer names from stdin, in the same
way as buildChain.sh above. This script serves as an alternative to gmx pdb2gmx, which works
well for proteins (and all-atom polymers, for which new residues can be defined and added to
aminoacids.rtp).
The work is done by a (long) function addMonITP, defined at the top of the script. It is often
convenient to define the function to be iterated separately from the iteration (as we did for the
script getDhDl.sh above).
The actual body of the script is rather short (lines 108–129); it consists of writing headers to
temporary files for the different declaration sections, looping over the monomers (lines 119–124),
and using cat to write the completed declaration sections to stdout.
addMonITP reads through the monomer .itp file, incrementing the atom numbers, residue numbers,
and charge group numbers throughout, and using printf to write the incremented values in a
convenient format to a set of temporary output files for the various declaration sections ([ atoms ],
[ bonds ], and so forth). Global variables cAtom, cRes, and cChgrp store the current “offset
values” of these numberings.
Finally, note again the use of exec 3&< $line.itp to open the monomer .itp files for sequential
access, so that the multiple read commands within addMonITP access successive lines in the file.
29
1 #!/bin/bash¬
2 ¬
3 # function to add one monomer¬
4 # all variables are global¬
5 function addMonITP {¬
6 itpFile=$1¬
7 # read header line; read counts¬
8 read 0<&3 line ¬
9 read 0<&3 nAtom nChgrp nCons nVsite nExcl nBond nAng nDih ¬
10 # read atom header¬
11 read 0<&3 line ¬
12 read 0<&3 line ¬
13 # copy atoms (iAtom is 1, res is 3, chgrp is 6)¬
14 for i in `seq 1 $nAtom`¬
15 do¬
16 read 0<&3 f0 f1 f2 f3 f4 f5 f6 f7 ¬
17 let f0=$f0+$cAtom¬
18 let f2=$f2+$cRes¬
19 let f5=$f5+$cChgrp¬
20 printf "%5s%5s%5s%5s%5s%5s%10s%10s\n" $f0 $f1 $f2 $f3 $f4 $f5 $f6 $f7 >> fAtom.temp¬
21 done¬
22 ¬
23 # read constraint header¬
24 read 0<&3 line ¬
25 read 0<&3 line ¬
26 # copy constraints (atom nums are 1,2)¬
27 for i in `seq 1 $nCons`¬
28 do¬
29 read 0<&3 f0 f1 f2 f3 ¬
30 let f0=$f0+$cAtom¬
31 let f1=$f1+$cAtom¬
32 printf "%5s%5s%5s%10s\n" $f0 $f1 $f2 $f3 >> fCons.temp¬
33 done¬
34 ¬
35 #read vsites header¬
36 read 0<&3 line ¬
37 read 0<&3 line ¬
38 # copy vsites (atom nums are 1, 2, 3, 4)¬
39 for i in `seq 1 $nVsite`¬
40 do¬
41 read 0<&3 f0 f1 f2 f3 f4 f5 f6 f7 ¬
42 let f0=$f0+$cAtom¬
43 let f1=$f1+$cAtom¬
44 let f2=$f2+$cAtom¬
45 let f3=$f3+$cAtom¬
46 printf "%5s%5s%5s%5s%5s%10s%10s\n" $f0 $f1 $f2 $f3 $f4 $f5 $f6 $f7 >> fVsite.temp¬
47 done¬
48 ¬
49 # read exclusions header¬
50 read 0<&3 line ¬
51 read 0<&3 line ¬
52 # copy exclusions (atom nums are 1, 2)¬
53 for i in `seq 1 $nExcl`¬
54 do¬
55 read 0<&3 f0 f1 ¬
56 let f0=$f0+$cAtom¬
57 let f1=$f1+$cAtom¬
58 printf "%5s%5s\n" $f0 $f1 >> fExcl.temp¬
59 done¬
60 ¬
61 # read bonds header¬
62 read 0<&3 line ¬
63 read 0<&3 line ¬
64 # copy bonds (atom nums are 1, 2)¬
65 for i in `seq 1 $nBond`¬
66 do¬
67 read 0<&3 f0 f1 ¬
68 let f0=$f0+$cAtom¬
69 let f1=$f1+$cAtom¬
70 printf "%5s%5s\n" $f0 $f1 >> fBond.temp¬
71 done¬
Figure 11: First page of itpMake.sh, a script to assemble a polymer .itp file from .itp files for
the constituent monomers.
30
72 ¬
73 # read angles header¬
74 read 0<&3 line ¬
75 read 0<&3 line ¬
76 # copy angles (atom nums are 1, 2, 3)¬
77 for i in `seq 1 $nAng`¬
78 do¬
79 read 0<&3 f0 f1 f2 ¬
80 let f0=$f0+$cAtom¬
81 let f1=$f1+$cAtom¬
82 let f2=$f2+$cAtom¬
83 printf "%5s%5s%5s\n" $f0 $f1 $f2 >> fAng.temp¬
84 done¬
85 ¬
86 # read dihedrals header¬
87 read 0<&3 line ¬
88 read 0<&3 line ¬
89 # copy dihedrals (atom nums are 1, 2, 3, 4)¬
90 for i in `seq 1 $nDih`¬
91 do¬
92 read 0<&3 f0 f1 f2 f3 ¬
93 let f0=$f0+$cAtom¬
94 let f1=$f1+$cAtom¬
95 let f2=$f2+$cAtom¬
96 let f3=$f3+$cAtom¬
97 printf "%5s%5s%5s%5s\n" $f0 $f1 $f2 $f3 >> fDih.temp¬
98 done¬
99 ¬
100 # increment counts¬
101 let cAtom=$cAtom+$nAtom¬
102 let cRes=$cRes+1¬
103 let cChgrp=$cChgrp+$nChgrp ¬
104 }¬
105 ¬
106 # main program¬
107 # write headers into each temp file¬
108 echo "[ atoms ]" > fAtom.temp¬
109 echo "[ constraints ]" > fCons.temp¬
110 echo "[ virtual_sites3 ]" > fVsite.temp¬
111 echo "[ exclusions ]" > fExcl.temp¬
112 echo "[ bonds ]" > fBond.temp¬
113 echo "[ angles ]" > fAng.temp¬
114 echo "[ dihedrals ]" > fDih.temp¬
115 ¬
116 cAtom=0; cRes=0; cChgrp=0 ¬
117 # read monomers from stdin;¬
118 # open file descriptor for each monomer¬
119 while read line¬
120 do¬
121 exec 3< $line.itp¬
122 addMonITP¬
123 exec 3<&-¬
124 done¬
125 ¬
126 # concatenate files¬
127 cat fAtom.temp fCons.temp fVsite.temp fExcl.temp fBond.temp fAng.temp fDih.temp¬
128 # delete temp files¬
129 rm fAtom.temp fCons.temp fVsite.temp fExcl.temp fBond.temp fAng.temp fDih.temp¬
130
31
3.7 Building a salt solution
As a final example of a job script, buildSystem.sh prepares the set of files needed to simulate
a salt solution in which the ions are confined by a harmonic “umbrella potential”. (The purpose
of the simulation is to measure the resulting concentration profile; from the profile, the osmotic
pressure as a function of concentration can be obtained, by analyzing the balance of forces on each
slice of the concentration profile.) The script takes as input parameters the number of ions of each
kind, the linear dimension of the system, and the “spring constant” (curvature) of the confining
potential.
The script is unremarkable until the final portion (lines 37–60), in which the index (.ndx) and
MD parameters (.mdp) files are generated. Because the confining potential acts independently on
every ion, the index file must contain a group for every ion, and the .mdp file must contain a set of
“pulling options” specifying the (identical) force acting on each of these groups.
The entries to the index file are generated by a brief script writeIndex.sh, which iterates over
the number of ions to write the necessary lines to stdout. The main script buildSystem.sh calls
writeIndex.sh, and appends its output by redirection (>>) to the existing index file.
The repetitive “pulling options” for all the ion groups are generated by a script writePulls.sh.
This script uses the stream editor sed to modify a “template file”, replacing keywords within the
file with the corresponding variable values. Here, pullHeader.mdp is the template file for the initial
lines in the section of .mdp that describe the pulling, with NIONS as the variable to be replaced.
Similarly, pullTemplate.mdp is the template file for the pulling options for each ion group, with
ION (the ion number), CTR (the center of the harmonic potential), and KVAL (the potential “spring
constant”) to be replaced by the corresponding variable values $i, $kVal, and $ctr.
1 #!/bin/bash¬
2 # arguments are # ions, spring constant, center position¬
3 ¬
4 nIons=$1¬
5 kVal=$2¬
6 ctr=$3¬
7 ¬
8 # copy the non-pull .mdp lines¬
9 cat mdTop.mdp ¬
10 # copy the top pull .mdp lines, replace NIONS¬
11 sed "s/NIONS/${nIons}/g" < pullHeader.mdp¬
12 ¬
13 # loop over the ions¬
14 for i in `seq 1 $nIons`¬
15 do¬
16 # replace the template .mdp lines with correct values¬
17 sed "s/ION/${i}/g;s/KVAL/${kVal}/g;s/CTR/${ctr}/g" < pullTemplate.mdp¬
18 done¬
19
Figure 13: writePulls.sh, a script to construct the repetitive “pulling options” for all the ions.
32
1 ; pull settings¬
2 pull = yes¬
3 pull-ngroups = NIONS¬
4 pull-ncoords = NIONS¬
5 pull-nstxout = 0¬
6 pull-nstfout = 0
1 pull-groupION-name=ionION¬
2 pull-coordION-type = umbrella¬
3 pull-coordION-geometry = direction-periodic¬
4 pull-coordION-vec = 0 0 1¬
5 pull-coordION-groups = 0 ION¬
6 pull-coordION-origin = CTR CTR CTR¬
7 pull-coordION-dim = N N Y¬
8 pull-coordION-k = KVAL
1 #!/bin/bash¬
2 ¬
3 nOffset=$1¬
4 nIons=$2¬
5 for i in `seq 1 $nIons`¬
6 do¬
7 echo "[ ion"$i" ]"¬
8 echo $((i+nOffset))¬
9 done¬
10
Figure 16: writeIndex.sh, a script to construct the repetitive “index file” entries for all the ions.
33
1 #!/bin/bash¬
2 # nEach = # of + and - ions, lBox = box dimension, kVal = spring constant¬
3 nEach=$1¬
4 lBox=$2¬
5 kVal=$3¬
6 ¬
7 # make job folder¬
8 JOB=${nEach}ions_${lBox}box¬
9 mkdir $JOB¬
10 ¬
11 # copy job scripts into folder¬
12 cp equil.sh job.sh $JOB¬
13 ¬
14 # make the system folder¬
15 mkdir $JOB/System¬
16 ¬
17 # copy .mdp files¬
18 cp em.mdp nvt.mdp npt.mdp mdTop.mdp $JOB/System¬
19 ¬
20 # start the .top file¬
21 cp systemTop.top $JOB/System/system.top¬
22 cd $JOB/System¬
23 ¬
24 # add water¬
25 gmx solvate -box $lBox $lBox $lBox -cs spc216.gro -o solvate.gro -p system.top &>
… solvate.log¬
26 ¬
27 # make a tpr file for genion to use¬
28 gmx grompp -f em.mdp -c solvate.gro -p system.top -o system.tpr -maxwarn 2 &> grompp.log¬
29 ¬
30 # add ions¬
31 cp solvate.gro system.gro¬
32 echo "2" | gmx genion -s system.tpr -o system.gro -p system.top -neutral -pname NA -pq 1
… -np $nEach -nname CL -nq -1 -nn $nEach -rmin 0.45 &> genion.log¬
33 ¬
34 #create ndx file #¬
35 echo "q" | gmx make_ndx -f system.gro -o system.ndx &> make_ndx.log¬
36 ¬
37 # compute nOffset (index of last non-ion atom)¬
38 # get second line from solvate.gro¬
39 inputLine=`head -n 2 solvate.gro | tail -n 1`¬
40 # parse as input (eliminates stray spaces)¬
41 read -a args <<< "$inputLine"¬
42 nAtoms=${args[0]}¬
43 let nOffset=$nAtoms-2*$nEach¬
44 ¬
45 # write index file lines¬
46 cd ../..¬
47 writeIndex.sh $nOffset $nIons >> $JOB/System/system.ndx¬
48 ¬
49 # write pullCount.txt¬
50 let nIons=2*$nEach¬
51 newLine="pull-ngroups = "$nIons¬
52 echo $newLine > pullCount.txt¬
53 newLine="pull-ncoords = "$nIons¬
54 echo $newLine >> pullCount.txt¬
55 ¬
56 # write pull lines¬
57 writePulls.sh $nIons $kVal > pullLines.txt¬
58 ¬
59 # write .mdp file¬
60 cat mdTop.mdp pullCount.txt pullLines.txt > $JOB/System/md.mdp¬
61 ¬
62 # clean up¬
63 rm pullCount.txt pullLines.txt¬
64 rm $JOB/System/"#"*"#"¬
65 ¬
66
Figure 17: buildSystem.sh, a script to construct the set of input files to simulate a salt solution
confined by an external “umbrella” potential.
34