KEMBAR78
PERL - complete_Training_Modules_Ref.ppt
Introduction to
Perl
Programming
Module 1
Perl Basics
What is Perl?
• Perl is a general-purpose programming language,
and can be used for practically any programming
task any other high-level language can be used for.
However, Perl is usually thought of as a “glue”
language, so called because it binds things
together (such as tying databases to Web pages,
converting files from one format to another, and
so on).
• Perl is very flexible and is currently available on
over two dozen operating system platforms
Perl
• The name Perl comes from “Practical
Extraction and Report Language”. Perl has
many features borrowed from other
programming languages.
• The Perl system uses an interpreter, called
“perl”. Usually Perl and perl are considered
to be the same thing for practical purposes.
Installing Perl
Versions of Perl
• The current versions of Perl are all in the 5.X and
6.X series (6.X was released in 2001). If you have
an older version of Perl (such as Perl 4.X), you
should upgrade it as many changes were made
between releases.
• Perl 4.X was a very buggy release of Perl and
should not be used. Also, many Perl programs
designed for 5.X will not work with 4.X.
Maybe Perl is already installed
• Many operating systems (Linux and UNIX
notably, but also Windows NT Resource Kit)
come with Perl installed. You can easily check
whether Perl is loaded on your system by opening
a console or terminal window and issuing the
command:
perl –v
If you get a version number, Perl is installed. If
you get an error message about command not
found (or something similar), Perl is not installed.
Where to get Perl
• Perl is available free of charge from many public
sites.There are several releases of Perl available for
different operating systems, so make sure you get a
current release.
• For Linux or UNIX systems, visit perl.com for the
latest releases
• For Windows systems, you can compile the Perl
source code yourself (a hassle) or download a
preconfigured Windows release at activestate.com
• For Macintosh, visit macperl.com for MacPerl
Perl documentation
• Every release of Perl comes with documentation in a
set of files. Most releases have over 1,700 pages of
documentation included in reference books, user
guides, FAQs, and so on.
• On most operating systems, a utility called perldoc is
installed as part of the Perl system. The perldoc utility
can search for and format Perl documentation for you.
To use perldoc to look up the basic syntax for perl,
open a terminal or console and issue the command:
perldoc perl
More on perldoc
• The Perl documentation is divided into parts by
purpose:
– perlfunc (Perl functions)
– perlfaq (Perl FAQs)
– perlop (Perl operators)
• To search for a particular keyword, use the –tf
options. For example to look up the print keyword:
perldoc –tf print
• To search the FAQs use –q as an option:
perldoc –q free
A first Perl program
What you need
• When you have installed Perl on your system, all
you need to use the language is a text editor that
can save ASCII files. All Perl scripts are written
and saved in ASCII characters.
• On some operating systems that do not have a Perl
GUI front end, you will need to use a console or
terminal window to interact with Perl. Some GUI-
based Perl front ends are available for Linux,
UNIX, Macintosh and Windows.
Comments in Perl
• All comments in Perl are written starting with a #
sign. Anything after the # sign through to the end
of the line is ignored by the interpreter.
• Comments can be placed anywhere on the line, but
commands cannot follow a comment on the same
line
• Multiline comments should have a # symbol as the
first character on every line
The #! directive
• The sole exception to # indicating a comment is
on the first line of a Perl program (or “script”).
All Perl programs can begin with the line:
#!/usr/bin/perl
• The #! is a hold-over from UNIX that instructs the
operating system to use the /usr/bin/perl program
to run whatever is in this file
• The path may be different for your system, and
many environments such as Windows do not need
this line. However, it will not cause errors.
Semicolons
• All valid Perl command lines end in semicolons.
Without a semicolon, Perl continues to read onto
the next line and doesn’t assume a carriage-return
is the end of a statement.
• You can break Perl commands over multiple lines
because of this, as long as a semicolon is the end
character in the complete statement.
• Perl uses semicolons in the same way as C/C++
and Java
Whitespace
• Whitespace is ignored by the Perl
intepreter. You can use whitespace (spaces
and tabs) anywhere in your programs to
make them more readable.
• You should use whitespace to help format
your scripts to show loops, logic layout, and
continuation of statements, as you will see
later in this course
The print command
• The print function tells Perl to display whatever
follows, such as a string, variable name, and so on.
You’ll see how to build complex print statements
later.
• The print statement allows the C or Java escape
characters to be used for line feeds, backspace,
tabs, and so on. For example, the command:
print “Hellon”;
will print “Hello” followed by a newline.
A Hello World script
• We can write a simple Perl script for the
traditional Hello World application:
#!/usr/bin/perl
print “Hello World!n”;
• These two lines can be save in a file as
ASCII and then run by perl by issuing the
command:
perl filename
Perl scalars
Scalars
• Scalars are the Perl term for basic units,
including strings and numbers of different
forms, as well as constants (which are often
called “literals”)
• There are several types of data supported by
Perl, and you will see most of them in this
and the next module
Numeric Scalar Variables
• Perl uses the dollar sign to indicate scalar
variables, followed by the name of the variable.
For example:
$date
is a variable called “date”. The dollar sign is a
type identifier that tells Perl this is scalar. Arrays
use a different identifier, as you will see later.
• Variable names are case sensitive, so $Date and
$date are different variables
Strings
• String types in Perl are like those in other
programming language. Strings are treated
literally when enclosed in quotation marks (either
single or double). Escape sequences can be used
with Perl strings. These are the most common:
– n newline
– r carriage return
– t tab
– b backspace
Special escape sequences
• Some escape sequences for strings have
special meaning to Perl:
– l change next character to lower case
– u change next character to upper case
– ’ literal single quotation mark
– ” literal double quotation mark
–  backslash
The q and qq operators
• Perl allows you to use these structures:
– q( )
– qq( )
Instead of single and double quotes, respectively. So,
qq(This is a test)
is the same as
“This is a test”
• These can be handy when embedding marks that
would otherwise need escaping:
qq(He said “Help!” then “Now!”)
Single and double quotes
• Double quotation marks allow expansion of
variables within them. Single quotes do not.
• For example:
“This is from $name1”;
is not the same as
‘This is from $name1’;
as the second will literally display ‘$name1”
which the first will substituted the value in the
variable name1.
Declaring variables
• Unlike many programming languages, variables
do not need to be declared prior to use with Perl.
When the variable is assigned an initial value, Perl
can figure out the data type.
• If you try to use an uninitialized variable, Perl will
use the value zero for a numeric, or Null for a
string. Avoid uninitialized variables as much as
possible, as results can be unpredictable.
Assigning values
• Variables are assigned values using the
equal sign:
$string1=“This is a test”;
$var1=6;
$var2=3.14159;
• You can assign operation results, as you
would expect:
$var3=$var2 + $var1;
The $_ variable
• The $_ variable is used by Perl as a default
variable. You can use it in place of variable names
in your scripts:
$_=“This is a test”;
print;
• This will print the default variable $_ and display
the string. Use the default operator carefully as it
can easily be confusing, but some operators and
functions work best with default variables, as you
will see later in this course.
Perl operators
Standard operators
• Perl supports the standard mathematical operators
+, -, *, /, % (modulus) and ** (exponent)
• Operator order of precedence applies according to
standard rules; parentheses group operations
• Operators can be combined in statements:
$num1=$num2 * ((3 + 8)*$num3/2);
• Multiple assignments can be made on one line:
$num1=$num2=$num3=7;
Exercise
• Write a program that simulates rolling five dice.
Assign values between 1 and 6 to five different
variables and display all five on the screen as well as
the sum of the five numbers. Later you’ll see how to
use random numbers, but for now just assign the
values.
• If you want to display a string and a variable together
in a print statement, separate them with periods:
print “The sum is ” . $sum;
You’ll see this in the next few slides.
Positive and negative
• Numbers are assumed to be positive unless you
specify a negative sign in front:
$num1=6; # positive
$num2=-6; # negative
$num3=-(-6); # positive
• You can convert positive to negative any time
using the minus sign in front of the variable:
$num4=-$num4;
Increment and decrement
• Like C/C++ and Java, Perl supports autoincrement
and autodecrement operators, which increase or
decrease a value by one:
$var1++;
is the same as
$var1=$var1+1;
and
$var2--;
is the same as
$var2=$var2-1;
Shortform assignment
• As with autoincrement and autodecrement,
Perl supports shortform assignments like
this:
$var1+=5;
which is the same as
$var1=$var1 + 5;
• This can be performed for all four basic
mathematical operators.
Operators and strings
• Strings can be used with the “.” operator for
concatenation:
$str1=“Hello ”;
$str2=“World!”;
$str3=$str1 . $str2;
print $str3;
would print “Hello World!”
• You can also concatenate on output in most cases
by specifying the string variables together:
print $str1 . $str2;
The x operator
• The repetition operator, x, is used with strings to
indicate a number of repeats for a string. For
example, the code:
$str1= “x” x 20;
will string 20 “x”s together and assign them to
$str1.
• You can use the repetition operator with existing
strings:
$str1=“x”;
$str2=$str1 x 20;
Other operators
• Perl supports several other operators:
– int: returns the integer portion
– cos: cosine
– sin: sine
– rand: random number between 0 and argument
– length: length of argument
– lc: converts to lowercase
– uc: converts to uppercase
Exercise
• Rewrite the last program to randomly assign
a number to each of the five dice. To use
rand, you need to specify the upper limit:
rand(5) will generate a number between
zero and 5. Remember all random numbers
have a lower limit of zero. Have the
program roll five dice and display results
between 1 and 6, as well as the sum of the
dice.
Converting strings to numbers
• Perl is flexible when using string types as
numbers, as long as the conversion makes sense.
For example, this works:
$str1=“6”;
$num1=10-$str1;
print $num1;
will display the value 4. Perl can convert the
string to a number if the string looks like a
number. This applies to decimal strings as well.
Converting numbers to strings
• Perl can also convert numbers to strings when the
conversion makes sense:
$num1=3;
$str1=“I did it” . $num1 . “
times”;
print $str1;
will display the message “I did it 3 times.”
• If the conversion doesn’t make sense to Perl, it
will use a zero instead when you try to call the
number.
Code blocks
• Perl statements can be grouped together into
blocks, each block surrounded by braces (like with
Java)
• Code blocks can be nested many deep
• Each code block is treated as a unit by Perl,
although execution is still always top to bottom
unless moderated by control structures
• Usually blocks will be associated with other
statements, such as if conditions
Exercise
• Modify the last program to display strings
explaining that you are going to throw the
five dice, generate five numbers, show them
one at a time like this:
The first dice was X.
The second dice was Y.
and so on, and display the sum on a
separate line.
Module 2
Control Structures
Comparison operators
• Perl supports the standard comparison
operators:
> greater than
< less than
>= greater than or equal to
<= less than or equal to
== exactly equal to
!= not equal to
True and false
• In Perl, any condition that evaluate to false is assigned
a value of zero. Anything that is non-zero is true. This
applies to conditions in statements (such as the if you’ll
see in a moment) as well as for numeric evaluation:
0 false
3 true (non-zero)
5-5 false (evaluates to zero)
0.00 false (it’s zero with precision)
“” Null string is false
“ ” String with a space or anything not null is
true
The if statement
The if statement
• Perl’s if statement is similar to those of other high-
level languages:
if (condition)
{ do if true }
else
{ do if false }
• The blocks of code can contain many statements.
The else and its statements are optional.
• Only one block of code is executed, depending on
whether the condition is true or false
Example of if
• A simple if statement is:
if ( $num1 > 5 )
{print “It is greater than 5.”;}
else
{print “It is less than or equal
to 5.”;}
• If there is only one statement in a block, you can
leave the curly braces off as long as a semicolon is
used, however many programmer use curly braces
to make the code more readable
Exercise
• Write a program that rolls six dice, all with
values between 1 and 6. Add up the result
of the die. If the total is greater than 20,
display a message that the user wins the
game. If the total is not greater than 20,
they lose.
• Modify the program to determine if the total
die roll is odd or even and display the result
Nested if-elses
• The if statement can be nested within other if
statements, either inside the blocks or as part of the
else:
if (cond1)
{ if (cond2)
{statements}}
else { if (cond3)
{ statements}
else
{statements}
}
Exercise
• Write a program that generates three
random numbers between 1 and 10.
Display a message telling the user whether
the sum of the three numbers is greater
than, less than, or equal to 15. Also, tell the
user whether the sum is even or odd.
Reading input
Input from the keyboard
• The easiest way to get input into a Perl
program is to read from the keyboard. To
do this, use the <STDIN> structure (the
standard input, or STDIN, is the keyboard
by default).
• You use <STDIN> to read a value and store
it in a variable like this:
$var1=<STDIN>;
The chomp operator
• When reading input from the keyboard, the entire
string entered by the user, including the RETURN is
saved in the assigned variable
• If you want to eliminate the newline character at the
end of the input, use the chomp operator to remove it:
$str1=<STDIN>;
chomp $str1;
print “You said ” . $str1 . “.”;
without the chomp operator, the print statement
would have moved down a line at $str1
Exercise
• Write a program that randomly chooses a
number between 1 and 100. Let the user
enter a guess, and tell them whether they
got the number correct, or whether the
random number is higher or lower than the
guessed number, and by how much. Only
do this guess once.
String relationships
• There are relational operators for strings (actually,
for non-numbers):
eq equal to
ne not equal to
gt greater than
lt less than
ge greater than or equal to
le less than or equal to
• Comparisons are left-to-right, using ASCII values
Example of string comparisons
$str1=“Hello”;
$str2=<STDIN>;
chomp $str2;
if ($str1 eq $str2)
{print “You guessed the string!
n”;}
else
{print “Wrong guess!n”;}
Exercise
• Write a program that creates three different
variables, all with names of animals
assigned to them (one animal per variable).
Display the three animals for the user. Have
a random number generator pick one of the
three animals, and ask the user to guess
which animal was chosen. Let the user
know whether they guessed correctly or not.
Booleans
Boolean operators
• Perl supports to Boolean AND and OR operators
in the same way as other high-level languages:
&& AND
|| OR
• These operators are often used for conditions:
if (($num1 < 10) && ($num1 > 5))
• Perl also allows the use of the words “and” and
“or”:
if ($num1 < 10 and $num1 > 5)
The NOT operator
• Perl allows the negation NOT operator to be
specified either as “!” or as the word “not”:
if (($x < 5) && !($x > 0))
if ($x < 5 and not $x > 0)
• Boolean conditions are always evaluated left to
right. Keep in mind some complex combinations
may not make sense. Check the logic carefully
with compound statements.
Exercise
• Write a program that asks the user for a
number between 1 and 100. Check to see if
the number is even or greater than 10. If it
is, display a message to that effect. Also
check to see if the number is between 15
and 25 and display a message showing the
result.
Shortform ifs
The shortform if
• A shortform if statement is popular in Perl, and is
applicable if there is only one statement that
would reside inside the code block. Instead of
writing:
if (condition)
{statement;}
you can write:
statement if (condition);
• This may look confusing, and many programmers
do not use it, but it is legal
Example of shortform if
• This is an example of the shortform if
statement:
$flag = 1 if ($num1 > 10);
• This is the same as writing
if ($num1 > 10)
{$flag = 1;}
• Shortform ifs can only be used when a
single statement is to be executed
The elsif construct
• Instead of using if-else structures to nest ifs, you
can also use the shortform elsif. This removes the
need for a set of curly braces for the else. Instead
of writing:
else { if (cond) {statements…}}
you can write:
elsif (cond) {statements…}
• The use of elsif simplifies the number of braces,
but some find it awkward to read easily
Exercise
• Write a program that asks the user for the outside
temperature in degrees Fahrenheit. Display the
equivalent in degrees Celsius. The conversion
formula is:
F=32+9C/5
where F is degrees Fahrenheit and C is degrees
Celsius. Then, if the temperature is going to be
below 40F tell the user to take a coat. If the
temperature is above 80F tell them to avoid sunburn.
If it’s in between, tell them it will be a great day!
Module 3
Looping
Perl loops
• As with all other high-level languages, Perl
supports loops. The for and while loops are
similar to those in languages like C/C++.
• Loops allow you to run a block of code as
many times as you want, as long as some
condition evaluates to true. Loops always
have some condition attached to them.
The for loop
The for loop
• The for loop in Perl is similar to that in C/C++ and
Java. It consists of three components:
for (initial;condition;increment)
where initial is the code to run prior to starting the
loop, condition is tested prior to each loop and
must be true for the loop to continue, and
increment is performed after every loop.
• The three parts of the for loop must exist
(although they can be empty) and are separated by
semicolons
The parts of the for loop
• The initial statements in a for loop are always
executed, whether the condition is true or not.
The initial statements are executed prior to the
condition being examined, and are never executed
again (unless inside a larger loop)
• The continuation condition is evaluated prior to
the code block execution, and revaluated each
loop. At the end of the execution of the code
block and prior to the condition being tested, the
increment conditions are executed.
Example of a for loop
for ($x=1; $x<=10; $x++)
{
print $x . “n”;}
• This program will print the numbers from one to
ten in a column and then exit. The initial condition
sets the value of $x to 1, each loop increments it
by one, and the condition is true until x in
incremented to 11.
Exercise
Write a program that throws six dice,
generating random numbers between one and
six. Display each number as it is thrown, and
after six throws, show the sum. Use a for loop
to throw the dice six times, using the same
variable for each throw.
The while loop
The while loop
• The while loop uses a condition to test
whether to loop or not. When the condition
is true, the loop executes. When the
condition is false, the loop terminates. The
syntax is:
while (condition)
{statements…}
The condition can be anything that
evaluates to true or false.
Example of a while loop
$num1=6;
while ($num1 < 12)
{
print $num1;
$num1++;
}
• This loop will print the values of $num1 starting at
6 and going through to 11. The loop fails when
$num1 is incremented to 12.
Exercise
• Modify the last program you wrote to use a
while loop instead of a for loop.
The last and next statements
The last statement
• The last statement can be used to terminate a loop,
regardless of the condition’s value. The last statement
is similar to break in other programming languages.
The last statement is usually used with some condition:
while ($x < 10)
{ if ($x == 6) {last;}
statements…}
• In this case, when $x is exactly 6, the last statement
terminates the loop and execution continues past the
closing curly brace.
The next statement
• The next statement causes execution of a loop to
restart. It is similar to the continue statement in some
languages. Any statements below the next statement
are not executed. The next statement is usually used
with a conditional:
while ($x < 10)
{ if ($x == 6) {next;}
statements…}
• When the next is encountered when $x is 6, the loop is
restarted again, and the statements below the if are not
executed on this pass
Exercise
• Write a program that prompts the user for
numbers, adding the numbers the user
enters together as they are entered. If the
user enters the number 13, terminate the
loop at that point and display the total so
far. If the user enters the numbers 9 or 18,
ignore the numbers in the running total, but
continue asking for other numbers.
Code block labels
Labels
• Blocks of code and the for and while statements
can all be labeled, and identified by that label. To
use a label, place it before the statement block
followed by a colon:
BLOCK1: {statements…}
• The name of the label can be any valid ASCII
characters. The convention is to use uppercase for
labels so there is no conflict with existing
keywords.
Labels and loops
• When you are labeling a loop, use the same
convention as a code block:
OUTERLOOP: for ( …;…;…)
BIGCOND: while (cond)
• Labels allow you to specify which loop is affected
using last and next:
if (cond) { last OUTERLOOP;}
will terminate the loop called OUTERLOOP, even
if the statement is nested levels deep inside
OUTERLOOP
The exit statement
The exit statement
• The exit statement is used to terminate a Perl
script at any time. Whenever the exit statement is
encountered, the script is terminated. You can
send a return status code back to the calling
program with exit, if you want, by appending the
return code after the exit statement:
if ($val1 == 0) { exit 0;}
• This will exit the program with a return code of 0
is $val1 is zero.
Exercise
• One of the traditional exercises to show
loops is finding prime numbers. Write a
program that displays all the primes
between 1 and 1,000. Show each prime as it
is found. Also show how many primes you
found at the end of the program.
Some Perl functions and
operators
Perl functions and operators
• Perl is a very flexible language. As you will see,
there are several ways to accomplish the same
tasks in Perl, sometimes quite easily if you know
all the functions or operators.
• There are several useful functions that work with
scalars and strings, and we can look at these in the
next few slides
• A Perl reference book is a handy resource for
looking up functions and operators
The index and rindex functions
The index function
• The index function is used to find one string inside
another. For example, if you have the string “A
stitch in time” and want to find out if the string
“itch” occurs inside it, you could use the index
function.
• The syntax of index is:
index string, search_string;
where string is the string to be examined for
search_string. Of course, you can use variables for
either string component.
Perl functions and parentheses
• Perl functions can be written with or without
parentheses in most cases, so the statements:
index string, search_string;
and
index(string, search_string);
are identical. It is up to you whether you use
parentheses.
Example of index
• If index finds the substring, it returns the position of
the start of the substring in the string, counting from
zero. If not found, -1 is returned.
• To find “itch” in “A stitch in time”, you would issue
the command:
index “A stitch in time”, “itch”;
or you could use variables:
$str1 = “A stitch in time”;
$str2 = “itch”;
$foo=index $str1, $str2;
which will return $foo a value of 4.
Modifications of index
• You can specify a starting position for the
substring search, allowing you to skip a known
substring if you want. To specify a starting
position, give the number after the substring:
index $str1, $str2, 6;
this will start looking for $str2 starting at the 6th
position of $str1.
• This could also be written:
index($str1, $str2, 6);
The rindex function
• The rindex function is the same as index, except it
starts at the right of a string and works towards to
left. As with index, it will return the position of
the first match, or –1 if no match is found:
$str1 = “A stitch in time”;
$str2 = “itch”;
$foo=rindex $str1, $str2;
Again, $foo would have the value of 4.
• You can specify a starting position with rindex in
the same way as index
Exercise
• Prompt the user for a long string, followed
by a shorter one. Use both index and rindex
to locate the short string inside the longer
one and compare the results.
The printf function
The printf function
• The printf function is a more talented version of
print, and is similar to the printf in other languages
like C and C++
• The printf function is a “formatted print” and
allows better control of the output from a print
statement. The syntax of the printf statement is the
same as that in C/C++:
printf format, list;
where format is the format specifier and list is
what is to be formatted.
The printf format specifiers
• The format specifiers for printf have the general
format of:
%-w.dl
where % is the identifier used for a specifier, - is an
optional minus sign used for justification, w is the
width of the field, the period is an optional part
followed by the number of decimals, and l is the field
type. The field type must be specified. Examples are:
%20s
%-6.2f
Field types
• The most common field types for the printf
statement are:
c character
s string
d integer number (no fraction)
f floating number
• There are some other field types, but they are
rarely used in programming.
Examples
• To display the variable $num1 with 6 digits total,
two to the right of the decimal, you would use a
printf like this:
printf “%6.2f”, $num1;
or
printf(“%6.2f”, $num1);
• To print leading zeros if the number does not have
enough digits, add a zero in front of the width:
printf “%06.2f”, $num1;
String examples
• Strings can be displayed with format specifiers,
too:
printf “%10s”, “Tim”;
This will right-justify “Tim” into a field 10
characters wide.
• To left-justify a string, use a minus sign in front of
the specifier:
printf “%-10s”, “Tim”;
Multiple variables
• If you are displaying more than one value, you
need a format specifier for each. They are read
from left to right:
printf “%6d %5.4f %3d”,$x1,$x2,$x3;
• The format specifiers are matched up with the
variables in order. Multiple format specifiers can
be inside the quotations marks, as can any string:
printf “Total is %8.4f”, $total;
Exercise
• Modify the prime number program you
wrote earlier to search primes between 2
and 1000. List the primes in a column,
stacked one on top of each other so the
column of digits are correct, like this:
001
011
111
etc…
Module 4
Lists and arrays, Part 1
Scalars, lists, arrays and hashes
• The last few modules have dealt with scalars:
variables with a single value. There are times you
want to work with a collection of data, all the
same type of different types.
• With Perl, data that is not a scalar is one of three
different types: list, hash, or array. We’ll look at
all three types in this module.
Perl lists
Lists
• Lists are groups of scalars and usually reflect
multiple values of the same type of data. There are
several ways to represent lists, but the most
common is to put the list in parentheses, separated
by commas:
(“Tim”, 1234567, 16.65)
• This list has a string value and two numbers, one
an integer and one a float (although you don’t care
about the numeric types). Each part of this list is a
“list element” so there are three list elements here.
List contents
• Perl allows lists to have as many elements
as you would like in them: there is no size
limitation. Lists can therefore be very large.
• You can use variables inside lists, too:
(“Tim”, $var1, $num1, 65)
• Lists can refer to other lists, holding one list
inside another list as a variable
Creating list variables
• List variables are similar to scalar variables except
the $ used as the first letter of a scalar variable
name is replaced by an @ sign for array variables.
Lists are a type of array. The array is the variable
that had a list assigned to it.
• The same rules apply for array variable naming as
with scalar variables, although $foo and @foo are
two different variables as far as Perl is concerned
because the first letters are different
Assigning lists
• You can assign a list to a variable the same way as
any other value. This creates an array variable
containing the list:
@list1=(“Tim”, 45, 56.0);
• When one list contains another list, you use the
same syntax:
@list2=(@list1, “sample”, 5.3);
• You can combine lists and variables:
@list2=(@list1, “sample”, $num1);
The qw operator
• When you are adding several scalars to a list and
don’t want to bother with quotation marks, you
can use the qw operator like this:
qw(red green 45 $x);
• The qw operator has a whitespace separated list in
parentheses, and it converts everything to literals
(with no expansion of variables, so $x is a literal).
The line above is equivalent to:
(‘red’, ‘green’, ‘45’, ‘$x’);
Getting elements from an array
• You can retrieve list (array) elements by using
their element index value. All list elements are
numbered, starting from zero, left to right. To
access an array element, use the element number
in square brackets:
@test=(0,1,2,3,4,5);
print @test[3];
• This will display the fourth element (because of
zero-origin) which will be 3
A slice of a list
• You can retrieve more than one list element at a
time. This is subgroup of a list is called a slice.
Specify the list elements you want separated by
commas:
@biglist=qw(a b c d e f g h);
@vowels=@biglist[0,4];
• You can subgroup or slice lists many times using
this method
End of a list, method #1
• You can find how many elements there are in a list
by using the special symbol $# followed by the list
name. For example:
@list1=qw(a b c d e);
print $#list1;
will display the value 4 (counting starts at 0, and
there are five elements, so the last is @list1[4].
• Can you use this method to assign the size of the
array to a variable, like this:
$size=$#list1;
End of a list, method #2
• A sneakier way to determine the length of a list is
to use a scalar conversion:
$size = @list1;
• $size is a scalar and cannot contain the contents of
list @list1. It holds the number of elements in
@list1. (This works because of Perl contexts,
which you will see later.)
Exercise
• Create an array that holds the numbers from
1 to 10. Create a slice of that list with a
different name that holds all the odd
numbers, and another slice of that sublist
that holds all the primes. Write a program
that displays the last slice, one element at a
time. Display the size of all three lists.
The foreach loop and arrays
Accessing elements
• You already know you can access a list element
using its index number, such as
@list[4];
Often you will have to move along an entire array,
working with each element in the list. You can do
this with a for loop:
for ($x=1; $x<@list; $x++)
{ print @list[$x];}
• The @list in the for condition evaluates to a
scalar, the length of the list
The foreach loop
• Perl has an easier setup for moving across a list, and
that’s the foreach loop. The foreach loop creates an
index variable that is equal to each element in the list,
left to right. Here’s the same code as the previous
slide:
foreach $x (@list)
{ print @list[$x];}
• The variable $x is set to each element in @list in turn,
so you don’t have to set up the for loop. The foreach
loop starts at index 1, so the 0th
element is not counted.
The for and foreach loops
• Perl uses the foreach and for loops interchangably:
you can use either in most situations
• However, you will find that most Perl
programmers use foreach when working with
arrays, and for in other situations. This makes it
easier to keep track of the loops and uses.
Exercise
• Modify the last program you wrote to use a
foreach or for loop to display the list
elements in each of the three lists.
The pop and push operators
Arrays and stacks
• Perl has many ways of doing things. Another way to
move through a list is to use the pop and push
operators.
• You can think of an array as a stack, like a stack of
paper. When you add something to the top of the
stack, everything else gets moved down one place.
This is called a push operation. When you take
something off the top of the stack, everything moves
up one place. This is called a pop operation. (These
terms will be familiar to anyone who knows RPN
notation.)
Using pop
• Perl supports the pop and push operators to work
through a list. The list is stacked, one on top of the
other, from left to right, so the last entry is on top.
Here’s an example:
@list=(“A”, “B”, “C”, “D”, “E”);
$pick=pop @list;
print $pick; #prints “E”
$pick=pop @list;
print $pick; #prints “D”
• You can work through the stack to the bottom using
the pop operator
Using push
• The push operator will add things onto the top of the
stack:
@list=(“A”, “B”, “C”, “D”, “E”);
pop @list; # “E”
pop @list; # “D”
$x=pop @list; #holds “C”
push @list, $x; #puts “C” back
The trick to using pop and push is imagining a
pointer in the list that moves with each operation.
Once something has been popped off the stack it is
not in the stack anymore unless pushed.
More push
• You can push anything back onto a stack. For
example:
@list=(“A”, “B”, “C”, “D”, “E”);
pop @list; # “E”
pop @list; # “D”
push @list, “F”; #puts “F” back
• The stack will now have (“A”, “B”, “C”, “F”)
• You can add more than one item to the stack:
push @list, “F”, “G”, “H”;
Pushing arrays
• So far we’ve only pushed scalars, but you
can push arrays onto a stack as well:
push @list @array;
• When you push a list onto a stack, the new
list is a concatenation of the two
• You can push lists of lists, as well. To do
this, the list of lists gets flattened into a
single list, then added to the stack.
The shift and unshift operators
• The push and pop operators work on the top
element of a stack. The shift and unshift operators
do the same task at the bottom of the stack.
• The unshift operator adds an object to the bottom
of the stack, and the shift operator removes an
object from the bottom of the stack.
• You can use pop/push and shift/unshift at the same
time. This is one way of manipulating arrays and
list elements, but not all programmers use this
method.
Context
Context
• Context has to do with the way Perl uses variables
and values. The two important types of contexts
in Perl are scalar context and list context.
• When assigning variables, the expression on the
left of the equals sign determines what context the
right side is evaluated in. The differences between
scalar context and list context indicate whether
values are treated as lists or scalars, as you will
see.
Scalar and list contexts
• Scalar context arises when using scalars:
$num1=$num2;
since the left is a scalar, the right is treated as a
scalar. If the left is a list, the right is treated as a list:
@list1=@list2;
• There are times when mixed contexts arise:
$num1=@list1;
which has a scalar to the left so it evaluates the right
as a scalar and returns the number of items in the list.
Forcing scalar
• There are times when you want to force a list to be
treated as a scalar. The print function actually
expects a list as an argument, so the command:
print @list1;
prints the contents of the list @list1.
• You can force the print command to accept a
scalar context like this:
print scalar(@list1);
which will convert the list to a scalar and display
the number of elements in the list.
Reordering array elements
Reordering array elements
• You will often need to work with array elements
in a different order than they appear in a list. You
could code variable storage for each element, but
there are better ways in Perl. To organize a list in
some order, there is the sort function.
• The syntax of the sort function is:
sort list;
where list is the name of the list to be sorted.
• By default an ASCII sort is produced. The original
list is not modified.
Using sort
• You can use sort to order a list, and assign the
result to another variable:
@let1=qw(A F D w U d F l X H);
@let2=sort(@let1);
• It is important to remember that the ASCII sort
will result in lower case letters appearing after
upper case letters. Also, numbers will sort in a
different order than you expect. For example, 12
will be after 102. To handle these issues, different
sorting orders are needed.
Exercise
• Create an array with a list of names in it,
and sort them in ASCII order, displaying
the result.
Changing the sort order
• To change the sort order from ASCII, you need to
add an argument to the sort function. The
argument is a block of code which contains
instructions on the new sort order.
• Inside the block of code, two variables ($a and $b)
are used to indicate two elements in the list. The
logic returns one of three values when comparing
$a and $b:-1 if $a is less than $b, 1 if $a is greater
than $b, and 0 if they are equal.
The “spaceship” operator
• To help with changing the sorting order of
numbers, Perl has the spaceship operator “<=>”
which performs the comparison of $a and $b. The
spaceship operator returns –1 if the left is less than
the right, 1 if the left is greater than the right, and
0 if they are equal. Here’s how to use it:
@newlist=sort {$a <=> $b} @list;
• This command will sort list based on the argument
values. This works with numbers, not strings.
Exercise
• To see how the spaceship operator and the
sort function work, create a list of random
numbers (you can generate it or simply
enter one on the keyboard). Then, use the
sort function with the spaceship operator to
sort the numbers in ascending order, and
display the result.
Converting scalars and arrays
Scalars to arrays
• Perl allows conversion between scalars and arrays in
several ways
• You can convert a scalar to an array using the split
function, which requires a pattern to be specified that
breaks the scalar into an array based on that pattern.
The pattern is surrounded by slashes. For example,
to break the scalar “This is a script” into an array
based on spaces, you would issue the command:
@n1=split(/ /,“This is a script”);
Results of split
• The result of the command:
@n1=split(/ /,“This is a script”);
would be an array called @n1 that has the list
(“This”, “is”, “a”, “script”) as the contents.
• If you don’t specify a pattern to break up the scalar,
whitespace is used by default. If you do not specify
a variable name as an argument to split, the default
variable $_ is used.
• A special pattern is null (//) which breaks the scalar
up character-by-character
Using split
• The split function allows you to break up arrays
into smaller parts when combined with foreach:
@cpu=(“Intel, Pentium”, “AMD, Athlon”,
“Transmeta, Crusoe”);
foreach $x (@cpu)
{ ($company, $chip)=split(/,/, $x);}
• This uses the foreach to extract the three list
elements in @cpu, and then split breaks those list
elements into two parts based on the commas.
These are then assigned to the scalars $company
and $chip respectively.
Exercise
• Create an array which holds a list of record
albums, consisting of album title, the year it
was released (guess if you don’t know) and
artist (in that order). Add at least five
albums to the list. Then, use the split
function and a for or foreach loop to display
the artist followed by the title for each list
element.
Module 5
Lists and arrays, Part 2
Converting scalars and arrays
Arrays to scalars
• The split function lets you convert scalars to
arrays. To convert arrays to scalars, use the
join function. The join function expects a
string and a list as arguments, and joins the
list together using the string, returning a
string scalar.
• The syntax is:
join string (list);
Example of join
• To join a list of letters into a single string,
separated by spaces, you would issue the
command:
$string=join(‘ ’, (‘A’, ‘B’,
‘C’, ‘D’, ‘E’));
The result would be $string set to “A B C D E”.
(The parentheses around the arguments are
optional as with most Perl functions.)
• The joining string can be a null or any valid ASCII
character
Exercise
• Create an array with a list of numbers from
1 to 10, then convert the array into a scalar
using commands to separate the numbers.
Use a hyphen to separate the numbers. Use
the join function to perform the conversion.
Split and join together
• You can use the split and join functions together
to manipulate strings. For example:
$str=“Hello”;
$temp=join(‘ ’, split(//,$str);
print $temp;
• This will result in the output “H e l l o”. The split
breaks the scalar into an array based on individual
characters, and the join adds the parts back
together into a list with spaces between.
The reverse function
• The reverse function is used with strings to
reverse the characters in the string. It
returns a reversed string. For example:
$str1=“A B C D E”;
$str2=reverse($str1);
print $str2;
will display the string “E D C B A”.
Exercise
• Write a program that prompts the user for a
string, and then display it back, reversed, to
the user with hyphens between each of the
letters in the string. If they entered “This is
a test” you would display “t-s-e-t- -a- -s-i- -
s-i-h-T”.
Hashes
Hashes
• Scalars are single data entities. Arrays are ordered
collections of data entities. A hash is an unordered
collection of data which may contain many sets of
data, but there is no “first” or “last”, no “top” or
“bottom” item in a hash.
• Hash data is always in pairs
• Hashes are treated much like lists, but the hash
variable is used. Hash variables all start with %.
Creating a hash
• A hash is created in the same way as a list:
%hash1=(“Intel”, “Pentium”, “AMD”,
“Athlon”, “Transmeta”, “Crusoe”);
• To help readability, the hash is usually laid out in
its declaration:
%hash1=(“Intel”, “Pentium”,
“AMD”, “Athlon”,
“Transmeta”,“Crusoe”);
• This shows the pairing of hash data more clearly
The => operator
• To make relationships in hashes clearer, Perl allows
the use of the => operator, which Perl interprets to
mean a double-quoted string and a comma. This:
%hash1=(“Intel”, “Pentium”,
“AMD”, “Athlon”,
“Transmeta”,“Crusoe”);
becomes this:
%hash1=(Intel => Pentium,
AMD => Athlon,
Transmeta => Crusoe);
Hash keys
• When you have a pair like this:
Intel => Pentium
the left side of the pair is called the hash key, and
the right side is the value. The hash key is used to
look up the pair.
• All hash keys must be unique. If you have more
than one hash key the same, the newer entries will
overwrite the older ones in the list. Values do not
have to be unique. Having unique hash keys leads
to the use of hashes as a quick-lookup structure.
Locating hash entries
• To locate a specific entry in a hash, we use the
hash keys. The key is enclosed in curly braces:
%hash1=(Intel => Pentium,
AMD => Athlon,
Transmeta => Crusoe);
print $hash1{AMD};
• This will print the value corresponding to the
AMD hash key, which is “Athlon”. Note the use
of a scalar $hash1.
Exercise
• Create a hash with a set of names and phone
numbers. Populate the hash with five sets
of data. Ask the user which name they
would like to look up the phone number for,
and use that as the key to display the
number.
Showing all entries with keys
• Instead of using a for or foreach loop to show all
entries in a hash, the keys function can be used:
keys(%hash)
• This will show a list of all the keys. To show keys
and values, a for loop can be used:
for (keys %hash) {
print “$_ is the value for
$hash{$_}n”;}
• This uses the default variable show the hash key
($hash{$_}) and the value assigned to it as $_.
Reversing hashes
• You can use the reverse function on a hash to reverse
the pairs, converting the value to a hash key and the
hash key to a value. This allows you to look up with
either value:
reverse %hash;
• Reversing a hash swaps the hash key and value, not
the order of the pairs in the hash
• Be careful when using reverse. Since all hash keys
have to be unique, there is the chance of losing some
data if the values are the same prior to reversing.
Exercise
• Set up a hash that contains a few city names and
their corresponding zip codes (guess if you don’t
know them). Then, display the names of the cities
and ask the user to enter one. Display the
corresponding zip code. Reverse the hash and
display the zip codes, asking the user to choose
one. Display the city for that zip code. You can
display the lists of cities and zip codes any way
you want, or display both at the same time with
the keys function.
Modifying hash contents
Adding entries to a hash
• You can add entries to a hash by specifying the
key and value:
%hash1=(Intel => Pentium,
AMD => Athlon,
Transmeta => Crusoe);
$hash{HP} = “PARISC”;
• This adds the entry HP as the key and “PARISC”
as the value to the hash. Again, we use a scalar for
the function.
Changing values
• You can change values in a hash by reassigning them:
%hash1=(Intel => Pentium,
AMD => Athlon,
Transmeta => Crusoe);
$hash1{AMD} = “Thunderbird”;
• This will change the value associated with the key
AMD. To change the key, you can add a new entry
and delete the old one, or reverse the hash and change
the value, then reverse back (which may cause
problems).
Deleting hash entries
• To remove an entry from a hash, the delete
function is used with the hash key:
%hash1=(Intel => Pentium,
AMD => Athlon,
Transmeta => Crusoe);
delete ${Transmeta};
• This will delete the hash pair starting with the key
“Transmeta”.
Converting hashes to arrays
• You can convert hashes to arrays and vice-versa
easily, since they both have the same list structure.
To convert, simply reassign using the proper
variable name:
%hash=(“A”, “B”, “C”, “D”);
@array=%hash;
%hash2=@array;
• The first line creates a hash, the second converts
the hash format to an array format, and the third
line creates a new hash with the array’s contents.
Blanking a hash
• To remove all data from a hash (hash keys and
values), you could use the delete function in a
loop, but an easier way is to simply redefine the
hash. Suppose you want to blank:
%hash=(“A”, “B”, “C”, “D”);
To blank this hash, just redefine it:
%hash=();
and the old values are deleted and a hash with no
contents is left.
Sorting a hash
• You can use the sort function to sort a hash. This
is easily done by using keys to extract the hash
keys and then use sort on those keys:
foreach ( sort keys %hash1)
{ print “$_ $hash1{$_}n”;}
• In this case, we’ve just printed the sorted hash, but
you could store the results in a new has variable in
sorted order
Testing for hash keys
• You can test to see if a hash key exists, preventing
problems with other statements. Most people
would assume this would work:
if ( $hash{$key})…
but it doesn’t.
• To check whether a hash key exists, use the exists
function instead:
if (exists $hash{$key}) …
Exercise
• Modify the last program you wrote to:
– display the current hash, and ask the user if they want
to add a new entry. If they do, prompt for the city and
zip and add it to the hash
– display the hash again and ask if the user wants to
delete an entry. If they do, ask for the hash key to
delete, and remove that entry
– perform the same lookups for city and zip as before, but
check to make sure the keys exist. If not, display an
error message.
The grep function
The grep function
• The grep function searches for patterns in an array.
The syntax for Perl’s grep is:
grep pattern, list;
• For example, to find all the elements with the
pattern “day” in the array:
@days=qw(Monday, Tuesday, Friday);
you would issue the command:
@result=grep /day/, @days;
which will populate the array @result with the
matching elements.
How grep works
• The Perl grep works by proceeding through an
array, one element at a time, and assigning the
element to the default variable $_. The pattern to
be found is then compared against $_.
• If the pattern is found in $_, the expression is true
and the element is returned by grep.
• If the pattern is not found, the element is not
returned by grep.
Hash intersections
Intersections
• A common task with Perl is finding the
intersections of two hashes or arrays. In other
words, find out which elements are in common
with two sets of data. We can also find the
difference, by finding everything not in the
intersection set.
• The find an intersection, the grep function is very
useful
Performing the intersection
• Finding an intersection between @array1 and @array2 is
surprisingly simple:
%temp=();
foreach (@array1)
{ $temp{$_}=1;}
@intersect=grep $temp{$_}, @array2;
• This code starts by setting up an empty hash. The foreach
stores each element in @array1 in $_, one at a time, and
fills the temp list with them and sets the values to 1 (so it is
true). The last line examines @array2 one element at a time
and sets it to $_, which is grepped in %temp. If there, it is
an intersection element and added to @intersect.
Finding the difference
• Finding a difference is similar to an intersection
but using negation:
%temp=();
foreach (@array1)
{ $temp{$_}=1;}
@intersect=grep (! $temp{$_}, @array2);
Exercise
• Create two arrays of numbers, both with
some identical and some different values.
Display the intersection and difference of
the two arrays. Display the two derived
arrays in sorted order.
Module 6
File and directories
File handles
Filehandles
• In order to work with files, you need to use a
filehandle. A filehandle is a variable that acts as a
reference between your Perl program and the
operating system’s file structure.
• Filehandles contain information about the file, the
way the file was opened (read-only, etc), where
you are in the file, and some other attributes.
• Every file manipulation in Perl is done through
filehandles
Naming filehandles
• Filehandle variables do not have a special
character in front of them like scalars, lists, arrays,
or hashes. For that reason, the convention is to
use uppercase for filehandle variables to avoid
confusion with Perl keywords.
• Filehandle names can be any combination of
characters you want, but descriptive names are
often easiest to work with and keep track of
Opening a file
• To open a file to be read by Perl, you need to use
the open function with a filehandle. The syntax
for open is:
open handle, filename;
where handle is the filehandle and filename is the
file to be opened, which may include a path.
• An example of using an open function is:
open (BIGFILE, “file1.dat”);
• If you do not specify a directory path, the current
directory is assumed
Checking an open
• Normally you will embed an open function inside an if
statement to make sure the file was opened properly.
Otherwise, commands later in the program would
cause errors. Here’s a typical setup:
if (open(BIGFILE, “datafile.dat”))
{ statements to run }
else
{ print “Cannot open the file!n”;
exit 0;}
Using pathnames
• If you use a pathname in the file open command, it
should conform to the format of directory paths in
your operating system. For example:
open(BIGFILE, “D:datadata.fil”);
will work for Windows but will cause problems for
UNIX and Linux. This format will cause Perl
problems because the  has to be escaped. To prevent
problems, Perl allows you to use UNIX-like slashes
for Windows paths, which is correctly interpreted by
the Windows Perl interpreter:
open(BIGFILE, “D:/data/data.fil”);
Problems with paths
• You must use double backslashes for Windows paths
because of escaping of the backslash if you are not
using forward slashes, but this can cause even more
problems. For example,
open(BFILE, “D:datadata.fil”);
should be written as:
open(BFILE, “D:datadata.fil”);
to escape the backslashes properly.
• You can use both absolute and relative pathnames, as
well as Windows’ UNC names (such as machine
sharename)
Closing a filehandle
• After you have opened a file and done something
with it, you should always close the file. Closing
the file lets the operating system know the file is
not in use anymore and the filehandle is freed.
• To close a filehandle, use the close function with
the handle name:
open(BIGFILE, “data.txt”;
statements…
close BIGFILE;
Reusing a filehandle
• If you have one file open with a specific
filehandle, and then use the same filehandle in
another open command, the first file is
automatically closed and the filehandle is opened
with the new file.
• This can be used to eliminate the opening and
closing of file statements in a program, as long as
the files are used sequentially
Reading files
Reading from a filehandle
• There are a couple of ways to read from an open
filehandle. The most common is to use the file
input operator, which is a pair of angle brackets
around the filehandle name (just like <STDIN> to
read from the keyboard). For example:
open(BIGFILE, “data.txt”)
$line=<BIGFILE>;
• This will read a line from the file data.txt (referred
to by the filehandle and not the name) and store it
in $line
Using the file input operator
• The line
$line=<MFILE>;
will read a whole line of input from the MFILE
filehandle. If there is nothing to read, the value
“undef” (for undefined) is returned.
• You can use loops to read through an entire file.
To test whether the value undef has been detected,
use the “defined” keyword:
while (defined($line=<MFILE>)) …
A shortcut for reading lines
• Perl allows the code on the previous slide to be
shortened. Instead of writing:
while (defined($line=<MFILE>))
{print $line;}
you can write:
while(<MFILE>)
{print $_;}
• This works because the shortform stores the line in
the default variable $_. The shortform also checks
for end-of-file for you.
Exercise
• Write a program that reads in a file (pick
any file from the directory) and display the
contents of that file, line by line. Make sure
the end of file is handled properly, and
remember to close the filehandle after you
are finished. Prompt the user for the
filename to be read.
Reading into a list
• So far, we read file contents into a scalar, one line
at a time. You could assign the lines read from a
file to a list just as easily:
open (MFILE, “data.txt”);
@list=<MFILE>;
close <MFILE>;
• When using a list or array, the entire file is read in.
Each line in the file is assigned as one element in
the list. (So the first line is @list[0], and so on.)
Using lists
• If you need to read a lot of data from a file, it is
often easiest to use a list or array to hold the
contents, instead of assigning a variable for each
line, then processing the contents of the line
somehow.
• Since the array or list is just a copy of the file’s
contents, any changes made to the array will not
harm the original file
Exercise
• Write a program that prompts the user for a filename,
then reads that file in and displays the contents
backwards, line by line, and character-by-character
on each line. You can do this with scalars, but an
array is much easier to work with. If the original file
is:
abcdef
ghijkl
the output will be:
lkjihg
fedcba.
The die statement
The open or die syntax
• Perl has a command called “die” which is often used
with file commands. When the die command is
encountered, the program stops executing and shows a
message such as:
Died at fileopen.txt line 165
• To use the die command with an open function, you can
use this format instead of an if statement:
open(BIGFILE, “data.txt”) || die;
• This is read as “open or die”: if the open is successful,
execution continues; otherwise, the die statement
terminates the program
Adding messages to die
• To help decipher program exits from the die
command, you can use strings to be shown upon
exit. For example:
die “File won’t open”;
will display the message:
File won’t open at foo.txt line 52
when the die statement causes termination of the
program.
• You can use these messages to embed error codes
or strings in likely locations for die statements
The $! variable
• When an error is recorded by Perl, it stores the
error number in a special variable called $!. When
examined numerically, $! shows a number, but
when examined as a string it shows an error
message from the operating system. This can be
used as part of the die string:
die “Can’t open: $! n”;
• This statement will display the message “Can’t
open” followed by the error string from the
operating system when the die is triggered
Warnings
• Instead of bombing out of a program using die,
you may simply want to issue a warning to the
user. You can do this with the warn command:
warn “message”;
• The warn command will display the error
message, but the program keeps running. You can
use the error codes with warn:
warn “message: $!”;
Exercise
• Modify the last program you wrote to
incorporate the die statement to handle file
open errors. You can use a custom message
if you want.
Writing data to a file
Opening a file for writing
• Before you can write data to a file, it has to be
opened specifically for writing. You use the open
function to open a file for writing, but then use a
redirection operator in the filename component:
open(MYFILE, “>bigfile.txt”);
open(MYFILE, “>>bigfile.txt”);
• The redirection operators are the same used by
UNIX. “>” overwrites any contents already in the
file, while “>>” appends to the end of the file.
Creating new files
• If the file you instruct open to open for writing
does not exist. The file is created for you in the
current directory unless a path has been specified.
• If the file does exist, it will be overwritten unless
you use the append operator
• Most operating systems treat case in filenames as
important, but some do not. Check with your
operating system to see if mixed case filenames
are significant, or whether everything is converted
to uppercase.
Writing data
• Writing data to a file is done with the print or printf
command, but with the filehandle included. The
syntax is:
print filehandle data;
• There is no comma between the filehandle and the
data!
• For example, to write a single variable $num1 to the
file once it is opened for writing, use the command:
print MFILE $num1;
assuming MFILE is the filehandle.
Checking for writes
• You can use a logic structure to make sure a write
has been performed properly:
if (! print MFILE $num1)
{warn “Can’t write to the file!”;)
}
close (MFILE);
• If the data value $num1 could not be written to the
file, the warning is displayed. We used a warn
here, but you can also use a die statement instead.
Closing after writing
• It is important to issue a close operation after
writing data to a file. This is because most
operating systems don’t write to the file
immediately, but buffer the data. The close
operation tells the operating system to commit the
changes, and mark the file as not in use.
• If you do not issue a close operation, there is a
chance you will lose the data you have tried to
write, and may corrupt the file
Exercise
• Write a program that creates a file called
“data.dat” in the current directory. Prompt
the user for five numbers, and write them,
one at a time, on both the screen and into
the file. Close the file, then open it again
for reading only, and display the contents
on the screen. Handle error conditions that
may occur.
Working with multiple files
Multiple files
• You can have many files open at once. The limit
to the number of files that can be open (for reading
or writing) is usually set by your operating system.
There is no intrinsic limit imposed by Perl.
• Often you will want to read one file, line by line,
and process the output saving it into another file.
This requires two files to be open at once. Keep
track of the filehandles and the process will be
simple.
Exercise
• Create a file that has a series of ten strings
in it, all accepted from the user through the
keyboard. Then, open that file in read
mode, and reverse the order of the
characters in each line, saving the reversed
line to a new file. Display the completed
reversed file when done.
Binary files
Binary vs. text
• Binary files are files that have to be translated
literally, such as a picture file, a sound file, or a
binary file. Text files are any files that contain
records that end in end-of-line characters.
• Some operating systems distinguish between
binary and text files. Unix and Linux do not, but
Windows does. Perl can’t tell the difference
between binary and text files (it has a Unix
heritage).
Handling text files
• When Perl writes data to a file, it does so in text
mode. When the newline n character is
encountered in a string to be written to a file, Perl
converts it to the appropriate characters for the
native operating system:
UNIX/Linux: ASCII 10 (LF)
Windows: ASCII 13/10 (CR/LF)
Macintosh: ASCII 13 (CR)
Handling binary data
• When writing binary data to a file you don’t want Perl
converting anything, so you have to use the binmode
command with the filehandle to tell Perl this is to be
written literally:
open(BFILE, “>file1.dat”);
binmode(BFILE);
• You only need to specify binmode for a filehandle
once, until you close the file
• On some operating systems (UNIX/Linux and
Macintosh) binmode is ignored as there is no distinction
between binary and text files
File tests
File tests
• Perl allows the UNIX file tests to be performed.
This is usually done in a condition like this:
if (-r FILE) {..}
• The condition has one valid option followed by
the filehandle to be tested. Alternatively, you can
use a filename or full path and filename instead of
a filehandle.
Valid tests
• These tests are all UNIX tests available to Perl:
-B true if a binary file
-dtrue if directory
-e true if file exists
-f true if regular file
-M returns age in days since last modification
-r true if readable
-s returns size of file in bytes
-T true if text file
-w true if writable
-z true if file exists but is empty
Using tests
• You can use tests to verify files when
opening or writing. If you are prompting the
user for a filename, you can check to make
sure the file exists or has the correct type of
data. You can also use test to make sure
you are not overwriting an existing file.
Exercise
• Modify the last program you wrote to allow
the user to enter both the filename to read
and the filename to write, and check to
make sure that the file to read exists, and
the file to write to doesn’t (so you don’t
overwrite a file). Display messages if the
tests fail.
File and directory manipulation
Renaming files
• To rename a file, you use the rename command
with the old filename and the new filename
separated by a comma:
rename “a.dat”, “b.dat”;
• You use the filenames and not the filehandles,
since the file cannot be open when you rename it.
You can use die and warn to trap failures of the
rename command, as you have seen earlier.
Deleting files
• To delete a file, use the unlink command
with the filename(s) in a list:
unlink file1.dat;
• As with rename, you can’t use a filehandle
because you can’t delete an open file.
Again, you can trap false returns from the
operating system with die and warn.
Directories
• Almost all operating systems use a hierarchical
structure to maintain files in directories. Being able to
read the directory structure is important. Perl lets you
do this through directory handles.
• A directory handle is used to read the contents of a
directory. You can open a directory handle using the
opendir function:
opendir handle directory;
where handle is the directory handle you want to open,
and directory is the name of the directory to be read.
Directory listings
• Once a directory has been opened with a
dirhandle, you can read the directory contents with
the readdir function:
opendir TEMPDIR, “/temp” || die;
readdir TEMPDIR;
• After you are finished with a dirhandle, you
should close the handle with closedir:
closedir TEMPDIR;
Storing directory contents in an
array
• You will often want to read the directory contents
and store the list for future use. You can assign
the contents to an array just as you did with file
contents:
opendir (MDIR, “/temp”) || die;
@filelist=readdir MDIR;
closedir MDIR;
• You could then manipulate the contents of
@filelist, which will have one directory line per
element with most operating systems
Changing directories
• To change directories, you can use the chdir
command. Changes in directory can be specified
absolutely or relatively. For example:
chdir ../book;
will move up one directory level and down into
the directory book.
• If you do not specify a directory argument for
chdir, it will change to your home directory (if one
is defined by your operating system)
Creating directories
• To create a directory, use the mkdir command with both
the new directory name and the permissions are
arguments:
mkdir newdir, perms;
• For example:
mkdir temp, 0755 || die;
will create the directory temp under the current
directory with the UNIX permissions 755
• The directory permissions use the same convention as
your operating system. Not all operating systems will
use permissions.
Deleting directories
• To delete directories, use the rmdir function
with the pathname to the directory to be
removed:
rmdir temp;
• If the directory cannot be removed, a false
is returned by the operating system and can
be handled with die or warn.
• You can only delete empty directories!
Module 7
Functions (Subroutines)
Subroutines
Functions and Perl
• Almost every high-level programming language
supports functions and Perl is no exception. Simply
put, a function is a block of code that can be called
by name to perform some task, then return to the
calling program. Functions can usually accept
parameters and return values.
• You’ve seen a lot of functions so far: print, printf,
reverse, sort, open and so on are all built-in functions
• Perl calls user-defined functions “subroutines” or
“subs”
Creating a subroutine
• To create a Perl subroutine, you use the keyword
sub followed by the subroutine name and the code
for that subroutine in curly braces:
sub sub_name
{ statements…
}
• Subroutine names do not have any specific
character first, and naming rules are the same as
other variables in Perl
Running a subroutine
• To run a subroutine in Perl, you can use one of two
syntaxes:
subname();
or
&subname();
• The ampersand is optional in almost all cases, as is
most commonly left off. If you are passing parameters
to the subroutine, they would be enclosed in the
parentheses.
• Subroutines can call other subroutines, or themselves
recursively
Returning values
• Many subroutines will return a value when the
subroutine code has been executed. This value is
usually assigned to a variable or used in a statement
like print. For example:
sub twoplustwo
{ 2+2;}
$num1=8+twoplustwo;
• In this case, $num1 will have the value of 12. In this
case there is no return statement in the subroutine as
with other languages. The last expression in the code
is returned by the sub.
Using return
• You can use a return statement if you want to specify
what to return from a subroutine or want to have the
option of terminating the sub early:
sub oddeven
{ return(0) if ($num%2 == 0);
statements…}
• In this case, $num will have been defined outside the
subroutine. If the mod 2 of that number is 0, the
returned number will be zero and the sub terminates at
this point. Otherwise, the program continues
executing the statements.
Returning variables
• You can return any variable from a subroutine,
including hashes and arrays:
sub somesub
{ statements…
return @array;
statements…}
@rev=reverse(somesub(@array));
• In this code, an array is returned from the sub and
reversed in the body of the Perl script, then stored
in a new array variable
Exercise
• Write two subroutines. Both use a variable
defined in the body of the program. The
first sub multiplies the variable by ten,
while the second sub divides the number by
two. Prompt the user for the number, and
then call both subs and display the results
from the Perl script (not the subs).
Arguments
Arguments
• The code we wrote so far depends on a variable
being defined outside a subroutine, and that’s not
good because this reduces code reuse. Ideally,
subroutines should be stand-alone and portable.
• To pass data to a subroutine as part of the
subroutine call is to use arguments. You’ve seen
arguments passed to functions with several built-in
functions, such as sort. To pass arguments to a
subroutine, you specify them as part of the
subroutine call.
Specifying arguments
• Arguments for a subroutine can be specified in two
ways, either with or without parentheses (most Perl
functions don’t care about parentheses as you have
seen already in this course):
sub(args…);
sub args…;
• You can only leave off the parentheses if the
subroutine has been defined earlier in the script that
the subroutine call. This is a common error for
programmers who define subroutines at the end of
their code.
Multiple arguments
• The args list can be composed of many arguments,
all separated by commas:
sub1 arg1, arg2, arg3;
• There is no limit to the number of arguments that
can be passed to a subroutine, and they can be of
any valid type or size. You can mix argument
types in the list passed to the subroutine:
sub1 $x1, @array1, $x2, @array2;
The @_ variable
• Inside the subroutine, Perl uses the argument variable
@_ to hold all the arguments. You can use @_ in your
subroutines:
sub showarg
{ print join(“ ”, @_);}
showarg “This”,“is”,“a”,“test” ;
• This program passes the four strings to showarg,
which uses the join to paste them together with spaces
between each one. The print displays the result.
• The @_ variable is not related to the $_ variable
Accessing each argument
• Inside a subroutine, each argument is
accessed as a part of the @_ array using
subscripts, as with any array. The first
argument will be element @_[0], the second
array element @_[1], and so on.
• You can use loops to access each argument
using the array element name
Using names for arguments
• Inside a subroutine, using @_[x] can be awkward and
counter-intuitive. Perl allows you to name arguments
being passed into the subroutine, just as with other
language function arguments, but you have to do it by
assigning the @_ array elements at the top of the
subroutine code:
sub foo1{
($var1, $var2)=@_;
statements…}
• This will assign the first argument to $var1, and the
second to $var2, and so on if more are named
Exercise
• Write a subroutine that expects three
numbers passed as arguments, and multiply
the three together, returning the result from
the subroutine. Call the subroutines from
inside a Perl script that asks the user for all
three numbers. Use names for each
argument inside the subroutine itself.
Passing arrays and hashes as
arguments
Passing an array or hash
• In the last section you passed scalars to a
subroutine. Passing an array or hash to a
subroutine is done in the same way:
sub revarray
{ return reverse(@_);}
@rev=revarray @array;
• This will pass the array @array to the subroutine
revarray, reverse it, and store the result passed
back from the subroutine into @rev. The entire
array is read in to the subroutine as @_.
Passing multiple arrays or
hashes, Part 1
• You might think you could pass more than one array
or hash in the same manner:
sub printarrays
{ (@array1, @array2)=@_;
print @array1, @array2;}
however this will not work as @_ will hold the two
arrays together and there is no way to break them into
two separate arrays in the subroutine. There is no
indication where one array ends and the next starts.
In this case, @array1 would have the entire @_
contents and @array2 is empty.
Passing multiple arrays or
hashes, Part 2
• There is no easy way to solve this problem of passing
arrays directly. However, they can be passed by
reference (which is seen in Module 8).
• You can pass one array and a mix of scalars without
problem as long as the array or hash is the last
assigned element. The scalars will be properly
assigned from the @_ array and everything else goes
in the array:
sub passarrays
{ ($x1, $x2, $x3, @array1)=@_;
statements…;}
Exercise
• Write a program that contains a subroutine
that accepts one letter and an array of
letters. Prompt the user in the main part of
the Perl script for the letter, then for five
strings to be sent as an array. Inside the
script, combine the five elements in the
array into a scalar string, using the letter as
a joining character, and pass the result back
to the script. Display the resulting string.
Subroutine prototypes
Subroutine prototypes
• A subroutine prototype allows you to specify at
the top of your code the name of a subroutine and
the number of arguments it expects. Then, later in
the code after you have used the subroutine by
name in statements, you can define the subroutine
properly. This allows you to use the shorter forms
of calling subroutines, and allows for code reuse
more often.
• A prototype’s role is to tell the interpreter what
type of arguments are to be used by a subroutine
Defining a prototype
• To define a subroutine prototype, use the same
syntax as when defining the subroutine itself. You
do not have to specify variable names, but you can
indicate the presence of variables by using the first
symbol (such as $ or @) of the variable:
sub mysub1 ($ $ $);
• This defines a subroutine called mysub1 which
will expect three scalars. The actual code for
mysub1 is defined later in the program.
Exercise
• Modify the last program you wrote so the
body of the subroutine definition is at the
end of the script. Add a subroutine
prototype to the top of the script and try
rerunning the program.
Subroutines and scope
Scope
• If you have programmed in any other language that
supports functions, you have seen scope. Scope refers
to the block of code where a variable has meaning.
• If you define variables inside a Perl script they are
available to any subroutines inside that script.
Variables created inside a subroutine are available
outside the subroutine, as well (which is not how most
programming languages behave). This is an important
difference in language! In this case, the variables have
scope in the entire program.
Keeping it private with “my”
• To help code reuse and portability, you want to be able
to define variables inside a subroutine that have no
meaning outside the subroutine. This prevents
conflicts with other script variables when you move
the subroutine to new programs. To define a local
variable, you use the my operator:
my $var1;
my @array1;
• Any variable defined with the word my is considered
private to the block of code it is defined in. That is
where the variable has scope.
Private variables
• Any variable defined with the keyword my to make it
private is released as soon as the code block in which it
was defined is terminated
• You can have private and global variables with the
same name, and they are treated differently by the
compiler:
$num1=6;
sub something
{ my $num1; # different variable
statements…}
• Both $num1s are treated as different in this case
Strict
Using strict
• Perl has a keyword called strict. When used in a
program, it tells the interpreter to use much more
care when evaluating statements and to display
warnings and error messages for everything it finds
questionable (usually Perl will let you get away with
quite a bit before complaining about something).
Using strict is a good way to enhance your
programming abilities. To do this, simply put:
use strict;
at the top of your code.
Perl debugger
The debugger
• Part of the Perl interpreter is a debugger that you
can use to examine the execution of your Perl
scripts. The debugger allows step-by-step
execution of scripts, examination of variable
values, and the use of breakpoint.
• The debugger is built into every Perl interpreter
and is activated with the –d option when launching
the interpreter:
perl –d myprog.txt
Debugger output
• When you first launch a program with the debugger
option you will see version information for the perl
interpreter, then a help prompt:
Enter h or ‘h h’ for help.
then the first line of the script. You will also see a
message showing which filename the statement is in,
and what the line number was.
• Finally, the debugger prompt
DB<1>
is shown, indicating the debugger is waiting for your
first debug command.
The statements
• When the debugger shows you a statement, it is in
the cache ready to be executed but has not yet
been executed.
• Each statement read by the debugger can be
examined and manipulated prior to it being run.
This allows for some changes or examination of
the environment before each statement is
executed, which is ideal for debugging the script.
• Any valid Perl command can be used at the
debugger prompt
Debugger help
• You can get help from within the debugger at any
time using the h command, usually followed by
the command you want information about. For
example, for help on the b (breakpoint) command,
type:
h b
• The command ‘h h’ shows a summary of the
available commands and their syntax
• To page the output from the help system, put a | in
front of the command (such as |h h)
Listing the program
• To list the next ten lines in your Perl script, use the l
command. Every time you issue an l command, the
next ten lines will be shown.
• Listing the lines does not affect the line that is being
executed: it simply shows you the next ten lines of the
script. The next line to be executed is shown in the
listing like this:
===>
• You can specify which lines to show by using a range:
l 10-15 shows lines 10 through 15 inclusively in the
script.
Stepping through statements
• To run each line, one at a time, in the debugger,
use the n (next) command. Each line is shown on
the screen before it is executed.
• To see the value of any variable, use the print
command at the prompt:
print $var1
and the current value will be shown without
affecting the program
• You can keep using the n command to step
through each line in the program until termination
Stepping into subroutines
• When a subroutine call is encountered by the
debugger in the script, it executes the subroutine
as a single call and doesn’t show the lines in that
subroutine. To jump into the subroutine and move
through it one line at a time, use the s (step)
command.
• When you issue the step command the debugger
shows each line, one at a time, executed inside the
subroutine and all valid debugger commands can
be used inside the subroutine
Breakpoints
• You can use the n command step through a
program line by line, or let the debugger run all
the lines until some condition is met. This is a
breakpoint and is set with a b command.
• You can set a breakpoint at any line number by
specifying the line number. To set a breakpoint at
line 10, you would use the command:
b 10
• The c (continue) command lets you continue
executing after a breakpoint has been triggered
Using breakpoints
• You can set a breakpoint on any line in a script
except those that have:
– Just a curly brace or closing parentheses
– A blank line
– A comment
• Usually, breakpoints are used after a loop,
subroutine return, or complex command so you
can verify the actions taken. You can set a
breakpoint anywhere except those listed above.
Showing and removing
breakpoints
• To show all the breakpoints that are set in your
script, use the L command
• To remove a breakpoint, use the d command
followed by the line number (or the subroutine
number, if a breakpoint is set to a subroutine). For
example:
d 37
deletes the breakpoint set at line 37.
The reset command
• You can reset the debugger to clear all
breakpoints and variables, and restart the
execution of the script from the top with the
R command
• When reset is executed, any defined
variables lose their value, and the first line
of the script is the to-be-executed line
GUI debuggers
• The built-in debugger is acceptable for simple
tracing and debugging, but is not suitable for very
complex debugging tasks. Also, it is not
graphical.
• There are many GUI-based debuggers for Perl
available on the market, some bundled with Perl
distributions. The ActiveState Perl distribution
has a Windows debugger in the package, for
example, and there are several available for UNIX
and Linux.
Module 8
References
References
In this last module of the course we will look at
references. If you have programmed in C, C++, or
other high level languages, you may be familiar
with pointers, which are the same as Perl’s
references. If you have not seen these capabilities
before, the learning curve is a little steep. To help
show the abilities of references we have used very
simple exercises throughout. This should show
the basics of the subject without overwhelming
you.
References
References
• Many high-level languages like C and C++ have
the concept of a pointer, a variable that points to
the memory address of another variable. Perl has
pointers too, but they are called references.
• References do not hold a variable value, but hold
the memory address of another variable. For
example, if you have a variable called $num1, you
could have a reference called $refnum1 which
holds the memory address when $num1 has its
data stored.
What is a reference
• Every variable in Perl has a value assigned in memory.
Any value assigned to that variable is contained in
that memory. For example, the statement:
$num1=10;
will have a memory location assigned with the name
$num1, and a value of 10 is stored in that memory
location.
• A reference is another variable, and has an assigned
memory location, but holds the address of the $num1
memory location instead of a value.
Why use references
• Why bother using a reference to a memory
location with a value in it? There are many
reasons when you get into complex coding, but the
simplest reason is it allows you to change the
variable the reference points to (and hence the
value it points to in that variable’s memory
location). This is very handy in some programs, as
you will see.
• References are especially handy when dealing
with arrays and lists
Creating a reference
• References are created exactly the same way as
other variables, and have no special naming
convention. To assign a value to the reference, you
use the backslash:
$refnum1=$num1;
• This will create a reference variable called
$refnum1which will hold the memory address of
the variable $num1. Creating a reference to a
variable doesn’t affect the variable in any way.
Dereferencing
• To use the value a reference is pointing to, you
have to tell the interpreter that you don’t want to
know the memory address it holds, but the value
inside the memory address it points to. This is done
with the dereferencing operator. For example:
print $refnum1;
will print the memory address $refnum1 holds, but
print $$refnum1;
will print the value in the memory address of
$num1 (if that’s what it points to).
Using references to change
values
• You can use dereferencing to change the value of a
variable the reference points to. This is done in the
same way as looking up the value:
$$refnum1=15;
• This command will change the value in the memory
location $refnum1 points to and set the value of 15
there. You have to use two $ signs here: if you had
written
$refnum1=15;
you would be setting the value 15 into the memory
location of $refnum1, not the variable it points to.
Exercise
• Write a program that create a variable with
a user-supplied value. Create a reference to
that variable. Display the memory address
of the reference and the value stored in the
dereferenced variable.
Using reference values
• When a reference has been given a value (a memory
location of a variable), the reference can be used
with other variables and references. For example:
$num1=10;
$refnum1=$num1;
$refnum2=$refnum1;
print $$refnum2;
will have $refnum1 point to $num1. $refnum2 then
is set to the same value, so the last line shows the
dereferenced value of $refnum2, or 10.
References to references
• You can set up a reference to a reference, although
you won’t need this type of ability until you get into
complex coding:
$num1=10;
$refnum1=$num1;
$refnum2=$refnum1;
the last line sets $refnum2 to the value of $refnum1
(the memory location of $num1), and not to $num1
directly. To dereference $refnum2 here and see the
value of $num1, use:
print $$$refnum2;
Exercise
• Write a program that sets up five scalars
filled with numbers supplied by the user.
Then, set up a reference variable that points
to the first scalar. Display the dereferenced
values of that variable, as well as the
memory location it uses. Change the
reference to each of the other four variables
in turn and repeat the display process.
References to arrays
References to arrays
• You can set up a reference to an array in the same
way as a reference to a scalar. Since the reference
holds a memory address, it is a scalar itself and
defined with a $:
@array1=(“1”, “2”, “3”);
$refarray1=@array1;
• The variable $refarray1 will have the memory
address of the first element of the array @array1.
It does not point to the entire array, just the start of
the memory for @array1.
Dereferencing array references
• To dereference array references, you can reference any
element in the array pointed to with the usual element
subscript:
$$refarray1[2];
This shows the value of the third element in whatever
$refarray points to.
• If you want to see the whole array, use:
@$refarray1;
• You can see a range of elements, too:
@$refarray1[0-3];
shows the first four element in the array.
Exercise
• Write a program that prompts the user for
five strings, and save them as elements in
an array. Then, set a reference to that array.
Use a loop to show each of the elements in
that array using the reference, one element
at a time.
References to hashes
References to hashes
• References to hashes are set up the same way as
arrays:
$refhash1=%hash1;
• You access single elements in the hash through the
hash key, and get the value associated with that
key back:
$$refhash1{key};
• To see the whole hash the reference points to, use:
%$refhash1;
Exercise
• Create a hash and a reference to that hash.
You can either prompt the user for hash
keys and values, or simply hardcode them
to save time. Use a loop to display all the
values associated with each hash key in the
hash. You may have to refer back to
Module 5 for the hash functions.
References and subroutine
Passing references to subroutines
• One of the strengths of references is the ability to
pass references to arrays and hashes to
subroutines. This allows more than one array or
hash to be passed properly to a subroutine. Since
the code:
sub twoarrays
{ (@array1, @array2)=@_;…}
does not work, as both arrays are joined into one
array @_, references provide a way to pass scalars
which reference more than one array.
Passing arrays
• To pass two arrays to a subroutine, you could do this:
@array1=(…);
@array2=(…);
$refarray1=@array1;
$refarray2=@array2;
passarray($refarray1, $refarray2);
sub passarray
{ statements…}
and both arrays can be used inside the passarray
subroutine by dereferencing the references.
Exercise
• Create two arrays, one holding vowels and
the other holding consonants. Pass both
arrays into a subroutine using references.
Inside the subroutine, display all the
elements of each array.

PERL - complete_Training_Modules_Ref.ppt

  • 1.
  • 2.
  • 3.
    What is Perl? •Perl is a general-purpose programming language, and can be used for practically any programming task any other high-level language can be used for. However, Perl is usually thought of as a “glue” language, so called because it binds things together (such as tying databases to Web pages, converting files from one format to another, and so on). • Perl is very flexible and is currently available on over two dozen operating system platforms
  • 4.
    Perl • The namePerl comes from “Practical Extraction and Report Language”. Perl has many features borrowed from other programming languages. • The Perl system uses an interpreter, called “perl”. Usually Perl and perl are considered to be the same thing for practical purposes.
  • 5.
  • 6.
    Versions of Perl •The current versions of Perl are all in the 5.X and 6.X series (6.X was released in 2001). If you have an older version of Perl (such as Perl 4.X), you should upgrade it as many changes were made between releases. • Perl 4.X was a very buggy release of Perl and should not be used. Also, many Perl programs designed for 5.X will not work with 4.X.
  • 7.
    Maybe Perl isalready installed • Many operating systems (Linux and UNIX notably, but also Windows NT Resource Kit) come with Perl installed. You can easily check whether Perl is loaded on your system by opening a console or terminal window and issuing the command: perl –v If you get a version number, Perl is installed. If you get an error message about command not found (or something similar), Perl is not installed.
  • 8.
    Where to getPerl • Perl is available free of charge from many public sites.There are several releases of Perl available for different operating systems, so make sure you get a current release. • For Linux or UNIX systems, visit perl.com for the latest releases • For Windows systems, you can compile the Perl source code yourself (a hassle) or download a preconfigured Windows release at activestate.com • For Macintosh, visit macperl.com for MacPerl
  • 9.
    Perl documentation • Everyrelease of Perl comes with documentation in a set of files. Most releases have over 1,700 pages of documentation included in reference books, user guides, FAQs, and so on. • On most operating systems, a utility called perldoc is installed as part of the Perl system. The perldoc utility can search for and format Perl documentation for you. To use perldoc to look up the basic syntax for perl, open a terminal or console and issue the command: perldoc perl
  • 10.
    More on perldoc •The Perl documentation is divided into parts by purpose: – perlfunc (Perl functions) – perlfaq (Perl FAQs) – perlop (Perl operators) • To search for a particular keyword, use the –tf options. For example to look up the print keyword: perldoc –tf print • To search the FAQs use –q as an option: perldoc –q free
  • 11.
    A first Perlprogram
  • 12.
    What you need •When you have installed Perl on your system, all you need to use the language is a text editor that can save ASCII files. All Perl scripts are written and saved in ASCII characters. • On some operating systems that do not have a Perl GUI front end, you will need to use a console or terminal window to interact with Perl. Some GUI- based Perl front ends are available for Linux, UNIX, Macintosh and Windows.
  • 13.
    Comments in Perl •All comments in Perl are written starting with a # sign. Anything after the # sign through to the end of the line is ignored by the interpreter. • Comments can be placed anywhere on the line, but commands cannot follow a comment on the same line • Multiline comments should have a # symbol as the first character on every line
  • 14.
    The #! directive •The sole exception to # indicating a comment is on the first line of a Perl program (or “script”). All Perl programs can begin with the line: #!/usr/bin/perl • The #! is a hold-over from UNIX that instructs the operating system to use the /usr/bin/perl program to run whatever is in this file • The path may be different for your system, and many environments such as Windows do not need this line. However, it will not cause errors.
  • 15.
    Semicolons • All validPerl command lines end in semicolons. Without a semicolon, Perl continues to read onto the next line and doesn’t assume a carriage-return is the end of a statement. • You can break Perl commands over multiple lines because of this, as long as a semicolon is the end character in the complete statement. • Perl uses semicolons in the same way as C/C++ and Java
  • 16.
    Whitespace • Whitespace isignored by the Perl intepreter. You can use whitespace (spaces and tabs) anywhere in your programs to make them more readable. • You should use whitespace to help format your scripts to show loops, logic layout, and continuation of statements, as you will see later in this course
  • 17.
    The print command •The print function tells Perl to display whatever follows, such as a string, variable name, and so on. You’ll see how to build complex print statements later. • The print statement allows the C or Java escape characters to be used for line feeds, backspace, tabs, and so on. For example, the command: print “Hellon”; will print “Hello” followed by a newline.
  • 18.
    A Hello Worldscript • We can write a simple Perl script for the traditional Hello World application: #!/usr/bin/perl print “Hello World!n”; • These two lines can be save in a file as ASCII and then run by perl by issuing the command: perl filename
  • 19.
  • 20.
    Scalars • Scalars arethe Perl term for basic units, including strings and numbers of different forms, as well as constants (which are often called “literals”) • There are several types of data supported by Perl, and you will see most of them in this and the next module
  • 21.
    Numeric Scalar Variables •Perl uses the dollar sign to indicate scalar variables, followed by the name of the variable. For example: $date is a variable called “date”. The dollar sign is a type identifier that tells Perl this is scalar. Arrays use a different identifier, as you will see later. • Variable names are case sensitive, so $Date and $date are different variables
  • 22.
    Strings • String typesin Perl are like those in other programming language. Strings are treated literally when enclosed in quotation marks (either single or double). Escape sequences can be used with Perl strings. These are the most common: – n newline – r carriage return – t tab – b backspace
  • 23.
    Special escape sequences •Some escape sequences for strings have special meaning to Perl: – l change next character to lower case – u change next character to upper case – ’ literal single quotation mark – ” literal double quotation mark – backslash
  • 24.
    The q andqq operators • Perl allows you to use these structures: – q( ) – qq( ) Instead of single and double quotes, respectively. So, qq(This is a test) is the same as “This is a test” • These can be handy when embedding marks that would otherwise need escaping: qq(He said “Help!” then “Now!”)
  • 25.
    Single and doublequotes • Double quotation marks allow expansion of variables within them. Single quotes do not. • For example: “This is from $name1”; is not the same as ‘This is from $name1’; as the second will literally display ‘$name1” which the first will substituted the value in the variable name1.
  • 26.
    Declaring variables • Unlikemany programming languages, variables do not need to be declared prior to use with Perl. When the variable is assigned an initial value, Perl can figure out the data type. • If you try to use an uninitialized variable, Perl will use the value zero for a numeric, or Null for a string. Avoid uninitialized variables as much as possible, as results can be unpredictable.
  • 27.
    Assigning values • Variablesare assigned values using the equal sign: $string1=“This is a test”; $var1=6; $var2=3.14159; • You can assign operation results, as you would expect: $var3=$var2 + $var1;
  • 28.
    The $_ variable •The $_ variable is used by Perl as a default variable. You can use it in place of variable names in your scripts: $_=“This is a test”; print; • This will print the default variable $_ and display the string. Use the default operator carefully as it can easily be confusing, but some operators and functions work best with default variables, as you will see later in this course.
  • 29.
  • 30.
    Standard operators • Perlsupports the standard mathematical operators +, -, *, /, % (modulus) and ** (exponent) • Operator order of precedence applies according to standard rules; parentheses group operations • Operators can be combined in statements: $num1=$num2 * ((3 + 8)*$num3/2); • Multiple assignments can be made on one line: $num1=$num2=$num3=7;
  • 31.
    Exercise • Write aprogram that simulates rolling five dice. Assign values between 1 and 6 to five different variables and display all five on the screen as well as the sum of the five numbers. Later you’ll see how to use random numbers, but for now just assign the values. • If you want to display a string and a variable together in a print statement, separate them with periods: print “The sum is ” . $sum; You’ll see this in the next few slides.
  • 32.
    Positive and negative •Numbers are assumed to be positive unless you specify a negative sign in front: $num1=6; # positive $num2=-6; # negative $num3=-(-6); # positive • You can convert positive to negative any time using the minus sign in front of the variable: $num4=-$num4;
  • 33.
    Increment and decrement •Like C/C++ and Java, Perl supports autoincrement and autodecrement operators, which increase or decrease a value by one: $var1++; is the same as $var1=$var1+1; and $var2--; is the same as $var2=$var2-1;
  • 34.
    Shortform assignment • Aswith autoincrement and autodecrement, Perl supports shortform assignments like this: $var1+=5; which is the same as $var1=$var1 + 5; • This can be performed for all four basic mathematical operators.
  • 35.
    Operators and strings •Strings can be used with the “.” operator for concatenation: $str1=“Hello ”; $str2=“World!”; $str3=$str1 . $str2; print $str3; would print “Hello World!” • You can also concatenate on output in most cases by specifying the string variables together: print $str1 . $str2;
  • 36.
    The x operator •The repetition operator, x, is used with strings to indicate a number of repeats for a string. For example, the code: $str1= “x” x 20; will string 20 “x”s together and assign them to $str1. • You can use the repetition operator with existing strings: $str1=“x”; $str2=$str1 x 20;
  • 37.
    Other operators • Perlsupports several other operators: – int: returns the integer portion – cos: cosine – sin: sine – rand: random number between 0 and argument – length: length of argument – lc: converts to lowercase – uc: converts to uppercase
  • 38.
    Exercise • Rewrite thelast program to randomly assign a number to each of the five dice. To use rand, you need to specify the upper limit: rand(5) will generate a number between zero and 5. Remember all random numbers have a lower limit of zero. Have the program roll five dice and display results between 1 and 6, as well as the sum of the dice.
  • 39.
    Converting strings tonumbers • Perl is flexible when using string types as numbers, as long as the conversion makes sense. For example, this works: $str1=“6”; $num1=10-$str1; print $num1; will display the value 4. Perl can convert the string to a number if the string looks like a number. This applies to decimal strings as well.
  • 40.
    Converting numbers tostrings • Perl can also convert numbers to strings when the conversion makes sense: $num1=3; $str1=“I did it” . $num1 . “ times”; print $str1; will display the message “I did it 3 times.” • If the conversion doesn’t make sense to Perl, it will use a zero instead when you try to call the number.
  • 41.
    Code blocks • Perlstatements can be grouped together into blocks, each block surrounded by braces (like with Java) • Code blocks can be nested many deep • Each code block is treated as a unit by Perl, although execution is still always top to bottom unless moderated by control structures • Usually blocks will be associated with other statements, such as if conditions
  • 42.
    Exercise • Modify thelast program to display strings explaining that you are going to throw the five dice, generate five numbers, show them one at a time like this: The first dice was X. The second dice was Y. and so on, and display the sum on a separate line.
  • 43.
  • 44.
    Comparison operators • Perlsupports the standard comparison operators: > greater than < less than >= greater than or equal to <= less than or equal to == exactly equal to != not equal to
  • 45.
    True and false •In Perl, any condition that evaluate to false is assigned a value of zero. Anything that is non-zero is true. This applies to conditions in statements (such as the if you’ll see in a moment) as well as for numeric evaluation: 0 false 3 true (non-zero) 5-5 false (evaluates to zero) 0.00 false (it’s zero with precision) “” Null string is false “ ” String with a space or anything not null is true
  • 46.
  • 47.
    The if statement •Perl’s if statement is similar to those of other high- level languages: if (condition) { do if true } else { do if false } • The blocks of code can contain many statements. The else and its statements are optional. • Only one block of code is executed, depending on whether the condition is true or false
  • 48.
    Example of if •A simple if statement is: if ( $num1 > 5 ) {print “It is greater than 5.”;} else {print “It is less than or equal to 5.”;} • If there is only one statement in a block, you can leave the curly braces off as long as a semicolon is used, however many programmer use curly braces to make the code more readable
  • 49.
    Exercise • Write aprogram that rolls six dice, all with values between 1 and 6. Add up the result of the die. If the total is greater than 20, display a message that the user wins the game. If the total is not greater than 20, they lose. • Modify the program to determine if the total die roll is odd or even and display the result
  • 50.
    Nested if-elses • Theif statement can be nested within other if statements, either inside the blocks or as part of the else: if (cond1) { if (cond2) {statements}} else { if (cond3) { statements} else {statements} }
  • 51.
    Exercise • Write aprogram that generates three random numbers between 1 and 10. Display a message telling the user whether the sum of the three numbers is greater than, less than, or equal to 15. Also, tell the user whether the sum is even or odd.
  • 52.
  • 53.
    Input from thekeyboard • The easiest way to get input into a Perl program is to read from the keyboard. To do this, use the <STDIN> structure (the standard input, or STDIN, is the keyboard by default). • You use <STDIN> to read a value and store it in a variable like this: $var1=<STDIN>;
  • 54.
    The chomp operator •When reading input from the keyboard, the entire string entered by the user, including the RETURN is saved in the assigned variable • If you want to eliminate the newline character at the end of the input, use the chomp operator to remove it: $str1=<STDIN>; chomp $str1; print “You said ” . $str1 . “.”; without the chomp operator, the print statement would have moved down a line at $str1
  • 55.
    Exercise • Write aprogram that randomly chooses a number between 1 and 100. Let the user enter a guess, and tell them whether they got the number correct, or whether the random number is higher or lower than the guessed number, and by how much. Only do this guess once.
  • 56.
    String relationships • Thereare relational operators for strings (actually, for non-numbers): eq equal to ne not equal to gt greater than lt less than ge greater than or equal to le less than or equal to • Comparisons are left-to-right, using ASCII values
  • 57.
    Example of stringcomparisons $str1=“Hello”; $str2=<STDIN>; chomp $str2; if ($str1 eq $str2) {print “You guessed the string! n”;} else {print “Wrong guess!n”;}
  • 58.
    Exercise • Write aprogram that creates three different variables, all with names of animals assigned to them (one animal per variable). Display the three animals for the user. Have a random number generator pick one of the three animals, and ask the user to guess which animal was chosen. Let the user know whether they guessed correctly or not.
  • 59.
  • 60.
    Boolean operators • Perlsupports to Boolean AND and OR operators in the same way as other high-level languages: && AND || OR • These operators are often used for conditions: if (($num1 < 10) && ($num1 > 5)) • Perl also allows the use of the words “and” and “or”: if ($num1 < 10 and $num1 > 5)
  • 61.
    The NOT operator •Perl allows the negation NOT operator to be specified either as “!” or as the word “not”: if (($x < 5) && !($x > 0)) if ($x < 5 and not $x > 0) • Boolean conditions are always evaluated left to right. Keep in mind some complex combinations may not make sense. Check the logic carefully with compound statements.
  • 62.
    Exercise • Write aprogram that asks the user for a number between 1 and 100. Check to see if the number is even or greater than 10. If it is, display a message to that effect. Also check to see if the number is between 15 and 25 and display a message showing the result.
  • 63.
  • 64.
    The shortform if •A shortform if statement is popular in Perl, and is applicable if there is only one statement that would reside inside the code block. Instead of writing: if (condition) {statement;} you can write: statement if (condition); • This may look confusing, and many programmers do not use it, but it is legal
  • 65.
    Example of shortformif • This is an example of the shortform if statement: $flag = 1 if ($num1 > 10); • This is the same as writing if ($num1 > 10) {$flag = 1;} • Shortform ifs can only be used when a single statement is to be executed
  • 66.
    The elsif construct •Instead of using if-else structures to nest ifs, you can also use the shortform elsif. This removes the need for a set of curly braces for the else. Instead of writing: else { if (cond) {statements…}} you can write: elsif (cond) {statements…} • The use of elsif simplifies the number of braces, but some find it awkward to read easily
  • 67.
    Exercise • Write aprogram that asks the user for the outside temperature in degrees Fahrenheit. Display the equivalent in degrees Celsius. The conversion formula is: F=32+9C/5 where F is degrees Fahrenheit and C is degrees Celsius. Then, if the temperature is going to be below 40F tell the user to take a coat. If the temperature is above 80F tell them to avoid sunburn. If it’s in between, tell them it will be a great day!
  • 68.
  • 69.
    Perl loops • Aswith all other high-level languages, Perl supports loops. The for and while loops are similar to those in languages like C/C++. • Loops allow you to run a block of code as many times as you want, as long as some condition evaluates to true. Loops always have some condition attached to them.
  • 70.
  • 71.
    The for loop •The for loop in Perl is similar to that in C/C++ and Java. It consists of three components: for (initial;condition;increment) where initial is the code to run prior to starting the loop, condition is tested prior to each loop and must be true for the loop to continue, and increment is performed after every loop. • The three parts of the for loop must exist (although they can be empty) and are separated by semicolons
  • 72.
    The parts ofthe for loop • The initial statements in a for loop are always executed, whether the condition is true or not. The initial statements are executed prior to the condition being examined, and are never executed again (unless inside a larger loop) • The continuation condition is evaluated prior to the code block execution, and revaluated each loop. At the end of the execution of the code block and prior to the condition being tested, the increment conditions are executed.
  • 73.
    Example of afor loop for ($x=1; $x<=10; $x++) { print $x . “n”;} • This program will print the numbers from one to ten in a column and then exit. The initial condition sets the value of $x to 1, each loop increments it by one, and the condition is true until x in incremented to 11.
  • 74.
    Exercise Write a programthat throws six dice, generating random numbers between one and six. Display each number as it is thrown, and after six throws, show the sum. Use a for loop to throw the dice six times, using the same variable for each throw.
  • 75.
  • 76.
    The while loop •The while loop uses a condition to test whether to loop or not. When the condition is true, the loop executes. When the condition is false, the loop terminates. The syntax is: while (condition) {statements…} The condition can be anything that evaluates to true or false.
  • 77.
    Example of awhile loop $num1=6; while ($num1 < 12) { print $num1; $num1++; } • This loop will print the values of $num1 starting at 6 and going through to 11. The loop fails when $num1 is incremented to 12.
  • 78.
    Exercise • Modify thelast program you wrote to use a while loop instead of a for loop.
  • 79.
    The last andnext statements
  • 80.
    The last statement •The last statement can be used to terminate a loop, regardless of the condition’s value. The last statement is similar to break in other programming languages. The last statement is usually used with some condition: while ($x < 10) { if ($x == 6) {last;} statements…} • In this case, when $x is exactly 6, the last statement terminates the loop and execution continues past the closing curly brace.
  • 81.
    The next statement •The next statement causes execution of a loop to restart. It is similar to the continue statement in some languages. Any statements below the next statement are not executed. The next statement is usually used with a conditional: while ($x < 10) { if ($x == 6) {next;} statements…} • When the next is encountered when $x is 6, the loop is restarted again, and the statements below the if are not executed on this pass
  • 82.
    Exercise • Write aprogram that prompts the user for numbers, adding the numbers the user enters together as they are entered. If the user enters the number 13, terminate the loop at that point and display the total so far. If the user enters the numbers 9 or 18, ignore the numbers in the running total, but continue asking for other numbers.
  • 83.
  • 84.
    Labels • Blocks ofcode and the for and while statements can all be labeled, and identified by that label. To use a label, place it before the statement block followed by a colon: BLOCK1: {statements…} • The name of the label can be any valid ASCII characters. The convention is to use uppercase for labels so there is no conflict with existing keywords.
  • 85.
    Labels and loops •When you are labeling a loop, use the same convention as a code block: OUTERLOOP: for ( …;…;…) BIGCOND: while (cond) • Labels allow you to specify which loop is affected using last and next: if (cond) { last OUTERLOOP;} will terminate the loop called OUTERLOOP, even if the statement is nested levels deep inside OUTERLOOP
  • 86.
  • 87.
    The exit statement •The exit statement is used to terminate a Perl script at any time. Whenever the exit statement is encountered, the script is terminated. You can send a return status code back to the calling program with exit, if you want, by appending the return code after the exit statement: if ($val1 == 0) { exit 0;} • This will exit the program with a return code of 0 is $val1 is zero.
  • 88.
    Exercise • One ofthe traditional exercises to show loops is finding prime numbers. Write a program that displays all the primes between 1 and 1,000. Show each prime as it is found. Also show how many primes you found at the end of the program.
  • 89.
    Some Perl functionsand operators
  • 90.
    Perl functions andoperators • Perl is a very flexible language. As you will see, there are several ways to accomplish the same tasks in Perl, sometimes quite easily if you know all the functions or operators. • There are several useful functions that work with scalars and strings, and we can look at these in the next few slides • A Perl reference book is a handy resource for looking up functions and operators
  • 91.
    The index andrindex functions
  • 92.
    The index function •The index function is used to find one string inside another. For example, if you have the string “A stitch in time” and want to find out if the string “itch” occurs inside it, you could use the index function. • The syntax of index is: index string, search_string; where string is the string to be examined for search_string. Of course, you can use variables for either string component.
  • 93.
    Perl functions andparentheses • Perl functions can be written with or without parentheses in most cases, so the statements: index string, search_string; and index(string, search_string); are identical. It is up to you whether you use parentheses.
  • 94.
    Example of index •If index finds the substring, it returns the position of the start of the substring in the string, counting from zero. If not found, -1 is returned. • To find “itch” in “A stitch in time”, you would issue the command: index “A stitch in time”, “itch”; or you could use variables: $str1 = “A stitch in time”; $str2 = “itch”; $foo=index $str1, $str2; which will return $foo a value of 4.
  • 95.
    Modifications of index •You can specify a starting position for the substring search, allowing you to skip a known substring if you want. To specify a starting position, give the number after the substring: index $str1, $str2, 6; this will start looking for $str2 starting at the 6th position of $str1. • This could also be written: index($str1, $str2, 6);
  • 96.
    The rindex function •The rindex function is the same as index, except it starts at the right of a string and works towards to left. As with index, it will return the position of the first match, or –1 if no match is found: $str1 = “A stitch in time”; $str2 = “itch”; $foo=rindex $str1, $str2; Again, $foo would have the value of 4. • You can specify a starting position with rindex in the same way as index
  • 97.
    Exercise • Prompt theuser for a long string, followed by a shorter one. Use both index and rindex to locate the short string inside the longer one and compare the results.
  • 98.
  • 99.
    The printf function •The printf function is a more talented version of print, and is similar to the printf in other languages like C and C++ • The printf function is a “formatted print” and allows better control of the output from a print statement. The syntax of the printf statement is the same as that in C/C++: printf format, list; where format is the format specifier and list is what is to be formatted.
  • 100.
    The printf formatspecifiers • The format specifiers for printf have the general format of: %-w.dl where % is the identifier used for a specifier, - is an optional minus sign used for justification, w is the width of the field, the period is an optional part followed by the number of decimals, and l is the field type. The field type must be specified. Examples are: %20s %-6.2f
  • 101.
    Field types • Themost common field types for the printf statement are: c character s string d integer number (no fraction) f floating number • There are some other field types, but they are rarely used in programming.
  • 102.
    Examples • To displaythe variable $num1 with 6 digits total, two to the right of the decimal, you would use a printf like this: printf “%6.2f”, $num1; or printf(“%6.2f”, $num1); • To print leading zeros if the number does not have enough digits, add a zero in front of the width: printf “%06.2f”, $num1;
  • 103.
    String examples • Stringscan be displayed with format specifiers, too: printf “%10s”, “Tim”; This will right-justify “Tim” into a field 10 characters wide. • To left-justify a string, use a minus sign in front of the specifier: printf “%-10s”, “Tim”;
  • 104.
    Multiple variables • Ifyou are displaying more than one value, you need a format specifier for each. They are read from left to right: printf “%6d %5.4f %3d”,$x1,$x2,$x3; • The format specifiers are matched up with the variables in order. Multiple format specifiers can be inside the quotations marks, as can any string: printf “Total is %8.4f”, $total;
  • 105.
    Exercise • Modify theprime number program you wrote earlier to search primes between 2 and 1000. List the primes in a column, stacked one on top of each other so the column of digits are correct, like this: 001 011 111 etc…
  • 106.
    Module 4 Lists andarrays, Part 1
  • 107.
    Scalars, lists, arraysand hashes • The last few modules have dealt with scalars: variables with a single value. There are times you want to work with a collection of data, all the same type of different types. • With Perl, data that is not a scalar is one of three different types: list, hash, or array. We’ll look at all three types in this module.
  • 108.
  • 109.
    Lists • Lists aregroups of scalars and usually reflect multiple values of the same type of data. There are several ways to represent lists, but the most common is to put the list in parentheses, separated by commas: (“Tim”, 1234567, 16.65) • This list has a string value and two numbers, one an integer and one a float (although you don’t care about the numeric types). Each part of this list is a “list element” so there are three list elements here.
  • 110.
    List contents • Perlallows lists to have as many elements as you would like in them: there is no size limitation. Lists can therefore be very large. • You can use variables inside lists, too: (“Tim”, $var1, $num1, 65) • Lists can refer to other lists, holding one list inside another list as a variable
  • 111.
    Creating list variables •List variables are similar to scalar variables except the $ used as the first letter of a scalar variable name is replaced by an @ sign for array variables. Lists are a type of array. The array is the variable that had a list assigned to it. • The same rules apply for array variable naming as with scalar variables, although $foo and @foo are two different variables as far as Perl is concerned because the first letters are different
  • 112.
    Assigning lists • Youcan assign a list to a variable the same way as any other value. This creates an array variable containing the list: @list1=(“Tim”, 45, 56.0); • When one list contains another list, you use the same syntax: @list2=(@list1, “sample”, 5.3); • You can combine lists and variables: @list2=(@list1, “sample”, $num1);
  • 113.
    The qw operator •When you are adding several scalars to a list and don’t want to bother with quotation marks, you can use the qw operator like this: qw(red green 45 $x); • The qw operator has a whitespace separated list in parentheses, and it converts everything to literals (with no expansion of variables, so $x is a literal). The line above is equivalent to: (‘red’, ‘green’, ‘45’, ‘$x’);
  • 114.
    Getting elements froman array • You can retrieve list (array) elements by using their element index value. All list elements are numbered, starting from zero, left to right. To access an array element, use the element number in square brackets: @test=(0,1,2,3,4,5); print @test[3]; • This will display the fourth element (because of zero-origin) which will be 3
  • 115.
    A slice ofa list • You can retrieve more than one list element at a time. This is subgroup of a list is called a slice. Specify the list elements you want separated by commas: @biglist=qw(a b c d e f g h); @vowels=@biglist[0,4]; • You can subgroup or slice lists many times using this method
  • 116.
    End of alist, method #1 • You can find how many elements there are in a list by using the special symbol $# followed by the list name. For example: @list1=qw(a b c d e); print $#list1; will display the value 4 (counting starts at 0, and there are five elements, so the last is @list1[4]. • Can you use this method to assign the size of the array to a variable, like this: $size=$#list1;
  • 117.
    End of alist, method #2 • A sneakier way to determine the length of a list is to use a scalar conversion: $size = @list1; • $size is a scalar and cannot contain the contents of list @list1. It holds the number of elements in @list1. (This works because of Perl contexts, which you will see later.)
  • 118.
    Exercise • Create anarray that holds the numbers from 1 to 10. Create a slice of that list with a different name that holds all the odd numbers, and another slice of that sublist that holds all the primes. Write a program that displays the last slice, one element at a time. Display the size of all three lists.
  • 119.
    The foreach loopand arrays
  • 120.
    Accessing elements • Youalready know you can access a list element using its index number, such as @list[4]; Often you will have to move along an entire array, working with each element in the list. You can do this with a for loop: for ($x=1; $x<@list; $x++) { print @list[$x];} • The @list in the for condition evaluates to a scalar, the length of the list
  • 121.
    The foreach loop •Perl has an easier setup for moving across a list, and that’s the foreach loop. The foreach loop creates an index variable that is equal to each element in the list, left to right. Here’s the same code as the previous slide: foreach $x (@list) { print @list[$x];} • The variable $x is set to each element in @list in turn, so you don’t have to set up the for loop. The foreach loop starts at index 1, so the 0th element is not counted.
  • 122.
    The for andforeach loops • Perl uses the foreach and for loops interchangably: you can use either in most situations • However, you will find that most Perl programmers use foreach when working with arrays, and for in other situations. This makes it easier to keep track of the loops and uses.
  • 123.
    Exercise • Modify thelast program you wrote to use a foreach or for loop to display the list elements in each of the three lists.
  • 124.
    The pop andpush operators
  • 125.
    Arrays and stacks •Perl has many ways of doing things. Another way to move through a list is to use the pop and push operators. • You can think of an array as a stack, like a stack of paper. When you add something to the top of the stack, everything else gets moved down one place. This is called a push operation. When you take something off the top of the stack, everything moves up one place. This is called a pop operation. (These terms will be familiar to anyone who knows RPN notation.)
  • 126.
    Using pop • Perlsupports the pop and push operators to work through a list. The list is stacked, one on top of the other, from left to right, so the last entry is on top. Here’s an example: @list=(“A”, “B”, “C”, “D”, “E”); $pick=pop @list; print $pick; #prints “E” $pick=pop @list; print $pick; #prints “D” • You can work through the stack to the bottom using the pop operator
  • 127.
    Using push • Thepush operator will add things onto the top of the stack: @list=(“A”, “B”, “C”, “D”, “E”); pop @list; # “E” pop @list; # “D” $x=pop @list; #holds “C” push @list, $x; #puts “C” back The trick to using pop and push is imagining a pointer in the list that moves with each operation. Once something has been popped off the stack it is not in the stack anymore unless pushed.
  • 128.
    More push • Youcan push anything back onto a stack. For example: @list=(“A”, “B”, “C”, “D”, “E”); pop @list; # “E” pop @list; # “D” push @list, “F”; #puts “F” back • The stack will now have (“A”, “B”, “C”, “F”) • You can add more than one item to the stack: push @list, “F”, “G”, “H”;
  • 129.
    Pushing arrays • Sofar we’ve only pushed scalars, but you can push arrays onto a stack as well: push @list @array; • When you push a list onto a stack, the new list is a concatenation of the two • You can push lists of lists, as well. To do this, the list of lists gets flattened into a single list, then added to the stack.
  • 130.
    The shift andunshift operators • The push and pop operators work on the top element of a stack. The shift and unshift operators do the same task at the bottom of the stack. • The unshift operator adds an object to the bottom of the stack, and the shift operator removes an object from the bottom of the stack. • You can use pop/push and shift/unshift at the same time. This is one way of manipulating arrays and list elements, but not all programmers use this method.
  • 131.
  • 132.
    Context • Context hasto do with the way Perl uses variables and values. The two important types of contexts in Perl are scalar context and list context. • When assigning variables, the expression on the left of the equals sign determines what context the right side is evaluated in. The differences between scalar context and list context indicate whether values are treated as lists or scalars, as you will see.
  • 133.
    Scalar and listcontexts • Scalar context arises when using scalars: $num1=$num2; since the left is a scalar, the right is treated as a scalar. If the left is a list, the right is treated as a list: @list1=@list2; • There are times when mixed contexts arise: $num1=@list1; which has a scalar to the left so it evaluates the right as a scalar and returns the number of items in the list.
  • 134.
    Forcing scalar • Thereare times when you want to force a list to be treated as a scalar. The print function actually expects a list as an argument, so the command: print @list1; prints the contents of the list @list1. • You can force the print command to accept a scalar context like this: print scalar(@list1); which will convert the list to a scalar and display the number of elements in the list.
  • 135.
  • 136.
    Reordering array elements •You will often need to work with array elements in a different order than they appear in a list. You could code variable storage for each element, but there are better ways in Perl. To organize a list in some order, there is the sort function. • The syntax of the sort function is: sort list; where list is the name of the list to be sorted. • By default an ASCII sort is produced. The original list is not modified.
  • 137.
    Using sort • Youcan use sort to order a list, and assign the result to another variable: @let1=qw(A F D w U d F l X H); @let2=sort(@let1); • It is important to remember that the ASCII sort will result in lower case letters appearing after upper case letters. Also, numbers will sort in a different order than you expect. For example, 12 will be after 102. To handle these issues, different sorting orders are needed.
  • 138.
    Exercise • Create anarray with a list of names in it, and sort them in ASCII order, displaying the result.
  • 139.
    Changing the sortorder • To change the sort order from ASCII, you need to add an argument to the sort function. The argument is a block of code which contains instructions on the new sort order. • Inside the block of code, two variables ($a and $b) are used to indicate two elements in the list. The logic returns one of three values when comparing $a and $b:-1 if $a is less than $b, 1 if $a is greater than $b, and 0 if they are equal.
  • 140.
    The “spaceship” operator •To help with changing the sorting order of numbers, Perl has the spaceship operator “<=>” which performs the comparison of $a and $b. The spaceship operator returns –1 if the left is less than the right, 1 if the left is greater than the right, and 0 if they are equal. Here’s how to use it: @newlist=sort {$a <=> $b} @list; • This command will sort list based on the argument values. This works with numbers, not strings.
  • 141.
    Exercise • To seehow the spaceship operator and the sort function work, create a list of random numbers (you can generate it or simply enter one on the keyboard). Then, use the sort function with the spaceship operator to sort the numbers in ascending order, and display the result.
  • 142.
  • 143.
    Scalars to arrays •Perl allows conversion between scalars and arrays in several ways • You can convert a scalar to an array using the split function, which requires a pattern to be specified that breaks the scalar into an array based on that pattern. The pattern is surrounded by slashes. For example, to break the scalar “This is a script” into an array based on spaces, you would issue the command: @n1=split(/ /,“This is a script”);
  • 144.
    Results of split •The result of the command: @n1=split(/ /,“This is a script”); would be an array called @n1 that has the list (“This”, “is”, “a”, “script”) as the contents. • If you don’t specify a pattern to break up the scalar, whitespace is used by default. If you do not specify a variable name as an argument to split, the default variable $_ is used. • A special pattern is null (//) which breaks the scalar up character-by-character
  • 145.
    Using split • Thesplit function allows you to break up arrays into smaller parts when combined with foreach: @cpu=(“Intel, Pentium”, “AMD, Athlon”, “Transmeta, Crusoe”); foreach $x (@cpu) { ($company, $chip)=split(/,/, $x);} • This uses the foreach to extract the three list elements in @cpu, and then split breaks those list elements into two parts based on the commas. These are then assigned to the scalars $company and $chip respectively.
  • 146.
    Exercise • Create anarray which holds a list of record albums, consisting of album title, the year it was released (guess if you don’t know) and artist (in that order). Add at least five albums to the list. Then, use the split function and a for or foreach loop to display the artist followed by the title for each list element.
  • 147.
    Module 5 Lists andarrays, Part 2
  • 148.
  • 149.
    Arrays to scalars •The split function lets you convert scalars to arrays. To convert arrays to scalars, use the join function. The join function expects a string and a list as arguments, and joins the list together using the string, returning a string scalar. • The syntax is: join string (list);
  • 150.
    Example of join •To join a list of letters into a single string, separated by spaces, you would issue the command: $string=join(‘ ’, (‘A’, ‘B’, ‘C’, ‘D’, ‘E’)); The result would be $string set to “A B C D E”. (The parentheses around the arguments are optional as with most Perl functions.) • The joining string can be a null or any valid ASCII character
  • 151.
    Exercise • Create anarray with a list of numbers from 1 to 10, then convert the array into a scalar using commands to separate the numbers. Use a hyphen to separate the numbers. Use the join function to perform the conversion.
  • 152.
    Split and jointogether • You can use the split and join functions together to manipulate strings. For example: $str=“Hello”; $temp=join(‘ ’, split(//,$str); print $temp; • This will result in the output “H e l l o”. The split breaks the scalar into an array based on individual characters, and the join adds the parts back together into a list with spaces between.
  • 153.
    The reverse function •The reverse function is used with strings to reverse the characters in the string. It returns a reversed string. For example: $str1=“A B C D E”; $str2=reverse($str1); print $str2; will display the string “E D C B A”.
  • 154.
    Exercise • Write aprogram that prompts the user for a string, and then display it back, reversed, to the user with hyphens between each of the letters in the string. If they entered “This is a test” you would display “t-s-e-t- -a- -s-i- - s-i-h-T”.
  • 155.
  • 156.
    Hashes • Scalars aresingle data entities. Arrays are ordered collections of data entities. A hash is an unordered collection of data which may contain many sets of data, but there is no “first” or “last”, no “top” or “bottom” item in a hash. • Hash data is always in pairs • Hashes are treated much like lists, but the hash variable is used. Hash variables all start with %.
  • 157.
    Creating a hash •A hash is created in the same way as a list: %hash1=(“Intel”, “Pentium”, “AMD”, “Athlon”, “Transmeta”, “Crusoe”); • To help readability, the hash is usually laid out in its declaration: %hash1=(“Intel”, “Pentium”, “AMD”, “Athlon”, “Transmeta”,“Crusoe”); • This shows the pairing of hash data more clearly
  • 158.
    The => operator •To make relationships in hashes clearer, Perl allows the use of the => operator, which Perl interprets to mean a double-quoted string and a comma. This: %hash1=(“Intel”, “Pentium”, “AMD”, “Athlon”, “Transmeta”,“Crusoe”); becomes this: %hash1=(Intel => Pentium, AMD => Athlon, Transmeta => Crusoe);
  • 159.
    Hash keys • Whenyou have a pair like this: Intel => Pentium the left side of the pair is called the hash key, and the right side is the value. The hash key is used to look up the pair. • All hash keys must be unique. If you have more than one hash key the same, the newer entries will overwrite the older ones in the list. Values do not have to be unique. Having unique hash keys leads to the use of hashes as a quick-lookup structure.
  • 160.
    Locating hash entries •To locate a specific entry in a hash, we use the hash keys. The key is enclosed in curly braces: %hash1=(Intel => Pentium, AMD => Athlon, Transmeta => Crusoe); print $hash1{AMD}; • This will print the value corresponding to the AMD hash key, which is “Athlon”. Note the use of a scalar $hash1.
  • 161.
    Exercise • Create ahash with a set of names and phone numbers. Populate the hash with five sets of data. Ask the user which name they would like to look up the phone number for, and use that as the key to display the number.
  • 162.
    Showing all entrieswith keys • Instead of using a for or foreach loop to show all entries in a hash, the keys function can be used: keys(%hash) • This will show a list of all the keys. To show keys and values, a for loop can be used: for (keys %hash) { print “$_ is the value for $hash{$_}n”;} • This uses the default variable show the hash key ($hash{$_}) and the value assigned to it as $_.
  • 163.
    Reversing hashes • Youcan use the reverse function on a hash to reverse the pairs, converting the value to a hash key and the hash key to a value. This allows you to look up with either value: reverse %hash; • Reversing a hash swaps the hash key and value, not the order of the pairs in the hash • Be careful when using reverse. Since all hash keys have to be unique, there is the chance of losing some data if the values are the same prior to reversing.
  • 164.
    Exercise • Set upa hash that contains a few city names and their corresponding zip codes (guess if you don’t know them). Then, display the names of the cities and ask the user to enter one. Display the corresponding zip code. Reverse the hash and display the zip codes, asking the user to choose one. Display the city for that zip code. You can display the lists of cities and zip codes any way you want, or display both at the same time with the keys function.
  • 165.
  • 166.
    Adding entries toa hash • You can add entries to a hash by specifying the key and value: %hash1=(Intel => Pentium, AMD => Athlon, Transmeta => Crusoe); $hash{HP} = “PARISC”; • This adds the entry HP as the key and “PARISC” as the value to the hash. Again, we use a scalar for the function.
  • 167.
    Changing values • Youcan change values in a hash by reassigning them: %hash1=(Intel => Pentium, AMD => Athlon, Transmeta => Crusoe); $hash1{AMD} = “Thunderbird”; • This will change the value associated with the key AMD. To change the key, you can add a new entry and delete the old one, or reverse the hash and change the value, then reverse back (which may cause problems).
  • 168.
    Deleting hash entries •To remove an entry from a hash, the delete function is used with the hash key: %hash1=(Intel => Pentium, AMD => Athlon, Transmeta => Crusoe); delete ${Transmeta}; • This will delete the hash pair starting with the key “Transmeta”.
  • 169.
    Converting hashes toarrays • You can convert hashes to arrays and vice-versa easily, since they both have the same list structure. To convert, simply reassign using the proper variable name: %hash=(“A”, “B”, “C”, “D”); @array=%hash; %hash2=@array; • The first line creates a hash, the second converts the hash format to an array format, and the third line creates a new hash with the array’s contents.
  • 170.
    Blanking a hash •To remove all data from a hash (hash keys and values), you could use the delete function in a loop, but an easier way is to simply redefine the hash. Suppose you want to blank: %hash=(“A”, “B”, “C”, “D”); To blank this hash, just redefine it: %hash=(); and the old values are deleted and a hash with no contents is left.
  • 171.
    Sorting a hash •You can use the sort function to sort a hash. This is easily done by using keys to extract the hash keys and then use sort on those keys: foreach ( sort keys %hash1) { print “$_ $hash1{$_}n”;} • In this case, we’ve just printed the sorted hash, but you could store the results in a new has variable in sorted order
  • 172.
    Testing for hashkeys • You can test to see if a hash key exists, preventing problems with other statements. Most people would assume this would work: if ( $hash{$key})… but it doesn’t. • To check whether a hash key exists, use the exists function instead: if (exists $hash{$key}) …
  • 173.
    Exercise • Modify thelast program you wrote to: – display the current hash, and ask the user if they want to add a new entry. If they do, prompt for the city and zip and add it to the hash – display the hash again and ask if the user wants to delete an entry. If they do, ask for the hash key to delete, and remove that entry – perform the same lookups for city and zip as before, but check to make sure the keys exist. If not, display an error message.
  • 174.
  • 175.
    The grep function •The grep function searches for patterns in an array. The syntax for Perl’s grep is: grep pattern, list; • For example, to find all the elements with the pattern “day” in the array: @days=qw(Monday, Tuesday, Friday); you would issue the command: @result=grep /day/, @days; which will populate the array @result with the matching elements.
  • 176.
    How grep works •The Perl grep works by proceeding through an array, one element at a time, and assigning the element to the default variable $_. The pattern to be found is then compared against $_. • If the pattern is found in $_, the expression is true and the element is returned by grep. • If the pattern is not found, the element is not returned by grep.
  • 177.
  • 178.
    Intersections • A commontask with Perl is finding the intersections of two hashes or arrays. In other words, find out which elements are in common with two sets of data. We can also find the difference, by finding everything not in the intersection set. • The find an intersection, the grep function is very useful
  • 179.
    Performing the intersection •Finding an intersection between @array1 and @array2 is surprisingly simple: %temp=(); foreach (@array1) { $temp{$_}=1;} @intersect=grep $temp{$_}, @array2; • This code starts by setting up an empty hash. The foreach stores each element in @array1 in $_, one at a time, and fills the temp list with them and sets the values to 1 (so it is true). The last line examines @array2 one element at a time and sets it to $_, which is grepped in %temp. If there, it is an intersection element and added to @intersect.
  • 180.
    Finding the difference •Finding a difference is similar to an intersection but using negation: %temp=(); foreach (@array1) { $temp{$_}=1;} @intersect=grep (! $temp{$_}, @array2);
  • 181.
    Exercise • Create twoarrays of numbers, both with some identical and some different values. Display the intersection and difference of the two arrays. Display the two derived arrays in sorted order.
  • 182.
    Module 6 File anddirectories
  • 183.
  • 184.
    Filehandles • In orderto work with files, you need to use a filehandle. A filehandle is a variable that acts as a reference between your Perl program and the operating system’s file structure. • Filehandles contain information about the file, the way the file was opened (read-only, etc), where you are in the file, and some other attributes. • Every file manipulation in Perl is done through filehandles
  • 185.
    Naming filehandles • Filehandlevariables do not have a special character in front of them like scalars, lists, arrays, or hashes. For that reason, the convention is to use uppercase for filehandle variables to avoid confusion with Perl keywords. • Filehandle names can be any combination of characters you want, but descriptive names are often easiest to work with and keep track of
  • 186.
    Opening a file •To open a file to be read by Perl, you need to use the open function with a filehandle. The syntax for open is: open handle, filename; where handle is the filehandle and filename is the file to be opened, which may include a path. • An example of using an open function is: open (BIGFILE, “file1.dat”); • If you do not specify a directory path, the current directory is assumed
  • 187.
    Checking an open •Normally you will embed an open function inside an if statement to make sure the file was opened properly. Otherwise, commands later in the program would cause errors. Here’s a typical setup: if (open(BIGFILE, “datafile.dat”)) { statements to run } else { print “Cannot open the file!n”; exit 0;}
  • 188.
    Using pathnames • Ifyou use a pathname in the file open command, it should conform to the format of directory paths in your operating system. For example: open(BIGFILE, “D:datadata.fil”); will work for Windows but will cause problems for UNIX and Linux. This format will cause Perl problems because the has to be escaped. To prevent problems, Perl allows you to use UNIX-like slashes for Windows paths, which is correctly interpreted by the Windows Perl interpreter: open(BIGFILE, “D:/data/data.fil”);
  • 189.
    Problems with paths •You must use double backslashes for Windows paths because of escaping of the backslash if you are not using forward slashes, but this can cause even more problems. For example, open(BFILE, “D:datadata.fil”); should be written as: open(BFILE, “D:datadata.fil”); to escape the backslashes properly. • You can use both absolute and relative pathnames, as well as Windows’ UNC names (such as machine sharename)
  • 190.
    Closing a filehandle •After you have opened a file and done something with it, you should always close the file. Closing the file lets the operating system know the file is not in use anymore and the filehandle is freed. • To close a filehandle, use the close function with the handle name: open(BIGFILE, “data.txt”; statements… close BIGFILE;
  • 191.
    Reusing a filehandle •If you have one file open with a specific filehandle, and then use the same filehandle in another open command, the first file is automatically closed and the filehandle is opened with the new file. • This can be used to eliminate the opening and closing of file statements in a program, as long as the files are used sequentially
  • 192.
  • 193.
    Reading from afilehandle • There are a couple of ways to read from an open filehandle. The most common is to use the file input operator, which is a pair of angle brackets around the filehandle name (just like <STDIN> to read from the keyboard). For example: open(BIGFILE, “data.txt”) $line=<BIGFILE>; • This will read a line from the file data.txt (referred to by the filehandle and not the name) and store it in $line
  • 194.
    Using the fileinput operator • The line $line=<MFILE>; will read a whole line of input from the MFILE filehandle. If there is nothing to read, the value “undef” (for undefined) is returned. • You can use loops to read through an entire file. To test whether the value undef has been detected, use the “defined” keyword: while (defined($line=<MFILE>)) …
  • 195.
    A shortcut forreading lines • Perl allows the code on the previous slide to be shortened. Instead of writing: while (defined($line=<MFILE>)) {print $line;} you can write: while(<MFILE>) {print $_;} • This works because the shortform stores the line in the default variable $_. The shortform also checks for end-of-file for you.
  • 196.
    Exercise • Write aprogram that reads in a file (pick any file from the directory) and display the contents of that file, line by line. Make sure the end of file is handled properly, and remember to close the filehandle after you are finished. Prompt the user for the filename to be read.
  • 197.
    Reading into alist • So far, we read file contents into a scalar, one line at a time. You could assign the lines read from a file to a list just as easily: open (MFILE, “data.txt”); @list=<MFILE>; close <MFILE>; • When using a list or array, the entire file is read in. Each line in the file is assigned as one element in the list. (So the first line is @list[0], and so on.)
  • 198.
    Using lists • Ifyou need to read a lot of data from a file, it is often easiest to use a list or array to hold the contents, instead of assigning a variable for each line, then processing the contents of the line somehow. • Since the array or list is just a copy of the file’s contents, any changes made to the array will not harm the original file
  • 199.
    Exercise • Write aprogram that prompts the user for a filename, then reads that file in and displays the contents backwards, line by line, and character-by-character on each line. You can do this with scalars, but an array is much easier to work with. If the original file is: abcdef ghijkl the output will be: lkjihg fedcba.
  • 200.
  • 201.
    The open ordie syntax • Perl has a command called “die” which is often used with file commands. When the die command is encountered, the program stops executing and shows a message such as: Died at fileopen.txt line 165 • To use the die command with an open function, you can use this format instead of an if statement: open(BIGFILE, “data.txt”) || die; • This is read as “open or die”: if the open is successful, execution continues; otherwise, the die statement terminates the program
  • 202.
    Adding messages todie • To help decipher program exits from the die command, you can use strings to be shown upon exit. For example: die “File won’t open”; will display the message: File won’t open at foo.txt line 52 when the die statement causes termination of the program. • You can use these messages to embed error codes or strings in likely locations for die statements
  • 203.
    The $! variable •When an error is recorded by Perl, it stores the error number in a special variable called $!. When examined numerically, $! shows a number, but when examined as a string it shows an error message from the operating system. This can be used as part of the die string: die “Can’t open: $! n”; • This statement will display the message “Can’t open” followed by the error string from the operating system when the die is triggered
  • 204.
    Warnings • Instead ofbombing out of a program using die, you may simply want to issue a warning to the user. You can do this with the warn command: warn “message”; • The warn command will display the error message, but the program keeps running. You can use the error codes with warn: warn “message: $!”;
  • 205.
    Exercise • Modify thelast program you wrote to incorporate the die statement to handle file open errors. You can use a custom message if you want.
  • 206.
  • 207.
    Opening a filefor writing • Before you can write data to a file, it has to be opened specifically for writing. You use the open function to open a file for writing, but then use a redirection operator in the filename component: open(MYFILE, “>bigfile.txt”); open(MYFILE, “>>bigfile.txt”); • The redirection operators are the same used by UNIX. “>” overwrites any contents already in the file, while “>>” appends to the end of the file.
  • 208.
    Creating new files •If the file you instruct open to open for writing does not exist. The file is created for you in the current directory unless a path has been specified. • If the file does exist, it will be overwritten unless you use the append operator • Most operating systems treat case in filenames as important, but some do not. Check with your operating system to see if mixed case filenames are significant, or whether everything is converted to uppercase.
  • 209.
    Writing data • Writingdata to a file is done with the print or printf command, but with the filehandle included. The syntax is: print filehandle data; • There is no comma between the filehandle and the data! • For example, to write a single variable $num1 to the file once it is opened for writing, use the command: print MFILE $num1; assuming MFILE is the filehandle.
  • 210.
    Checking for writes •You can use a logic structure to make sure a write has been performed properly: if (! print MFILE $num1) {warn “Can’t write to the file!”;) } close (MFILE); • If the data value $num1 could not be written to the file, the warning is displayed. We used a warn here, but you can also use a die statement instead.
  • 211.
    Closing after writing •It is important to issue a close operation after writing data to a file. This is because most operating systems don’t write to the file immediately, but buffer the data. The close operation tells the operating system to commit the changes, and mark the file as not in use. • If you do not issue a close operation, there is a chance you will lose the data you have tried to write, and may corrupt the file
  • 212.
    Exercise • Write aprogram that creates a file called “data.dat” in the current directory. Prompt the user for five numbers, and write them, one at a time, on both the screen and into the file. Close the file, then open it again for reading only, and display the contents on the screen. Handle error conditions that may occur.
  • 213.
  • 214.
    Multiple files • Youcan have many files open at once. The limit to the number of files that can be open (for reading or writing) is usually set by your operating system. There is no intrinsic limit imposed by Perl. • Often you will want to read one file, line by line, and process the output saving it into another file. This requires two files to be open at once. Keep track of the filehandles and the process will be simple.
  • 215.
    Exercise • Create afile that has a series of ten strings in it, all accepted from the user through the keyboard. Then, open that file in read mode, and reverse the order of the characters in each line, saving the reversed line to a new file. Display the completed reversed file when done.
  • 216.
  • 217.
    Binary vs. text •Binary files are files that have to be translated literally, such as a picture file, a sound file, or a binary file. Text files are any files that contain records that end in end-of-line characters. • Some operating systems distinguish between binary and text files. Unix and Linux do not, but Windows does. Perl can’t tell the difference between binary and text files (it has a Unix heritage).
  • 218.
    Handling text files •When Perl writes data to a file, it does so in text mode. When the newline n character is encountered in a string to be written to a file, Perl converts it to the appropriate characters for the native operating system: UNIX/Linux: ASCII 10 (LF) Windows: ASCII 13/10 (CR/LF) Macintosh: ASCII 13 (CR)
  • 219.
    Handling binary data •When writing binary data to a file you don’t want Perl converting anything, so you have to use the binmode command with the filehandle to tell Perl this is to be written literally: open(BFILE, “>file1.dat”); binmode(BFILE); • You only need to specify binmode for a filehandle once, until you close the file • On some operating systems (UNIX/Linux and Macintosh) binmode is ignored as there is no distinction between binary and text files
  • 220.
  • 221.
    File tests • Perlallows the UNIX file tests to be performed. This is usually done in a condition like this: if (-r FILE) {..} • The condition has one valid option followed by the filehandle to be tested. Alternatively, you can use a filename or full path and filename instead of a filehandle.
  • 222.
    Valid tests • Thesetests are all UNIX tests available to Perl: -B true if a binary file -dtrue if directory -e true if file exists -f true if regular file -M returns age in days since last modification -r true if readable -s returns size of file in bytes -T true if text file -w true if writable -z true if file exists but is empty
  • 223.
    Using tests • Youcan use tests to verify files when opening or writing. If you are prompting the user for a filename, you can check to make sure the file exists or has the correct type of data. You can also use test to make sure you are not overwriting an existing file.
  • 224.
    Exercise • Modify thelast program you wrote to allow the user to enter both the filename to read and the filename to write, and check to make sure that the file to read exists, and the file to write to doesn’t (so you don’t overwrite a file). Display messages if the tests fail.
  • 225.
    File and directorymanipulation
  • 226.
    Renaming files • Torename a file, you use the rename command with the old filename and the new filename separated by a comma: rename “a.dat”, “b.dat”; • You use the filenames and not the filehandles, since the file cannot be open when you rename it. You can use die and warn to trap failures of the rename command, as you have seen earlier.
  • 227.
    Deleting files • Todelete a file, use the unlink command with the filename(s) in a list: unlink file1.dat; • As with rename, you can’t use a filehandle because you can’t delete an open file. Again, you can trap false returns from the operating system with die and warn.
  • 228.
    Directories • Almost alloperating systems use a hierarchical structure to maintain files in directories. Being able to read the directory structure is important. Perl lets you do this through directory handles. • A directory handle is used to read the contents of a directory. You can open a directory handle using the opendir function: opendir handle directory; where handle is the directory handle you want to open, and directory is the name of the directory to be read.
  • 229.
    Directory listings • Oncea directory has been opened with a dirhandle, you can read the directory contents with the readdir function: opendir TEMPDIR, “/temp” || die; readdir TEMPDIR; • After you are finished with a dirhandle, you should close the handle with closedir: closedir TEMPDIR;
  • 230.
    Storing directory contentsin an array • You will often want to read the directory contents and store the list for future use. You can assign the contents to an array just as you did with file contents: opendir (MDIR, “/temp”) || die; @filelist=readdir MDIR; closedir MDIR; • You could then manipulate the contents of @filelist, which will have one directory line per element with most operating systems
  • 231.
    Changing directories • Tochange directories, you can use the chdir command. Changes in directory can be specified absolutely or relatively. For example: chdir ../book; will move up one directory level and down into the directory book. • If you do not specify a directory argument for chdir, it will change to your home directory (if one is defined by your operating system)
  • 232.
    Creating directories • Tocreate a directory, use the mkdir command with both the new directory name and the permissions are arguments: mkdir newdir, perms; • For example: mkdir temp, 0755 || die; will create the directory temp under the current directory with the UNIX permissions 755 • The directory permissions use the same convention as your operating system. Not all operating systems will use permissions.
  • 233.
    Deleting directories • Todelete directories, use the rmdir function with the pathname to the directory to be removed: rmdir temp; • If the directory cannot be removed, a false is returned by the operating system and can be handled with die or warn. • You can only delete empty directories!
  • 234.
  • 235.
  • 236.
    Functions and Perl •Almost every high-level programming language supports functions and Perl is no exception. Simply put, a function is a block of code that can be called by name to perform some task, then return to the calling program. Functions can usually accept parameters and return values. • You’ve seen a lot of functions so far: print, printf, reverse, sort, open and so on are all built-in functions • Perl calls user-defined functions “subroutines” or “subs”
  • 237.
    Creating a subroutine •To create a Perl subroutine, you use the keyword sub followed by the subroutine name and the code for that subroutine in curly braces: sub sub_name { statements… } • Subroutine names do not have any specific character first, and naming rules are the same as other variables in Perl
  • 238.
    Running a subroutine •To run a subroutine in Perl, you can use one of two syntaxes: subname(); or &subname(); • The ampersand is optional in almost all cases, as is most commonly left off. If you are passing parameters to the subroutine, they would be enclosed in the parentheses. • Subroutines can call other subroutines, or themselves recursively
  • 239.
    Returning values • Manysubroutines will return a value when the subroutine code has been executed. This value is usually assigned to a variable or used in a statement like print. For example: sub twoplustwo { 2+2;} $num1=8+twoplustwo; • In this case, $num1 will have the value of 12. In this case there is no return statement in the subroutine as with other languages. The last expression in the code is returned by the sub.
  • 240.
    Using return • Youcan use a return statement if you want to specify what to return from a subroutine or want to have the option of terminating the sub early: sub oddeven { return(0) if ($num%2 == 0); statements…} • In this case, $num will have been defined outside the subroutine. If the mod 2 of that number is 0, the returned number will be zero and the sub terminates at this point. Otherwise, the program continues executing the statements.
  • 241.
    Returning variables • Youcan return any variable from a subroutine, including hashes and arrays: sub somesub { statements… return @array; statements…} @rev=reverse(somesub(@array)); • In this code, an array is returned from the sub and reversed in the body of the Perl script, then stored in a new array variable
  • 242.
    Exercise • Write twosubroutines. Both use a variable defined in the body of the program. The first sub multiplies the variable by ten, while the second sub divides the number by two. Prompt the user for the number, and then call both subs and display the results from the Perl script (not the subs).
  • 243.
  • 244.
    Arguments • The codewe wrote so far depends on a variable being defined outside a subroutine, and that’s not good because this reduces code reuse. Ideally, subroutines should be stand-alone and portable. • To pass data to a subroutine as part of the subroutine call is to use arguments. You’ve seen arguments passed to functions with several built-in functions, such as sort. To pass arguments to a subroutine, you specify them as part of the subroutine call.
  • 245.
    Specifying arguments • Argumentsfor a subroutine can be specified in two ways, either with or without parentheses (most Perl functions don’t care about parentheses as you have seen already in this course): sub(args…); sub args…; • You can only leave off the parentheses if the subroutine has been defined earlier in the script that the subroutine call. This is a common error for programmers who define subroutines at the end of their code.
  • 246.
    Multiple arguments • Theargs list can be composed of many arguments, all separated by commas: sub1 arg1, arg2, arg3; • There is no limit to the number of arguments that can be passed to a subroutine, and they can be of any valid type or size. You can mix argument types in the list passed to the subroutine: sub1 $x1, @array1, $x2, @array2;
  • 247.
    The @_ variable •Inside the subroutine, Perl uses the argument variable @_ to hold all the arguments. You can use @_ in your subroutines: sub showarg { print join(“ ”, @_);} showarg “This”,“is”,“a”,“test” ; • This program passes the four strings to showarg, which uses the join to paste them together with spaces between each one. The print displays the result. • The @_ variable is not related to the $_ variable
  • 248.
    Accessing each argument •Inside a subroutine, each argument is accessed as a part of the @_ array using subscripts, as with any array. The first argument will be element @_[0], the second array element @_[1], and so on. • You can use loops to access each argument using the array element name
  • 249.
    Using names forarguments • Inside a subroutine, using @_[x] can be awkward and counter-intuitive. Perl allows you to name arguments being passed into the subroutine, just as with other language function arguments, but you have to do it by assigning the @_ array elements at the top of the subroutine code: sub foo1{ ($var1, $var2)=@_; statements…} • This will assign the first argument to $var1, and the second to $var2, and so on if more are named
  • 250.
    Exercise • Write asubroutine that expects three numbers passed as arguments, and multiply the three together, returning the result from the subroutine. Call the subroutines from inside a Perl script that asks the user for all three numbers. Use names for each argument inside the subroutine itself.
  • 251.
    Passing arrays andhashes as arguments
  • 252.
    Passing an arrayor hash • In the last section you passed scalars to a subroutine. Passing an array or hash to a subroutine is done in the same way: sub revarray { return reverse(@_);} @rev=revarray @array; • This will pass the array @array to the subroutine revarray, reverse it, and store the result passed back from the subroutine into @rev. The entire array is read in to the subroutine as @_.
  • 253.
    Passing multiple arraysor hashes, Part 1 • You might think you could pass more than one array or hash in the same manner: sub printarrays { (@array1, @array2)=@_; print @array1, @array2;} however this will not work as @_ will hold the two arrays together and there is no way to break them into two separate arrays in the subroutine. There is no indication where one array ends and the next starts. In this case, @array1 would have the entire @_ contents and @array2 is empty.
  • 254.
    Passing multiple arraysor hashes, Part 2 • There is no easy way to solve this problem of passing arrays directly. However, they can be passed by reference (which is seen in Module 8). • You can pass one array and a mix of scalars without problem as long as the array or hash is the last assigned element. The scalars will be properly assigned from the @_ array and everything else goes in the array: sub passarrays { ($x1, $x2, $x3, @array1)=@_; statements…;}
  • 255.
    Exercise • Write aprogram that contains a subroutine that accepts one letter and an array of letters. Prompt the user in the main part of the Perl script for the letter, then for five strings to be sent as an array. Inside the script, combine the five elements in the array into a scalar string, using the letter as a joining character, and pass the result back to the script. Display the resulting string.
  • 256.
  • 257.
    Subroutine prototypes • Asubroutine prototype allows you to specify at the top of your code the name of a subroutine and the number of arguments it expects. Then, later in the code after you have used the subroutine by name in statements, you can define the subroutine properly. This allows you to use the shorter forms of calling subroutines, and allows for code reuse more often. • A prototype’s role is to tell the interpreter what type of arguments are to be used by a subroutine
  • 258.
    Defining a prototype •To define a subroutine prototype, use the same syntax as when defining the subroutine itself. You do not have to specify variable names, but you can indicate the presence of variables by using the first symbol (such as $ or @) of the variable: sub mysub1 ($ $ $); • This defines a subroutine called mysub1 which will expect three scalars. The actual code for mysub1 is defined later in the program.
  • 259.
    Exercise • Modify thelast program you wrote so the body of the subroutine definition is at the end of the script. Add a subroutine prototype to the top of the script and try rerunning the program.
  • 260.
  • 261.
    Scope • If youhave programmed in any other language that supports functions, you have seen scope. Scope refers to the block of code where a variable has meaning. • If you define variables inside a Perl script they are available to any subroutines inside that script. Variables created inside a subroutine are available outside the subroutine, as well (which is not how most programming languages behave). This is an important difference in language! In this case, the variables have scope in the entire program.
  • 262.
    Keeping it privatewith “my” • To help code reuse and portability, you want to be able to define variables inside a subroutine that have no meaning outside the subroutine. This prevents conflicts with other script variables when you move the subroutine to new programs. To define a local variable, you use the my operator: my $var1; my @array1; • Any variable defined with the word my is considered private to the block of code it is defined in. That is where the variable has scope.
  • 263.
    Private variables • Anyvariable defined with the keyword my to make it private is released as soon as the code block in which it was defined is terminated • You can have private and global variables with the same name, and they are treated differently by the compiler: $num1=6; sub something { my $num1; # different variable statements…} • Both $num1s are treated as different in this case
  • 264.
  • 265.
    Using strict • Perlhas a keyword called strict. When used in a program, it tells the interpreter to use much more care when evaluating statements and to display warnings and error messages for everything it finds questionable (usually Perl will let you get away with quite a bit before complaining about something). Using strict is a good way to enhance your programming abilities. To do this, simply put: use strict; at the top of your code.
  • 266.
  • 267.
    The debugger • Partof the Perl interpreter is a debugger that you can use to examine the execution of your Perl scripts. The debugger allows step-by-step execution of scripts, examination of variable values, and the use of breakpoint. • The debugger is built into every Perl interpreter and is activated with the –d option when launching the interpreter: perl –d myprog.txt
  • 268.
    Debugger output • Whenyou first launch a program with the debugger option you will see version information for the perl interpreter, then a help prompt: Enter h or ‘h h’ for help. then the first line of the script. You will also see a message showing which filename the statement is in, and what the line number was. • Finally, the debugger prompt DB<1> is shown, indicating the debugger is waiting for your first debug command.
  • 269.
    The statements • Whenthe debugger shows you a statement, it is in the cache ready to be executed but has not yet been executed. • Each statement read by the debugger can be examined and manipulated prior to it being run. This allows for some changes or examination of the environment before each statement is executed, which is ideal for debugging the script. • Any valid Perl command can be used at the debugger prompt
  • 270.
    Debugger help • Youcan get help from within the debugger at any time using the h command, usually followed by the command you want information about. For example, for help on the b (breakpoint) command, type: h b • The command ‘h h’ shows a summary of the available commands and their syntax • To page the output from the help system, put a | in front of the command (such as |h h)
  • 271.
    Listing the program •To list the next ten lines in your Perl script, use the l command. Every time you issue an l command, the next ten lines will be shown. • Listing the lines does not affect the line that is being executed: it simply shows you the next ten lines of the script. The next line to be executed is shown in the listing like this: ===> • You can specify which lines to show by using a range: l 10-15 shows lines 10 through 15 inclusively in the script.
  • 272.
    Stepping through statements •To run each line, one at a time, in the debugger, use the n (next) command. Each line is shown on the screen before it is executed. • To see the value of any variable, use the print command at the prompt: print $var1 and the current value will be shown without affecting the program • You can keep using the n command to step through each line in the program until termination
  • 273.
    Stepping into subroutines •When a subroutine call is encountered by the debugger in the script, it executes the subroutine as a single call and doesn’t show the lines in that subroutine. To jump into the subroutine and move through it one line at a time, use the s (step) command. • When you issue the step command the debugger shows each line, one at a time, executed inside the subroutine and all valid debugger commands can be used inside the subroutine
  • 274.
    Breakpoints • You canuse the n command step through a program line by line, or let the debugger run all the lines until some condition is met. This is a breakpoint and is set with a b command. • You can set a breakpoint at any line number by specifying the line number. To set a breakpoint at line 10, you would use the command: b 10 • The c (continue) command lets you continue executing after a breakpoint has been triggered
  • 275.
    Using breakpoints • Youcan set a breakpoint on any line in a script except those that have: – Just a curly brace or closing parentheses – A blank line – A comment • Usually, breakpoints are used after a loop, subroutine return, or complex command so you can verify the actions taken. You can set a breakpoint anywhere except those listed above.
  • 276.
    Showing and removing breakpoints •To show all the breakpoints that are set in your script, use the L command • To remove a breakpoint, use the d command followed by the line number (or the subroutine number, if a breakpoint is set to a subroutine). For example: d 37 deletes the breakpoint set at line 37.
  • 277.
    The reset command •You can reset the debugger to clear all breakpoints and variables, and restart the execution of the script from the top with the R command • When reset is executed, any defined variables lose their value, and the first line of the script is the to-be-executed line
  • 278.
    GUI debuggers • Thebuilt-in debugger is acceptable for simple tracing and debugging, but is not suitable for very complex debugging tasks. Also, it is not graphical. • There are many GUI-based debuggers for Perl available on the market, some bundled with Perl distributions. The ActiveState Perl distribution has a Windows debugger in the package, for example, and there are several available for UNIX and Linux.
  • 279.
  • 280.
    References In this lastmodule of the course we will look at references. If you have programmed in C, C++, or other high level languages, you may be familiar with pointers, which are the same as Perl’s references. If you have not seen these capabilities before, the learning curve is a little steep. To help show the abilities of references we have used very simple exercises throughout. This should show the basics of the subject without overwhelming you.
  • 281.
  • 282.
    References • Many high-levellanguages like C and C++ have the concept of a pointer, a variable that points to the memory address of another variable. Perl has pointers too, but they are called references. • References do not hold a variable value, but hold the memory address of another variable. For example, if you have a variable called $num1, you could have a reference called $refnum1 which holds the memory address when $num1 has its data stored.
  • 283.
    What is areference • Every variable in Perl has a value assigned in memory. Any value assigned to that variable is contained in that memory. For example, the statement: $num1=10; will have a memory location assigned with the name $num1, and a value of 10 is stored in that memory location. • A reference is another variable, and has an assigned memory location, but holds the address of the $num1 memory location instead of a value.
  • 284.
    Why use references •Why bother using a reference to a memory location with a value in it? There are many reasons when you get into complex coding, but the simplest reason is it allows you to change the variable the reference points to (and hence the value it points to in that variable’s memory location). This is very handy in some programs, as you will see. • References are especially handy when dealing with arrays and lists
  • 285.
    Creating a reference •References are created exactly the same way as other variables, and have no special naming convention. To assign a value to the reference, you use the backslash: $refnum1=$num1; • This will create a reference variable called $refnum1which will hold the memory address of the variable $num1. Creating a reference to a variable doesn’t affect the variable in any way.
  • 286.
    Dereferencing • To usethe value a reference is pointing to, you have to tell the interpreter that you don’t want to know the memory address it holds, but the value inside the memory address it points to. This is done with the dereferencing operator. For example: print $refnum1; will print the memory address $refnum1 holds, but print $$refnum1; will print the value in the memory address of $num1 (if that’s what it points to).
  • 287.
    Using references tochange values • You can use dereferencing to change the value of a variable the reference points to. This is done in the same way as looking up the value: $$refnum1=15; • This command will change the value in the memory location $refnum1 points to and set the value of 15 there. You have to use two $ signs here: if you had written $refnum1=15; you would be setting the value 15 into the memory location of $refnum1, not the variable it points to.
  • 288.
    Exercise • Write aprogram that create a variable with a user-supplied value. Create a reference to that variable. Display the memory address of the reference and the value stored in the dereferenced variable.
  • 289.
    Using reference values •When a reference has been given a value (a memory location of a variable), the reference can be used with other variables and references. For example: $num1=10; $refnum1=$num1; $refnum2=$refnum1; print $$refnum2; will have $refnum1 point to $num1. $refnum2 then is set to the same value, so the last line shows the dereferenced value of $refnum2, or 10.
  • 290.
    References to references •You can set up a reference to a reference, although you won’t need this type of ability until you get into complex coding: $num1=10; $refnum1=$num1; $refnum2=$refnum1; the last line sets $refnum2 to the value of $refnum1 (the memory location of $num1), and not to $num1 directly. To dereference $refnum2 here and see the value of $num1, use: print $$$refnum2;
  • 291.
    Exercise • Write aprogram that sets up five scalars filled with numbers supplied by the user. Then, set up a reference variable that points to the first scalar. Display the dereferenced values of that variable, as well as the memory location it uses. Change the reference to each of the other four variables in turn and repeat the display process.
  • 292.
  • 293.
    References to arrays •You can set up a reference to an array in the same way as a reference to a scalar. Since the reference holds a memory address, it is a scalar itself and defined with a $: @array1=(“1”, “2”, “3”); $refarray1=@array1; • The variable $refarray1 will have the memory address of the first element of the array @array1. It does not point to the entire array, just the start of the memory for @array1.
  • 294.
    Dereferencing array references •To dereference array references, you can reference any element in the array pointed to with the usual element subscript: $$refarray1[2]; This shows the value of the third element in whatever $refarray points to. • If you want to see the whole array, use: @$refarray1; • You can see a range of elements, too: @$refarray1[0-3]; shows the first four element in the array.
  • 295.
    Exercise • Write aprogram that prompts the user for five strings, and save them as elements in an array. Then, set a reference to that array. Use a loop to show each of the elements in that array using the reference, one element at a time.
  • 296.
  • 297.
    References to hashes •References to hashes are set up the same way as arrays: $refhash1=%hash1; • You access single elements in the hash through the hash key, and get the value associated with that key back: $$refhash1{key}; • To see the whole hash the reference points to, use: %$refhash1;
  • 298.
    Exercise • Create ahash and a reference to that hash. You can either prompt the user for hash keys and values, or simply hardcode them to save time. Use a loop to display all the values associated with each hash key in the hash. You may have to refer back to Module 5 for the hash functions.
  • 299.
  • 300.
    Passing references tosubroutines • One of the strengths of references is the ability to pass references to arrays and hashes to subroutines. This allows more than one array or hash to be passed properly to a subroutine. Since the code: sub twoarrays { (@array1, @array2)=@_;…} does not work, as both arrays are joined into one array @_, references provide a way to pass scalars which reference more than one array.
  • 301.
    Passing arrays • Topass two arrays to a subroutine, you could do this: @array1=(…); @array2=(…); $refarray1=@array1; $refarray2=@array2; passarray($refarray1, $refarray2); sub passarray { statements…} and both arrays can be used inside the passarray subroutine by dereferencing the references.
  • 302.
    Exercise • Create twoarrays, one holding vowels and the other holding consonants. Pass both arrays into a subroutine using references. Inside the subroutine, display all the elements of each array.