Cs8493 Notes
Cs8493 Notes
net
OS is a program that acts as an intermediary between a user of a computer and the computer
hardware
t
Make the computer system convenient to use
ne
Use the computer hardware in an efficient manner
.
o Hardware – provides basic computing resources
pz
CPU, memory, I/O devices
o Operating system
ee Controls and coordinates use of hardware among various applications and
users
o Application programs – define the ways in which the system resources are used to
solve the computing problems of the users
ad
Word processors, compilers, web browsers, database systems, video
games
o Users
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
Depends on the point of view
ad
Users want convenience, ease of use and good performance
But shared computer such as mainframe or minicomputer must keep all users happy
.p
Users of dedicate systems such as workstations have dedicated resources but frequently
use shared resources from servers
Handheld computers are resource poor, optimized for usability and battery life
w
Some computers have little or no user interface, such as embedded computers in devices
and automobiles
w
OS is a resource allocator
o Decides between conflicting requests for efficient and fair resource use
OS is a control program
―The one program running at all times on the computer‖ is the kernel.
t
ne
o an application program.
COMPUTER STARTUP
.
o Typically stored in ROM or EPROM, generally known as firmware
pz
o Initializes all aspects of system
COMPUTER-SYSTEM OPERATION
ee
One or more CPUs, device controllers connect through common bus providing access to
shared memory
ad
.p
w
w
w
Device controller informs CPU that it has finished its operation by causing an
interrupt
Interrupt transfers control to the interrupt service routine generally, through the
interrupt vector, which contains the addresses of all the service routines
t
A trap or exception is a software-generated interrupt caused either by an error or a
ne
user request
INTERRUPT HANDLING
.
pz
The OS preserves the state of the CPU by storing registers and the program counter
o polling
ee The interrupt controller polls (send a signal out to) each device to
determine which one made the request
ad
Separate segments of code determine what action should be taken for each type of
interrupt
INTERRUPT TIMELINE
.p
w
w
w
www.padeepz.net
I/O STRUCTURE www.padeepz.net
t
o At most one I/O request is outstanding at a time
ne
no simultaneous I/O processing
o After I/O starts, control returns to user program without waiting for I/O
.
completion
pz
o Harder to program, more efficient
o System call – request to the OS to allow user to wait for I/O completion (polling
periodically to check busy/done)
ee
o Device-status table contains entry for each I/O device indicating its type,
address, and state
STORAGE HIERARCHY
ad
The basic unit of computer storage is the bit. A bit can contain one of two values, 0 and 1. All
other storage in a computer is based on collections of bits. Given enough bits, it is amazing how
many things a computer can represent: numbers, letters, images, movies, sounds, documents, and
programs, to name a few. A byte is 8 bits, and on most computers it is the smallest convenient
chunk of storage. For example, most computers don’t have an instruction to move a bit but do
.p
have one to move a byte. A less common term is word, which is a given computer architecture’s
native unit of data. A word is made up of one or more bytes. For example, a computer that has
64-bit registers and 64-bit memory addressing typically has 64-bit (8-byte) words. A computer
w
executes many operations in its native word size rather than a byte at a time.
Computer storage, along with most computer throughput, is generally measured and manipulated
in bytes and collections of bytes.
w
www.padeepz.net
Computer manufacturers often round off these numbers and say that a megabytewww.padeepz.net
is 1 million
bytes and a gigabyte is 1 billion bytes. Networking measurements are an exception to this
general rule; they are given in bits (because networks move data a bit at a time).
STORAGE STRUCTURE
Main memory – only large storage media that the CPU can access directly
o Random access
t
o Typically volatile
ne
Secondary storage – extension of main memory that provides large nonvolatile storage
capacity
Hard disks – rigid metal or glass platters covered with magnetic recording material
.
o Disk surface is logically divided into tracks, which are subdivided into sectors
pz
o The disk controller determines the logical interaction between the device and the
computer
o Volatility
.p
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
ad
.p
CACHING
w
Important principle
o in hardware,
w
o operating system,
o software
o Efficiency
t
DIRECT MEMORY ACCESS STRUCTURE
ne
Typically used for I/O devices that generate data in blocks, or generate data fast
Device controller transfers blocks of data from buffer storage directly to main
memory without CPU intervention
.
Only one interrupt is generated per block, rather than the one interrupt per byte
pz
HOW A MODERN COMPUTER SYSTEM WORKS
ee
ad
.p
w
w
w
TYPES OF SYSTEMS
o Advantages include:
1. Increased throughput
2. Economy of scale
t
o Two types:
ne
Asymmetric Multiprocessing – each processor is assigned a specific task
.
pz
ee
ad
.p
Multicore
w
www.padeepz.net
www.padeepz.net
t
ne
CLUSTERED SYSTEMS
.
pz
ee
ad
Symmetric clustering has multiple nodes running applications, monitoring each other
www.padeepz.net
www.padeepz.net
t
. ne
pz
MEMORY LAYOUT FOR MULTIPROGRAMMED SYSTEMS
ee
ad
.p
w
w
w
www.padeepz.net
o Software interrupt (exception or trap): www.padeepz.net
t
. ne
pz
Dual-mode operation allows OS to protect itself and other system components
Provides ability to distinguish when system is running user code or kernel code
ad
Some instructions designated as privileged, only executable in kernel mode
System call changes mode to kernel, return from call resets it to user
regain control, or
w
PROCESS MANAGEMENT
o Initialization data
Typically system has many processes, some user, some operating system running
concurrently on one or more CPUs
ACTIVITIES
t
ne
Creating and deleting both user and system processes
.
Providing mechanisms for process communication
pz
Providing mechanisms for deadlock handling
MEMORY MANAGEMENT
ee
To execute a program all (or part) of the instructions must be in memory
All (or part) of the data that is needed by the program must be in memory.
o Keeping track of which parts of memory are currently being used and by whom
.p
o Deciding which processes (or parts thereof) and data to move into and out of
memory
STORAGE MANAGEMENT
File-System management
www.padeepz.net
o Files usually organized into directories www.padeepz.net
o OS activities include
t
Mapping files onto secondary storage
ne
Backup files onto stable (non-volatile) storage media
.
pz
data that does not fit in main memory, or
ee
Entire speed of computer operation hinges on disk subsystem and its algorithms
OS activities
ad
Free-space management
Storage allocation
Disk scheduling
.p
Multitasking environments must be careful to use most recent value, no matter where it is
stored in the storage hierarchy
w
Multiprocessor environment must provide cache coherency in hardware such that all
CPUs have the most recent value in their cache
w
I/O SUBSYSTEM
t
ne
o Huge range, including denial-of-service, worms, viruses, identity theft, theft of
service
Systems generally first distinguish among users, to determine who can do what
.
pz
COMPUTING ENVIRONMENTS
TRADITIONAL
ee
But blurred as most systems interconnect with others (i.e., the Internet)
Networking becoming ubiquitous – even home systems use firewalls to protect home
computers from Internet attacks
.p
MOBILE
Distributed computing
t
Network Operating System provides features between systems across network
ne
Communication scheme allows systems to exchange messages
.
Client-Server Computing
pz
Dumb terminals supplanted by smart PCs
Broadcast request for service and respond to requests for service via
discovery protocol
o Examples include Napster and Gnutella, Voice over IP (VoIP) such as Skype
www.padeepz.net
www.padeepz.net
t
ne
Virtualization
.
VMM - virtual machine manager
pz
Creates and runs virtual machines
ee
Applications run within these guest OSes
Example: Parallels for OS X running Win and/or Linux and their apps
Example. VMware ESX: installed on hardware, runs when hardware boots, provides
services to apps, runs guest OSes
.p
Use cases
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
Operating systems made available in source-code format rather than just binary
closed-source
Counter to the copy protection and Digital Rights Management (DRM) movement
ad
Started by Free Software Foundation (FSF), which has ―copyleft‖ GNU Public
License (GPL)
Examples include GNU/Linux and BSD UNIX (including core of Mac OS X), and
.p
many more
Can use VMM like VMware Player (Free on Windows), Virtualbox (open source and
free on many platforms - http://www.virtualbox.com)
w
System Calls
System Programs
System Boot
t
(helpful functions) to programs and users
ne
User services:
o User interface
.
o Program execution - Loading a program into memory and running it, end
pz
execution, either normally or abnormally (indicating error)
o I/O operations - A running program may require I/O, which may involve a file
or an I/O device
ee
o User services (Cont.):
May occur in the CPU and memory hardware, in I/O devices, in user
w
program
For each type of error, OS should take the appropriate action to ensure
w
System services:
o For ensuring the efficient operation of the system itself via resource sharing
www.padeepz.net
www.padeepz.net
Many types of resources - CPU cycles, main memory, file storage, I/O
devices.
o Accounting - To keep track of which users use how much and what kinds of
computer resources
t
ne
Protection involves ensuring that all access to system resources is
controlled
.
pz
ee
ad
.p
t
. ne
pz
ee
ad
Typically, a number associated with each system call
.p
The system call interface invokes the intended system call in OS kernel and returns status
of the system call and any return values
w
The caller need know nothing about how the system call is implemented
o Just needs to obey API and understand what OS will do as a result call
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
Three general methods used to pass parameters to the OS in system calls
ee
o Simplest: in registers
o Parameters placed, or pushed, onto the stack by the program and popped off the
.p
o Block and stack methods do not limit the number or length of parameters being
passed
w
w
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
TYPES OF SYSTEM CALLS
Process control
ee
o create process, terminate process
o end, abort
o load, execute
ad
o get process attributes, set process attributes
File management
Device management
www.padeepz.net
o request device, release device www.padeepz.net
Information maintenance
t
o get time or date, set time or date
ne
o get system data, set system data
Communications
.
o create, delete communication connection
pz
o send, receive messages if message passing model to host name or process name
www.padeepz.net
www.padeepz.net
t
ne.
pz
ee
ad
.p
w
w
w
www.padeepz.net
SYSTEM PROGRAMS www.padeepz.net
Most users’ view of the operation system is defined by system programs, not the actual
system calls
t
o File manipulation
ne
rm, ls, cp, mv, etc in Unix
.
pz
o Program loading and execution
o Communications
o Background services
ee
o Application programs
o File management - Create, delete, copy, rename, print, dump, list, and generally
.p
Status information
o Some ask the system for info - date, time, amount of available memory, disk
w
o Typically, these programs format and print the output to the terminal or other
output devices
w
File modification
www.padeepz.net
www.padeepz.net
o Special commands to search contents of files or perform transformations of the
text
Program loading and execution- Absolute loaders, relocatable loaders, linkage editors,
and overlay-loaders, debugging systems for higher-level and machine language
t
Communications - Provide the mechanism for creating virtual connections among
ne
processes, users, and computer systems
o Allow users to send messages to one another’s screens, browse web pages, send
electronic-mail messages, log in remotely, transfer files from one machine to
another
.
Background Services
pz
o Launch at boot time
o Provide facilities like disk checking, process scheduling, error logging, printing
Application programs
o Run by users
Design and Implementation of OS not ―solvable‖, but some approaches have proven
successful
w
w
o User goals – operating system should be convenient to use, easy to learn, reliable,
safe, and fast
www.padeepz.net
www.padeepz.net
o System goals – operating system should be easy to design, implement, and
maintain, as well as flexible, reliable, error-free, and efficient
t
The separation of policy from mechanism is a very important principle, it allows
ne
maximum flexibility if policy decisions are to be changed later (example – timer)
Much variation
.
o Early OSes in assembly language
pz
o Then system programming languages like Algol, PL/1
o Now C, C++
ee
Actually usually a mix of languages
o Main body in C
ad
o Systems programs in C, C++, scripting languages like PERL, Python, shell scripts
o But slower
.p
o Each modules is responsible for one (or several) aspect of the desired
functionality
o Advantages:
o Disadvantages:
t
OPERATING SYSTEM STRUCTURES
ne
In general, various ways are used to structure OSes
.
o Combine multiple approaches to address performance, security, usability needs
pz
Simple structure – MS-DOS
ee
Layered OSes
Microkernel OSes
SIMPLE STRUCTURE
ad
MS-DOS was created to provide the most functionality in the least space
o But its interfaces and levels of functionality are not well separated
www.padeepz.net
www.padeepz.net
t
. ne
pz
Traditional UNIX has limited structuring
ee
UNIX consists of 2 separable parts:
o Systems programs
o Kernel
ad
UNIX Kernel
o Kernel provides
w
Rather monolithic
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
LAYERED APPROACH
ee
One way to make OS modular – layered approach
.p
Layers are selected such that each uses functions (operations) and services of only lower-
level layers
Advantages:
w
Disadvantages:
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
OPERATING SYSTEM DEBUGGING
ee
Debugging is finding and fixing errors, or bugs
Failure of an application can generate core dump file capturing memory of the process
ad
Operating system failure can generate crash dump file containing kernel memory
Kernighan’s Law: ―Debugging is twice as hard as writing the code in the first place.
w
Therefore, if you write the code as cleverly as possible, you are, by definition, not smart
enough to debug it.‖
w
PERFORMANCE TUNNING
OS SYSGEN
www.padeepz.net
Тhe system must be configured for each specific computer site www.padeepz.net
o Etc
t
Used to build system-specific compiled kernel or system-tuned
ne
Can general more efficient code than one general kernel
SYSTEM BOOT
How OS is loaded?
.
pz
When power is initialized on system, execution starts at a predefined memory location
Bootstrap loader
ee
o small piece of code
www.padeepz.net
www.padeepz.net
UNIT II
t
o Time-shared systems – ―user programs” or ―tasks”
ne
We will use the terms job and process almost interchangeably
.
pz
o Program becomes process when executable file loaded into memory
Execution of program started via GUI, command line entry of its name, etc
ee
o Consider multiple users executing the same program
program counter
processor registers
.p
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
DIAGRAM OF PROCESS STATE
ad
.p
w
w
www.padeepz.net
o terminated: The process has finished execution www.padeepz.net
t
Process state – running, waiting, etc
ne
Program counter – location of instruction to next execute
.
pz
Memory-management information – memory allocated to the process
Accounting information – CPU used, clock time elapsed since start, time limits
I/O status information – I/O devices allocated to process, list of open files
ee
ad
.p
w
w
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
THREADS
ee
So far, process has a single thread of execution
t
. ne
pz
PROCESS SCHEDULING
Goal of multiprogramming:
ee
Goal of time sharing:
o Ready queue – set of all processes residing in main memory, ready and waiting
w
to execute
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
ad
REPRESENTATION OF PROCESS SCHEDULING
.p
w
w
w
Queuing diagram
www.padeepz.net
o a common representation of process scheduling www.padeepz.net
SCHEDULERS
Scheduler – component that decides how processes are selected from these queues for
scheduling purposes
t
ne
On this slide - ―LTS‖ (LTS is not a common notation)
In a batch system, more processes are submitted then can be executed in memory
.
LTS selects which processes should be brought into the ready queue
pz
LTS is invoked infrequently
ee
LTS controls the degree of multiprogramming
Spends more time doing I/O than computations, many short CPU bursts
CPU-bound process
.p
Spends more time doing computations; few very long CPU bursts
www.padeepz.net
Hence cannot use costly selection logic www.padeepz.net
t
Key idea:
ne
o Reduce the degree of multiprogramming by swapping
o Swapping removes a process from memory, stores on disk, brings back in from
disk to continue execution
.
pz
ee
ad
CONTEXT SWITCH
.p
Context switch
w
OPERATIONS ON PROCESSES
t
process creation,
ne
process termination,
PROCESS CREATION
.
pz
Children can, in turn, create other processes
ee
Resource sharing options (of process creation)
Execution options
.p
UNIX examples
w
www.padeepz.net
o exec() system call used after a fork() to replace the process’www.padeepz.net
memory space with a
new program
t
ne
PROCESS TERMINATION
Process executes last statement and then asks the operating system to delete it using the
exit() system call.
.
pz
o Returns status data from child to parent (via wait())
Parent may terminate the execution of children processes using the abort() system call.
ee
Some reasons for doing so:
o Some OSes don’t allow child to exists if its parent has terminated
The parent process may wait for termination of a child process by using the wait()system
call.
w
o The call returns status information and the pid of the terminated process
o pid = wait(&status);
w
o Cooperating process can affect (or be affected) by such results of another process
t
ne
Advantages of process cooperation
o Information sharing
o Computation speed-up
.
o Modularity
pz
o Convenience
INTERPROCESS COMMUNICATION
o Shared memory
ad
o Message passing
.p
w
w
w
www.padeepz.net
The communication is under the control of the users processes, www.padeepz.net
not the OS.
Major issue is to provide mechanism that will allow the user processes to synchronize
their actions when they access shared memory.
t
o We will use it to exemplify IPC
ne
o Very important!
Producer process
.
pz
o incrementally
Consumer process
o as it becomes available
Challenge:
Shared data
w
#define BUFFER_SIZE 10
typedef struct {
w
...
} item;
item buffer[BUFFER_SIZE];
int in = 0;
www.padeepz.net
int out = 0; www.padeepz.net
item next_produced;
while (true) {
next_produced = ProduceItem();
t
while (((in + 1) % BUFFER_SIZE) == out)
ne
; /* do nothing, no space in buffer */
.
/* enough space in buffer */
pz
buffer[in] = next_produced; //put item into buffer
in = (in + 1) % BUFFER_SIZE;
}
ee
MESSAGE PASSING
o send(message)
o receive(message)
o Implementation issues:
o How many links can there be between every pair of communicating processes?
o Direct or indirect
o Synchronous or asynchronous
t
o Automatic or explicit buffering
ne
DIRECT COMMUNICATION
.
o receive(Q, message) – receive a message from process Q
pz
o Properties of a direct communication link
Messages are directed and received from mailboxes (also referred to as ports)
Operations
o destroy a mailbox
www.padeepz.net
o Primitives are defined as: www.padeepz.net
SYNCHRONIZATION
t
o Blocking, or
ne
o Non-blocking
.
o Blocking receive -- the receiver is blocked until a message is available
pz
Non-blocking is considered asynchronous
A valid message, or
Null message
ad
Different combinations possible
message next_produced;
w
while (true) {
ProduceItem(&next_produced);
w
send(next_produced);
w
BUFFERING
1. Zero capacity
www.padeepz.net
– no messages are queued on a link. www.padeepz.net
2.Bounded capacity
3.Unbounded capacity
t
ne
– infinite length
- Sender never waits
.
pz
Each process has critical section segment of code
o Process may be changing common variables, updating table, writing file, etc
eesection
Each process
ad
o must ask permission to enter critical section in entry section,
t
Assume that each process executes at a nonzero speed
ne
No assumption concerning relative speed of the n processes
1. Mutual Exclusion
.
otherwise what critical section?
pz
2. Progress
l
ee otherwise very wasteful.
Formal: If no process is executing in its critical section and there exist some
processes that wish to enter their critical section, then the selection of the
processes that will enter the critical section next cannot be postponed indefinitely
ad
3. Bounded Waiting
l Formal: A bound must exist on the number of times that other processes are
allowed to enter their critical sections after a process has made a request to enter
its critical section and before that request is granted
w
w
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
Previous solutions are complicated and generally inaccessible to application
ad
programmers
www.padeepz.net
www.padeepz.net
t
. ne
pz
SEMAPHORE
n
ee
Synchronization tool that provides more sophisticated ways (than Mutex locks) for
process to synchronize their activities.
ad
n Semaphore S – integer variable
wait(S) {
while (S <= 0)
w
S--;
www.padeepz.net
signal(S) { www.padeepz.net
S++;
t
l Initialized to the number of resources available
ne
n Binary semaphore – integer value can range only between 0 and 1
.
n Consider P1 and P2 that require S1 to happen before S2
pz
Create a semaphore ―synch‖ initialized to 0
P1:
S1;
ee
signal(synch);
P2:
ad
wait(synch);
S2;
n Data Structures
www.padeepz.net
o But, applications may spend lots of time in critical sections www.padeepz.net
typedef struct{
t
int value;
ne
struct process *list; // waiting queue
} semaphore;
Two operations:
.
o block – place the process invoking the operation on the appropriate waiting queue
pz
o wakeup – remove one of processes in the waiting queue and place it in the ready
queue
wait(semaphore *S) {
ee
S->value--;
if (S->value < 0) {
add this process to S->list;
ad
block(); //suspends self, sleeps, avoids CPU cycles
}
.p
signal(semaphore *S) {
S->value++;
w
if (S->value <= 0) {
remove a process P from S->list;
w
wakeup(P);
}
w
www.padeepz.net
DEADLOCK www.padeepz.net
Synchronization problems
Deadlock – two or more processes are waiting indefinitely for an event that can be
caused by only one of the waiting processes
t
P0 P1
ne
wait(S); wait(Q);
wait(Q); wait(S);
... ...
.
signal(S); signal(Q);
pz
signal(Q); signal(S);
ad
Processed pushed first into it might not get a chance to execute
Priority Inversion
o Bounded-Buffer Problem
w
o Dining-Philosophers Problem
w
MONITORS
monitor monitor-name
w
function P1 (…) { … }
…
www.padeepz.net
function Pn (…) {……} www.padeepz.net
initialization_code (…) { … }
t
o A monitor type – is an ADT that provides a convenient and effective mechanism
ne
for process synchronization
o Encapsulation
.
Local variables accessed only via local functions
pz
Local functions access only local vars and params
www.padeepz.net
www.padeepz.net
o Monitors not powerful enough to model some synchr. schemes
condition x, y;
t
o x.wait()
ne
A process that invokes the operation is suspended (sleeps)
.
pz
releases the lock to shared data, atomically with sleep
o x.signal()
www.padeepz.net
Issues with monitors: assume www.padeepz.net
t
Because they are within a monitor
ne
Options include
o Signal and wait – P waits until Q either leaves the monitor or it waits for another
condition
.
o Signal and continue – Q waits until P either leaves the monitor or it waits for
pz
another condition
CPU SCHEDULING
ee
o Maximum CPU utilization obtained with multiprogramming
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
ad
SCHEDULING LEVELS
High-Level Scheduling
.p
Selects jobs allowed to compete for CPU and other system resources.
Intermediate-Level Scheduling
w
w
Short-term scheduler
t
Switches from waiting to ready
ne
Terminates
.
All other scheduling is called preemptive
pz
Process can be interrupted and must release the CPU
Dispatcher
o a module that gives control of the CPU to the process selected by the
short-term scheduler; this involves:
ad
switching context
that program
Dispatch latency
o Time it takes for the dispatcher to stop one process and start another
w
running
www.padeepz.net
www.padeepz.net
Turnaround time – amount of time to execute a particular process
Waiting time – amount of time a process has been waiting in the ready queue
o Time from when a request was submitted until the first response is
t
produced
ne
Max CPU utilization
Max throughput
.
Min waiting time
pz
Min response time
P2
24
P3 3
ad
Suppose that the processes arrive in the order: P1 , P2 , P3
The Gantt Chart for the schedule is:
P1 P2 P3
.p
0 24 27 30
FCFS
w
P2 , P3 , P1
w
FCFS is nonpreemptive
P2 P3 P1
t
ne
0 3 6 30
Convoy effect – when several short processes wait for long a process to get off the CPU
Assume
.
o 1 long CPU-bound process
pz
o Many short I/O-bound processes
Execution:
eeo The long one occupies CPU
The short ones wait for it: no I/O is done at this stage
SJF
Associate with each process the length of its next CPU burst
w
o SJF uses these lengths to schedule the process with the shortest time
A new process Pnew arrives while the current one Pcur is still executing
t
This is also called shortest-remaining-time-first scheduling
ne
Advantage:
Challenge of SJF:
.
o Hinges on knowing the length of the next CPU burst
pz
But how can we know it?
o In a short-term scheduling
Use estimation
P1 0.0 6
w
P2 2.0 8
P3 4.0 7
w
P4 5.0 3
P4 P1 P3 P2
0 3 9 16 24
www.padeepz.net
Average waiting time = (3 + 16 + 9 + 0) / 4 = 7 www.padeepz.net
PRIORITY
The CPU is allocated to the process with the highest priority (smallest integer highest
priority)
o Preemptive
t
ne
o Nonpreemptive
o SJF is priority scheduling where priority is the inverse of predicted next CPU
burst time
.
pz
Solution Aging – as time progresses increase the priority of the process
ee
ad
.p
w
w
ROUND ROBIN
w
o Time quantum q
If
t
o the time quantum is q
ne
then
.
o in chunks of ≤ q time units at once
pz
o Each process waits ≤ (n-1)q time units
ee
ad
.p
P1 24
P2 3
w
P3 3
w
P1 P2 P3 P1 P1 P1 P1 P1
0 4 7 10 14 18 22 26 30
www.padeepz.net
DEADLOCK www.padeepz.net
Deadlock – two or more processes are waiting indefinitely for an event that can be
caused by only one of the waiting processes
P0 P1
wait(S); wait(Q);
t
ne
wait(Q); wait(S);
... ...
signal(S); signal(Q);
.
signal(Q); signal(S);
pz
System consists of resources
o Type: CPU
ad
2 instances - CPU1, CPU2
o Type: Printer
o Request resource
w
If cannot be granted immediately, the process must wait until it can get it
o Use resource
o Release resource
Because a process would want to get not just ―any‖ lock among a group of
locks, but a specific lock that guards a specific shared data type
t
Mutual exclusion: only one process at a time can use a resource
ne
Hold and wait: a process holding at least one resource is waiting to acquire additional
resources held by other processes
No preemption: a resource can be released only voluntarily by the process holding it,
after that process has completed its task
.
Circular wait: there exists a set {P0, P1, …, Pn} of waiting processes such that
pz
o P0 is waiting for a resource that is held by P1, and so on:
o P0 P1 P2 … Pn–1 Pn P0
ee
Notice: ―Circular wait‖ implies ―Hold and Wait‖
o Because, computationally, ―Hold and wait‖ can be tested much more efficiently
than ―Circular wait‖
ad
o Some algorithms we consider only need to check H&W
Trivial example
www.padeepz.net
UNIT III www.padeepz.net
MAIN MEMORY
Program must be brought (from disk) into memory and placed within a process for it
to be run
Main memory and registers are only storage CPU can access directly
t
Memory unit only sees a stream of addresses + read requests, or address + data and
write requests
ne
Register access in one CPU clock (or less)
.
Cache sits between main memory and CPU registers
pz
Protection of memory required to ensure correct operation
A pair of base and limit registers define the logical address space
CPU must check every memory access generated in user mode to be sure it is
ee
between base and limit for that user
ad
.p
w
w
w
www.padeepz.net
www.padeepz.net
HARDWARE PROTECTION
t
. ne
pz
ee
ADDRESS BINDING
Programs on disk, ready to be brought into memory to execute form an input queue
i.e. 74014
Address binding of instructions and data to memory addresses can happen at three
different stages
www.padeepz.net
www.padeepz.net
o Load time: Must generate relocatable code if memory location is not known at
compile time
o Execution time: Binding delayed until run time if the process can be moved
during its execution from one memory segment to another
Need hardware support for address maps (e.g., base and limit registers)
MULTISTEP PROCESSING
t
. ne
pz
ee
ad
.p
w
The concept of a logical address space that is bound to a separate physical address space
w
Logical and physical addresses are the same in compile-time and load-time address-
binding schemes; logical (virtual) and physical addresses differ in execution-time
address-binding scheme
Logical address space is the set of all logical addresses generated by a program
www.padeepz.net
www.padeepz.net
Physical address space is the set of all physical addresses generated by a program
MMU
To start, consider simple scheme where the value in the relocation register is added to
t
every address generated by a user process at the time it is sent to memory
ne
o Base register now called relocation register
The user program deals with logical addresses; it never sees the real physical addresses
.
o Execution-time binding occurs when reference is made to location in memory
pz
o Logical address bound to physical addresses
DYNAMIC ALLOCATION
ee
Routine is not loaded until it is called
DYNAMIC LINKING
www.padeepz.net
www.padeepz.net
Static linking – system libraries and program code combined by the loader into the
binary program image
Small piece of code, stub, used to locate the appropriate memory-resident library routine
Stub replaces itself with the address of the routine, and executes the routine
t
Operating system checks if routine is in processes’ memory address
ne
o If not in address space, add to address space
.
Consider applicability to patching system libraries
pz
o Versioning may be needed
SWAPPING
ee
A process can be swapped temporarily out of memory to a backing store, and then
brought back into memory for continued execution
Backing store – fast disk large enough to accommodate copies of all memory images for
ad
all users; must provide direct access to these memory images
Roll out, roll in – swapping variant used for priority-based scheduling algorithms; lower-
priority process is swapped out so higher-priority process can be loaded and executed
.p
Major part of swap time is transfer time; total transfer time is directly proportional to the
amount of memory swapped
System maintains a ready queue of ready-to-run processes which have memory images
w
on disk
Does the swapped out process need to swap back in to same physical addresses?
w
Modified versions of swapping are found on many systems (i.e., UNIX, Linux, and
Windows)
t
. ne
pz
ee
CONTIGUOUS ALLOCATION
o Resident operating system, usually held in low memory with interrupt vector
Relocation registers used to protect user processes from each other, and from changing
w
o Limit register contains range of logical addresses – each logical address must be
less than the limit register
o Can then allow actions such as kernel code being transient and kernel changing
size
www.padeepz.net
www.padeepz.net
t
. ne
pz
FRAGMENTATION
External Fragmentation – total memory space exists to satisfy a request, but it is not
contiguous
ee
Internal Fragmentation – allocated memory may be slightly larger than requested
memory; this size difference is memory internal to a partition, but not being used
First fit analysis reveals that given N blocks allocated, 0.5 N blocks lost to fragmentation
o Shuffle memory contents to place all free memory together in one large block
.p
o I/O problem
w
SEGMENTATION
w
main program
www.padeepz.net
procedure www.padeepz.net
function
method
object
t
common block
ne
stack
symbol table
arrays
.
pz
ee
ad
.p
w
<segment-number, offset>,
w
Segment table – maps two-dimensional physical addresses; each table entry has:
o base – contains the starting physical address where the segments reside in
memory
w
Protection
read/write/execute privileges
t
Protection bits associated with segments; code sharing occurs at segment level
ne
Since segments vary in length, memory allocation is a dynamic storage-allocation
problem
.
pz
ee
ad
.p
w
w
PAGING
To run a program of size N pages, need to find N free frames and load program
t
Still have Internal fragmentation
ne
Address generated by CPU is divided into:
o Page number (p) – used as an index into a page table which contains base
address of each page in physical memory
.
o Page offset (d) – combined with base address to define the physical memory
pz
address that is sent to the memory unit
ee
page number page offset
p d
ad
m -n n
PAGING STRUCTURE
.p
w
w
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
Calculating internal fragmentation
ad
o Page size = 2,048 bytes
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
Page table is kept in main memory
o One for the page table and one for the data / instruction
.p
The two memory access problem can be solved by the use of a special fast-lookup
hardware cache called associative memory or translation look-aside buffers (TLBs)
w
Some TLBs store address-space identifiers (ASIDs) in each TLB entry – uniquely
identifies each process to provide address-space protection for that process
w
On a TLB miss, value is loaded into the TLB for faster access next time
ASSOCIATIVE MEMORY
Page # Frame #
t
. ne
pz
PAGING WITH TLB
ee
ad
.p
w
w
w
Hit ratio =
o Hit ratio – percentage of times that a page number is found in the associative
registers; ratio related to number of associative registers
www.padeepz.net
www.padeepz.net
Consider = 80%, = 20ns for TLB search, 100ns for memory access
EAT = (1 + ) + (2 + )(1 – )
=2+–
Consider = 80%, = 20ns for TLB search, 100ns for memory access
t
o EAT = 0.80 x 100 + 0.20 x 200 = 120ns
ne
Consider more realistic hit ratio -> = 99%, = 20ns for TLB search, 100ns for
memory access
.
MEMORY PROTECTION
pz
Memory protection implemented by associating protection bit with each frame to indicate
if read-only or read-write access is allowed
o ―valid‖ indicates that the associated page is in the process’ logical address space,
and is thus a legal page
ad
o ―invalid‖ indicates that the page is not in the process’ logical address space
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
STRUCTURE OF THE PAGE TABLE
ad
Memory structures for paging can get huge using straight-forward methods
o If each entry is 4 bytes -> 4 MB of physical address space / memory for page
table alone
w
Hierarchical Paging
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
A logical address (on 32-bit machine with 1K page size) is divided into:
ad
o a page number consisting of 22 bits
o Since the page table is paged, the page number is further divided into:
.p
where p1 is an index into the outer page table, and p2 is the displacement within the page
of the inner page table
www.padeepz.net
ADDRESS TRANSLATION SCHEME www.padeepz.net
t
. ne
pz
64 BIT ARCHITECTURE
ee
Even two-level paging scheme not sufficient
o But in the following example the 2nd outer page table is still 234 bytes in size
w
www.padeepz.net
www.padeepz.net
t
ne
INTEL 32 BIT ARCHITECTURE
.
pz
ee
ad
.p
w
w
VIRTUAL MEMORY
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
VIRTUAL ADDRESS SPACE
Usually design logical address space for stack to start at Max logical address and grow
ad
―down‖ while heap grows ―up‖
Enables sparse address spaces with holes left for growth, dynamically linked libraries,
w
etc
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
SHARED LIBRARY USING VIRTUAL MEMORY
ad
.p
w
w
w
DEMAND PAGING
www.padeepz.net
Could bring entire process into memory at load time www.padeepz.net
o Faster response
t
o More users
ne
Similar to paging system with swapping (diagram on right)
.
pz
o not-in-memory bring to memory
Lazy swapper – never swaps a page into memory unless page will be needed
With swapping, pager guesses which pages will be used before swapping out again
o Need to detect and load the page into memory from storage
t
Without programmer needing to change code
ne
VALID OR INVALID BIT
.
Initially valid–invalid bit is set to i on all entries
pz
Example of a page table snapshot:
ee
ad
.p
w
During MMU address translation, if valid–invalid bit in page table entry is i page fault
w
If there is a reference to a page, first reference to that page will trap to operating system:
w
PAGE FAULT
t
page fault
ne
l And for every other process pages on first access
7. Actually, a given instruction could access multiple pages -> multiple page faults
.
pz
l Consider fetch and decode of instruction which adds 2 numbers from memory and
stores result back to memory
INSTRUCTION RESTART
block move
auto increment/decrement location
www.padeepz.net
www.padeepz.net
PERFORMANCE
t
2. Save the user registers and process state
ne
3. Determine that the interrupt was a page fault
4. Check that the page reference was legal and determine the location of the page on the
disk
.
5. Issue a read from the disk to a free frame:
pz
1. Wait in a queue for this device until the read request is serviced
8. Save the registers and process state for the other user
ad
9. Determine that the interrupt was from the disk
10. Correct the page table and other tables to show page is now in memory
.p
12. Restore the user registers, process state, and new page table, and then resume the
interrupted instruction
w
PAGE REPLACEMENT
Use modify (dirty) bit to reduce overhead of page transfers – only modified pages
are written to disk
w
NEED
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
BASIC PAGE REPLACEMENT
3. Bring the desired page into the (newly) free frame; update the page and frame tables
4. Continue the process by restarting the instruction that caused the trap
w
w
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
ALGORITHMS
ad
Frame-allocation algorithm determines
Page-replacement algorithm
w
o Repeated access to the same page does not cause a page fault
7,0,1,2,0,3,0,4,2,3,0,3,0,3,2,1,2,0,1,7,0,1
FIFO
www.padeepz.net
www.padeepz.net
t
ne
OPTIMAL ALGORITHM
.
pz
LRU
ee
ad
.p
o Read page into free frame and select victim to evict and add to free pool
w
w
o When backing store otherwise idle, write pages there and set to non-dirty
Possibly, keep free frame contents intact and note what is in them
o If referenced again before reused, no need to load contents again from disk
o 2 pages to handle to
t
Maximum of course is total frames in the system
ne
Two major allocation schemes
o fixed allocation
o priority allocation
.
pz
Many variations
THRASHING
If a process does not have ―enough‖ pages, the page-fault rate is very high
ee
o Page fault to get page
multiprogramming
w
www.padeepz.net
www.padeepz.net
t
ne.
pz
ee
ad
.p
w
w
w
www.padeepz.net
UNIT IV www.padeepz.net
FILE
Types:
o Data
t
numeric
ne
character
binary
o Program
.
Contents defined by file’s creator
pz
o Many types
ee
FILE ATTRIBUTES
Time, date, and user identification – data for protection, security, and usage
w
monitoring
Information about files are kept in the directory structure, which is maintained on the
disk
w
w
OPERATIONS
Create
Delete
Truncate
Open(Fi) – search the directory structure on disk for entry Fi, and move the content
t
of entry to memory
ne
Close (Fi) – move the content of entry Fi in memory to directory structure on disk
.
o File pointer: pointer to last read/write location, per process that has the file
pz
open
o Shared lock similar to reader lock – several processes can acquire concurrently
.p
Mandatory or advisory:
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
ad
STRUCTURE
o Lines
o Fixed length
w
o Variable length
Complex Structures
w
o Formatted document
Can simulate last two with first method by inserting appropriate control characters
Who decides:
o Operating system
o Program
t
ne
ACCESS METHODS
Sequential Access
read next
.
pz
write next
reset
read n
ad
write n
position to n
read next
.p
write next
rewrite n
w
www.padeepz.net
Can be built on top of base methods www.padeepz.net
t
IBM indexed sequential-access method (ISAM)
ne
o Small master index, points to disk blocks of secondary index
.
VMS operating system provides index and relative files as another example
pz
ee
ad
.p
w
w
DIRECTORY STRUCTURE
w
DISK STRUCTURE
Disk or partition can be used raw – without a file system, or formatted with a file system
Each volume containing file system also tracks that file system’s info in device directory
t
or volume table of contents
ne
As well as general-purpose file systems there are many special-purpose file systems,
frequently all within the same operating system or computer
.
pz
ee
ad
.p
TYPES
But systems frequently have may file systems, some general- and some special- purpose
o objfs – interface into kernel memory to get kernel symbols for debugging
w
Create a file
Delete a file
List a directory
t
Rename a file
ne
Traverse the file system
DIRECTORY ORGANIZATION
.
pz
Naming – convenient to users
ee
Grouping – logical grouping of files by properties, (e.g., all Java programs, all games, …)
Naming problem
Grouping problem
w
Path name
w
Efficient searching
No grouping capability
www.padeepz.net
www.padeepz.net
t
ne
TREE STRUCTURED DIRECTORIES
.
pz
ee
ad
.p
Efficient searching
w
Grouping Capability
w
o cd /spell/mail/prog
w
o type list
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
Two different names (aliasing)
o Solutions:
ad
o Backpointers, so we can delete all pointers
Variable size records a problem
o Entry-hold-count solution
www.padeepz.net
www.padeepz.net
t
. ne
pz
FILE SYSTEM MOUNTING
ee
ad
.p
w
w
MOUNT POINT
www.padeepz.net
www.padeepz.net
t
. ne
pz
FILE SHARING
ee
Sharing of files on multi-user systems is desirable
If multi-user system
Client-server model allows clients to mount remote file systems from servers
o Standard operating system file calls are translated into remote calls
t
o For example corruption of directory structures or other non-user data, called
ne
metadata
Remote file systems add new failure modes, due to network failure, server failure
Recovery from failure can involve state information about status of each remote request
.
Stateless protocols such as NFS v3 include all information in each request, allowing easy
pz
recovery but less security
PROTECTION
o by whom
Types of access
ad
o Read
o Write
.p
o Execute
o Append
o Delete
w
o List
w
RWX
RWX
www.padeepz.net
c) public access 1 001 www.padeepz.net
Ask manager to create a group (unique name), say G, and add some users to the group.
t
ne
FILE SYSTEM STRUCTURE
.
File structure
pz
o Logical storage unit
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
FILE SYSTEM LAYERS
ad
Device drivers manage I/O devices at the I/O control layer
o Given commands like ―read drive1, cylinder 72, track 2, sector 10, into memory
location 1060‖ outputs low-level hardware specific commands to hardware
.p
controller
Basic file system given command like ―retrieve block 123‖ translates to device driver
File organization module understands files, logical address, and physical blocks
w
o Translates file name into file number, file handle, location by maintaining file
control blocks (inodes in UNIX)
www.padeepz.net
o Directory management www.padeepz.net
o Protection
Layering useful for reducing complexity and redundancy, but adds overhead and can
decrease performanceTranslates file name into file number, file handle, location by
maintaining file control blocks (inodes in UNIX)
t
designer
ne
FILE SYSTEM IMPLEMENTATION
We have system calls at the API level, but how do we implement their functions?
.
Boot control block contains info needed by system to boot OS from that volume
pz
o Needed if volume contains OS, usually first block of volume
Volume control block (superblock, master file table) contains volume details
ee
o Total # of blocks, # of free blocks, block size, free block pointers or array
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
VIRTUAL FILE SYSTEMS
ad
Virtual File Systems (VFS) on Unix provide an object-oriented way of implementing
file systems
VFS allows the same system call interface (the API) to be used for different types of file
systems
.p
o Implementation can be one of many file systems types, or network file system
w
www.padeepz.net
VFS IMPLEMENTATION www.padeepz.net
t
Function table has addresses of routines to implement that function on that
ne
object
For example:
.
• int close(. . .)—Close an already-open file
pz
• ssize t read(. . .)—Read from a file
o Simple to program
ad
o Time-consuming to execute
o Collisions – situations where two file names hash to the same location
• An allocation method refers to how disk blocks are allocated for files:
o Simple – only starting location (block #) and length (number of blocks) are
required
www.padeepz.net
www.padeepz.net
o Problems include finding space for file, knowing file size, external fragmentation,
need for compaction off-line (downtime) or on-line
t
. ne
pz
•
ee
Linked allocation – each file a linked list of blocks
o No external fragmentation
ad
o Each block contains pointer to next block
www.padeepz.net
www.padeepz.net
t
ne.
pz
ee
FILE ALLOCATION TABLE
ad
.p
w
w
w
www.padeepz.net
• Indexed allocation www.padeepz.net
• Each file has its own index block(s) of pointers to its data blocks
• Logical view
t
. ne
pz
• index table
ee
ad
.p
w
w
w
• Random access
• Dynamic access without external fragmentation, but have overhead of index block
www.padeepz.net
• www.padeepz.net
Mapping from logical to physical in a file of maximum size of 256K bytes and block
size of 512 bytes. We need only 1 block for index table
t
. ne
pz
ee
Block number calculation
(number of bits per word) *(number of 0-value words) +offset of first 1 bit
• No waste of space
w
www.padeepz.net
www.padeepz.net
t
. ne
pz
• Grouping
o Modify linked list to store address of next n-1 free blocks in first free block, plus
•
ee
a pointer to next block that contains free-block-pointers (like this one)
Counting
Free space list then has entries containing addresses and counts
• Space Maps
.p
o Used in ZFS
Full data structures like bit maps couldn’t fit in memory -> thousands of
I/Os
w
www.padeepz.net
www.padeepz.net
o Metaslab activity -> load space map into memory in balanced-tree structure,
indexed by offset
t
ne
o Types of data kept in file’s directory entry
.
PAGE CACHE
pz
• A page cache caches pages rather than disk blocks using virtual memory techniques
and addresses
•
ee
Routine I/O through the file system uses the buffer (disk) cache
www.padeepz.net
www.padeepz.net
IO SYSTEMS
o Storage
o Transmission
t
o Human-interface
ne
Common concepts – signals from I/O devices interface with computer
.
pz
PCI bus common in PCs and servers, PCI Express (PCIe)
Sometimes integrated
www.padeepz.net
www.padeepz.net
t
. ne
pz
ee
I/O instructions control devices
ad
Devices usually have registers where device driver places commands, addresses, and data
to write, or read data from registers after command execution
o Memory-mapped I/O
www.padeepz.net
www.padeepz.net
t
ne.
pz
ee
ad
.p
w
w
w
www.padeepz.net
www.padeepz.net
UNIT V
Linodes run Linux. Linux is an operating system, just like Windows and Mac OS X. As an
operating system, Linux manages your Linode’s hardware and provides services your other
software needs to run.
t
Linux is a very hands-on operating system. If running Windows is like driving an automatic,
ne
then running Linux is like driving a stick. It can take some work, but once you know your way
around Linux, you’ll be using the command line and installing packages like a pro. This article
aims to ease you into the world of Linux.
.
explanations for basic concepts. There are a few how-to sections as well, which are intended to
get you on your feet with your Linode. At times we’ll link off to a different guide that has more
pz
details on a particular topic.
Everything on a Linux system is case-sensitive. That means that photo.jpg, photo.JPG, and
Photo.jpg are all different files. Usernames and passwords are also case-sensitive.
ee
Linux, like Mac OS X, is based on the Unix operating system. A research team at AT&T’s Bell
Labs developed Unix in the late 1960s and early 1970s with a focus on creating an operating
system that would be accessible and secure for multiple users.
Corporations started licensing Unix in the 1980s and 1990s. By the late 1980s, there was interest
ad
in building a free operating system that would be similar to Unix, but that could be tinkered with
and redistributed. In 1991, Linus Torvalds released the Linux kernel as free, open-source
software. Open source means that the code is fully visible, and can be modified and redistributed.
Strictly speaking, Linux is the kernel, not the entire operating system. The kernel provides an
interface between your Linode’s hardware and the input/output requests from applications. The
.p
rest of the operating system usually includes many GNU libraries, utilities, and other software,
from the Free Software Foundation. The operating system as a whole is known as GNU/Linux.
w
Your Linode is a type of server. What’s a server? A server is a type of computer that provides
services over a network, or connected group of computers. When people think about servers,
w
Since a server is a type of computer, there are a lot of similarities between a Linode and your
home computer. Some important similarities include:
The physical machine: Your Linode is hosted on a physical machine. It’s sitting in one of
our data centers.
www.padeepz.net
The operating system: As we mentioned in the introduction, Linodes www.padeepz.net
use the Linux
operating system. It’s just another type of operating system like Windows or Mac OS X.
Applications: Just like you can install applications on your home computer or
smartphone, you can install applications on your Linode. These applications help your
Linode do things like host a website. WordPress is a popular website application, for
example. Applications are also known as software and programs.
Files and directories: In the end, whether it’s an application or a photo, everything on
your Linode is a file. You can create new files, edit and delete old ones, and navigate
through directories just like you would on your home computer. In Linux, folders are
t
called directories.
Internet access: Your Linode is connected to the Internet. That’s how you connect to it to
ne
get everything set up, and how your users connect to it to view your website or download
your app.
.
SYSTEM ADMINISTRATION
pz
Linux System Administration Basics
This presents a collection of common issues and useful tips for Linux system administration.
Whether you’re new to system administration or have been maintaining systems for some time,
ee
we hope these tips are helpful regardless of your background or choice in Linux distributions.
Basic Configuration
These tips cover some of the basic steps and issues encountered during the beginning of system
ad
configuration. We provide a general getting started guide for your convenience if you’re new to
Linode and basic Linux system administration. Additionally, you may find some of our
Introduction to Linux Concepts guide useful.
Please follow our instructions for setting your hostname. Issue the following commands to make
sure it is set properly:
w
1 hostname
2 hostname -f
The first command should show your short hostname, and the second should show your fully
w
When setting the timezone of your server, it may be best to set it to the timezone of the bulk of
your users. If you’re unsure which timezone would be best, consider using universal coordinated
time or UTC (i.e. Greenwich Mean Time).
By default, Linode base installs are set to Eastern Standard Time. The following process will set
the timezone manually, though many operating systems provide a more elegant method for
changing timezones. To change the time zone manually, you must find the proper zone file in
/usr/share/zoneinfo/ and link that file to /etc/localtime. See the example below for
www.padeepz.net
www.padeepz.net
common possibilities. Please note that all contents following the double hashes (eg. ##) are
comments and should not be copied into your terminal.
t
7
ln -sf /usr/share/zoneinfo/US/Eastern /etc/localtime ## for American Eastern
ne
(including DST)
To change the time zone in Debian and Ubuntu systems, issue the following command and
answer the questions as prompted by the utility:
.
1 dpkg-reconfigure tzdata
pz
In Arch Linux, set the timezone in the /etc/rc.conf file by configuring the TIMEZONE= setting
in the ―Localization‖ section. This line will resemble the following:
/etc/rc.conf
ee
TIMEZONE=‖America/New_York‖
Note that the string specified in TIMEZONE refers to the ―zoneinfo‖ file located in or below the
/usr/share/zoneinfo/ directory.
ad
Use the /etc/hosts File
The /etc/hosts file provides a list of IP addresses with corresponding hostnames. This allows
you to specify hostnames for an IP address once on the local machine, and then have multiple
applications connect to external resources via their hostnames. The system of host files predates
.p
DNS, and hosts files are always checked before DNS is queried. As a result, /etc/hosts can be
useful for maintaining small ―internal‖ networks, for development purposes, and for managing
clusters.
w
Some applications require that the machine properly identify itself in the /etc/hosts file. As a
result, we recommend configuring the /etc/hosts file shortly after deployment. Here is an
example file:
w
/etc/hosts
You can specify a number of hostnames on each line separated by spaces. Every line must begin
with one and only one IP address. In this case, replace 12.34.56.78 with your machine’s IP
address. Let us consider a few additional /etc/hosts entries:
/etc/hosts
The second entry tells the system to look to 192.168.1.1 for the domain stick.example.com.
These kinds of host entries are useful for using ―private‖ or ―back channel‖ networks to access
other servers in a cluster without needing to access the public network.
Network Diagnostics
t
The following tips address the basic usage and functionality of a number of tools that you can
ne
use to assess and diagnose network problems. If you suspect connectivity issues, including
output of the relevant commands in your support ticket can help our staff diagnose your issue.
This is particularly helpful in cases where networking issues are intermittent.
.
The ping command tests the connection between the local machine and a remote address or
pz
machine. The following command ―pings‖ google.com and 74.125.67.100:
These commands send a bit of data (i.e. an ICMP packet) to the remote host, and wait for a
ee
response. If the system is able to make a connection, for every packet it will report on the ―round
trip time.‖ Here is the output of four pings of google.com:
In this case yx-in-f100.1e100.net is the reverse DNS for this IP address. The time field
specifies in milliseconds that the round trip takes for an individual packet. When you’ve gathered
the amount of information you need, send Control+C to interrupt the process. At this juncture,
you’ll be presented with some statistics. This will resemble:
w
Packet Loss, or the discrepancy between the number of packets sent and the number of packets
that return successfully.
Round Trip Time statistics on the final line report important information about all the ping
responses. For this ping we see that the fastest packet round trip took 33.89 milliseconds. The
longest packet took 53.28 milliseconds. The average round trip took 40.175 milliseconds. A
single standard deviation unit for these four packets is 7.67 milliseconds.
www.padeepz.net
Use the ping tool to contact a server and ensure that you are able to make awww.padeepz.net
connection.
Furthermore, ping is useful as an informal diagnostic tool to measure point-to-point network
latency, and as a network connection testing tool.
The traceroute command expands on the functionality of the ping command. traceroute
provides a report on the path that the packets take to get from the local machine to the remote
machine. Route information is useful when troubleshooting a networking issue: if there is packet
t
loss in one of the first few ―hops‖ the problem is often related to the user’s local area network
ne
(LAN) or Internet service provider (ISP). By contrast, if there is packet loss near the end of the
route, the problem may be caused by an issue with the server’s connection.
.
pz
1 207.192.75.2 (207.192.75.2) 0.414 ms 0.428 ms 0.509 ms 2 vlan804.tbr2.mmu.nac.net
(209.123.10.13) 0.287 ms 0.324 ms 0.397 ms 3 0.e1-1.tbr2.tl9.nac.net (209.123.10.78) 1.331 ms
1.402 ms 1.477 ms 4 core1-0-2-0.lga.net.google.com (198.32.160.130) 1.514 ms 1.497 ms 1.519
ms 5 209.85.255.68 (209.85.255.68) 1.702 ms 72.14.238.232 (72.14.238.232) 1.731 ms 21.031
ms 6 209.85.251.233 (209.85.251.233) 26.111 ms 216.239.46.14 (216.239.46.14) 23.582 ms
ee
23.468 ms 7 216.239.43.80 (216.239.43.80) 123.668 ms 209.85.249.19 (209.85.249.19) 47.228
ms 47.250 ms 8 209.85.241.211 (209.85.241.211) 76.733 ms 216.239.43.80 (216.239.43.80)
73.582 ms 73.570 ms 9 209.85.250.144 (209.85.250.144) 86.025 ms 86.151 ms 86.136 ms 10
64.233.174.131 (64.233.174.131) 80.877 ms 216.239.48.34 (216.239.48.34) 76.212 ms
64.233.174.131 (64.233.174.131) 80.884 ms 11 216.239.48.32 (216.239.48.32) 81.267 ms
81.198 ms 81.186 ms 12 216.239.48.137 (216.239.48.137) 77.478 ms pw-in-f100.1e100.net
ad
(74.125.53.100) 79.009 ms 216.239.48.137 (216.239.48.137) 77.437 ms
Often the hostnames and IP addresses on either side of a failed jump are useful in determining
who operates the machine where the routing error occurs. Failed jumps are designated by line
with three asterisks (e.g. * * *).
.p
ISP’s network. Recording traceroute information is particularly useful if you are experiencing
an intermittent issue.
w
The ―mtr‖ command, like the traceroute tool, provides information about the route that Internet
traffic takes between the local system and a remote host. However, mtr provides additional
w
information about the round trip time for the packet. In a way, you can think of mtr as a
combination of traceroute and ping.
t
Used without the --report flag, mtr tracks the speed of the connection in real time until you
exit the program. Additionally, be aware that mtr will pause for a few moments before
ne
generating output. For more information regarding mtr consider our guide to diagnosing network
issues with mtr.
System Diagnostics
.
If you’re having an issue with your Linode that is neither related to networking, nor another
pz
easily diagnosable application issue, it is worthwhile to rule out ―hardware‖ and operating
system level issues. Use the following tools to better diagnose and resolve these kinds of issues.
If you determine that you have a problem with memory usage, please reference our document
regarding resolving memory usage issues. Use the following tools and approaches to determine
ee
the specific cause of your troubles.
If you need to see how much memory your system is using at the current moment issue the
ad
following command:
1 free -m
On a moderately utilized Linode 1GB, this command will generate output that resembles the
following:
.p
Mem: 1002 956 46 0 171 357 -/+ buffers/cache: 427 575 Swap: 127 39 88
w
This output takes a little bit of careful reading to interpret correctly. Out of a total 1002
megabytes of memory (RAM), the system is using 956 megabytes, and has 46 megabytes free.
w
However, the system also has 427 megabytes of ―stale‖ data buffered and stored in cache. The
operating system will ―drop‖ the caches when and if it needs the space, but retains the cache if
there is no other need for the space. It is totally normal for a Linux system to leave old data in
RAM until the space is needed, and you should not be alarmed if only a small amount of
w
In the above example, there are 575 megabytes of memory that are actually free. This means 575
megabytes are available to your system when you start an additional process or a running
application needs more memory.
If you think you’re having an I/O issue then run the following command:
1 vmstat 1 20
This runs a vmstat every second, twenty times. We find this gives a pretty good sample of the
current state of the system. The output generated resembles the following:
t
ne
procs -----------memory---------- ---swap-- -----io---- -system-- ----cpu--
1 --
2 r b swpd free buff cache si so bi bo in cs us sy id
3 wa
4 0 0 4 32652 47888 110824 0 0 0 2 15 15 0 0 100 0
5 0 0 4 32644 47888 110896 0 0 0 4 106 123 0 0 100 0
6 0 0 4 32644 47888 110912 0 0 0 0 70 112 0 0 100 0
.
7 0 0 4 32644 47888 110912 0 0 0 0 92 121 0 0 100 0
8 0 0 4 32644 47888 110912 0 0 0 36 97 136 0 0 100 0
pz
9 0 0 4 32644 47888 110912 0 0 0 0 96 119 0 0 100 0
10 0 0 4 32892 47888 110912 0 0 0 4 96 125 0 0 100 0
11 0 0 4 32892 47888 110912 0 0 0 0 70 105 0 0 100 0
12 0 0 4 32892 47888 110912 0 0 0 0 97 119 0 0 100 0
13 0 0 4 32892 47888 110912 0 0 0 32 95 135 0 0 100 0
14 0 0 ee 4 33016 47888 110912 0 0 0 0 75 107 0 0 100 0
15 0 0 4 33512 47888 110912 0 0 0 24 113 134 0 0 100 0
16 0 0 4 33512 47888 110912 0 0 0 0 175 244 0 0 100 0
17 0 0 4 33512 47888 110912 0 0 0 0 92 148 0 0 100 0
18 0 0 4 33512 47888 110912 0 0 0 0 114 162 0 0 100 0
19 0 0 4 33512 47888 110912 0 0 0 36 100 157 0 0 100 0
20 0 0 4 33388 47888 110912 0 0 0 0 116 166 0 0 100 0
21 0 0 4 33388 47888 110912 0 0 0 0 97 157 0 0 100 0
ad
22 0 0 4 33388 47888 110912 0 0 0 0 89 144 0 0 100 0
0 0 4 33380 47888 110912 0 0 0 0 181 185 0 0 99 0
The memory and swap columns provide the same kind of information provided by the ―free -m‖
command, albeit in a slightly more difficult to comprehend format. The most salient information
.p
produced by this command is the wa column, which is the final column in most implementations.
This field displays the amount of time the CPU spends waiting for IO operations to complete.
If this number is consistently and considerably higher than 0, you might consider taking
w
measures to address your IO usage. However, if the vmstat output resembles the above, you can
be sure in the knowledge that you’re not experiencing an IO-related issues.
If you are experiencing an intermittent issue, you will need to run vmstat when you experience
w
the issue in order to properly diagnose or rule out an IO issue. vmsat output can sometimes help
support diagnose problems.
w
If you want a more organized and real-time view of the current state of your system, we
recommend a tool called htop. This is not installed by default on most systems. To install htop,
issue one of the following commands, depending on which distribution you use:
1 htop
You can quit at any time by pressing the F10 or Q keys. There are a couple of htop behaviors that
may not be initially intuitive. Take note of the following:
t
The memory utilization graph displays used memory, buffered memory, and cached memory.
ne
The numbers displayed at the end of this graph reflect the total amount of memory available
and the total amount memory on the system as reported by the kernel.
The default configuration of htop presents all application threads as independent processes,
which is non-intuitive. You can disable this by selecting the “setup” option with F2, then
“Display Options,” and then toggling the “Hide userland threads” option.
You can toggle a “Tree” view with the F5 key that usefully displays the processes in a hierarchy
.
and shows which processes were spawned by which other processes. This is helpful in
pz
diagnosing a problem when you’re having trouble figuring out what processes are what.
If you’re new to administering systems and the Linux world, you might consider our ―Tools &
Reference‖ section and articles including: ―installing and using WinSCP‖ using rsync to
synchronize files and ―using SSH and the terminal.‖
ad
As always, if you are giving other users access to upload files to your server, it would be wise to
consider the security implications of all additional access that you grant to third parties seriously.
If you’re used to using an FTP client, OpenSSH (which is included and active with all of the
Linode provided installation templates) allows you to use an FTP-like interface over the SSH
protocol. Known as ―SFTP,‖ many clients support this protocol, including: ―WinSCP‖ for
w
Windows, ―Cyberduck‖ for Mac OS X, and ―Filezilla‖ for Linux, OS X, and Windows desktops.
If you are accustomed to FTP, SFTP is great option. Do note that by default, whatever access a
w
user has to a file system at the command line, they will also have over SFTP. Consider file
permissions very carefully.
Conversely, you can use Unix utilities including scp and rsync to securely transfer files to your
w
The command, scp, is followed by the name of the file on the local file system to be transferred.
Next is the username and hostname of the remote machine, separated by an ―at‖ sign (e.g. @).
Following the hostname, there is a colon (e.g. :) and the path on the remote server where the file
should be uploaded to. Taken another way, this command would be:
www.padeepz.net
scp [/path/to/local/file] [remote-username]@[remote- www.padeepz.net
1
hostname]:[/path/to/remote/file]
This command is available by default on OS X and Linux machines. You can use it to copy files
to a Linode, as well as between remote servers. If you have SSH keys deployed, you can use the
scp command without entering a password for every transfer.
The syntax of scp follows the form scp [source] [destination]. You can copy files from a
remote host to the local machine by reversing the order of the paths in the above example.
t
ne
How to Protect Files on a Remote Server
Because Linode servers are network accessible and often have a number of distinct users,
maintaining the security of files is often an important concern. We recommend you familiarize
yourself with our basic security guide. Furthermore, our documentation of access control with
user accounts and permissions may provide additional insight.
.
pz
Additionally, we suggest the following best practices for maintaining security:
Only give users the permission to do what they need to. This includes application specific users.
Only run services on public interfaces that you are actively using. One common source of
security vulnerabilities are in daemons that are left running and unused. This includes database
ee
servers, HTTP development servers, and FTP servers.
Use SSH connections whenever possible to secure and encrypt the transfer of sensitive
information.
1 ln -s /home/squire/config-git/etc-hosts /etc/hosts
w
This creates a link of the file etc-hosts at the location of the system’s /etc/hosts file. More
generically. this command would read:
w
1 ln -s [/path/to/target/file] [/path/to/location/of/sym/link]
The final term, the location of the link, is optional. If you opt to omit the link destination, a link
will be created in the current directory with the same name as the file you’re linking to.
When specifying the location of the link, ensure that path does not have a final trailing slash.
You can create a sym link that targets a directory, but sym links cannot terminate with slashes.
You may remove a symbolic link without affecting the target file.
You can use relative or absolute paths when creating a link.
www.padeepz.net
How to Manage and Manipulate Files on a Linux System www.padeepz.net
If you’re new to using Linux and manipulating files on the terminal interface we encourage you
to consider our using the terminal document. This tip provides an overview of basic file
management operations.
cp /home/squire/todo.txt /home/squire/archive/todo.01.txt
t
ne
This copies todo.txt to an archive folder, and adds a number to the file name. If you want to
recursively copy all of the files and subdirectories in a directory to another directory, use the -R
option. This command looks like:
1 cp -R /home/squire/archive/ /srv/backup/squire.01/
.
If you need to move a file or directory, use the following command:
pz
1 mv /home/squire/archive/ /srv/backup/squire.02/
1 rm scratch.txt
This will delete the scratch.txt file from the current directory.
ad
For more information about file system navigation and manipulation, please consider our
documentation of file system navigation in the using the terminal document.
Package Management
.p
Contemporary Linux systems use package management tools to facilitate the installation and
maintenance of all software on your system. For more in-depth coverage of this topic, please
reference our package management guide.
w
While package management provides a number of powerful features, it is easy to obviate the
benefits of package management. If you install software manually without package management
tools, it becomes very difficult to keep your system up to date and to manage complex
w
dependencies. For these reasons, we recommend installing all software through package
management tools unless other means are absolutely necessary. The following tips outline a
couple of basic package management tasks.
w
Because packages are so easy to install, and often pull in a number of dependencies, it can be
easy to lose track of what software is installed on your system. The following commands provide
a list of installed packages on your system.
The following example presents the first few lines of the output of this command on a production
Debian Lenny system.
t
4 2.2
5 ii apache2-utils 2.2.9-10+lenny4 utility programs for
ne
6 webservers
7 ii apache2.2-common 2.2.9-10+lenny4 Apache HTTP Server
8 common files
9 ii apt 0.7.20.2+lenny1 Advanced front-end for dpkg
ii apt-utils 0.7.20.2+lenny1 APT utility programs
ii bash 3.2-4 The GNU Bourne Again SHell
.
pz
For CentOS and Fedora systems, issue the following command:
The following example presents a few relevant lines of the output of this command:
ee
1 MAKEDEV.i386 3.23-1.2 installed
2 SysVinit.i386 2.86-15.el5 installed
CentOS and Fedora systems provide the name of the package (e.g. SysVinit), the architecture it
was compiled for (e.g. i386), and the version of the build installed on the system (e.g. 2.86-
ad
15.el5).
1 pacman -Q
.p
This command provides a total list of all packages installed on the system. Arch also allows you
to filter these results to display only packages that were explicitly installed (with the -Qe option)
or that were installed as dependencies (with the -Qd option). The above command is thus the
w
1 pacman -Qe
2 pacman -Qd
w
1 perl-www-mechanize 1.60-
2 perl-yaml 0.70-1
3 pkgconfig 0.23-1
4 procmail 3.22-2
5 python 2.6.4-1
6 rsync 3.0.6-1
www.padeepz.net
1 emerge -evp --deep world www.padeepz.net
t
[ebuild R ] sys-libs/zlib-1.2.3-r1 0 Kb
ne
Because there are often a large number of packages installed on any given system, the output of
these commands is often quite large. As a result, it is often useful to use tools like grep and less
to make these results more useful. For example, the command :
.
pz
will return a list of all packages with the word python in their name or description. Similarly, the
following command:
1 dpkg -l | less
ee
will return the same list as the plain ―dpkg -l ; however, the results will appear in the less
pager, which allows you to search and scroll more easily.
You can append | grep "[string]" to these commands to filter package list results, or | less
to display the results in a pager, regardless of distribution.
ad
How to Discover Package Names and Information
Sometimes the name of a package doesn’t correspond to the name that you may associate with a
given piece of software. As a result, most package management tools make provide an interface
to search the package database. These search tools may be helpful if you’re looking for a specific
.p
This will search the local package database for a given term and generate a list with brief
w
2
vim-python - Vi IMproved - enhanced vi editor (transitional package)
3
vtk-examples - C++, Tcl and Python example programs/scripts for VTK
4
zope-plone3 - content management system based on zope and cmf
5
zorp - An advanced protocol analyzing firewall
6
groovy - Agile dynamic language for the Java Virtual Machine
7
python-django - A high-level Python Web framework
8
python-pygresql-dbg - PostgreSQL module for Python (debug extension)
9
python-samba - Python bindings that allow access to various aspects of
10
Samba
www.padeepz.net
Note that apt-cache search queries the full records for all of the packages,www.padeepz.net
and not simply the
titles and the descriptions displayed here, hence the inclusion of vim-nox and groovy which both
mention python in their descriptions. To see the full record on a package issue the following
command:
This provides information regarding the maintainer, the dependencies, the size, the homepage of
the upstream project, and a description of the software. This command can be used to provide
t
additional information about a package from the command line.
ne
For CentOS and Fedora systems, issue the following command:
This generates a list of all packages available in the package database that match the given term.
.
See the following excerpt for an example of the output of yum search wget:
pz
Loaded plugins: fastestmirror
1
Loading mirror speeds from cached hostfile
2
* addons: centos.secsup.org
3
* base: centos.secsup.org
4
* extras: centos.secsup.org
ee
5
* updates: styx.biochem.wfubmc.edu
6
================================ Matched: wget
7
=================================
8
wget.i386 : A utility for retrieving files using the HTTP or FTP protocols.
You can use the package management tools to discover more information about a specific
ad
package. Use the following command to get a full record from the package database:
This output presents more in-depth information concerning the package, its dependencies,
.p
This will perform a search of the local copy of the package database. Here is an excerpt of results
for a search for ―python:
w
1 extra/twisted 8.2.0-1
2 Asynchronous networking framework written in Python.
w
3 community/emacs-python-mode 5.1.0-1
4 Python mode for Emacs
The terms ―extra‖ and ―community‖ refer to which repository the software is located in. This
level of specificity is unnecessary when specifying packages to install or display more
information about. To request more information about a specific package issue a command in the
following form:
t
The first command only searches the database for package names. The second command
searches through the database for package names and descriptions. These commands will allow
ne
you to search your local package tree (i.e. portage) for the specific package name or term. The
output of either command is similar to the excerpt presented bellow.
1 Searching...
2 [ Results for search key : wget ]
.
3 [ Applications found : 4 ]
4
pz
5 * app-emacs/emacs-wget
6 Latest version available: 0.5.0
7 Latest version installed: [ Not Installed ]
8 Size of files: 36 kB
9 Homepage: http://pop-club.hp.infoseek.co.jp/emacs/emacs-wget/
10 Description:
ee Wget interface for Emacs
11 License: GPL-2
Because the output provided by the emerge --search command is rather verbose, there is no
―show more information‖ tool, unlike other distributions’ tools. The emerge --search
command accepts input in the form of a regular expression if you need to narrow results even
further.
ad
Since there are often a large number of results for package searches, these commands output a
great quantity of text. As a result it is often useful to use tools like grep and less to make these
results easier to scroll. For example, the command :
.p
will return the subset of the list of packages which matched for the search term ―python,‖ and
that mention xml in their name or short description. Similarly, the following command:
w
will return the same list as the plain apt-cache search python but the results will appear in
the less pager. This allows you to search and scroll more conveniently.
You can append | grep "[string]" to any of these commands to filter package search results,
w
Text Manipulation
Among Linux and UNIX-like systems, nearly all system configuration information is stored and
manipulated in plain text form. These tips provide some basic information regarding the
manipulation of text files on your system.
www.padeepz.net
How to Search for a String in Files with grep www.padeepz.net
The grep tool allows you to search a stream of text, such as a file or the output of a command,
for a term or pattern matching a regular expression.
t
This will search the mail spool for subject lines (i.e. begins with the word ―Subject:‖), beginning
ne
with any number of characters, containing the word ―help‖ in upper case, and followed by any
number of additional characters. grep would then print these results on the terminal.
grep provides a number of additional options that, if specified, force the program to output the
context for each match (e.g. with -C 2 for two lines of context). With -n, grep outputs the line
number of the match. With -H, grep prints the file name for each match, which is useful when
.
you ―grep‖ a group of files or ―grep‖ recursively through a file system (e.g. with -r). Consider
pz
the output of grep --help for more options.
To grep a group of files, you can specify the file with a wildcard, as in the following example:
In this example, we assume that the /home/squire/data directory contains a large number of
files that have a UNIX time stamp in their file name. The above command will filter the output
to only display those tiles that have the four digits ―1257‖ in their file name. Note, in these cases
w
grep only filters the output of ls and does not look into file contents. For more information
regarding grep consider the full documentation of the grep command.
While the grep tool is quite powerful for filtering text on the basis of regular expressions, if you
need to edit a file or otherwise manipulate the text you may use the sed tool. sed, or the Stream
w
EDitor, allows you search for a regular expression pattern and replace it with another string.
sed is extremely powerful, and we recommend that you back up your files and test your sed
commands thoroughly before running them, particularly if you’re new to using sed. Here is a
very simple sed one-liner, intended to illustrate its syntax.
www.padeepz.net
This replaces occurrences of the word ―good‖ occurring at the beginning ofwww.padeepz.net
a line (as noted by
the ^) with the string ―BAD‖ in the file morning-star.txt. The option -i tells sed to perform
the replacements ―in place.‖ sed can make backups of the files it edits if you specify a suffix
after the -i option, as in -iBAK. In the above command this option would save the original file as
morning-star.txt.BAK before making changes.
1 's/[regex]/[replacement]/'
t
ne
To match literal slashes (e.g. /), you must escape them with a backslash (e.g. \). As a result, to
match a / character you would use \/ in the sed expression. If you are searching for a string that
has a number of slashes, you can replace the slashes which another character. For instance:
1 's|r/e/g/e/x|regex|'
.
This would strip the slashes from the string r/e/g/e/x so that this string would be regex after
pz
running the sed command on the file that contains the string.
The following example, from our migrating a server to your Linode document, searches and
replaces one IP address with another. In this case 98.76.54.32 is replaced with 12.34.56.78:
ee
1 sed -i 's/98\.76\.54\.32/12\.34\.56\.78/'
In the above example, period characters are escaped as \.. In regular expressions the full-stop
(period) character matches to any character.
ad
Once again, sed is a very powerful and useful tool; however, if you are unfamiliar with it, we
strongly recommend testing your search and replace patterns before making any edit of
consequence. For more information about sed consider the full documentation of text
manipulation with sed.
.p
In many Linode Library documents, you may be instructed to edit the contents of a file. To do
this, you need to use a text editor. Most of the distribution templates that Linode provides come
w
with an implementation of the vi/vim text editor and the nano text editor. These are small,
lightweight, and very powerful text editors that allow you manipulate the text of a file from the
terminal environment.
w
There are other options for text editors, notably emacs and ―zile.‖ Feel free to install these
programs using your operating system’s package manager. Make sure you search your package
database so that you can install a version compiled without GUI components (i.e. X11).
w
To open a file, simply issue a command beginning with the name of the editor you wish to run
followed by the name of the file you wish to edit. Here are a number of example commands that
open the /etc/hosts file:
1 nano /etc/hosts
2 vi /etc/hosts
3 emacs /etc/hosts
4 zile /etc/hosts
www.padeepz.net
When you’ve made edits to a file, you can save and exit the editor to returnwww.padeepz.net
to the prompt. This
procedure varies between different editors. In emacs and zile, the key sequence is the same:
depress control and type x and s to save. This operation is typically notated ―C-x C-s‖ and then
―C-x C-c‖ to close the editor. In nano, press Control-O (notated \^O) and confirm the file name
to write the file, and type \^X to exit from the program.
Since vi and vim are modal editors, their operation is a bit more complex. After opening a file in
vi, you can enter ―insert‖ mode by pressing the ―i‖ key; this will let you edit text in the
conventional manner. To save the file, you must exit into ―normal‖ mode by pressing the escape
t
key (Control-[ also sends escape), and then type :wq to write the file and quit the program.
ne
This provides only the most basic outline of how to use these text editors, and there are
numerous external resources which will provide a more thorough introduction for more advanced
use of this software.
.
pz
Linodes do not come with any particular web server installed by default. You have the choice
and power to install and configure your web server as you see fit. This allows you to deploy a
configuration in a way that makes sense for your application and desired use case. The Linode
Library contains a number of documents regarding the installation and maintenance of various
web servers.
ee
The following tips cover a number of basic web serving tasks and functions, as well as some
guidance for users new to the world of web servers.
In order to provide these resources to connected users, your Linode needs to be running a web
server. There are multiple different HTTP servers and countless configurations to provide
w
support for various web development frameworks. The three most popular general use web
servers are the Apache HTTP server, Lighttpd server (―Lighty‖), and nginx server (―Engine X‖).
Each server has its strengths and weaknesses, and your choice depends largely on your
w
Once you’ve chosen a web server, you need to decide what (if any) scripting support you need to
install. Scripting support allows you to run dynamic content with your web server and program
w
server side scripts in languages such as Python, PHP, Ruby, and Perl.
If you need a full web application stack, we encourage you to consider one of our more full-
featured LAMP stack guides. If you need support for a specific web development framework,
consult our tutorials for installing and using specific web development frameworks.
The Apache HTTP Server is considered by many to be the de facto standard web server. It is the
most widely deployed open source web server, its configuration interface has been stable for
many years, and its modular architecture allows it to function in many different types of
t
deployments. Apache forms the foundation of the LAMP stack, and contains superb support for
integrating dynamic server-side applications into the web server.
ne
By contrast, web servers like Lighttpd and nginx are highly optimized for serving static content
in an efficient manner. If you have a deployment where server resources are limited and are
facing a great deal of demand, consider one of these servers. They are very functional and run
very well with minimal systems resources. Lighttpd and nginx can be more complex to set up
.
than Apache, and can be difficult to configure with regards to integration with dynamic content
pz
interpreters. Furthermore, as these servers are more directed at niche use cases, there are more
situations and applications which remain undocumented.
Finally the Cherokee web server provides a general purpose web server with an easy to configure
interface. Cherokee might be a good option for some basic deployments.
ee
Remember that the choice of web servers is often contextually determined. Specific choices
depend on factors like: the type of content you want to serve, the demand for that content, and
your comfort with that software as an administrator.
By default, error logs are located in the /var/log/apache2/error.log file. You can track or ―tail‖
this log with the following command:
w
1 tail -F /var/log/apache2/error.log
In the default virtual host configurations suggested in our Apache installation and LAMP guides,
we suggest the following error logging setup:
w
Where bucknell.net represents the name of your virtual host, and the location of relevant files.
These configuration directives make Apache create two log files that contain logging information
specific to that virtual host. This allows you to easily troubleshoot errors on specific virtual
hosts. To track or tail the error log, issue the following command:
www.padeepz.net
1 tail -F /srv/www/example.com/logs/error.log www.padeepz.net
This will allow you to see new error messages as they appear. Often problems can be diagnosed
by using specific parts of an error message from an Apache log as a term in Web search (e.g.
Google.) Common errors to look for include:
t
Dynamic code execution or interpretation errors.
ne
DNS Servers and Domain Names
The Domain Name System, or DNS, is the service that the Internet uses to associate the hard to
remember and manage IP addresses with more human-usable domain names. These tips address
.
several specific DNS related tasks. To learn more about DNS consider our overview of the
domain name system. If you are familiar with DNS and just need to figure out how to set up your
pz
DNS server, consider our documentation of the Linode DNS manager.
CNAME DNS records make it possible to redirect requests for one hostname or domain to
ee
another hostname or domain. This is useful in situations where you want to direct requests for
one domain to another, but don’t want to set up the web-server to handle requests.
CNAMEs are only valid when pointing from one domain to another. If you need to redirect a full
URL, you will need to set up a web server and configure redirection and/or virtual hosting on the
ad
server level. CNAMEs will allow you to redirect subdomains, such as team.example.com, to
other subdomains or domains, such as jack.example.org. CNAMEs must point a valid a
domain that has a valid A Record, or to another CNAME.
Although limited in their capabilities, CNAMEs can be quite useful in some situations. In
particular, if you need to change the hostname of a machine, CNAMEs are quite useful. To learn
.p
how to set up CNAME records with the Linode Manager, consult our documentation of the
Linode DNS Manager.
When reading domain names, we commonly refer to parts before the main or first-level domain
as ―sub-domains.‖ For example, in the domain team.example.com, team is a sub-domain for the
w
If you want to create and host a sub-domain, consider the following process:
w
in the DNS zone for the domain. This is easily accomplished when using the Linode DNS
Manager. As always, you may host the DNS for your domain with any provider you choose.
In order for your server to respond to requests for this domain, you must set up a server to
respond to these requests. For web servers like Apache this requires configuring a new virtual
www.padeepz.net
www.padeepz.net
host. For XMPP Servers you must configure an additional host to receive the requests for this
host. For more information, consult the documentation for the specific server you wish to deploy.
Once configured, subdomains function identically to first-level domains on your server in almost
all respects. If you need to, you can set up HTTP redirection for the new sub domain.
t
further demystify email management.
ne
Choosing an Email Solution
There are two major components of the email stack that are typically required for basic email
functionality. The most important part of the tool chain is the SMTP server or ―Mail Transfer
Agent.‖ The MTA, as it is often called, sends mail from one server to another. The second
.
crucial part of an email system is a server that permits users to access and download that mail
pz
from the server to their own machine. Typically these server use a protocol such as POP3 or
IMAP to provide remote access to the mailbox.
There are additional components in the email server tool chain. These components may or may
not be optional depending on the requirements of your deployment. They include filtering and
ee
delivery tools like procmail, anti-virus filters like ClamAV, mailing list managers like MailMan,
and spam filters like SpamAssassin. These components function independently of which MTA
and remote mailbox accessing server you chose to deploy.
The most prevalent SMTP servers or MTAs in the UNIX-like world are Postfix, Exim, and
ad
Sendmail. Sendmail has the longest history and many systems administrators have extensive
experience with it. Postfix is robust and modern, and is compatible with many different
deployment types. Exim is the default MTA in Debian systems, and many consider it to be easier
to use for basic tasks. For remote mailbox access, servers like Courier and Dovecot are widely
deployed to provide remote access to mailboxes.
.p
If you are in need of an integrated and easy to install email solution we encourage you to
consider the Citadel groupware server. Citadel provides an integrated ―turnkey‖ solution that
includes an SMTP server, remote mailbox access, real time collaboration tools including XMPP,
and a shared calendar interface. Along similar lines, we also provide documentation for the
w
If, by contrast, you want a more simple and modular email stack, we urge you to consider one of
w
Finally, it’s possible to outsource email service to a third party provider, such as Google Apps or
FastMail.fm. This allows you to send and receive mail from your domain, without hosting email
w
services on your Linode. Consult our documentation for setting up Google Apps for your
domain.
In many cases, administrators have no need for a complete email stack like those documented in
our email guides. However, applications running on that server still need to be able to send mail
for notifications and other routine purposes.
www.padeepz.net
The configuration of applications to send notifications and alerts is beyondwww.padeepz.net
the scope of this tip,
most applications rely on a simple ―sendmail‖ interface. Nevertheless, the modern MTAs Postfix
provides a sendmail-compatible interfaces located at /usr/sbin/sendmail.
You can install postfix on Debian and Ubuntu systems with the following command:
On CentOS and Fedora systems you can install postfix by issuing the following command:
t
ne
1 yum install postfix
Once Postfix is installed, your applications should be able to access the sendmail interface,
located at /usr/sbin/sendmail. Most applications running on your Linode should be able to
send mail normally with this configuration.
.
If you simply want to use your server to send email through an external SMTP server, you may
pz
want to consider a more simple tool like msmtp. Since msmtp is packaged in most distributions
you can install using the command appropriate to your distribution:
Use the command type msmtp, to find the location of msmtp on your system. Typically the
program is located at /usr/bin/msmtp. You can specify authentication credentials with
command line arguments or by declaring SMTP credentials in a configuration file. Here is an
example .msmtprc file.
ad
.msmtprc example
account default host smtp.example.com from squire@example.com auth on user squire password
s3cr37 tls on tls_certcheck off port 587
.p
The .msmptrc file needs to be set to mode 600, and owned by the user account that will be
sending mail. If the configuration file is located at /srv/smtp/msmtprc, you can call mstmp
with the following command:
w
1 /usr/bin/msmtp --file=/srv/smtp/msmtprc
w
VIRTUALIZATION
Virtualization projects are the focus of many IT professionals who are trying to consolidate
w
servers or data centers, decrease costs and launch successful ―green‖ conservation initiatives.
Virtualizing IT resources can be thought of as squeezing an enterprise’s computer processing
power, memory, network bandwidth and storage capacity onto the smallest number of hardware
platforms possible and then apportioning those resources to operating systems and applications
on a time-sharing basis. This approach aims to make the most efficient possible use of IT
resources. It differs from historical computing and networking models, which have typically
involved inextricably binding a given software
www.padeepz.net
www.padeepz.net
application or service to a specific operating system (OS), which, in turn, has been developed to
run on a particular hardware platform. By contrast, virtualization decouples these components,
making them available from a common resource pool. In this respect, virtualization prevents IT
departments from having to worry about the particular
hardware or software platforms installed as they deploy additional services. The decoupling and
optimization of these components is possible whether you are virtualizing servers, desktops,
applications, storage devices or networks. To virtualize some or all of a computing
infrastructure’s resources, IT departments require special virtualization software, firmware or a
third- party service that makes use of virtualization software or firmware. This
t
software/firmware component, called the hypervisor or the virtualizattion layer, performs the
mapping between virtual and physical resouces. It is what enables the various resources to be
ne
decoupled,then aggregated and dispensed, irrespective of the underlying hardware and, in some
cases, the software OS. In effect, the hypervisor takes over hardware management from the OS.
In addition to the hypervisor virtualization technology, the organization overseeing the
virtualization project requires a virtualization management tool – which might be procured from
the same or a different supplier – to set up and manage virtual devices and policies.
.
Why Virtualize?
pz
One key reason why IT organizations are considering virtualization of some or all of their
computing infrastructures is that the technology helps them to derive the biggest bang out of
their computing buck.
SETTING UP XEN
ee
Xen is a type 1, bare-metal virtual machine monitor (or hypervisor), which provides the ability to
run one or more operating system instances on the same physical machine. Xen, like other types
of virtualization, is useful for many use cases such as server consolidation and isolation of
production and development environments (Eg.: corporate and personal environments on the
same system).
ad
As of Ubuntu 11.10 (Oneiric), the default kernel included in Ubuntu can be used directly with
the Xen hypervisor as the management (or control) domain (dom0 or "Domain0" in Xen
terminology).
.p
Our example uses LVM for virtual disks and network bridging for virtual network cards. It also
assumes Xen 4.1 (the version available in 12.04). It assumes a familiarity with general
virtualization issues, as well as with the specific Xen terminology. Please see the Xen wiki (see
http://wiki.xen.org/wiki/Xen_Overview) for more information.
w
During the install of Ubuntu for the Partitioning method choose "Guided - use the entire disk and
setup LVM". Then, when prompted to enter "Amount of volume group to use for guided
w
partitioning" enter a value large enough for the Xen dom0 system, leaving the rest for virtual
disks. Enter a value smaller than the size of your installation drive. For example 100 GB should
be large enough for a minimal Xen dom0 system. Keep in mind that in our model stay inside that
guest (dom0) all installation media for guest OSs and other useful files, so that guest must have
enough space on it.
www.padeepz.net
www.padeepz.net
After Installation of Ubuntu
Install GUI
sudo apt-get update
sudo apt-get install ubuntu-desktop
To skip the login screen completely, boot into the console and then start the GUI
t
sudo gedit /etc/default/grub
ne
Change line GRUB_CMDLINE_LINUX_DEFAULT="quiet splash" to
GRUB_CMDLINE_LINUX_DEFAULT="text".
.
sudo update-grub
pz
Reboot and you should come up directly in tty1.
We understand that this is the best approach to have remote access to "Ubuntu Server 12.04
LTS" into a cross-plataform environment. The "XRDP" is an implementation of the "Remote
ad
Desktop" standards from Microsoft and works on the same way as for Windows. Allows remote
desktop access via native Windows client machines (or "RDESKTOP" on Ubuntu), does not
require loading the "Ubuntu Server 12.04 LTS" GUI (Graphical User Interface) on boot and
allows multiple simultaneous sessions. To use "RDESKTOP" on Ubuntu 12.04 LTS with
"TSCLIENT" see my post on http://superuser.com/questions/420291/ubuntu-12-04-how-to-get-
.p
tsclient-back.
With "XRDP" you can easily use Microsoft RDP to connect to Ubuntu without any
configuration. All you need to do is install the "xrdp" package, then open Remote Desktop
w
Next, open Windows Remote Desktop Connection (RDP) and type Ubuntu Server hostname or
w
IP address.
As you may already know, SSH is a secure communication protocol that lets you remotely
access networked computers. It is known as a replacement for Telnet which is very unsecure.
While Telnet sends traffic in plain text, SSH on the other hand uses a secure protocol to
communicate.
www.padeepz.net
Run the commands below to install SSH Server. www.padeepz.net
ssh <remote_user>@<ip_or_name>
t
ne
XCP (Xen Cloud Platform) is the open source version similar to Citrix Xen Server that uses the
Xen Hypervisor. XCP uses XAPI or XenAPI to manage Xen hosts. XCP is based on CentOS 5.5.
Project Kronos is an initiative to port the XAPI tool stack to Debian and Ubuntu. It is a
management stack implemented in OCaml that configures and controls Xen hosts, attached
storage, networking and virtual machine life cycle. It exposes a HTTP API and provides a
.
command line interface (xe) for resource management.
pz
XenCenter is Windows desktop application by Citrix that is distributed with XenServer for
managing servers running XenServer (the equivalent of linux is OpenXenManager). It uses
XAPI for talking to Xen resource pools. Since we are setting up XAPI, we can use XenCenter to
manage the server.
ee
Why use XCP-XAPI on Debian/Ubuntu when XCP appliance exists?
o sudo sed -i
's/GRUB_CMDLINE_LINUX=.*\+/GRUB_CMDLINE_LINUX="apparmor=0"/'
/etc/default/grub
w
www.padeepz.net
Update Grub with the config changes we just made www.padeepz.net
o sudo update-grub
o sudo reboot
t
o cat /proc/xen/capabilities should display "control_d"
ne
o Note: To stop or start xcp-xapi
sudo /etc/init.d/xcp-xapi stop (or start)
.
o Install XCP-XAPI
sudo apt-get install xcp-xapi
pz
-> Choose "bridge" when prompted for network backend
Fix for "qemu" which emulates the console does not have the keymaps in the correct
location
w
Network configuration
This section describes how to set up Linux bridging in Xen. It assumes eth0 is
both your primary interface to dom0 and the interface you want your VMs to
w
t
Create a bond called xenbr0. The file should look like this for a
static network configuration:
ne
# This file describes the network
interfaces available on your system
# and how to activate them. For more
information, see interfaces(5).
# The loopback network interface
.
auto lo
pz
iface lo inet loopback
# Xen network interface for "dom0"
auto xenbr0
iface xenbr0 inet static
ee
# IP address
address 192.168.1.111
# Subnet mask
netmask 255.255.255.0
# Default Gateway
gateway 192.168.1.1
ad
# DNS Server
dns-nameservers 192.168.1.1
bridge_ports eth0
iface eth0 inet manual
.p
# The primary network interface
# auto eth0
# iface eth0 inet dhcp
w
Eg.:
sudo chmod ugo+rwx
/dev/vmnet0
www.padeepz.net
www.padeepz.net
sudo chown eduardo
/dev/vmnet0
sudo chown :eduardo
/dev/vmnet0
All set! Ready to reboot and let xcp-xapi toolstack take over
t
ne
sudo reboot
sudo xe vm-list
This should list the control domain
.
"
pz
uuid (RO) : dbcf74d2-ee50-edd5-d44d-
b81fc8ba1777
name-label (RW): Control domain on host:
ubuntu-xenserver-1
power-state (RO): running
"
ee -> If your output looks similar, "xapi" is running on the server, if
you get ―Connection refused‖ then xapi is not setup correctly!
you simply provide the hostname or IP address of the NFS server and the
path to a directory that will be used to contain the storage repository (if
this resource is on another machine, it is not our case). The NFS server
must be configured to export the specified path to all servers in the pool.
w
StorageRepository /dev/ubuntus1204
Eg2.: sudo lvcreate -l 100%FREE -n
StorageRepository /dev/ubuntus1204
t
label=StorageRepository
"
ne
uuid ( RO): 37bc5263-c9fc-8876-d24c-
d5927f1bbed2
name-label ( RW): StorageRepository
name-description ( RW):
host ( RO): ubuntus1204
type ( RO): ext
.
content-type ( RO):
"
pz
Configure a ISO Repository for Use With XAPI
An ISO Repository contains ISOs (disk images) with operational systems to perform the installations.
ee
Then the following example makes a storage repository called ISOs
www.padeepz.net
www.padeepz.net
VMWARE ON LINUX
What do you need to get the most out of VMware Workstation 5? Take the following list of
requirements as a starting point. Like physical computers, the virtual machines running under
VMware Workstation generally perform better if they have faster processors and more memory.
t
PC Hardware
ne
Standard x86-compatible or x86-64-compatible personal computer
.
pz
Intel�: Celeron�, Pentium� II, Pentium III, Pentium 4, Pentium M (including computers
with Centrino™ mobile technology), Xeon™ (including "Prestonia"), EM64T
AMD™: Athlon™, Athlon MP, Athlon XP, Athlon 64, Duron™, Opteron™, Turion™ 64
AMD Opteron, AMD Athlon 64, AMD Turion 64, AMD Sempron, Intel EM64T; support for
64-bit guest operating systems is available only on the following specific versions of these
processors:
.p
Memory
w
You must have enough memory to run the host operating system, plus the memory required for
each guest operating system and for applications on the host and guest. See your guest operating
system and application documentation for their memory requirements.
www.padeepz.net
Display www.padeepz.net
Disk Drives
Guest operating systems can reside on physical disk partitions or in virtual disk files.
Hard Disk
t
IDE and SCSI hard drives supported, up to 950GB capacity
ne
At least 1GB free disk space recommended for each guest operating system and the
application software used with it; if you use a default setup, the actual disk space needs are
approximately the same as those for installing and running the guest operating system and
.
applications on a physical computer.
pz
For Installation — 80MB (Linux) or 250MB (Windows) free disk space required for basic
installation. You can delete the installer afterwards to reclaim disk space.
Non-Ethernet networks supported using built-in network address translation (NAT) or using a
.p
combination of host-only networking plus routing software on the host operating system
VMware Workstation is available for both Windows and Linux host operating systems.
t
Windows Server 2003 x64 Edition SP1
ne
Windows Server 2003 x64 Edition R2
Internet Explorer 4.0 or higher is required for the Windows online help system.
.
Linux Host Operating Systems (32-Bit)
pz
Supported distributions and kernels are listed below. VMware Workstation may not run on
systems that do not meet these requirements.
Note: As newer Linux kernels and distributions are released, VMware modifies and tests its
ee
products for stability and reliability on those host platforms. We make every effort to add support
for new kernels and distributions in a timely manner, but until a kernel or distribution is added to
the list below, its use with our products is not supported. Look for newer prebuilt modules in the
download area of our Web site. Go to www.vmware.com/download/.
ad
Mandriva Corporate Server 4
Mandriva Linux 2007 (experimental support)
Mandriva Linux 2006
Mandrake Linux 10.1
Mandrake Linux 9.0 — stock 2.4.19
.p
upgrade 2.4.9-31
Red Hat Linux 7.1 — stock 2.4.2-2, upgrade 2.4.3-12
Red Hat Linux 7.0 — stock 2.2.16-22, upgrade 2.2.17-14
t
SUSE Linux 8.2 — stock 2.4.20
ne
Ubuntu Linux 6.10 (experimental support)
Ubuntu Linux 6.06
Ubuntu Linux 5.10
Ubuntu Linux 5.04
.
Platforms not listed above are not supported. A Web browser is required for the Help system.
pz
Linux Host Operating Systems (64-Bit)
Supported distributions and kernels are listed below. VMware Workstation may not run on
systems that do not meet these requirements.
ee
Note: As newer Linux kernels and distributions are released, VMware modifies and tests its
products for stability and reliability on those host platforms. We make every effort to add support
for new kernels and distributions in a timely manner, but until a kernel or distribution is added to
the list below, its use with our products is not supported. Look for newer prebuilt modules in the
ad
download area of our Web site. Go to www.vmware.com/download/.
Platforms not listed above are not supported. A Web browser is required for the Help system.
t
. ne
pz
ee
ad
.p
w
w
w
www.padeepz.net