Chapter 1 (Notes) - Number Representation
Chapter 1 (Notes) - Number Representation
(0 1 2 3 4 5 6 7 8 9)10
A number base indicates how many digits are available within a numerical
system. Denary is known as base 10 because there are ten choices of digits
between 0 and 9.
For binary numbers there are only two possible digits available:
(0 or 1)2
The binary system is also known as base 2.
Each number has a place value which could be put into columns. Each column
is a power of ten in the base 10 system:
Or think of it as:
(6 x 1000) + (4 x 100) + (3 x 10) + (2 x 1) = 6432
By looking at the place values, we can calculate the equivalent denary number.
That is: (1 x 23) + (0 x 22) + (0 x21) + (1x20) = 8+0+0+1
27 = 128
26 = 64
25 = 32
24 = 16
23 = 8
22 = 4
21 = 2
20 = 1
Play
Put the remainders in reverse order to get the final number: 1010011.
64 32 16 8 4 2 1
1 0 1 0 0 1 1
Put the remainders in reverse order to get the final number: 1111010.
0 1 1 1 1 0 1 0
There are two methods for converting a denary (base 10) number to
binary (base 2). This is method two.
Take off the biggest 2n value you can
Remove the 2n numbers from the main number and mark up the equivalent
2n column with a 1. Work through the remainders until you reach zero. When you
reach zero, stop and complete the final columns with 0s.
Play
1 0 1 0 1 0 0
Binary combinations
These tables show how many binary combinations are available for each bit size.
One bit
Three bit
Fortunately, large binary numbers can be made much more compact—and hence easier to
work with—if represented in base-16, the so-called hexadecimal number system. You may
wonder: Binary numbers would also be more compact if represented in base-10—why not
just convert them to decimal? The answer, as you will soon see, is that converting between
binary and hexadecimal is exceedingly easy—much easier than converting between binary
and decimal.
Just as with decimal notation or binary notation, we again write a number as a string of symbols,
but now each symbol is one of the 16 possible hexadecimal digits (0 through F). To interpret a
hexadecimal number, we multiply each digit by the power of 16 associated with that digit’s
position.
For example, consider the hexadecimal number 1A9B. Indicating the values associated with the
positions of the symbols, this number is illustrated as:
1 A 9 B
The one main disadvantage of binary numbers is that the binary string equivalent of a large
decimal base -10 number, can be quite long. When working with large digital systems, such as
computers, it is common to find binary numbers consisting of 8, 16 and even 32 digits which
makes it difficult to both read and write without producing errors especially when working with
lots of 16 or 32-bit binary numbers. One common way of overcoming this problem is to arrange
the binary numbers into groups or sets of four bits (4-bits). These groups of 4-bits use another
type of numbering system also commonly used in computer and digital systems called
Hexadecimal Numbers.
Examples:
1
(i) 2 816 = 2 8H = 2 × 16 + 8 × 160 = 40
= 32 + 8 = 40
3 2 1 0
(iii) BC1216 = BC12H = 11×16 +12×16 +1×16 +2×16 = 48146
0101 0111
0X23+1X22+0X21+1X20 0X23+1X22+1X21+1X20
0X8+1X4+0X2+1X1 0X8+1X4+1X2+1X1
0+4+0+1=5 0+4+2+1=7
5 7
So Hexadecimal number is 57
Problems 1.2.6
Convert hexadecimal 2BF9 to its binary equivalent.
Convert binary 110011100001 to its hexadecimal equivalent. (Below is working area)
So 1A9B16 = 681110
For example, to convert the decimal number 746 to hexadecimal, we proceed as follows:
` Remainder
16 | 746
| 46 10 = A
|2 14 = E
|0 2
Step 1: Convert 65d to binary. Ignore the sign for now. Use the absolute value. The
absolute value of -65 is 65.
10111110
+ 1
--------------
10111111 <--- Two's complement
10111111b is -65 in binary. We know this it true because if we add 01000001 (+65) to
10111111b (-65) and ignore the carry bit, the sum is 0, which is what we obtain if we
add +65 + (-65) = 0.
01000001 +65
+ 10111111 -65
---------------------
100000000b 0 denary
^
Ignore the carry bit for now. What matters is that original number of bits (D7-D0) are all 0.
We will examine signed binary values in more detail later. For now, understand the
difference between one’s complement and two’s complement and practice converting
between them.
If all bits in a byte are inverted by changing each 1 to 0 and each 0 to 1, we have
formed the one’s complement of the number.
And that is all there is to it! One’s complement is useful for forming the two’s
complement of a number.
The two’s complement is a method for representing positive and negative integer values
in binary. The useful part of two’s complement is that it automatically includes the sign
bit. Rule: To form the two’s complement, add 1 to the one’s complement.
13
by using a numeration with more symbols, it is both easier to work with (saving paper
and screen space) and makes it possible to understand some of the vast streams of
data inside a computer merely by looking at the Hex output. This is why programs such
as DEBUG, use only Hexadecimal to display the actual Binary bytes of a Memory Dump
rather than a huge number of ones and zeros!
The second aspect is closely related: Whenever it is necessary to convert the Hex
representation back into the actual Binary bits, the process is simple enough to be done
in your own mind.
For example, FAD7 hex is 1111 1010 1101 0111 (F=1111, A=1010, D=1101, 7=0111)
in Binary. The reason one might wish to do this is in order to work with "logical" (AND,
OR or XOR) or "bit-oriented" instructions (Bit tests, etc.) which may make it easier (at
times) for a programmer to comprehend.
For example, if you wanted to logically AND the hex number FAD7 with D37E, you
might have a difficult time without first changing these numbers into Binary. If you jot
them out in Binary on scratch paper, the task
will be much easier:
BCD was used in many early decimal computers, and is implemented in the instruction
set of machines such as the IBM System/360 series and its descendants
and Digital's VAX. Although BCD per se is not as widely used as in the past and is no
longer implemented in computers' instruction sets[dubious – discuss], decimal fixed-
point and floating-point formats are still important and continue to be used in financial,
commercial, and industrial computing
As most computers deal with data in 8-bit bytes, it is possible to use one of the following
methods to encode a BCD number:
Unpacked: each numeral is encoded into one byte, with four bits representing the
numeral and the remaining bits having no significance.
Packed: two numerals are encoded into a single byte, with one numeral in the least
significant nibble (bits 0 through 3) and the other numeral in the most significant
nibble
Note: All the zeros are essential otherwise you can’t read it back.
But do not get confused, binary coded decimal is not the same as hexadecimal.
Whereas a 4-bit hexadecimal number is valid up to F16 representing binary 11112,
(decimal 15), binary coded decimal numbers stop at 9 binary 10012
There are a number of applications where BCD can be used. The obvious type of
application is where denary digits are to be displayed, for instance on the screen of a
calculator or in a digital time display.
IBM and BCD IBM used the terms binary-coded decimal and BCD for six-bit alphameric
codes that represented numbers, upper-case letters and special characters. Some
variation of BCD was used in most early IBM computers, including the IBM 1620, IBM
1400 series, and non-Decimal Architecture members of the IBM 700/7000 series. With
the introduction of System/360, IBM replaced BCD with 8-bit EBCDIC
ASCII code:
If text is to be stored in a computer it is necessary to have a coding scheme that
provides a unique binary code for each distinct individual component item of the text.
Such a code is referred to as a character code.
The scheme which has been used for the longest time is the ASCII (American Standard
Code for Information Interchange) coding scheme. This is an internationally agreed
standard. There are some variations on ASCII coding schemes but the major one is the
7-bit code. It is customary to present the codes in a table for which a number of different
designs have been used.
The full table shows the 27 (128) different codes available for a 7-bit code. You should
not try to remember any of the ind ividua l codes but the re are certain aspects of the
coding scheme which you need to understand.
It is worth emphasising here that these codes for numbers are exclusively for use in the
context of stored, displayed or printed text. All of the other coding schemes for numbers
are for internal use in a computer system and would not be used in a text.
There are some special features that make the coding scheme easy to use in certain
circumstances. The first is that the codes for numbers and for letters are in sequence in
each case so that, for example, if 1 is added to the code for seven the code for eight is
produced.
Unicode
Despite still being widely used, the ASCII codes are far from adequate for many
purposes.
It works by providing a unique number for every character, this creates a consistent
encoding, representation, and handling of text.
Basically Unicode is like a Universal Alphabet that covers the majority of different
languages across the world, it transforms characters into numbers.
Unicode has its own special terminology. For example, a character code is referred to
as a 'code point'. In any documentation there is a special way of identifying a code
point. An example is U+0041 which is the code point corresponding to the alphabetic
character A.
The 0041 are hexadecimal characters representing two bytes. The interesting point is
that in a text where the coding has been identified as Unicode it is only necessary to
use a one-byte representation for the 128 codes corresponding to ASCII. To ensure
such a code cannot be misinterpreted, the codes where more than one byte is needed
have restrictions applied.
A = 65
λ = 923
So Unicode says things like, ―Allright, this character exists, we assigned it an official
name and a codepoint, here are its lowercase or uppercase equivalents (if any), and
here’s a picture of what it could look like. Font designers, it’s up to you to draw this
in your font if you want to.‖
What can we do? One idea would be to always use, say, 3 bytes per character.
That would be nice for string traversal, because the 3rd codepoint in a string would
always start at the 9th byte. But it would be inefficient when it comes to storage
space and bandwidth.
UTF-8 :
UTF-8 gives you four templates to choose from: a one-byte template, a two-byte
template, a three-byte template, and a four-byte template.
0xxxxxxx
110xxxxx 10xxxxxx
1110xxxx 10xxxxxx 10xxxxxx
11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
Each of those templates has some headers which are always the same (shown
here in red) and some slots where your codepoint data can go (shown here as
―x‖s).
The four-byte template gives us 21 bits for our data, which would let us represent
2,097,151 different values. There are only about 128,000 codepoints right now, so
UTF-8 can easily encode any Unicode codepoint for the foreseeable future.
Practice Questions
Convert to one’s complement:
1. 1010
2. 11110000
3. 10111100 11000000
4. 10100001
1. 1010
2. 11110000
3. 10000000
4. 011111111
1. -192d
2. -16d
3. -1d
4. -0d
One’s complement:
1. 0101
2. 00001111
3. 01000011 00111111
4. 01011110
Two’s complement:
1. 0110 (1010 –> 0101 + 1 = 0110)
2. 00010000
3. 10000000 (Result is the same as the original value.)
4. 10000001
Negative decimal to negative binary:
1. 01000000b (192d = 11000000b –> 00111111 + 1 = 01000000b)
2. 11110000b (16d = 00010000b –> 11101111 + 1 = 11110000b)
3. 11111111b (1d = 00000001b –> 11111110 + 1 = 11111111b) Tricky? Before
converting from binary to decimal, we must know ahead of time if the binary value
is signed or not because a signed binary value will not convert properly using the
place value chart we have seen so far. If seen by itself, 11111111b = 255d, not -1d.
As a rule, assume that a binary value, such as 11111111b, is a positive integer
unless context specifies otherwise. Since we are dealing with negative binary
values in this problem set, then 11111111b is -1d, not 255d.
4. 0. There is no such thing as negative zero (-0). Nothing is always nothing and does
not have a sign. We can convert anyway: 0d = 00000000 –> 11111111 + 1 = 1
00000000b. We still arrive at 0 in binary for the eight relevant bits. Ignore the ninth
carry bit.