Lecture-36 Assembler Directives

Download as pdf or txt
Download as pdf or txt
You are on page 1of 10

Lecture-36

ASSEMBLER DIRECTIVES
To assemble a program automatically the assembler needs
information in the form of assembler directives that controls the
assembly. For example, the assembler must be told at what address
to start assembling the program. These assembler directives are
command placed in the program by the designer that provides
information to the assembler. They do not become part of the final
program as they are not the part of the instruction set of the
microprocessor nor did they translate into executable code.
Therefore, they are also known as pseudo-instruction on false
instructions.
Each assembler has its own unique pseudo instructions or
assembler directives. These instructions differ from assembler to
assembler but most of the assembler contains an equivalent set of
pseudo instructions written in assembly language format.

1. ORG: The origin (ORG) instruction tells the assembler the


address of the memory location for the next instruction or data byte
should be assembled. ORG is entered at the beginning of a program.
When different parts of a programme (e.g. subroutines) are to be
placed in different areas of memory, an ORG pseudo instruction is
used before each part of the program to specify the starting location
for assembly of that part of the program. The origin instruction has
the following form,
ORG expression
where expression evaluates to a 16-bit address, i.e., ORG is
followed by an address. If no origin pseudo instruction appears before
the first instruction in the program, assembly will begin, by default, at
memory location 0000H.
For example, ORG 0100H tells the assembler to start assembling the
immediately following program at 0100H in memory.

2. END: When an assembler scans the program to be assembled it


must know, where the program ends. It cannot depend on a HLT
instruction for this because some programmes don’t contain a halt
instruction as the last instruction and other don’t contain a halt at all.
An application program used, e.g., in process monitoring on control
might run continuously and, therefore, not contain a halt instruction.
Thus, an end assembly, END directive must be the last instruction.
The directive has a form.
END
The END statement explicitly indicates the end of the program
to the assembler. If no END statement is given, then the assembler
just keeps on running through all the memory.
The ORG and END assembler directives, in effect, frame the
program to be assembled.
ORG 0000H
[Assembly language instructions]
END
When there is more than one ORG assembler directive, then
the assembly of group of instruction start at the location specified by
the origin assemble directive that proceeds. But there will be only one
END instruction to tell the assembler the physical end of the program.
For example,
ORG 0000H
[Assembly language instructions]
{This block of instructions is assembled starting at location 0000H}
ORG 0100 H
[Assembly language instructions]
{This block of instructions is assembled starting at location 0100H}
END

3. EQU: Symbolic names, which appear in assembly language


programs as labels, instructions mnemonics and operands are
translated to binary values by the assembler. As discussed in hand-
assembly the labels are assigned the current value of the
assembler’s location counter when encountered in the first pass of
the assembly. Instruction mnemonics have predefined values that the
assembler obtains from a table that is part of the assembler.
A symbolic operand can be a register name, an address or a
data constant. Register names have predefined values. All addresses
correspond to labels in the program and their values are defined.
Data constants, on the other hand, are defined by the designer using
an equate assembler directive. The equate instruction EQU defines
symbols used in the program. Equate assembler directives usually
appear as a group at the beginning of a program and have the form.
name EQU expression.
‘name’ stands for the symbolic name. The assembler evaluates the
expression and equates the symbolic name to it by placing the name
in its symbol table along with the value of the expression. Therefore,
whenever the name appears in the program, it is replaced by the
value the expression in the equate pseudo instruction. For example,
COUNT EQU 0100 H
Note the symbolic name is not followed by a colon and is not a label
even though it appears in the label field. The symbolic name in one
equate statement cannot be used in another nor can it be used as the
label of another instruction. That is, the name in an equate directive
cannot be redefined. If its value is changed, the equate assembler
directive must be changed and the program reassembled.

4. SET: SET is similar to EQU assembler directive. This directive


also assigns a value to the name associated it. However, the same
symbol can be redefined by another SET statement later in the
program. Thus, more than one SET instructions can have the same
name the SET assembler directive has the form.
name SET expression.
5. DS: Another pseudo instruction, the define storage, reserves or
allocates read/write memory locations for storage of temporary data.
The first of the locations allocated can be referred to by an optional
symbolic label. The define storage instruction has the form
opt. label: DS expression.
A number of bytes of memory equal to the value of the expression
are reserved. However, no assumptions can be made about the initial
values of the data in these reserves locations, i.e, the assembler
does not initialize the contents of these locations in anyway.
If has a symbolic name is used with the DS pseudo instructions,
it has the value of the address of the first reserved location. Some of
the assemblers use DFS for define storage. For example, to establish
two 1-byte storage registers in RWM memory with the names TEMP1
& TEMP2, the instruction are written.
TEMP1: DS 1
TEMP2: DS 2
During the first pass, the assembler assigns the values of its location
counter to TEMP1 and TEMP2 respectively and thus an address is
associated with each label. Instructions in the program can read or
write these locations using memory reference instructions such as
STA TEMP1 or LDA TEMP2.
A memory buffer is a collection of consecutive memory
locations and also used to store data temporarily. For example,
BUFFER: DS 50D
It tells the assembler to reserve 50 memory locations for storage
when it assembles the program. The address of the first location is
BUFFER (BUFFER to BUFFER + 50 -1). Such a buffer is usually
written and read sequentially using register indirect addressing.

6. DB: When a table of fixed data values is required, memory must


also be allocated. However, unlike the DS, each memory locations
must have a defined value that is assembled into it. The pseudo
instructions for this is define, DB and the general form is
opt. name: DB list
‘list’ refers either to one or more arithmetic or logic expressions that
evaluate to 8-bit data quantities or to strings of character enclosed in
quotes that the assembler replaces with their equivalent ASCII
representations. Assembled bytes of data are stored in successive
memory locations until the list is exhausted. Some of the assemblers
use DFB for this assembler directive.
For example,
DB 07AH
It stores 7AH in memory location right after the preceding instruction.
Another example is
DB ‘J’, ‘O’, ‘H’, ‘N’
It stores 4AH, 4FH, 48H & 4EH in the four successive memory locations
to represent the string of ASCII characters.

7. DW: Define word DW instruction is similar to define byte pseudo


instruction.
opt. name: DW list
The only difference between the DB & DW is that expression in this
define word list is evaluated to 16-bit quantity and stored as 2-bytes. It
is stored with the lower order byte in the lower of the two memory
locations and the higher order byte in the next higher location. This is
consistent with the convention for storing 16- bit quantities in 8085A
systems. Some of the assemblers use DFW for this assembler
directive.
Macros: Sometimes it is required that same set of instructions are to
be repeated again & again. One way to simplify the problem is the
use of subroutine. This increases the execution time due to overhead.
The other way is the use of macros. The assemblers which have the
capability to process macro instructions are called macro assemblers.
The assemblers are designed such that the programmer need to
write set of instruction once and then refer it many times as desired.
A macro instruction is a single instruction that the macro
assemble replaces with a group of instruction whenever it appears in
an assembly language program. The macro instruction and the
instruction that replace it are defined by the system design only once
in the program. Macros are useful when a small group of instruction
must be repeated several times in a program, with only minor or no
changes in each repetition.
The use of macro in ALP entails three groups:
1) The macro definition
2) The macro reference
3) The macro expansion.
The macro definition defines the group of instructions
equivalent to macro. Macro reference is the use of the macro
instruction as an instruction in the program. A macro expansion is the
replacement of the macro instruction defined by its equivalent. The
first two steps are carried out by the system designer and the third by
the macro assembler.
The macro definition has the following format:
LABEL CODE (Mnemonic) OPERAND
Name MACRO List
[Macro body]
ENDM
‘Name’ stands for the name of the macro that appears in the label
field of the macro definition. A list of dummy parameters may be
specified as List and, if so, these parameters also appear in the
macro body. The macro body is the sequence of assembly language
instructions that replace the macro reference into program when
assembled. The macro definition produces no object code
(hexadecimal number); it simply indicates to the assembler what
instructions are represented by the macro name.

Example-1:
Consider the use of a macro involving a large amount of indirect
addressing. An indirect addressing input capability is provided by the
two instructions:
LHLD addr
MOV r, M
This sequence can be written as a macro named LDIND; with a
macro definition of
LDIND MACRO REG, ADDR
LHLD ADDR
MOV REG, M
ENDM

To have the macro body appear at any given point in the program, it
requires a macro reference. This format is identical to that of an
assembly language instruction.
Label Code (Mnemonic) Operand
optional label name parameters list
‘Name’ is the label by which the macro is referenced or called. The
following macro instructions load register (C) indirectly through the
address PRT
LDIND C, PTR
When a program containing macro is input to a macro assembler the
assembler carries out a test substitution, the macro expansion,
substituting for each macro reference the macro body specified in the
macro definition. And for each dummy parameter list in the macro
body, the appropriate parameter from the parameter list of the macro
reference macro assembler encounter the macro instruction
LDIND C, PTR
It replaces it with the instructions
LHLD PTR
MOV C, M
Example-2:
The following macro rotation the contents of the accumulator to the
left through carry, N times. This is done with a loop that is terminated
when a register is counted down to zero. The numbers of rotation, N.
and the register to be used as the counter are the parameters in the
macro definition:
RALN MACRO N, REG
MVI REG, N
LOOP: RAL
DCR REG
JNZ LOOP
ENDM
If this macro appears in a program, a problem results as it will be
referred twice. When the macro is expanded, the label LOOP will
appear twice in the program, resulting in a multiply define symbol
error when the program is assembled. This problem is avoided by
use of the LOCAL directive; which is placed in the macro definition.
The LOCAL directive has the form:
LABEL CODE OPERAND

--- LOCAL label names


The specified label names are defined to have meaning only within
the current macro expansion. Each time the macro is referenced and
expanded; the assembler assigns each local symbol a unique symbol
in the form ‘??nnnn’. The assembler assigns ‘??0001’ to the first
symbol, ‘??0002’ to the second symbol and so on. The most recent
symbol name generated always indicates the total number of symbols
created for all macro expansions. These symbols are never
duplicated by the assembler.

You might also like