FLOATING POINT SYSTEMS, INC. # AP-120B ARRAY TRANSFORM PROCESSOR FLOATING POINT SYSTEMS, INC. ## SOFTWARE DEVELOPMENT PACKAGE MANUALS © Floating Point Systems 1976 All Rights Reserved Printed in the United States of America # SOFTWARE DEVELOPMENT PACKAGE MANUALS FPS-7292 #### This document contains four (4) manuals: 1. HOW TO PROGRAM THE AP-120B Manual - FPS-7303 March 1976 2. AP-120B APAL - Array Processor Assembly Language Manual - FPS-7275-01 Revised February 26, 1976 3. AP-120B APLINK - Array Processor Linking Loader Manual - FPS-7276-01 Revised Feburary 26, 1976 4. AP-120B DEBUG - Array Processor Debugger Manual - FPS-7277-01 Revised February 26, 1976 #### TABLE OF CONTENTS | HOW TO PROGRAM THE | AP-120B | | | |----------------------------|-------------------------------|-----------------|------------| | Section 1 -<br>Section 2 - | Meet the AP again | | 1-1<br>2-1 | | Section 3 - | Caveat Programmer | | 3-1 | | AP-120B APAL - ARRA | AY PROCESSOR ASSEMBLY LANGUAG | E MANUA | AL. | | Section 1 - | Introduction | APAL | 1_1 | | Section 2 - | | APAL | | | | Source Program Statements | APAL | | | | Operating Procedures | APAL | | | | Error Messages | APAL | | | Appendix A | | APAL | | | Appendix B | | $\mathtt{APAL}$ | B-1 | | Appendix C | | APAL | C-1 | | AP-120B APLINK - AR | RAY PROCESSOR LINKING LOADER | MANUAI | ٠ | | Section 1 - | Introduction | ADI TNIZ | 1 1 | | | (Presently Omitted) | APLINK | T-T | | Section 3 - | | APLINK | 2 1 | | Section 4 - | | APLINK | | | Appendix A | <u>~</u> | APLINK | | | Appendix B | | APLINK | | | | | | | | AP-120B APDEBUG - A | RRAY PROCESSOR DEBUGGER MANU | AL | | | Appendix A | A | PDEBUG | A-1 | #### Documentation Update Abstract: These pages reflect program changes in Software Update #1 and replace corresponding pages in the Software Development Package Manuals. Changed or newly added material is identified by a bar along the outside margin of the page. Manuals Affected: APAL #7275-01 Fe February 26, 1976 Pages: 4-1 4-1a 4-2 APLINK #7276-01 February 26, 1976 Pages: i 3-1 3-1a 3 - 4 3-5 3-5a 3-6 3-7 A-1 APDEBUG #7277-01 February 26, 1976 Pages: A-1 HOW TO PROGRAM THE AP-120B FPS-7303 FPS-7303 © FLOATING POINT SYSTEMS, INC., 1976 ALL RIGHTS RESERVED PUBLISHED IN THE UNITED STATES OF AMERICA MARCH 1976 #### TABLE OF CONTENTS | SECTION 1 | MEET | THE APAGAIN | 1-1 | |-----------|--------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------| | | 1.2<br>1.3<br>1.4 | Introduction Basic Overview Referencing Memory S-Pad Mnemonics Other Pseudo-Ops | 1-1<br>1-1<br>1-3<br>1-3<br>1-4 | | SECTION 2 | LOOPS | 3 | | | | 2.2<br>2.3<br>2.4<br>2.5<br>2.6<br>2.7<br>2.8<br>2.9<br>2.10 | A Poor Loop Determining Length of Loop Writing a Real Memory-Limited Loop Writing Intros A Dot Product Program Notation Dropping Out One Early Interaction Between Columns Changing DPA Non-Memory-Limited Loops A 1-Cycle Loop | 2-1<br>2-3<br>2-3<br>2-4<br>2-6<br>2-8<br>2-1<br>2-1<br>2-1<br>2-1<br>2-1 | | SECTION 3 | CAVE | AT PROGRAMMER (Let the Programmer Beware) | 3-1 | | | 3.1 | Calling Another Sub-routine<br>Other Things to Watch Out For | 3-1<br>3-1 | ### SECTION 1 MEET THE AP....AGAIN #### 1.1 INTRODUCTION The purpose of this manual is to illustrate the way to use the AP most efficiently, i.e., to write good loops. It assumes that the reader has already read the Processor Handbook (especially Section 3) and the Software Development Package Manual (APAL, Sections 2 and 3), and has at least a passing acquaintance with the AP instruction set. The first section presents a short review of the basic elements of the Array Processor from the programmer's point of view. The second section covers methods and techniques of writing loops. The third section consists of a set of common pitfalls to avoid. The review in this section of the basic AP instructions is not meant to be all-inclusive. It is intended to briefly cover the most-often-used things. Further details can be found in the other manuals. This manual assumes the use of the AP's 333ns interleaved memory. #### 1.2 BASIC OVERVIEW 1.2.1 Arithmetic. Both the Floating Adder and Floating Multiplier need explicit instructions (e.g., FADD and FMUL, respectively) to push their respective answers out of the pipelines. Given these "pushers", the Floating Adder result (FA) will be available 2 cycles after the original instruction, and the Floating Multiplier result (FM) will be available 3 cycles after the original instruction: | 0. | FADD DPX, DPY | "add | | 0. | FMUL DPX, DPY | "multiply | |----|---------------|--------|--------|----|------------------------------------------------|---------------| | 1. | FADD | "push | | 1. | FMUL | ''push | | 2. | DPX(1) < FA | "store | answer | 2. | FMUL | "push | | | | | | 3. | DPY(1) <fm< td=""><td>"store answer</td></fm<> | "store answer | The empty FADD and FMUL "pushers" can also be real Adder or Multiplier operations, thus producing new answers each cycle. If the "pushers" do not directly follow the original instructions, FA will come out 1 cycle after the first FADD pusher, and FM will come out 1 cycle after the second FMUL pusher. Both FA and FM will remain available for succeeding cycles until a new FA or FM is pushed out. The arguments for Adder and Multiplier instructions consist of one from column A and one from column B, (in that order): | COLU | MN A (A1 | or M1) | | COLUMN | B (A2 | or M2) | |------|-----------|--------|-----|--------|-----------|--------| | | FM | | | • | FA | | | | TM<br>DPX | 17 | | | MD<br>DPX | | | | DPY | | 1_1 | | DPY | | The Adder has additional arguments of ZERO and NC (no change), which can be used in either or both columns. - 1.2.2 Main Data Memory. Reading from memory requires one of the following instructions: SETMA, INCMA, DECMA, or LDMA. In practice, it is usually done by the SETMA instruction. The result, MD, comes out three cycles later and is also available for succeeding cycles until a new MD comes out. No "pushers" are needed. Writing into memory requires one of the above instructions plus MI<source, where source is FA, FM, or DB. This goes on the same line as SETMA, and gets done in that cycle. Memory can be referenced every 2 cycles, for either a read or write. - 1.2.3 Table Memory. Table memory is usually referenced by the SETTMA or LDTMA instruction. Two cycles later, TM is available and remains so until 2 cycles after the next instruction affecting TMA. Such instructions can occur in every cycle, producing a new TM every cycle. - 1.2.4 Data Pad. DPX and DPY each contain 32 registers, 8 of which are accessible from any given DPA. That is, one can reference DPX from DPX(DPA-4) to DPX(DPA+3), and similarly for DPY. The Data Pad Bus is usually used to store data from memory or from one Data Pad register into another, or to utilize a value, e.g., in conjunction with a load operation: DPX(1) < DB; DB = DPY(-2) (This can be shortened to DPX(1)<DPY(-2).) DPX<D3: DB=MD (Or DPX<MD) LDDPA; DB=3 (This sets DPA=3) Storing into Data Pad from FA or FM does not use the Data Pad Bus. This is important, as it leaves DB free for other uses. 1.2.5 S-Pad. S-Pad registers are usually used as address pointers or counters, and thus to pass parameters to a program. An S-Pad operation must accompany a SETMA (or SETDPA, SETTMA, etc.) instruction. An S-Pad operation must also precede a conditional branch (BGT, BNE, etc.) by one cycle. That is, conditional branches are based on the S-Pad Function (SPFN) of the S-Pad operation in the previous cycle. The fastest way to get an integer into S-Pad is to use the LDSPI instruction: LDSPI COUNT; DB=5 This puts 5 into an S-Pad register called COUNT. The value is assumed to be octal unless a decimal point is added. DB=15. (note point) is equivalent to DB=17 (octal), or to DB=ØFX (hex). Hexadecimal numbers must start with a numeric digit and end with "X". Although the Floating Adder operation FSUB A1,A2 will do A1-A2, the S-Pad operation SUB subtracts in the opposite direction, i.e., SUB PIECE, TOTAL will do: (contents of S-Pad TOTAL) minus (contents of S-Pad PIECE). #### 1.3 REFERENCING MEMORY In order to read something out of memory, or write into it, the location in memory where this will occur must be provided. The SETMA instruction gets this necessary information from the S-Pad Function (SPFN) of the same cycle. Therefore, one needs to construct an S-Pad operation which will result in a pointer to the appropriate memory location. Generally this takes the form of adding increments to pointers. For example, if there was a 4-element vector in memory locations 100, 102, 104, 106, one would need an S-Pad register (say, APTR) containing the base address (100), and another S-Pad register (AINC) containing the increment between elements (2). Then, if one wanted to read the element in location 102, the appropriate instruction would be ADD AINC, APTR; SETMA. Now APTR would contain 102. If one wrote another ADD AINC, APTR; SETMA the contents of memory location 104 would be read. Consider the following instruction: MOV APTR, APTR. This doesn't seem to accomplish much, but in the light of the above discussion, it can be seen that its SPFN could be useful for a SETMA. This is how one would get the first element of a vector. All of the above is correspondingly true for writing into memory. #### 1.4 S-PAD MNEMONICS S-Pad names such as APTR, AINC, N are really only temporary names for the 16 S-Pad registers. A statement such as DEC N will not mean anything to the assembler unless the program has equated the mnemonic "N" with a specific S-Pad register, such as S-Pad O. This is done by the following assembler pseudo-op: N \$EQU O. All S-Pad names used in a program must be declared in this manner before using them in an instruction. Thus, programs generally begin with lists like: APTR \$EQU 0 AINC \$EQU 1 BPTR \$EQU 2 BINC \$EQU 3 N \$EQU 4 These S-Pad numbers should not be confused with the contents of the S-Pads. ADD BINC, BPTR would not add 3 to 2 (using the above list), but would add the contents of S-Pad 3 to the contents of S-pad 2. There can be more than one name for an S-Pad register. If you had 2 different vectors, A and B, and wished to use the mnemonics AINC and BINC for their increments, you could use the same S-Pad register if the increment for both is the same in all cases, by declaring: AINC \$EQU 1 BINC \$EQU 1 #### 1.5 OTHER PSEUDO-OPS Besides the \$EQU pseudo-op, the typical program includes \$TITLE and \$ENTRY pseudo-ops at the very beginning, and an \$END at the very end. A basic program with one loop would have the following form: | | | S-Pad | mnem | onic | \$TITLE<br>\$ENTRY<br>\$EQU | name<br>name<br>Ø | •<br>• | | | |-------|-------|-------|------|------|-----------------------------|-------------------|--------|-------|----| | | | | • | | • | 1 | | | 1 | | | | | • | | • . | 2 | | | | | | | | • | | • | | | | | | : | | | • | | | • | | | | | name: | (code | | | | to loor | | itiali | zatio | ns | | loop: | (code | ) | | | | | | | | | | 11 | | | | | | | | • | | | 11 | | | | | | | | | \$END See the software manual for explanations of these pseudo-ops. #### SECTION 2 LOOPS #### 2.1 A POOR LOOP The loop is where the potential of the AP comes into full bloom. For example, one way (lengthy but workable) to write a dot product program is as follows: Given: Vectors A and B in Main Data memory, with elements of each vector in equally spaced locations in memory (e.g. even-numbered locations). Produce: $\sum_{m=1}^{N} A_m \cdot B_m$ Parameters passed in S-Pad: | S-Pad Name | Contains: | |--------------|---------------------------------------------------| | APTR<br>BPTR | base address of vector A base address of vector B | | XINC | increment (number of locations from one | | 4. | element to the next) (same for both vectors) | | N | number of elements in each vector | | CPTR | address of answer | DOTPROD: SUB XINC, APTR (\*see below) SUB XINC, BPTR (\*see below) FADD ZERO, ZERO "initialize FA=0 FADD LOOP: ADD XINC, APTR; SETMA "get mth element of vector A NOP " from memory NOP DPX<MD "MD=Am, store into DPX ADD XINC, BPTR; SETMA "get mth element of vector B NOP NOP FMUL DPX, MD "MD= $B_m$ , do $A_m$ . $B_m$ FMUL FMUL FADD FM, FA "add product to sum of products FADD - DEC N "decrement counter BGT LOOP "branch back if not done yet "(i.e. if N>0) DONE: MOV CPTR, CPTR; SETMA; MI<FA "otherwise, store answer \*This is so that the first time through the loop, ADD XINC, APTR and ADD XINC, BPTR will not move the pointer to the second element, passing up the first altogether. To begin with, this program can certainly be shortened by combining instructions and overlapping memory fetches. Thus: DOTPROD: FADD ZERO, ZERO; SUB XINC, APTR FADD; SUB XINC, BPTR LOOP: ADD XINC, APTR; SETMA "get Am NOP ADD XINC, BPTR; SETMA "get Bm DPX<MD "store Am in DPX NOP FMUL DPX, MD "do $A_m.B_m$ FMUL FMUL FADD FM, FA; DEC N "add prod to sum of products " and decrement counter FADD; BGT LOOP "test if done. If not, branch to LOOP DONE: MOV CPTR, CPTR; SETMA; MI < FA "if so, store answer Note the extra FMUL's and FADD's, described as "pushers." These push the answers through the pipelines, so that FM and FA will contain what they are intended to contain. This is pointed out because the beginning AP programmer is likely to forget to put "pushers" in his code. Now the loop of the evolving dot product program is 10 cycles long. This means that each new pair of elements costs 10 more cycles. Although better than the initial example, which had a 14-cycle loop, this can actually be cut down to a mere 4 cycles! #### 2.2 DETERMINING LENGTH OF LOOP One might suppose that the length of a program loop depends on what one is trying to do. This is true, but not in the way one would think. The AP programmer decides ahead of time how many cycles his loop should contain, and then fits everything into that framework. How does he pick the magic number? Most commonly, loops are memory-limited. Recall that one can reference memory (to read or to store) every 2 cycles. If one has 2 memory references to do (e.g., "get A" and "get B"), then the loop will be at least 4 cycles long (2 per memory reference). And, unless one has more than 4 different FMUL's, 4 different FADD's, or 4 different S-Pad operations to do, the loop should be, at most, 4 cycles. A lot can be done in 4 cycles when one can do a Floating Multiplier operation, a Floating Adder operation, an S-Pad operation, a branch, a memory reference, a Data Pad Bus transfer, etc., in each cycle. #### 2.3 WRITING A REAL MEMORY-LIMITED LOOP Before continuing with the transformation of the dot product program, another example will be utilized. Given: Vectors A and B in Main Data memory, length=N elements Produce: Vector C (in memory), where $C_m = A_m^2 + B_m$ for m=1 to N | Parameters: | S-Pad Name | Contains | |-------------|------------------------------|-----------------------------------------------------------------------------------------------| | | APTR<br>BPTR<br>CPTR<br>XINC | base address of A base address of B base address of C increment (same for all vectors in this | | | N | example) number of elements | Note that there should be 3 memory references in the loop: "get A", "get B", and "store C". (Unlike the dot product which accumulated a running sum in the Adder, this program needs to store an answer after each set of computations. For the dot product, storing was not a repeated process, and hence not included in its loop). Three memory references, one every other cycle, means the loop would be 6 cycles long. It would start like this: - 1) --- (nothing here, but count a cycle) - 2) ADD XINC, APTR; SETMA "get A - 3) --- - 4) ADD XINC, BPTR; SETMA "get B - 5) DPX < MD "store A in DPX - 6) FMUL DPX, MD ''do A\*A (The reason for starting on the second line will be explained later.) Now it has run out of cycles, but there is still more to do, so it starts back up at the first cycle, which is where the end will branch to, when it gets around to testing if it's done. | | | | | | 1-A | |-------|----|-----------------------------------------------------------------------------------------------------------------------------------------|-------------|---------|---------------------------------------------------------------------------------------------------------------| | LOOP: | 1) | | | ı | FMUL "B is available here, but "not needed yet | | | 2) | ADD 2 | KINC, APTR; | SETMA | FMUL | | | 3) | | | , | FADD FM, MD "add B to A2 | | | 4) | ADD 3 | KINC, BPTR; | SETMA / | FADD | | | 5) | DPX <n< td=""><td><b>I</b>D</td><td>, , ,</td><td>DEC N "answer is available here "but can't reference mem." "yet to store it</td></n<> | <b>I</b> D | , , , | DEC N "answer is available here "but can't reference mem." "yet to store it | | | 6) | FMUL | DPX,MD | | ADD XINC, CPTR; SETMA; MI <fa; "store="" "test="" and="" answer="" bgt="" done<="" if="" loop="" td=""></fa;> | This is the entire loop. In its proper form, taking out lines and adding semicolons, it looks like this: LOOP: FMUL ADD XINC, APTR; SETMA; FMUL FADD FM, MD ADD XINC, BPTR: SETMA: FADD DPX<MD; DEC N FMUL DPX, MD; ADD XINC, CPTR; SETMA; MI < FA; BGT LOOP #### 2.4 WRITING INTROS Notice, however, that if the program goes right into this loop, after initial overhead such as SUB XINC, APTR SUB XINC, BPTR SUB XINC, CPTR it picks up the first element of A and B as it's supposed to, but it also stores something into C before it's ready to, and decrements the counter too early. It goes through both columns at the same time. What is desired, however, is that computations in the second column continue from the first column. The only way it can do this is to continue from what the first column did in the previous time through the loop. And the first time, there was no previous time. Hence the need for additional microcode before getting into the loop. Exactly what needs to go before the loop? In order for the second column of the loop to be doing what it's supposed to when the program gets to it, the first column must precede it. Essentially, one rewrites the first column as an "intro" to the loop. Thus: PROGRAM: MOV APTR, APTR; SETMA "get first element "of A SUB XINC, CPTR "to offset ADD in loop MOV BPTR, BPTR; SETMA "get first element "of B DPX<MD "store A(1) in DPX FMUL DPX, MD "de A(1) $^2$ LOOP: FMUL ADD XINC, APTR; SETMA; FMUL "get A(m+1) FADD FM, MD "do $A(m)^2 + B(m)$ ADD XINC, BPTR; SETMA; FADD "get B(m+1) DPX<MD; DEC N "store A(m+1) FMUL DPX, MD; ADD XINC, CPTR; SETMA; MI<FA; BGT\_LOOP "do $A(m+1)^2$ , store "C(m), test if done DONE: RETURN To clear up a loose end regarding the structure of memory-limited loops, one might notice that since the branch must be in the last cycle, the DEC N instruction must be in the second-to-last cycle. DEC is an S-Pad operation and cannot be in the same cycle as another S-Pad operation, such as ADD XINC, XPTR. A memory-limited loop has SETMA's (requiring S-Pad operations) on every other line. Since the DEC N operation will go on an odd-numbered line of the loop, the SETMA's must go on even-numbered lines. This is why the first thing to do, ADD XINC, APTR; SETMA (see section 2.3), was put on line 2. #### 2.5 A DOT PRODUCT PROGRAM It is now possible to write the 4-cycle dot product. Using the technique outlined above, the loop should be constructed as follows: | $\overline{1)}$ | | · | , | | |-----------------|-----|-------------|-------|--------| | 2) | ADD | XINC, APTR; | SETMA | "get A | | 3) | | | · . | | | 4) | ADD | XINC, BPTR; | SETMA | "get B | then | 1) | | | 1 | DPX <md< th=""><th>"store A</th></md<> | "store A | |----|-----|-------------|---------|----------------------------------------|----------| | 2) | ADD | XINC, APTR; | SETMA , | | | | 3) | | | 1. | FMUL DPX, MD | ''do A·B | | 4) | ADD | XINC, BPTR; | SETMA / | FMUL | | then | | | | | , ~ <i>y</i> | / <del>'</del> | | |----|-----|-------------|-------|---------------------------------------------|-----------------|-------------------------------| | 1) | | | | DPX <md< td=""><td>FMUL</td><td></td></md<> | FMUL | | | 2) | ADD | XINC, APTR; | SETMA | 1 | FADD FM, FA | "add A B to sum of "products" | | 3) | | | | FMUL DP | X,MD, FADD; DEC | N "decrement counter | | 4) | ADD | XINC, BPTR; | SETMA | FMUL | BGT LOOP | "test if done | | | | | | <b>,</b> | - | | The intro to this 3-column loop will consist of the first column alone, then the first and second column together. Other overhead, such as initializing FA to 0, can be mixed in with the intro. To generalize, an N-column loop would require an intro consisting of column 1 followed by columns 1 and 2 together, followed by columns 1, 2, and 3 together... followed by columns 1, 2,..,N-1 together. \$TITLE DOTPROD \$ENTRY DOTPROD APTR \$EQU 0 BPTR \$EQU 1 CPTR \$EQU 2 XINC \$EQU 3 N SEQU 4 DOTPROD: MOV APTR, APTR; SETMA; FADD ZERO, ZERO "get A(1) and "initialize FA=0 MOV BPTR, BPTR; SETMA; FADD "get B(1) DPX<MD "store A(1) ADD XINC, APTR; SETMA "get A(2) FMUL DPX, MD "do A(1)\*B(1) ADD XINC, BPTR; SETMA; FMUL "get B(2) LOOP: DPX<MD; FMUL "store A(m+1) "'A(m)B(m) to sum FMUL DPX, MD; FADD; DEC N "do A(m+1)B(m+1)" "decrement counter" ADD XINC, BPTR; SETMA; FMUL; BGT LOOP 'get B(m+2), test if "done H: C "get $\Lambda(m+2)$ , add DONE: MOV CPTR, CPTR; SETMA; MI < FA; RETURN ADD XINC, APTR; SETMA; FADD FM, FA "if so, store answer CEND Now each new pair of elements will only cost 4 more cycles, because every 4 cycles a new pair are being fetched; every 4 cycles another product is added to the sum. The longer overhead is no disadvantage as it is only done once, and even if the program was called with N containing 1, making the streamlined loop unnecessary, it takes no longer than the unstreamlined program. Note that there are 2 SETMA's in a row at the beginning and again at the end of the program. This will not cause any problems except to make memory spin, which is the memory's way of putting in the NOP's the programmer leaves out. The timing is still the same, and this way there are 2 less locations of Program Source used up. It might be mentioned that if one were getting Vectors A and B out of Data Pad instead of memory, the dot product could be written with a 1-cycle loop! This will be demonstrated later. #### 2.6 NOTATION A few words about notation are in order. The "---" used when writing loops in column form simply denotes a blank spot, indicating a cycle goes by while awaiting the results of a memory fetch or while looking for a more propitious spot to use the results of the Adder or Multiplier, etc. Normally, something else will eventually go on the same line, in a different column. Example: This takes vector A, multiplies it by a constant in DPX, and stores it in vector B. | 1) | | | | FMUL DPX,MD | | ··········· | | | |----|-----------------|-------|----|-----------------|--------|-------------|-----|------| | 2) | ADD XINC, APTR; | SETMA | 1. | FMUL | | | | | | 3) | | | 1 | FMUL; DEC N | | | | | | 4) | | | | ADD XINC, BPTR; | SETMA; | MI < FM; | BGT | LOOP | Since the length of the loop was already decided by the number of SETMA's, these blank spots cause no harm to the speed. It is the number of cycles in the loop, not the number of columns, which determines speed. Extra columns simply mean longer intros, which the program only goes through once anyway unless it's part of a nested loop. In loops with several Adder or Multiplier operations, it often happens that one such instruction will be a "pusher" for another in another column. | 1) | (code) | FMUL DPY, MD | (code) | |-----------|--------------------------------------------------|--------------|----------------------------------------| | 2) | " FMUL DPX, DPY | FMUL | 11 | | 3) | " FMUL | FMUL | 3 T | | 3) | " FMUL | DPY < FM | 11 | | 5) | '' DPX(1) <fm< td=""><td></td><td>• •</td></fm<> | | • • | | <u>6)</u> | " | | ************************************** | In column 2, lines 2 and 3 are illegal, as those lines already contain FMUL's (which will do the pushing for column 2 as well as column 1). However it may be advantageous to the programmer to note to himself somehow that FMUL's do belong there, in case things in the first column get moved around for some reason. This is the purpose of such notation as (fmul) or (fadd). Thus: | 1) | (code) | FMUL DPY, MD | (code) | |----|------------------------------------------------|----------------------------------|--------| | 2) | " FMUL DPX, DPY | (fmul) | 11 (1) | | 3) | " FMUL | (fmul) | 11 | | 4) | " FMUL | DPY <fm< td=""><td>11</td></fm<> | 11 | | 5) | " DPX(1) <fm< td=""><td></td><td>11</td></fm<> | | 11 | | 6) | tt - | | 11 | Now, if pieces of the first column were moved down a couple of lines for some reason, | 1) | (code) | FMUL DPY, MD | (code) DPX(1) < FM | |----|-----------------|--------------|--------------------| | 2) | 11 | (fmul) | ** | | 3) | | (fmul) | tt | | 4) | " FMUL DPX, DPY | DPY < FM | , 11 | | 5) | '' FMUL | | . 11 | | 6) | '' FMUL | | · · · · · · | the programmer would be reminded to put real FMUL's back on those lines. When writing loops with a small number of cycles, these reminders can also help one keep track of the columns, as in: | | - | | *. | | | |-----------------|-------|------------------------------------------------------------------------|-------------|--------------------|--| | *** | 7 | DPY <md< td=""><td>FMUL</td><td>FADD FM, FA; DEC N</td><td></td></md<> | FMUL | FADD FM, FA; DEC N | | | ADD XINC, APTR; | SETMA | FMUL DPY, | , MD (fmul) | FADD; BGT LOOP | | This gets a vector from memory, squares each element and adds the squares together (sort of a dot product between vector A and itself). The seemingly empty columns, which disappear when the loop is written in proper form (see below), are necessary in order to write the intro properly. If one left out the second column, for example, his intro would start with: MOV APTR, APTR; SETMA DPY < MD ADD XINC, APTR; SETMA; FMUL DPY, MD Clearly, the first MD will not be the first element fetched. By the time it gets down to FADD FM, FA in the loop, something which doesn't belong will be added in. This is what the intro and loop should look like: MOV APTR, APTR; SETMA FADD ZERÓ, ZERÓ "initialize FA=0 ADD XINC, APTR; SETMA; FADD DPY<MD ADD XINC, APTR; SETMA; FMUL DPY, MD DPY<MD; FMUL ADD XINC, APTR; SETMA; FMUL DPY, MD LOOP: DPY<MD; FMUL; FADD FM, FA; DEC N ADD XINC, APTR; SETMA; FMUL DPY, MD; FADD; BGT LOOP (answer) < FA #### 2.7 DROPPING OUT ONE EARLY ``` 1) (code) (code) 2) ADD XINC, APTR; SETMA 11 11 11 3) (code) 11 4) ADD XINC, BPTR; SETMA (code) 5) 11 6) ADD XINC, CPTR; SETMA 11 111 7) DEC N (code) ADD XINC, DPTR; SETMA; 8) MI < DPX; BGT LOOP ``` Here, there are 2 memory reads in the first column, one read in the second column, and a store in the last column. When writing the intro, the pointers should be taken care of as follows: If the memory reference in the second column of the loop was a store instead of a read, the problem would become more complicated. By the time the counter went down to zero and the last result was stored at DPTR, an extra C would have been stored, possibly over a valuable piece of data, such as the beginning of vector D. Or if instead of ADD XINC, CPTR; SETMA; MI<DPY in the second column, we had DPY<FA (where FA is cumulative, as in the dot product) and later stored DPY into CPTR after getting out of the loop, an extra FA would have been computed and DPY would contain an incorrect answer. In this case, it would be wise to drop out of the loop one time early. One would put an extra DEC N somewhere in the intro, so that the loop would be done N-l times. Then, after the loop, write just the last column (not including DEC and the branch, of course), which is all that remains to be done from the loop anyway. Example: This does a dot product of vectors A and B, and also outputs the square of each updated sum into vector D. | | | | | | 4 | |------|-------------|-------|--------------------------------------------------------------------------------------|-----|-----------------------------------------------| | | | | FMUL DPX, MD | | · | | ADD | XINC, APTR; | SETMA | (fmul) | 1. | FMUL DPY, DPY | | | | | FMUL | • • | (fmul) | | ADD | XINC, BPTR; | SETMA | FADD FM, FA | 1 | FMUL | | DPX< | < MD | , | FADD | | DEC N | | | , | | DPY <fa< td=""><td>,</td><td>ADD XINC, DPTR; SETMA; MI<fm;< td=""></fm;<></td></fa<> | , | ADD XINC, DPTR; SETMA; MI <fm;< td=""></fm;<> | | | | | | | BGT LOOP | When it is going through the loop for the last time and storing the very last thing in D (column 3), it is also simultaneously doing extra executions of columns 1 and 2. Normally, that doesn't matter, but in this case, something extra is being added to the cumulative sum of the dot product (column 2), which was completed the previous time through the loop. By dropping out of the loop before its last time around, this error is avoided: | MOV APTR, APTR; | ; SETMA | | | | |------------------------------------------------------------------------|----------|--------------------------------------------------------------------------------------|-----------------|-------------------------------| | DEC N | | "to cause drop | ping out early | | | MOV BPTR, BPTR; | ; SETMA | · · · · · · · · · · · · · · · · · · · | | | | DPX <md td="" ·<=""><td></td><td></td><td></td><td>•</td></md> | | | | • | | SUB XINC, DPTR | | "to nullify th | e first ADD XIN | C, DPTR | | | | FMUL DPX,MD | | | | ADD XINC, APTR; | ; SETMA; | FMUL | | | | | • | FMUL | | | | ADD XINC, BPTR; | SETMA: | FADD FM, FA | | | | DPX <md;< td=""><td></td><td>FADD</td><td>•</td><td></td></md;<> | | FADD | • | | | , | • | DPY <fa< td=""><td></td><td></td></fa<> | | | | LOOP: | | FMUL DPX, MD | | | | ADD XINC, APTR; | : SETMA | | FMUL DPY, DPY | | | , | • | FMUL | | | | ADD XINC, BPTR; | SETMA: | FADD FM, FA; | ${ t FMUL}$ | | | DPX <md;< td=""><td>•</td><td>FADD:</td><td>DEC N</td><td></td></md;<> | • | FADD: | DEC N | | | , | | DPY <fa;< td=""><td>ADD XINC, DPTR;</td><td>SETMA: MI<fm:< td=""></fm:<></td></fa;<> | ADD XINC, DPTR; | SETMA: MI <fm:< td=""></fm:<> | | | | , | BGT LOOP | ,, | | OUT: | | | FMUL DPY, DPY | | | | | MOV CPTR, CPTR; | • | FMUL | | | | ,,,,,,,,,,,,,,,,,, | FMUL | | | | | | ADD XINC, DPTR; | SETMA: MI <fm:< td=""></fm:<> | | | • | | RETURN | | | | | | | | Notice that the (fmul) in column 2 became a real FMUL in the intro. OUT starts just the last column. The next line stores the completed dot product. One might wish to come out one early even if one doesn't strictly need to, if the loop is long and there are only a couple of lines in the last column: | 1) | (code) | | (code) | |-----|--------|-----------------|-------------------| | 2) | '' S | ETMA | Tr. | | 3) | ** | | 11 | | 4) | tt 🕟 | | " SETMA; MI < DPX | | 5) | 11 | | | | 6) | '' S | $\mathtt{ETMA}$ | • | | 7) | 11 | | | | 8) | '' S. | ETMA | | | 9) | | | | | 10) | '' S | ETMA | • | | 11) | . 11 . | | DEC N | | 12) | | ETMA | BGT LOOP | In this case, coming out of the loop one time early and adding on the last 4 lines afterward would save going through 8 cycles for nothing. #### 2.8 INTERACTION BETWEEN COLUMNS In order to fit things into complicated loops without creating op-code conflicts, the AP programmer takes advantage of results (e.g. MD, FA) which are the same for one or more cycles after it is first available. Sometimes he will purposely delay the pushing of an answer through a pipeline by leaving out "pushers". But he must be careful of the way the columns interact with each other within the loop. | 1) | FMUL DPX, DPY | • | |----|---------------|--------------------------| | 2) | • | FMUL DPY(3), DPX(2) | | 3) | • | FMUL | | 4) | FMUL | • | | 5) | FMUL | DPY(1) <fm< td=""></fm<> | | 6) | DPX(1) < FM | <u> </u> | The FMUL's in column 2 will act as "pushers" for the FMUL DPX,DPY in column 1, whose answer will come out on line 4 instead of line 6 as desired and will disappear forever when replaced by a new FM on line 5. Notice the FMUL on line 4 in column 1 acts as a pusher for column 2, which was planned for. #### Another example: | 1) | (code) | | DPX <md< th=""><th>(code)</th></md<> | (code) | |----|-----------------|-------|--------------------------------------|-----------------------| | 2) | ADD XINC, APTR; | SETMA | (code) | DPX <fa< td=""></fa<> | | 3) | (code) | | FADD FM, DPX | (code) | | 4) | ff | | (code) | | The DPX of column 2 line 3 will not be the same as what was stored into it in column 2, line 1. It will be FA from column 3, line 2. #### 2.9 CHANGING DPA Because one can access things in Data Pad much faster than things in memory, it makes sense to store things from memory into Data Pad if they will be used again. For example, if one is going to use an N-element vector for several different computations, one could store it in DPX()), DPX(1),...,DPX(N-1). Because the Data Pad indices can only be accessed from -4 to +3 with a static DPA, it becomes useful to leave the index alone and change DPA. Storing vector A in DPX is basically the repeated operation of DPX<MD; INCDPA. If DPA is initially set to zero, then the first element will be stored into DPX(0). INCDPA will increase DPA for the <a href="mailto:next">next</a> instruction. Thus: DPX<MD; INCDPA "refers to DPX(0) DPX<MD "refers to DPX(1) The ways to set DPA to zero: CLR# (S-Pad name); SETDPA "uses up S-Pad field or DB=ZERO; LDDPA "uses up Adder field This loop will read a vector from memory into Data Pad X: | | | | _ | | | |-----------------|-------|----------------------------------------------------------------|---------|-------|--| | | | DPX <md;< th=""><th>INCDPA;</th><th>DEC N</th><th></th></md;<> | INCDPA; | DEC N | | | ADD XINC, APTR; | SETMA | BGT LOOF | ) | 100 | | #### With intro: MOV APTR, APTR; SETMA CLR# APTR; SETDPA ADD XINC, APTR; SETMA LOOP: DPX <MD; INCDPA; DEC N ADD XINC, APTR; SETMA; BGT LOOP #### 2.10 NON-MEMORY-LIMITED LOOPS A non-memory-limited loop is a loop in which 2 times the number of memory references is less than the number of same-op-code-field operations required. For example, if there are 5 Floating Adder operations to be done (FADD, FSUB, FSUBR, etc.) but only 2 memory references (a fetch and a store), the 5 Adder operations cannot fit into 4 cycles. Incidentally, "pushers" don't count in figuring out how many cycles are needed. In a 5-cycle loop with 5 different Adder operations, the Adder instructions become each other's pushers.) Recall that in memory-limited loops, the first instruction in column 1 usually starts on line 2, to avoid S-Pad conflicts on the next-to-last line. (See last paragraph of Section 2.4). This is not necessary in non-memory-limited loops. The following loop will test whether each element of a vector in DPY is within the range between a maximum limit and minimum limit. If so, the element is added to a cumulative sum. The maximum limit is conveniently located in MD, and the minimum limit in FM, by the grace of whatever program uses this loop. Neither FM nor MD change during this loop's execution. | <u> </u> | | | |---------------|-----------------------|------------------------------------| | FSUB DPY, MD | BFGT BIGGER | (fadd) | | FSUB FM, DPY | BFGT SMALL | DPX <fa; dec="" n<="" td=""></fa;> | | (fadd) INCDPA | FADD $DPY(-1)$ , $DP$ | X BGT LOOP | Note that the BFGT instruction tests FA of the previous cycle. #### 2.11 A 1-CYCLE LOOP For the 1-cycle dot product, it is assumed that the vectors are already in Data Pad, starting at DPX(0) and DPY(0) (where DPA=0). Obviously, vectors longer than 32 elements cannot be handled this way (or can only be handled in segments of 32 or less). This is what the loop really looks like: | FMUL DPX, DPY; | INCDPA | (fmul) | (fmul) | FADD | FM, FA; | DEC | N (fadd) | BGT LOOP | |----------------|--------|--------|--------|------|---------|-----|------------|----------| | | | , | | | | | 1 | | | | | | | | | | | | | | | | | | | | | | The FMUL and FADD instructions become their own "pushers". #### \$TITLE DOTPROD \$ENTRY DOTPROD N \$EQU O "number of elements in each vector CPTR SEQU 1 "where to store answer DOTPROD: CLR# N; SETDPA "DPA=O FMUL DPX, DPY; "do A(1)\*B(1) INCDPA; "DPA to 1 DEC N "set drop out early FMUL DPX, DPY; "do A(2)\*B(2)INCDPA "DPA to 2 FMUL DPX, DPY; "do A(3)\*B(3)INCDPA; "DPA to 3 FADD ZERO, ZERO "init. FA=0 "do A(4)\*B(4)FMUL DPX, DPY; INCDPA; "DPA to 4 FADD FM, ZERO; ''A(1)B(1) in Adder DEC N "decrement counter LOOP: FMUL DPX, DPY; "do A(m)\*B(m)INCDPA: "DPA to DPA+1 FADD FM, FA; "add A(m-3)B(m-3) to sum DEC N; "decrement counter BGT LOOP "test if done OUT: DPX<FA; FADD "store cumulative FA FADD DPX, FA "add it to other cumulative FA FADD MOV CPTR, CPTR; SETMA; MI<FA; "store answer SEND This particular sort of loop has a problem with the Floating Adder, in that a cumulative FA needs at least 2 cycles to accumulate each new addition. Hence, the 1-cycle loop is actually operating with 2 mutually exclusive cumulative FA's, interwoven with each other: FADD FM, FA FADD FM, FA FADD FM, FA FADD FM, FA FADD FM, FA At the end of all this, they (the two strings of sums) need to be added to each other. (See OUT, the label after LOOP). This also illustrates the practice of dropping out of the loop one time early. If it didn't drop out early, the last (unneeded) FADD FM, FA of the loop would push out one of the 2 cumulative FA's. By the next cycle it would be gone forever. By dropping out early, DPX<FA can be done before it's too late. This line of reasoning can eventually lead one to the idea that the last column of the loop (see beginning of Section 2.11) is unnecessary, since there is no way for the Adder result to come out in time for the next FADD FM, FA. The FADD FM, FA of each of the two strings of cumulative FA's will push out the other string. So the loop need only be of the form: FMUL DPX, DPY; INCDPA (fmul) (fmul) DEC N FADD FM, FA; BGT LOOP This is one column less than before, which means that there will be one column's worth (in this case, one line) less to put in the intro. It will also not be necessary to come out of the loop one time early, as there is no extra FADD FM, FA to push away something needed. It is still necessary to add the 2 cumulative FA's together at the end. "DPA=0 "A(1)\*B(1) " DPA to 1 ''A(2)\*B(2) " DPA to 2 ''A(3)\*B(3) ''A(m)\*B(m) DPA to DPA+1 test if done decrement counter "store first cumulative FA "add it to other cumulative FA " DPA to 3 " initialize cum. FA=0 initialize other cum. FA=0 add A(m-3)B(m-3) to cum. FA STITLE DOTPROD SENTRY DOTPROD N \$EQU 0 CPTR \$EQU 1 DOTPROD: CLR# N; SETDPA FMUL DPX, DPY; INCDPA; FADD ZERO, ZERO FMUL DPX, DPY; INCDPA: FADD ZERO, ZERO FMUL DPX, DPY; INCDPA; DEC N LOOP: FMUL DPX, DPY; INCDPA; DEC N: FADD FM, FA; BGT LOOP OUT: DPX<FA; FADD FADD DPX, FA FADD MOV CPTR, CPTR; SETMA; MI<FA; "store answer RETURN 3END ### SECTION 3 CAVEAT PROGRAMMER (Let the Programmer Beware) #### 3.1 CALLING ANOTHER SUB-ROUTINE The JSR instruction allows one program to utilize another program, for example the divide sub-routine (DIV). In order to do this, one must declare DIV external (\$EXT DIV) so that the assembler and linker will know what to do with the otherwise undefined symbol. One must also save everything he will need when program execution gets back to his main program. Depending upon what was used in the called sub-routine, some things may remain untouched. Commonly one should not count on being able to leave things in the Adder or Multiplier. Parts of Data Pad may also be changed, or DPA may change. S-Pad will probably not remain inviolate. (Remember, it's the S-Pad register number, not name, which is important.) These things need to be checked before doing a JSR. #### 3.2 OTHER THINGS TO WATCH OUT FOR The rest of this section consists of various short examples, cautions, and reminders. DPX < MD; DPY < DPX(1)</pre> Illegal. Data Bus is assigned twice. (The above is really DPX<DB; DB=MD; DPY<DB; DB=DPX(1).)</pre> DPX<MD; DPY<MD is legal. (DB=MD; DPX<DB; DPY<DB)</pre> DPX<FA; DPY<FM</pre> Legal. FA and FM don't use the Data Bus. DPX<FA; DPY<FM; DPX(1)<MD</pre> Illegal. Data Pad X is being written into twice (different indices). Within each cycle, there should be no more than one of each of the following: write into DPX write into DPY read from DPX read from DPY The exception is when reading out of a Data Pad more than once but using the same index: FADD DPX, FA; FMUL DPX, FA; DPY(1) < DPX is legal. FADD DPY, DPY; FMUL DPY, DPY is legal. FADD DPX, FA: FMUL DPX(1), FA is not legal. FADD DPX, DPY; DPY<MD The old value of DPY, before MD replaces it, is used in the sum. DB=4; LDSPI XINC; LDDPA; DPX(2)<FM Both DPA and the contents of XINC will become 4, but the previous DPA is used in referencing DPX(2). SUB# XINC,APTR; BGT OUT Illegal. The # uses the condition field (branch). THE \$END # AP-120B APAL ARRAY PROCESSOR ASSEMBLY LANGUAGE MANUAL 7275-01 FPS-7275-01 C FLOATING POINT SYSTEMS, INC. 1976 ALL RIGHTS RESERVED PRINTED IN THE UNITED STATES OF AMERICA REVISION 01, FEBRUARY 26, 1976 #### TABLE OF CONTENTS | Section 1 - INTRODUCTION | 1-1 | |------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------| | Section 2 - BASIC SYNTAX 2.1 Character Set 2.2 Symbol Names 2.3 Numeric Constants 2.4 Expressions | 2-1<br>2-1<br>2-1<br>2-1<br>2-2 | | Section 3 - SOURCE PROGRAM STATEMENTS 3.1 Comment Statements 3.2 Instruction Statements 3.3 Pseudo Operations Statements 3.4 Order of Program Statements 3.5 A Sample Program | 3-1<br>3-1<br>3-1<br>3-3<br>3-5<br>3-6 | | Section 4 - OPERATING PROCEDURES 4.1 Using APAL 4.2 Listing Format 4.3 A Sample Assembly listing | 4-1<br>4-1<br>4-2<br>4-4 | | Section 5 - ERROR MESSAGES | 5-1 | | Appendix A | A-1 | | Appendix B B.1 S-Pad Op-code Group B.2 Memory Address Op-code Group B.3 Table Memory Address Op-code Group B.4 Data Pad Address Op-code Group B.5 Branch Op-code Group B.6 Floating Adder Op-code Group B.7 Floating Point Multiply Op-code Group B.8 Data Pad X Op-code Group B.9 Data Pad Y Op-code Group B.10 Memory Input Op-code Group B.11 Data Pad Bus Op-code Group B.12 Special Operation Op-code Group B.13 I/O Op-code Group | B-1<br>B-2<br>B-3<br>B-3<br>B-4<br>B-4<br>B-6<br>B-6<br>B-7<br>B-7<br>B-8<br>B-8<br>B-11 | | Appendix C | | | <ul><li>C.1 Table Memory Symbols</li><li>C.2 Elementary Function Tables</li></ul> | C-1<br>C-2 | | Appendix D | | | D.1 Example Output form APAL | D-1 | #### SECTION 1 INTRODUCTION APAL (Array Processor Assembly Language) is a cross-assembler written in Fortran IV which provides a two-pass assembly of symbolic coding for the AP-120B. assembly of symbolic coding for the AP-120B. APAL is a conventional assembly language, and as such, should pose no difficulties to programmers familiar with using assembly language on other computers. On typical 16-bit mini-computers, APAL requires approximately 24K of available memory to operate as supplied. INTENTIONALLY BLANK #### SECTION 2 BASIC SYNTAX #### 2.1 CHARACTER SET APAL recognizes the following characters: Alphabetic A thru Z Numeric Ø thru 9 Special + - \* / . \$ space tab = < ( ); ,: " # &! #### 2.2 SYMBOL NAMES Symbol names may be of any length, however only the first six characters of a name are significant. The first character of a name must be an alphabetic character, the subsequent characters may be either letters of numbers. Examples: LOOP A6 STARTHERE Symbols are given a value in any of three ways: - 1. Being defined by a \$EQU pseudo-op. - 2. Being used as a label. - 3. Being declared \$EXTernal. #### 2.3 TABLE MEMORY SYMBOLS A symbol with a value preset to the address of each of the constants in Table Memory ROM is predefined in APAL. These symbols all start with a "!" to avoid conflict with any user defined symbol. They may be used in expressions in the same manner as ordinary symbols. A complete list of these symbols is in Appendix C. For example, to fetch PI from Table Memory, and add it to a number in DPX(2) LDTMA; DB=!PI "fetch PI from TM NOP "Wait (or do something else) FADD TM, DPX(2) "Add PI to DPX(2) #### 2.4 NUMBERIC CONSTANTS Numbers may be written in four radices: octal, decimal, binary, or hex. In each radix, a number may be either signed or unsigned. Unsigned numbers may range from Ø to 65535. Signed numbers may range from -32768 to +32767. The radix of a number is established by a radix identifying character which is written immediately after the number. Octal numbers are denoted by a "K" immediately following the number. Decimal is denoted by a ".", binary by a "B" and Hex by an "X". The first digit of a hex number must be a numeric character. The default radix, if a radix identifier is not used, is octal. Examples: Octal integers: 177777 -4Ø727K Decimal integers: 32767. -1øøø. +1Ø. Binary integers: 1Ø1 101011010B ØABCDX 123FX ØCX #### 2.5 EXPRESSIONS Expressions may be used whenever a numeric value is required. Expressions are made up of operands and operators. 2.5.1 Operands. Operands may be symbol names, numeric constants, or the location counter, denoted by ".". Examples: TBLADR 598X 2.5.2 Operators. Operators denote operations of addition ("+"), subtraction ("-"), multiplication ("\*"), or division ("/") upon a pair of operands. Some sample expressions: TBLADR + 37 . + 9. LOOP + 6 \* A Expressions are evaluated from left to right, modulo 216. # SECTION 3 SOURCE PROGRAM STATEMENTS APAL source statements may be divided into three categories: - 1. Comment statements - 2. Instruction statements - 3. Pseudo-op statements Comment statements allow program documentation, instruction statements contain the actual symbolic machine code, and pseudo-op's provide directives to APAL during the assembly process. APAL statements have a basically free format: spaces and tabs may be used as desired to improve legibility. #### 3.1 COMMENT STATEMENTS Everything on a line after a quote mark (") is treated as a comment by APAL. A line which contains only comments or a line that is completely blank is a comment statement, and is ignored during the assembly process. Carriage return terminates a comment. #### 3.2 INSTRUCTION STATEMENTS An APAL assembly language instruction statement has the general format of: Label: Op-code fields "Comments The label and comments are optional. The assembler processes the op-code fields and generates one 64-bit program word for each instruction statement. 3.2.1 Label Field. A label is a user-defined symbol which is assigned the value of the current location counter and entered into the user symbol table. A label is a symbolic means of refering to a specific location within a program. If present, a label always occurs first in an instruction statement and must be terminated by a colon. For example, if the current location is 76 the instruction statement: LOOP: FADD DPX, DPY "LOOP HERE Assigns the value of 76 to the symbol "LOOP". 3.2.2 Op-code field(s). The Op-code field follows the label field in an instruction statement and contains one or more AP-120B op-code mnemonic. Individual op-codes in an instruction are separated from each other by a semicolon ";". The last op-code in an instruction is not terminated by a semicolon. This tells the assembler when it has reached the end of a complete AP-120B instruction statement. For example, both of the following instruction statements are equivalent: LOOP: FADD DPX, DPY; FMUL TM, MD; BFGT DONE or LOOP: FADD DPX, DPY; FMUL TM, MD; BFGT DONE Each is one instruction statement, which assembles into one 64-bit instruction word. Thus, one instruction statement may be continued over as many lines as desired to achieve a readable program document. Instruction statements are terminated by an op-code field which is not followed by a semicolon: not by the end of a line. Op-codes may be written in any order preferred within any given instruction statement. The assembler will flag with an error message any conflicts between op-codes. Some op-codes require operands as arguments. The operand(s) are separated from the op-code by a space or tab, and from each other by a comma. Some example op-codes: No operands: HALT; RETURN One operand: FABS MD; BFGT LOOP Two operands: FADD DPX, DPY; FMUL TM, MD If an operand is missing or improper, the assembler will give an appropriate error message. A listing of the various AP-120B op-codes is contained in Appendix A. 3.2.3 Comment Field. The remainder of any line following a quote mark (") is treated as a comment by the assembler and ignored. The comment field is terminated by a carriage return. Thus, in the previous example, we could write: LOOP: FADD DPX, DPY; FMUL TM, MD; BFGT DONE "DO AN ADD "AND A MULTIPLY "AND A BRANCH "ALL IN ONE INSTRUCTION As before, an instruction statement is ended by the absence of a semicolon following the last op-code in that instruction. ### 3.3 PSEUDO OPERATIONS STATEMENTS Psuedo-operations are directives to the assembler which control certain aspects of the assembly translation process. Each psuedo-op must appear on a separate line in the source text. All psuedo-op names start with a "3". As with instruction statements, psuedo-op statements may be labeled and have comments. 3.3.1 \$EQU. This operator equates a symbol to an expression. If user defined symbols are used in the expression they must have been previously defined in the program. Examples: A \$EQU 321 LOOP \$EQU LOC + 3 HERE \$EQU . - 3 MASK \$EQU 132\*3+6 Alternatively, the characters " = " may be used in place of "\$EQU": A = 6X = A\*3 When so used, the "=" must be both preceded and followed by a least one space or tab character. 3.3.2 \$LOC. \$LOC sets the current location counter to the value of an expression. If symbols are used in the expression they must have been previously defined earlier in the program. Examples: \$LOC 300 \$LOC . + 6 "LEAVE NEXT SIX UNUSED \$LOC LOOP + 10 Caution: \$LOC should not be set to an absolute address, as in the first example, if the assembly output is to be linked relocatably with other programs. 3.3.3 \$END. \$END causes APAL to terminate the assembly. 3.3.4 \$VAL. This operator defines 6 bits worth of data to fill one program word. The data is specified as four 16-bit integers, which represent the four 16-bit quarters of a program word. The four expressions are separated by commas. Examples: \$VAL -377, 104763, 10., LOOP + 6 \$VAL 0, 0, 2000, 33 3.3.5 \$FP. This operator fills the right-most 38-bits of a program word with a specified floating-point number. The left-hand 26-bits of the word are cleared. Examples: \$FP 6.0023E23 \$FP 2 \$FP E-17 PI: \$FP 3.141592654 "PI IS HERE Note: a floating-point number (say a constant for an algorithm) can be read out of Program Source memory and onto the Data Pad Bus using a "RPSF" op-code. As an example, to load the contents of location "PI" into Data Pad X: RPSF PI: DPX+DB "GET PI INTO DPX 3.3.6 \$TITLE. This pseudo-op names a program. The name need not be unique from other symbols in the program. The \$TITLE pseudo-op must occur first in the program, before any other pseudo-ops or instruction statements. Examples: \$TITLE FFT \$TITLE DIVIDE 3.3.7 \$ENTRY. This pseudo-op declares a symbol to be global; i.e., a symbol which is defined in this program and may be referenced by other separately assembled programs. The identified symbols must be defined in this program either by an "\$EQU" pseudo-op or by being used as a label. \$ENTRY pseudo-ops must occur before any instruction statements in the program. If the symbol is to be an entry point for Host Computer Fortran Calls, then following the symbol name must be the number of S-Pad parameters expected in the CALL. This may be a number from $0-17_8$ , and is separated from the symbol name by a comma. Examples: \$ENTRY A \$ENTRY B,6 "Expect 6 S-Pad parameters \$ENTRY C,0 "Expect 0 S-Pad parameters 3.3.8 \$EXT. This pseudo-op declares global symbols which are referenced by this program, but are defined by another separately assembled program. \$EXT pseudo-ops must occur in the program before any instruction statements. Symbol names are separated by commas. Examples: \$EXT FLOAT, SCALE, FFT \$EXT DIVIDÉ #### 3.4 ORDER OF PROGRAM STATEMENTS There is definite ordering of statement types with a program. The \$TITLE Pseudo-op comes first. Next, if present, any \$ENTRY and \$EXT pseudo-ops. The the program body, i.e. the code, then occurs. Finally comes the \$END pseudo-op. Statement order: \$TITLE pseudo-op \$ENTRY pseudo-op(s)\* \$EXT pseudo-op(s)\* "code, etc."\* \$END pseudo-op \*Need not be present. ``` STITLE DØTPR SENTRY DØTPR. 6 "VECTOR DOT PRODUCT "DØES C(0) = SUM ( A(MI) * B(MJ) ) FØR M = 0 TØ N-1 --- STATISTICS --- "AUTHOR: A.E. CHARLESWORTH, JULY 75 "REVISION 1.3 FEB 76 CHANGED SENTRY "SIZE: 9. LØCATIØNS "SPEED: 2 MEMØRY REFERENCES PER PØINT "SCRATCH: SP: 0,4,5; DPX: 0 (RELATIVE TØ DPA) "S - PAD PARAMETERS: A SEQU 0 "BASE ADDRESS OF A I SEQU I "INCREMENT FOR A B SEQU 2 "BASE ADDRESS OF B J SEQU 3 "INCREMENT FOR B C SEQU 4 "ADDRESS OF C N SEQU 5 "VECTOR LENGTH DØTPR: MØV A,A; SETMA MØV B,B; SETMA "FETCH A(0) "FETCH B(0) MØV B.B; SETMA DPX<MD; "SAVE A(0) INC N ADD IA; SETMA; "FETCH A(1) " CLEAR SUM " KEEP COUNT RIGHT FADD ZERØJZERØ FMUL DPX,MD; LOOP: "DØ A(M)*B(M) " PUSH ADDER FADD: " SEE IF DONE DEC N BEQ DØNE; "BRANCH IF DØNE FMUL; " PUSH MULTIPLIER " FETCH B(M+1) ADD J.B; SETMA DPX<MD; "SAVE A(M+1) FMUL " PUSH MULTIPLIER FADD FM.FA; "ADD (A(M)*B(M)) TØ SUM " FETCH A(M+2) " BRANCH BACK ADD I.A; SETMA; BR LØØP MI<FA; MØV C.C; SETMA; "STØRE ANSWER IN C(O) DØNE: RETURN " RETURN SEND ``` ## SECTION 4 OPERATING PROCEDURES #### 4.1 USING APAL APAL assembles a file of source text containing an AP-120B program into a relocatable object file. Optionally an assembly listing is produced. APAL first requests whether another assembly is to be done: DONE? 1=YES, $\emptyset$ =NO: A response of "1" will cause APAL to exit to the system monitor. A " $\emptyset$ " will signal APAL that another assembly is to be done. APAL then requests the names of the three files to be used for source, object, and listing and errors respectively. The program requests the name of the source file by outputting to the user console: SOURCE FILE= The user responds by entering the desired program file name. APAL then requests the name of the file to receive the relocatable object output by outputting: OBJECT FILE= The user responds by entering the desired object file name. APAL then requests the name of the file to receive the assembly listing by outputting: LISTING AND ERROR FILE= The user replies by entering the name of the desired listing file. Finally, APAL outputs: LISTING? 1=YES, Ø=NO: A response of "1" will yield a full assembly listing, symbol table, and any error messages. A " $\emptyset$ " will suppress the assembly and symbol table listings and put out only any error messages into the listing file. Finally, if a listing was requested, APAL outputs: LISTING RADIX? 1=HEX, $\emptyset$ =OCTAL: A response of "1" will cause the assembly listing to be done in Hexadecimal (base 16). A" $\emptyset$ " will make the assembly listing in Octal (base 8). In each of the above cases, if the sought after file cannot be found or is otherwise unavailable, APAL types "??", and waits for another user response. An example dialogue is given below. The user desires to assemble an AP-120B program in file "FFT.AP" and put the object output into file "FFT.RB". The listing will be put out on the line printer. Of course, the precise details of how files and devices are named will depend on the particular operating system being used. The messages printed by the computer are underlined for clarity; a "\dark" means carriage return: RUN APAL + DONE? 1=YES, Ø=NO: 1 + SOURCE FILE = FFT.AP + OBJECT FILE = FFT.RB + LISTING FILE = LP: + LISTING? 1=YES, Ø=NO: 1 + LISTING RADIX? 1=HEX, Ø=OCTAL: Ø + DONE? 1=YES, Ø=NO: Ø + #### 4.2 LISTING FORMAT Upon commencement of the assembly, APAL outputs: APAL ### PASS 1 "###" is the version number of the assembler being used. Any errors detected during pass one are output next. At the start of pass two APAL outputs: PASS 2 The assembly listing follows. The listing contains the following information for each program statement: | Columns | Contents | |---------|------------------------------------| | 1-6 | The location counter, if relevant. | | 7-8 | Blank | | 9-14 | The assembled data, if relevant. | | 15-16 | Blank | | 17-132 | The source statement | For program instruction statements the assembled data is presented as four numbers, representing bits 0-15, 16-31, 32-47, and 48-63 of each program source word. At the end of pass two, APAL outputs \*\*\*\* ### ERRORS \*\*\*\* Where "###" is the number of errors detected. Finally, APAL outputs: SYMBOL NAME ## Followed by the symbol table: | Columns | Contents | |-----------------------------------|---------------------------------------------------------------| | 1-6<br>7-8<br>9-14<br>15<br>16-18 | Symbol Name Blank Symbol value Blank Symbol type: | | | Blank - local symbol EXT - external symbol ENT - entry symbol | In all of the above occurance where a number (location, data value, etc.) is printed on the listing, the radix is either octal or hex, as specified by the user during the initial dialogue. (INTENTIONALLY BLANK) #### 4.3 A SAMPLE ASSEMBLY LISTING ØBJECT FILE=TEMP LISTING AND ERROR FILE=/TTØ LISTING? 1=YES, 0=NØ: 1 LISTING RADIX: 1=HEX, 0=ØCTAL: 0 APAL V2-1 PASS 1 PASS 2 STITLE DØTPR SENTRY DØTPR. 6 "VECTØR DØT PRØDUCT "DØES C(0) = SUM ( A(MI) \* B(MJ) ) FØR M = 0 TØ N-1 --- STATISTICS --- "AUTHOR: A.E. CHARLESWORTH, JULY 75 "REVISION 1.3 FEB 76 CHANGED SENTRY "SIZE: 9. LØCATIØNS "SPEED: 2 MEMØRY REFERENCES PER PØINT "SCRATCH: SP: 0.4.5; DPX: 0 (RELATIVE TØ DPA) #### "S - PAD PARAMETERS: | | 000000 | | A | SEQU | 0 | | "BASE ADDRESS Ø | FA | | |--------|--------------------------------------|--------|----|-------|------------|-------|-----------------|---------|--------| | | 000001 | | Ì | SEQU | 1 | | "INCREMENT FØR | A | | | | 000002 | | В | SEQU | 2 | | "BASE ADDRESS Ø | F B | | | | 000003 | | J | SEQU | 3 | | "INCREMENT FØR | В | | | | 000004 | | C | SEQU | 4 | | "ADDRESS OF C | | , | | | 000005 | | N | SEQU | 5 | | | "VECTØR | LENGTH | | 000000 | 040000<br>000000<br>000000<br>000060 | DØTPR: | Mi | ØV A, | <b>A</b> J | SETMA | "FETCH | A(0) | | 000001 040210 MØV B.B; SETMA "FETCH B(0) 000000 000000 000060 000002 001124 DPX<MD; "SAVE A(0) 000000 INC N " KEEP CØUNT RIGHT 045004 000000 000003 020101 ADD I.A; SETMA; "FETCH A(1) 155000 FADD ZERØ, ZERØ " CLEAR SUM 000000 APAL 4-4 | 000.04 | 001225<br>100000<br>000400<br>013400 | LJØP: | FMUL DF FADD; DEC N | " PUSH ADDER " SEE IF DONE | |--------|--------------------------------------|-------|--------------------------------------------------|-----------------------------------------------------| | 000005 | 020310<br>000623<br>000000<br>010060 | | BEQ DØNE;<br>FMUL;<br>ADD J.B; SETMA | "BRANCH IF DØNE " PUSH MULTIPLIER " FETCH B(M+1) | | 000006 | 000000<br>000000<br>045004<br>010000 | | DPX <md;<br>FMUL</md;<br> | "SAVE A(M+1) " PUSH MULTIPLIER | | 000007 | 020101<br>111115<br>000000<br>000060 | | FADD FM.FA;<br>ADD I.A; SETMA;<br>BR LØØP | "ADD (A(M)*B(M)) TØ SU " FETCH A(M+2) " BRANCH BACK | | 000010 | 040420<br>000340<br>000000<br>000160 | DØNE: | MI <fa; c,c;="" møv="" setma;<br="">RETURN</fa;> | "STØRE ANSWER IN C(0) " RETURN | SEND \*\*\*\* O ERRORS \*\*\*\* | SYMBOL | VALUE | | |--------|--------|-----| | A | 000000 | | | 1 | 000001 | | | B | 000002 | | | J | 000003 | | | C | 000004 | | | N | 000005 | | | DØTPR | 000000 | ENT | | LØØP | 000004 | | | DØNE | 000010 | | INTENTIONALLY BLANK #### SECTION 5 ERROR MESSAGES APAL error messages are printed in the listing following the offending statement. There are five basic error classes, which are listed below along with the action taken by the assembler: - O Out of range: the offending numeric value was truncated to the proper range. - C Conflicting definitions: the first definition was used. - M Missing (or improper) argument: a value of zero was used. - B Bad syntax: the bad op-code field or pseudo-op was ignored. - W Warning of improper useage. Error diagnostics issued by APAL consist of two lines. The first line consists of the error number. The second line contains the error class and error message. Following are the assembler error messages, along with an explanation as to the possible causes and/or cures. - 1. W LINE BUFFER OVERFLOW An instruction statement was too long (600 characters maximum) for the listing buffer. - 2. C MULTIPLY DEFINED SYMBOL A symbol may be defined only once in a program. - 3. C CONFLICTING OP-CODES Two op-codes were used in an instruction statement which used the same instruction word bit fields. - 4. O S-PAD ADDRESS OUT OF RANGE An S-Pad address was outside the legal range of 0-15. - 5. O BRANCH ADDRESS OUT OF RANGE A branch address was more than 16 locations lower or 15 locations higher than the current location. - 6. C CONFLICTING BRANCH ADDRESSES Only one branch address may be used in any given instruction statement. - 7. M MISSING BRANCH ADDRESS No target address was given for a branch op-code. - 8. C CONFLICTING DATA PAD INDEXES Only one value may be given to each Data Pad Index (XR, XW, YR, YW) per instruction statement. - 9. M BAD OR MISSING EXPRESSION The assembler could not process an expression. - 10. M WRONG FADD ARGUMENT A floating adder op-code had an invalid Al or A2 operand. - 11. M WRONG FMUL AGREEMENT A FMUL op-code had an invalid M1 or M2 operand. - 12. M MISSING FADD OR FMUL ARGUMENT An operand was missing following a FADD or FMUL op-code. - 13. C VALUE FIELD CONFLICT Only one op-code which uses a 16-bit VALUE field operand may be used per instruction statement. - 14. M MISSING DATA PAD INDEX A Data Pad Index was missing from an op-code where it was needed. - 15. M UNDEFINED OP-CODE An op-code name was not a legal AP-120B instruction. - 16. M \$EXT SYMBOL IN EXPRESSION An external symbol may not be used to form an expression. - 17. M UNDEFINED USER SYMBOL A user symbol was referenced which was not defined. - 18. M MISSING ARITHMETIC OPERATOR An arithmetic operator (+ \* /) was missing from an expression. - 19. O INTEGER OVERFLOW An integer constant was too large to fit in 16 bits. - 20. B UNRECOGNIZED STATEMENT A statement line was neither a comment, instruction, or pseudo-op statement. - 21. M IMPROPER \$LOC OR \$EQU VALUE The value of a \$LOC or \$EQU pseudo-op was either an undefined symbol or an improper expression. - 22. M \$EXT SYMBOL NOT ALLOWED An external symbol may not be used as an argument for this op-code. - 23. W MISSING \$END A program must terminate with a \$END pseudo-op. - 24. O DATA PAD INDEX OUT OF RANGE A Data Pad Index must be between -4 and +3 inclusive - 25. B MISSING PARENTHESES The right parenthesis following a Data Pad Index was missing. - 26. M BAD DATA PAD INDEX EXPR A Data Pad Index expression could not be resolved into a numeric value. - 27. B COMMA MISSING Only a comma may be used to separate pseudo-op arguments. - 28. B SYMBOL MISSING IN \$EXT PSEUDO-OP No symbol names were found as arguments for an \$EXT pseudo-op. - 29. B MISSING SEP AFTER D.P. INDEX An illegal character was found following a Data Pad Index. - 30. B MULTIPLE PSEUDO-OPS Only one pseudo-op statement may appear on a line. - 31. M BAD FLOATING POINT NUMBER A floating-point number was unacceptable to the assembler. - 32. W ILLEGAL PSEUDO-OP POSITION If used, a \$TITLE pseudo-op must appear first in a program, followed by any \$EXT or \$ENTRY pseudo-ops. - 33. W \$ENTRY SYMBOL NOT LOCAL An \$ENTRY Symbol must not be \$EXTernal also. - 34. W UNREFERENCED \$EXT SYMBOL A declared external symbol was never used in the program. - 35. W UNDEFINED \$ENTRY SYMBOL An \$Entry symbol was not defined. - 36. C DATA PAD BUS CONFLICT Only one data source may be enabled onto the Data Pad Bus per instruction statement. - 37. M MISSING S-PAD ADDRESS An S-Pad op-code was missing its S-Pad Register Address. - 38. M MISSING PROGRAM SOURCE ADDRESS An op-code requiring a program address, such as a JMP or JSR, was missing its address. - 39. XW/YW CONFLICT If the value field is used in an instruction, an op-code which writes into Data Pad Y (such as DPY(2)<FM)) may be used also only if 1) no write into Data Pad X is done, or 2) the indexes are the same for the writes into both DPX and DPY. Examples: Legal: JSR SQRT; "Uses the value field DPY(2)<FM" A store into DPY Legal: JSR SQRT; "Uses the value field DPX(2)<FA; "Both Data Pad write indexes DPY(2)<FM " are the same Illegal: JSR SQRT: "Uses the value field. DPX(-1)<FA; "the two Data Pad write DPY(2)<FM"indexes are different ## APPENDIX A SPECIAL CHARACTER USAGE | Character | Function | |-------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | . <b>+</b> | integer addition operator | | | integer subtraction operator | | * | integer multiplication operator | | / | integer division operator | | <b>,</b> | decimal point, current location | | Ġ | first character of pseudo-op names | | Ψ<br>SD3.66 | symbol terminator | | space | ▼ · · · · · · · · · · · · · · · · · · · | | tab<br>_ | symbol terminator | | <b>=</b> | \$EQU pseudo-op, DB=op-code | | ( | preceeds a Data Pad index expression | | ) | terminates a Data Pad index expression | | < | used in DPX, DPY, and MI op-codes | | ; | op-code terminator | | • | operand separator | | : | label terminator | | 11 | comment start indicator (carriage return terminates) | | # | S-Pad no-load indicator | | %z | S-Pad bit-reverse indicator | | ! | first character of predefined symbols | | • | and the state of t | (INTENTIONALLY BLANK) # APPENDIX B AP-120B SYMBOLIC OP-CODES The various AP-120B op-codes may be divided into 13 groups. One op-code from <u>each</u> group may be used in any given instruction statement, unless otherwise stated. The following two symbols are used throughout this appendix: Indicated optional operands or mnemonics. The item enclosed in the brackets(e.g., <#>) may or may not be coded, depending upon whether or not the associated option is desired. Indicates a specific substitution is required. Substitute the desired address, name, number or mnemonic for the abbreviation underlined. The following list of abbreviations are used to facilitate the op-code descriptions. They are explained in the section of the op-code group where they first occur: | Abbreviation | Meaning | Section which | on in<br>described | |--------------|-------------------------------|---------------|--------------------| | sh | S-Pad Shift | | B.1 | | # | S-Pad no-load | | B.1 | | sps | S-Pad Source register | | B.1 | | spd | S-Pad Destination register | | B.1 | | & | Bit reverse | * | B.1 | | disp | Branch displacement | | B.5 | | al | Floating Adder argument #1 | • | B.6 | | a2 | Floating Adder argument #2 | | B.6 | | idx | Data Pad index | | B.6 | | ml | Floating Mulitplier argument; | #1 | B.7 | | m2 | Floating Multiplier argument | #2 | B.7 | | dbe | Data Pad Bus enable | | B.8 | | adr | address or value | | B.8 | #### B.1 S-PAD OP-CODE GROUP Purpose: S-Pad integer arithmetic | Double O | perand | Op-codes | Function | |----------|--------|----------|----------| | | | | | The result #### Single Operand Op-codes Function CLR<sh><#> spd Clear spd INCrement spd DEC<sh><#> spd DECrement spd COM<sh><#> spd COMplement sp The result of the above op-codes is SPFN (S-Pad Function). #### <u>Miscellaneous</u> Function LDSPNL spd LoaD Spd from PaNeL bus LDSPE LoaD SPd from data pad bus Exponent spd LDSPI LoaD SPd from data pad bus Integer spd (low 16-bits) LoaD SPd from data pad bus Table look-up bits WRTEXP enable WRiTe of EXPonent only into DPX, DPY or MI WRTHMN enable WRiTe of High MaNtissa only into DPX, DPY or MI WRTLMN enable WRiTe of Low MaNtissa only into DPX, DPY or MI #### ABBREVIATIONS: | Name | Meaning | |------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | sh | S-Pad shift: Choices (omitted) L shift SPFN left once R shift SPFN right once RR shift SPFN right twice | | # | S-Pad no-load: If present, do not load SPFN into spd (S-Pad destination register). If specified, a branch group op-code may not be used in the same instruction statement. | sps S-Pad source register: a name, number or expression specifying a register number between $\emptyset$ and $17_8$ . spd S-Pad destination register: a name, number, or expression specifying a register number between 0 and 178. SPFN is loaded into the S-Pad destination register unless S-Pad no-load (#) is specified. &z Bit reverse: if present, bit reverse the contents of sps before using. The bit reverse is done as specified by bits 13-15 of the Internal Status Register. Op-code Examples: MOV 3,6 SUBL 1,15 ADDL# &PTR, BASE DEC CTR CLR 9. LDSPI 6 ## B.2 MEMORY ADDRESS OP-CODE GROUP Purpose: to initiate Main Data Memory cycles Op-codes Function INCMA INCrement Memory Address DECMA DECrement Memory Address SETMA SET Memory Address from SPFN ## B.3 TABLE MEMORY ADDRESS OP-CODE GROUP Purpose: to initiate Table Memory fetches Op-codes Function INCTMA INCrement Table Memory Address DECTMA DECrement Table Memory Address SETTMA SET Table Memory Address from SPFN ## B.4 DATA PAD ADDRESS OP-CODE GROUP Purpose: to change the DPA (Data Pad Address) register Op-codes Function INCDPA INCrement Data Pad Address DECDPA DECrement Data Pad Address SETDPA SET Data Pad Address from SPFN #### B.5 BRANCH OP-CODE GROUP Purpose: Conditional branches | Op-code | | <u>Function</u> | |--------------------------------------------------------------|--------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | BR BINTRQ BION BIOZ BFPE BFEQ BFNE BFGE BFGT BEQ BNE BGE BGT | disp<br>disp<br>disp<br>disp<br>disp<br>disp<br>disp<br>disp | BRanch unconditionally Branch on INTerrupt ReQuest flag non-zero Branch if I/O data ready flag Non-zero Branch if I/O data ready flag Zero Branch on Floating Point Error Branch on Floating adder EQual to zero Branch on Floating adder Not Equal to zero Branch on Floating adder Greater or Equal to zero Branch on Floating adder Greater than zero Branch on s-pad function EQual to zero Branch on s-pad function Not Equal to zero Branch on s-pad function Greater or Equal to zero Branch on s-pad function Greater or Equal to zero Branch on s-pad function Greater than zero | | RETURN | | RETURN from subroutine | ## ABBREVIATION: Branch displacement: the branch target address, an address between 16 locations behind and 15 locations ahead of the current location. BR LOOP Examples: Double Operand BGT .+3 BFNE A-4 #### B.6 FLOATING ADDER OP-CODE GROUP Purpose: Floating-point adds | Op-codes | <u>Function</u> | |----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | FADD < a1, a2 > FSUB a1, a2 FSUBR a1, a2 FAND a1, a2 FOR a1, a2 FEQV a1, a2 | Floating ADD (al+a2) Floating SUBtract (al-a2) Floating SUBtract Reverse (a2-a1) Floating AND (al and a2) Floating OR (al or a2) Floating EQuiValence (al eqv a2) | | Single Operand<br>Op-codes | Function | | $\begin{array}{ccc} \text{FIX} & \underline{a2} \\ \text{FIXT} & \underline{a2} \\ \text{FSCALE} & \underline{a2} \\ \text{FSCLT} & \underline{a2} \\ \text{FSM2C} & \underline{a2} \\ \text{F2CSM} & \underline{a2} \\ \text{FABS} & \underline{a2} \\ \end{array}$ | FIX a2 to an integer FIX a2 to an integer, (Truncated) Floating SCALE of a2 Floating SCALE of.a2, (Truncated) Format conversion, Signed Magnitude to 2's complement Format conversion, 2's complement to signed magnitude Floating ABSolute value | APAL B-4 #### ADDER OPERANDS: al Floating adder argument #1: | Choices | Meaning | |----------------------------------------------------|--------------------------------------------------------------------------------------------------------------------| | NC<br>FM<br>DPX<(idx)><br>DPY<(idx)><br>TM<br>ZERO | No Change (use previous a1) Floating Multiplier output Data Pad X Data Pad Y Table Memory data floating-point ZERO | a2 Floating adder argument #2: | Choices | Meaning | |--------------------------------|-------------------------------------------------------------------------------------------| | NC FA DPX<(idx)> DPY<(idx)> TM | No Change (use previous a2) Floating Adder output Data Pad X Data Pad Y Table Memory data | | ZERO | floating ZERO | | MDPX<(idx)> | use Mantissa from Data Pad X, and exponent from SPFN | | EDPX (idx) | use Exponent Data Pad X, and mantissa from SPFN | #### ABBREVIATION: idx Data Pad index: A name, expression, or number which lies in a range of -4 to +3. Op-code examples: FADD TM, MD FSUB DPX(3), DPY(-4) FEQV DPX, DPY(C) FAND ZERO, MDPX(2) FSUBR NC, FA FADD Note: Up to four unique Data Pad indices may be specified in one instruction statement. In particular, only one indexing each may be used for reading from Data Pad X and Y, regardless of how many op-codes use the data read from Data Pad. #### B.7 FLOATING POINT MULTIPLY OP-CODE GROUP Purpose: Floating Point multiplies Op-code Function FMUL m1, m2Floating MULtiply ml times m2 #### MULTIPLIER OPERANDS: mlMultiplier-operand #1 > Choices Meaning FM Floating Multiplier output Data Pad X DPX<(idx)>Data Pad Y DPY<(idx)> TMTable Memory Multiplier-operand #2 m2 > Choices Meaning FAFloating Adder output DPX (idx) Data Pad X DPY (idx) Data Pad Y MDMemory Data Examples: FMUL TM, MD FMUL DPX (AR), DPY (BI) FMUL #### B.8 DATA PAD X OP-CODE GROUP Purpose: Storing into Data Pad X Op-code Function DPX<(idx)><FAStore Floating Adder output into Data Pad X Store Floating Multiplier output into Data Pad X Store Data Pad Bus into Data Pad X $DPX<(\overline{idx})><FM$ $DPX<(\overline{idx})> <DB$ Store dbe into Data Pad X $DPX<(\overline{idx})> < dbe$ #### ABBREVIATIONS: Data Pad Bus enable: has the same effect as an explicit dbe Data Pad Bus op-code. (see B.11) > Choices Meaning ZERO Floating zero adr adr $\overline{DPX}$ (idx) > Data Pad X Data Pad Y DPY (idx) > Memory Data MD S-Pad Function SPFN TMTable Memory data only one choice of Data Pad Bus enable may be made per instruction statement. adr An address or numeric value. Any 16-bit integer expression is legal. A Floating Multiplier, Memory Input, Memory Address, Table Memory Address, or Data Pad Address op-code may not be used in an instruction statement where an "adr" is used. Examples: DPX(3)<FM DPX(-2) < SPFNDPX+MD DPX(1) < DPY(-2)DPX(-2) < -123 ## DATA PAD Y OP-CODE GROUP Purpose: Storing into Data Pad Y Cp-Code Function $DPY_{<}(idx)_{><}FA$ Store Floating Adder output into Data Pad Y $DPY < (\overline{idx}) > FM$ Store Floating Multiplier output into Data Pad Y DPY < (idx) > DBStore Data Pad Bus into Data Pad Y $DPY < (\overline{idx}) > < dbe$ Store dbe into Data Pad Y Examples: DPY(-2) < FADPY < MD DPY(2) < TMDPY(1) < 39. ## B.10 MEMORY INPUT OP-CODE GROUP Purpose: Writing into Main Data Memory Op-codes Function MI <FA Move Floating Adder output to the Memory Input reg. MI < FMMove Floating Multiplier output to the Mem. Input reg. MI <DB Move Data Pad Bus to the Memory Input Register MI<dbe Move dbe to the Memory Input Register to effect a memory write, an op-code from the memory address group, or an "LDMA" op-code must also be included in the instruction statement to supply the memory address. Examples: MI <FA; INCMA MI < DPX(3); DECMA MI < MD; SETMA; ADD 3,6 #### B.11 DATA PAD BUS OP-CODE GROUP Purpose: to explicitly enable data onto the Data Pad Bus. #### Op-codes Function Note: as mentioned earlier, only one data source may be enabled onto the Data Pad bus per instruction statement. Examples: DB = 37 DB = DPX(-2) DB = MD DB = SPFN #### B.12 SPECIAL OPERATION OP-CODE GROUP Note: if an op-code from this group is chosen, an S-Pad Group op-code may not be used in the same instruction statement. #### B.12.1 SPECIAL TESTS Purpose: additional conditional branches | Op-codes | Function | |----------|----------| | | | | BFLT | disp | <br>Branch | on | Floating adder Less Than zero | |------|------|------------|----|-------------------------------| | | disp | Branch | on | s-pad function Less Than zero | | BNC | disp | Branch | on | Non-zero Carry bit | | BZC | disp | Branch | on | Zero Carry bit | | BDBN | disp | Branch | if | Data pad Bus Negative | | BDBZ | disp | | | Data pad Bus Zero | | BIFN | | | | Inverse FFT flag Non zero | | | disp | | | Inverse FFT flag Zero | | BFLØ | disp | | | FLag Ø is 1 | | BFL1 | disp | | | FLag l is l | | BFL2 | disp | | | FLag 2 is 1 | | BFL3 | disp | | | FLag 3 is 1 | Note: if one of these tests is used along with a test from the Branch Group, the conditions are "or'd." In this case, only one of the branch op-codes need have the target address as an operand. Examples: BNC ODD BFEQ LOOP; BFLT LOOP "LESS THAN OR EQUAL TO #### B.12.2 SETPSA Purpose: jumps and subroutine jumps Op-codes Function JMP<A> adr Jump to location adr JMPT Jump to location whose address is in TMA JMPP JumP to location whose address is on the Panel bus JumP to SubRoutine at location adr JSR<A> adr JSRT JumP to SubRoutine at address in Tma **JSRP** JumP to SubRoutine at address on Panel bus Examples: LOOP + 3JMP JSR FFT JMPA 300 B.12.3 SETEXIT Purpose: to alter a subroutine return Op-codes Function SETEX <A> adr SET subroutine EXit to adr SETEXT SET subroutine EXit to contents of Tma SETEXP SET subroutine EXit to contents of Panel bus Example: SETEX BAD B.12.4 P.S. Purpose: read/write of Program Source Memory Op-codes Function RPSL A> adr Read Program Source Left half of location adr RPSF ∢> adr Read Program Source Floating-point number from location adr RPSLT Read Program Source Left half at address in Tma RPSFT Read Program Source Floating point number at address in Tma RPSLP Read Prog. Source Left half at address on Panel bus RPSFP Read Prog. Source Floating-point number at address on Panel bus Note: these op-codes read onto the Data Pad Bus Function Op-codes Load Program Source Left half of location adr LPSL <A> adr Load Program Source Right half of location adr LPSR < A> adr Load Program Source Left half pointed at by Tma LPSLT Load Program Source Right half pointed at by Tma LPSRT Load Prog. Src Left half pointed at by Panel bus LPSLP Load Prog. Src Right half pointed at by Panel bus LPSRP these op-codes load from the Data Pad Bus Note: Example: RPSF PΙ #### B.12.5 PS ODD AND EVEN Purpose: reading the host panel switches into Program Source memory; writing Program Source to the panel lites. | <u>Op-codes</u> | Function | |----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | RPSØ <a> adr<br/>RPS1<a> adr<br/>RPS2<a> adr<br/>RPS3<a> adr<br/>RPSØT<br/>RPS1T<br/>RPS2T<br/>RPS3T<br/>WPSØ<a> adr<br/>WPS1<a> adr<br/>WPS2<a> adr<br/>WPS2<a> adr<br/>WPS3<a> adr<br/>WPS9T<br/>WPS9T<br/>WPS1T<br/>WPS2T<br/>WPS3T</a></a></a></a></a></a></a></a></a> | Read Program Source quarter Ø from location adr Read Program Source quarter 1 from location adr Read Program Source quarter 2 from location adr Read Program Source quarter 3 from location adr Read Program Source quarter Ø from address in Tma Read Program Source quarter 1 from address in Tma Read Program Source quarter 2 from address in Tma Read Program Source quarter 2 from address in Tma Read Program Source quarter 3 from address in Tma Write Program Source quarter Ø into location adr Write Program Source quarter 1 into location adr Write Program Source quarter 2 into location adr Write Program Source quarter 3 into location adr Write Program Source quarter Ø into address in Tma Write Program Source quarter Ø into address in Tma Write Program Source quarter 1 into address in Tma Write Program Source quarter 2 into address in Tma Write Program Source quarter 3 into address in Tma | | | | #### B.12.6 HOSTPANEL Purpose: Reading the host panel switches, writing to the host panel lites | PNLLIT Panel bus to LITes DBELIT Data pad Bus Exponent to LITes DBHLIT Data pad Bus High mantissa to LITes DBLLIT Data pad Bus Low mantissa to LITes SWDB SWitches to Data pad Bus SWDBE SWitches to Data pad Bus Exponent SWDBH SWitches to Data pad Bus High mantissa | <u>Op-code</u> | Function | |-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | SWDBL SWitches to Data pad Bus Low mantissa | DBELIT DBHLIT DBLLIT SWDB SWDBE SWDBH | Data pad Bus Exponent to LITes Data pad Bus High mantissa to LITes Data pad Bus Low mantissa to LITes SWitches to Data pad Bus SWitches to Data pad Bus Exponent SWitches to Data pad Bus High mantissa | #### B.12.7 Miscellaneous SPMDA SPin until a Main Data memory cycle Available ## B.13 I/O OP-CODE GROUP $\underline{\underline{\text{Note:}}}$ if an op-code is used from this group, a Floating Adder op-code may $\underline{\text{not}}$ be used in the same instruction statement. ## B.13.1 Load Reg, Read Reg Purpose: reading/writing of various internal registers | Op-codes | Function | |----------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | LDSPD LDMA LDTMA LDDPA LDSP LDAPS LDDA | LoaD S-Pad Destination address register LoaD Memory Address register LoaD Table Memory Address register LoaD Data Pad Address register LoaD S-Pad register pointed at by spd LoaD AP Status register LoaD i/o Device Address | | | | Note: the above op-codes load from the Data Pad Bus | Op-codes | Function | |-------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | RPSA<br>RSPD<br>RMA<br>RTMA<br>RDPA<br>RSPFN<br>RAPS<br>RDA | Read Program Source Address Read S-Pad Destination register Read Memory Address register Read Table Memory Address register Read Data Pad Address register Read S-Pad Function Read AP Status Read i/o Device Address | Note: the above read onto the Panel bus #### B.13.2 INOUT Purpose: Program control/input output of data <u>Op-codes</u> <u>Function</u> OUT OUTput data SPNOUT SPiN until device ready, then OUTput data OUTDA OUTput data, then set DA to spfn SPOTDA SPin until device ready, then OuTput data, then set DA to spfn Note: the above write to the I/O device specified by the Device Address Register (DA) whatever data is enabled onto the Data Pad Bus. <u>Op-codes</u> <u>Function</u> IN INput data SPININ SPIN until device ready, then INput data INDA INput data, then set DA to spfn SPINDA SPin until device ready, then INput data, then set DA to spfn Note: the above enable data onto the Input Bus from the I/O device specified by the Device Address Register (DA). To be used the data must be enabled onto the Data Pad Bus, and from there to a register or memory. An example: IN; DPX(2) < INBS "READ I/O DATA INTO DPX B.13.3 SENSE Purpose: Sensing an I/O device condition Op-codes Function SNSA SeNSe condition A SPINA SPIN on condition A SNSADA Sense condition A, then set DA to spfn SPNADA SPIN on condition A, then set DA to spfn SNSB SeNSe condition B SPINB SPIN on condition B SNSBDA Sense condition B, then set DA to spfn SPNBDA SPIN on condition B, then set DA to spfn ## B.13.4 FLAG Purpose: set/reset of program flags ## B.13.5 CONTROL Purpose: miscellaneous control functions | <u>Op-code</u> | Functions | |--------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------| | HALT<br>IORST<br>INTEN | HALT processor I/O ReSeT INTerrupt ENable | | INTA<br>REFR | INTerrupt Acknowledge memory REFResh synch | | WRTEX<br>WRTMN<br>SPMDAV | enable WRiTe of Exponent only into DPX, DPY or MI enable WRiTe of MaNtissa only into DPX, DPY or MI SPin until a Main Data memory cycle AVailable | INTENTIONALLY BLANK ## TABLE MEMORY SYMBOLS: ## 1. TABLE MEMORY CONSTANTS: | SYMBØL | CØNSTANT<br>REPRESENTED | VALUE IN<br>TABLE MEMØRY | TABLE MEMØRY<br>ADDRESS (ØCTAL) | |-----------------|-------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------| | ! ZERØ | ZERØ | 0.0 | 4371 | | IONE | ØNE | 1 • 0 | 4001 | | ! TWØ | TWØ | 2.0 | 4002 | | ! THREE | THREE | 3 • 0 | 4441 | | I FØ UR | FØUR | 4.0 | 4442 | | IFIVE | FIVE | 5•0 | 4443 | | ISIX | SIX | 6.0 | 4444 | | ISEVEN | SEVEN | 7.0 | 4445 | | !EIGHT | EIGHT | 8.0 | 4446 | | ININE | NINE | 9•0 | 4447 | | ! TEN | TEN | 10.0 | 4450 | | !SIXTN | SIXTEEN | 16.0 | 4451 | | !HALF | HALF | 0.5 | 4427 | | ! THIRD | ONE THIRD | 0.33333333 | 4430 | | - · | ONE FOURTH | 0.25 | 4431 | | I FØRTH | ONE FIFTH | 0.2 | 4432 | | !FIFTH | ONE SIXTH | 0.166666667 | 4433 | | ISIXTH | ØNE SEVENTH | 0-142857143 | 4434 | | ISVNTH | | 0.142637143 | 4435 | | ! EGHTH | ØNE EIGHTH | 0.125 | 4436 | | ININTH | ØNE NINETH | and the second s | 4437 | | ! TENTH | ONE TENTH | 0.1 | 4440 | | ISXNTH | ONE SIXTEENTH | 0.0625 | 4203 | | ! SQRT2 | SQRT(2) | 1.414213562 | | | ISQRT3 | SQRT(3) | 1.732050808 | 4422 | | 1 SQRT5 | SQRT(5) | 2.236067977 | 4423 | | ISQTIO | SQRT(10) | 3.162277660 | 4424 | | IISQT2 | 1.0/SQRT(2) | 0.707106781 | 4206 | | IISQT3 | 1.0/SQRT(3) | 0.577350269 | 4452 | | !ISQT5 | 1-0/SQRT(5) | 0.447213596 | 4453 | | 115010 | 1.0/SQRT(10) | 0.316227766 | 4454 | | CBT2 | CBRT(2) | 1-259921050 | 4417 | | !CBT3 | CBRT(3) | 1-442249570 | 4420 | | !QDRT2 | (2.0)**1/4 | 1.189207115 | 4421 | | LØG2E | LØG2(E) | 1 • 442695041 | 4317 | | 1 LØ G2 | LØG10(2) | 0.301029996 | 4411 | | !LØGE | LØGIO(E) | 0.434294482 | 4337 | | !LN2 | LN(2) | 0-693147181 | 4336 | | ILN3 | LN(3) | 1.098612289 | 4407 | | ILN10 | LN(10) | 2-302585093 | 4410 | | ! E | E | 2.718281828 | 4403 | | IINVE | 1.0/E | 0-367879441 | 4404 | | ! ESQ | E**2 | 7.389056096 | 4405 | | !PI | PI | 3-141592654 | 4402 | | ! TWØPI | 2*PI | 6-283185308 | 4415 | | ! INVPI | 1.0/PI | 0.318309886 | 4412 | | 1P12 | P1/2 | 1:570796327 | 4312 | | !PI4 | PI/4 | 0-785398164 | 4373 | | 1P114<br>1P1180 | PI/180 | 0.017453293 | 4413 | | • | PI**2 | 9-869604404 | 4414 | | IPISQ | SQRT(PI) | 1-772453851 | 4416 | | ! SQTP! | LN(PI) | 1-144729886 | 4406 | | ILNPI | GAMMA | 0.577215663 | 4425 | | I GAMMA | PHI | 1.618033989 | 4426 APAL C-1 | | 1 PH I | Pni | 1010000000 | REAL C-1 | #### 2. ELEMENTARY FUNCTION TABLES: | ELEMENTARY | TABLE MEMORY | |--------------|--------------------------------------------------| | FUNCTION | ADDRESS (ØCTAL) | | DIVIDE | 4000 | | SQUARE RØØT | 4202 | | SIN/CØS | 4306 | | LØ GAR I THM | 4333 | | EXPONENTIAL | 4317 | | ARC TANGENT | 4365 | | | DIVIDE SQUARE RØØT SIN/CØS LØGARITHM EXPØNENTIAL | 3. SIZE OF INSTALLED FFT COSINE TABLE SYMBØL SIZE (TYPICAL) IFFTSZ 2048 EXAMPLE OUTPUT from APAL ``` PASS 2 ``` ``` STITLE SINCOS SENTRY SIN SENTRY CØS "SINE, COSINE FUNCTION --- ABSTRACT --- "COMPUTES THE FUNCTION SIN(X) OR COS(X). WHERE X IS IN DPXX(DPA) --- STATISTICS --- "LANGUAGE: AP-120B ASSEMBLER "EQUIPMENT: AP-120B "STØRAGE: PS - 31 LØCATIØNS MD - NØT AFFECTED TM - 9 DPX - 2 DPY - 2 SP - NØT USED "SPEED: SIN - 4-42 US - AVERAGE (4-00 - 4-83) CØS - 4-75 US- AVERAGE (4-33 - 5-17) "AUTHOR: A.E. CHARLESVØRTH NØV: 1975 "DATE: "REVISION: 2.2 FEB 76 CHANGED SENTRY --- USEAGE --- "SAMPLE CALL: JSR SIN, JSR CØS "ARGUEMENT: X IS IN DPX(DPA) "ANSWER: SIN(X) OR COS(X) IS LEFT IN DPX(DPA) "SCRATCH: DPX(0-2), DPY(0-3) --- ERROR CONDITIONS --- "NØNE --- ALGØRITHM --- *1. IF COS(X) IS DESIRED, COS(X) = SIN(X+PI/2) "2" X IS MULTIPLIED BY 2/PI, AND SPLIT INTØ (I + F WHERE I IS AN INTEGER AND F A POSITIVE ** BETWEEN 0 AND .9999999999... IF I IS ØDD, F<= (1.0 = F) *4• SIN(F) = A + BF**3 + CF**5 + DF**7 + EF**9 THE POLYNOMIAL IS FROM 10 HART & CHANEY #3341 (PRECISION 8-27=5* RANGE O TØ PI/2 IF I/2 IS ØDD, SIN(F) IS NEGATED "NØTE: THE PØLYNØMIAL IS FACTØRED AS: (AF + F**3(B + CF**2)) + F**7(D + EF**2) "THE TABLE IN TABLE MEMORY IS AS FOLLOWS: "SINTBL:0-63661 97724 2/PI 0.79689 67894 6 E -1 C. 0.999999925 FRACTION MASK (1000,37 1.0 ØDD BIT MASK 0-15707 96318 44 E 1 A (PI/2) 0.15148 5129 E-3 E - • 64596 37105 99 B --46737 6661 E-2 D 2.0 NEXT ODD BIT MASK ``` APAL D-2 | | 004306 | "TABLE | LUCATION FM:<br>SINTBL SEQU ISNCS | | |--------|--------------------------------------|-----------------|-------------------------------------------------------------------------------------------|------------------------------------------------------| | | 000000 | "DATA P | AD X:<br>X \$EQU O | | | | 000001 | | TEMPX SEQU 1 | | | | 000000 | "DATA P | AD Y:<br>C SEQU O | | | | 000000 | . 4 | AF SEQU O | | | | 000001 | | F SEQU 1 | | | | 000001 | | F2 SEQU 1 | | | | 000001 | | F3 \$EQU 1 | | | 000000 | 000003<br>103000<br>002000<br>004312 | "CØME H<br>CØS: | ERE FØR CØSINE(X)<br>LDTMA; DB=SINTBL+4 | "FETCH PI/2 (A) | | 000001 | 000003<br>103000<br>002000<br>004306 | | DB=SINTBL; LDTMA | "FETCH 2/PI | | 000002 | 000001<br>142000<br>000400<br>000000 | | FADD TM.DPX(X) | "DØ X*2/PI | | 000003 | 000001<br>100000<br>000000<br>000001 | | FADD; INCTMA | "FETCH C | | 000004 | 000000<br>000124<br>000000<br>016001 | | FMUL TM, FA;<br>INCTMA;<br>BR CØMMØN | "DØ X*2/PI " FETCH FRACTIØN MASK " GØ DØ SIN(X+PI/2) | | | | | ERE FØR SINE(X) | • • • • • • • • • | | 000005 | 000003<br>103000<br>002000<br>004306 | SIN: | LDTMA; DB=SINTBL | "FETCH 2/PI | | 000006 | 000001<br>000000<br>000000 | | INCTMA | "FETCH C | | 000007 | 000000<br>000000<br>000400<br>016401 | | FMUL TM, DPX(X); INCTMA | "DØ X*2/PI<br>" FETCH FRACTIØN MASK | | 000010 | 000000<br>000000<br>017000<br>110001 | Cømmøn: | FMUL; INCTMA; DPY(C) <tm apal="" d-3<="" td=""><td>"FETCH ØDD BIT MASK " SAVE C</td></tm> | "FETCH ØDD BIT MASK " SAVE C | The second secon | 00011 | 000000<br>000000<br>047005<br>010001 | | FMUL; D;<br>INCTMA | . EMPX) < TM; | "SAVE FRACTIO. MA " FETCH A | SK | |--------|--------------------------------------|---------|---------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------|------| | | | "ØH, TH | WE CAN GET | THE FRACTIONS TO TO SEE IF | PØINT AND<br>NAL PART ØF A NUMBE<br>THE INTEGER PART ØF | | | 000012 | 000002<br>112000<br>047505<br>000000 | | FAND FM.DI | YK(TEMPX); | "GET F = FRAC(2X/ " SAVE ØDD BIT M | | | 000013 | 000002<br>112000<br>140504<br>000000 | | FAND FM.DI<br>DPX(X) <i< td=""><td>PX(TEMPX);<br/>FM</td><td>"SEE IF I IS ØDD " SAVE 2X/PI</td><td></td></i<> | PX(TEMPX);<br>FM | "SEE IF I IS ØDD " SAVE 2X/PI | | | | | "HERE W | HØPING THA | HEAD AND STAR<br>AT WE AREN'T<br>FØURTH QUADR | | | | 000014 | 000001<br>021000<br>020540<br>134000 | Ø DD ŧ | FMUL DPY(CDPY(F)< | ),FA; | "DØ C*F<br>" SAVE F | · | | 000015 | 000001<br>155000<br>000050<br>015000 | | FMUL DPY(F | F),DPY(F);<br>Mø,ZERØ | "DØ F**2<br>" PUSH 1-F ØUT | | | 000016 | 000000<br>000456 | 11 | WHICH WE N<br>S NØT AN IN<br>IF I IS ØI<br>THE SECØNI | JØW TEST FØR. JFINITE LØØP DJ. AND WE CØ TIME WE WILL O.O + O.O. W ZERØ PY(F); | ME THIS WAY AGAIN,<br>L BE TESTING THE RE | SULT | | | 000050<br>017000 | | PLAT OD | | BRANCA BACK IT | 1 13 | | 000017 | 000000<br>000000<br>000050<br>011000 | "NØW WE | ARE INTO T | THE PØLYNØMIA<br>PY(F) | L FØR GØØD **DØ CF **F | | | 000020 | 000000<br>000000<br>030050<br>131001 | | FMUL FM.DF<br>DPY(F2)<<br>INCTMA | the state of s | "DØ F**3<br>" SAVE F**2<br>" FETCH E | | | 000021 | 000000<br>000000<br>030050<br>115001 | | FMUL DPY(F<br>DPY(AF) <<br>INCTMA | 72), DPY(F2);<br>FM;<br>APAL D-4 | "DØ F**4 " SAVE AF " FETCH B | | | | | | | | | | | 0000 | 22 000000<br>000000<br>140055<br>017001 | | FMUL TM, DFY(F2); DPX(TEMPX) <fm; ingtma<="" th=""><th>"DØ E * F**2 " SAVE CF2 " FETCH D</th></fm;> | "DØ E * F**2 " SAVE CF2 " FETCH D | |----------|-----------------------------------------|------------------|-----------------------------------------------------------------------------------------------------|------------------------------------------------------| | 0000 | 142000 | | FADD TM.DPX(TEMPX); FMUL; | "DØ B+CF2 | | | 030500<br>130000 | | DPY(F3) <fm< td=""><td>** SAVE F**3</td></fm<> | ** SAVE F**3 | | 0000 | 000001<br>100000<br>047055<br>011000 | | FMUL FM.DPY(F3); DPX(TEMPX) <tm; fadd<="" td=""><td>"DØ F**7<br/>" SAVE D</td></tm;> | "DØ F**7<br>" SAVE D | | 0000 | 25 000001<br>112000<br>000550<br>014001 | | FMUL DPY(F3), FA; FADD FM, DPX(TEMPX); INCTMA | "DØ F**3 * (B+CF2) " DØ D + EF2 " FETCH NEXT ØDD BIT | | 0000 | 26 000001<br>100000<br>000000<br>010000 | | FMUL; FADD | "WAIT | | 0000 | 27 000002<br>142000<br>000400<br>010000 | "HERE W | E SEE IF MØD(I/2,2) IS<br>FMUL FM,FA;<br>FAND TM,DPX(X) | 1 ØR 0<br>"DØ F**7 * (D+EF2)<br>" SEE IF I/2 IS ØDD | | 0000 | 000001<br>113000<br>000040<br>010000 | | FADD FM,DPY(AF); FMUL | "DØ AF + (BF3+CF5) | | 0000: | 100000 | | FMUL; FADD | "WAIT | | | 000000 | | | | | 0000 | 111463<br>000000 | | FADD FM.FA;<br>BFNE NEG | "DØ (AF+BF3+CF5)+(DF7+<br>" SEE IF WE NEED TØ N | | | 000000 | | | | | 00003 | 100000<br>100000<br>000000<br>000000 | DØNE: | FADD | "WAIT | | 00000 | 000000<br>000340<br>100004 | | DPX(X) <fa; return<="" td=""><td>"STØRE ANSWER AND RETU</td></fa;> | "STØRE ANSWER AND RETU | | | 000000 | | | | | 0000 | 100000 | "CØME HI<br>NEG: | ERE IF WE NEED TØ NEGA<br>FADD | TE THE ANSWER | | * •<br>• | 000000 | | APAL D-5 | | | 000036 | 000001<br>051115 | | ZERU "FA;<br>DØNE | "0.0 - ANSWER<br>" GØ FINISH | |--------|------------------|---|-------------------|------------------------------| | | 0,00000 | • | | | | | 000000 | | | | \$END | **** | 0 | ERR | ØRS | **** | |--------|-----|------|-----|------| | | | • | | | | SYMBØL | VAL | .UE | | | | SINTBL | 004 | 306 | | | | X | 000 | 000 | | | | TEMPX | 000 | 1001 | | | | C | 000 | 0000 | | | | AF | 000 | 000 | | | | F | 000 | 0001 | | | | F2 | 000 | 0001 | | | | F3 | 000 | 001 | | | | CØS | | 000 | ENT | | | SIN | 000 | 005 | ENT | | | CØMMØN | 000 | 010 | | | | ØDD | 000 | 014 | | | | DØNE | 000 | 033 | | | | NEG | | 035 | | | (INTENTIONALLY BLANK) # AP-120B APLINK ARRAY PROCESSOR LINKING LOADER MANUAL 7276-02 FPS-7276-01 C FLOATING POINT SYSTEMS, INC. 1976 ALL RIGHTS RESERVED PRINTED IN THE UNITED STATES OF AMERICA REVISION 02, MARCH 30, 1976 # TABLE OF CONTENTS | Section 1 - INTRODUCTION 1.1 Introduction | 1-1<br>1-1 | |---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------| | Section 2 - (PRESENTLY OMITTED) | | | Section 3 - OPERATING PROCEDURE 3.1 Load, "L" 3.2 Symbols, "S" 3.3 Undefined, "U" 3.4 Next Base, "B" 3.5 Reset, "R" 3.6 Force, "F" 3.7 Memory, "M" 3.8 End, "E" 3.9 End with Assembly Code, "A" 3.10 Set Number Radix, "N" 3.11 Exit, "X" 3.12 An Example Loading Session | 3-1<br>3-2<br>3-3<br>3-3<br>3-4<br>3-4<br>3-4<br>3-5<br>3-5<br>3-5<br>3-6 | | Section 4 - ERROR MESSAGES | 4-1 | | Appendix A | A-1 | | Appendix B | B-1 | #### SECTION 1 INTRODUCTION #### 1.1 INTRODUCTION APLINK links separate object modules produced by APAL together into a single load module for execution by the AP-120B hardware or the simulator. The user can separately code and assemble a main line program and the associated subroutines, and later link them together for execution. APLINK serves this purpose be performing the following tasks: - 1. Relocating each object module and assigning absolute addresses. - 2. Linking the modules together by correllating global entry symbols defined in one module with external symbols referenced in another module. - 3. Selectively loading modules from program library, - 4. Optionally producing a load map showing the layout of the load module. APLINK is written in Fortran IV and requires roughly 10K of available memory in which to operate. INTENTIONALLY BLANK # SECTION 3 OPERATING PROCEDURE Program modules are linked interactively via a dialogue between the user and APLINK. The user enters a series of commands which direct the linking process. When execution begins, APLINK outputs: APLINK ### The "###" is the version number of APLINK. The asterisk ("\*") indicates that the program is ready to accept commands. After each user command, an "\*" is typed when that command has been completed, and APAL is ready for a new command. An illegal command will cause a "?" to be output. To load his relocatable programs and prepare them for execution, the user would normally follow the procedure outlined below: - 1. Using the "L" (load) command, load the file or files containing the desired main program, required subroutines, and library subprograms, if any. If a fatal error occurs during this step, the user must reinitialize using the "R" command, and repeat this step. - 2. Using the "U" (undefined) command, check to see if any global symbols are still undefined. If nothing is output from this command, continue to step 3. If any symbols are output, it usually means that there was an error in one or more of the programs loaded, or that the loading sequence was wrong. In these cases, the user should correct the error and restart the loading operation from step 1. - 3. Obtain the memory limits of the loaded program and/or a loader map, by using the "M" (memory) or "S" (symbols) command. - 4. Complete and output the load module by using the "E" (end) or "A" command. Note the values of HIGH and START as well as the possible presence of any remaining undefined symbols. - 5. Return to the operating system with an "X" (exit) command. The individual APLINK commands are described in the following sub-sections, and a complete example loading session is given in section 3.12. The three following abbreviations are used in the following sub-sections: ## Abbreviations ## Meaning filename A user specified input or output file. The "Filename" follows whatever naming conventions exist for the particular host computer operating systems. Carriage Return Indicates characters output by the program. The examples given are <u>illustrative</u> only, as file and I/O device names will vary from system to system. ### 3.1 LOAD, "L" To load a program module, or a program library enter: L↓ filename ↓ where "filename" is the name of the file containing the desired program or library. Example: \* L $\downarrow$ $\overline{F}FT$ . RB $\downarrow$ Loads a program from file FFT. RB. #### 3.2 SYMBOLS, "S" To output the global (external and entry) symbols enter: S ↓ filename ↓ where "filename" is the name of the file (or I/O device) to receive the symbol listing. The output of the loader map is as follows: HIGH = aaaaaa SYMBOL TABLE SYMBOL VALUE sssss nnnnn U where: aaaaaa Highest program address so far loaded. Normally, the next program will be loaded starting at location HIGH+1. SSSSSS symbol name nnnnnn symbol value; if undefined, the last location loaded which referenced this symbol. U If present indicates the symbol is as yet undefined. An example command: \*S ↓ <u>L</u>P:↓ Dumps the loader symbol table onto the line printer. #### 3.3 UNDEFINED, "U" To output to the console any presently undefined global symbols enter: U↓ filename↓ where "filename" is the file to receive the list of undefined symbols. The list format is: SSSSSS nnnnn where "ssssss" is the symbol name and "nnnnnn" is the location of the last program instruction which referenced the symbol. An example command: \*U ↓ TP:↓ Prints the names of any undefined symbols on the teletype. ### 3.4 NEXT BASE, "B" To specify a base address at which to load the next program, enter: B↓ loc↓ where "loc" is the location specified. An example: \*B ↓ 200 ↓ Sets the next location loaded to location 200. #### 3.5 RESET, "R" To reset APLINK, enter: R↓ This reinitializes the program to its initial state. The symbol table is cleared, any previously loaded programs are disregarded, and the next location is set to zero. This command must be given following a fatal error. #### 3.6 FORCE, "F" To force loading of a program module from a library, enter: F↓ name↓ where "name" is the name of the symbol to be forced. This command enters "name" into the symbol table as an external symbol. This will cause the loading of a library program which has "name" as an entry symbol. An example: \*F↓ DOTPRD↓ Forces the loading of any program defining symbol "DOTPRD" from any subsequently loaded library file. #### 3.7 MEMORY, "M" To get the address of the highest program source memory location so far loaded, enter: М↓ The information is printed as follows: HIGH = aaaaaa where "aaaaaa" is the highest address so far loaded, and "bbbbbb" if present, is the load module starting address. #### 3.8 END, "E" To end a load module and output the completed load module for use with APDEBUG, enter: E↓ filename↓ where "filename" is the name of the file to receive the loader output. The output is a "core image" which can be loaded by APDEBUG and executed by either the simulator APSIM, or the hardware. APLINK outputs the following information to the user console: HIGH = aaaaaa where "aaaaaa" is the highest program address loaded. If any symbols were still undefined, APLINK outputs: ### UNDEFINED SYMBOLS where "###" is the number still undefined. A value of $\emptyset$ was used in linking these undefined symbols. \*E↓ SAVE Stores the completed load module into file "SAVE". The "E" (or "A") command causes links between global symbols in the completed load module to be frozen. The load module can be output again (with another "E" or "A") but no further links can be added (with an "L"). To work on another load mode, a reset ("R") command must be given to clear the linker. #### 3.9 END with ASSEMBLY CODE, "A" To end a load module and output the completed load module as host computer assembly code (for use with APEX), enter: $A \downarrow$ filename $\downarrow$ where "filename" is the name of the file to receive the loader output. This output is a short host assembly language subroutine, which is the linkage between host computer Fortran "CALL's" and the AP-120B executive. The AP-120B code from the load module follows the host subroutine as assembly language data statements. Information concerning the highest address loaded into, and any undefined symbols, are output to the user console as described above for the "E" command. #### 3.10 NUMBER RADIX, "N" To set the radix for numeric input/output to and from the user console, enter: N↓ radix↓ where the radix is either 8 (for octal), 10 (for decimal), or 16 (for hexadecimal). The default radix for user I/O is set to either of these choices at installation. # 3.11 EXIT, "X" To exit to the operating system, enter: Χţ Note: X does $\underline{\text{not}}$ cause any output. An "E" or "A" must be used to output a load module. #### 3.12 AN EXAMPLE LOADING SESSION >>RUN APLINK APLINK REV 1 \*L BENCH:RB APLIB \*S /TTØ HIGH=000116 SYMBOL TABLE' SYMBOL VALUE BENCH 000000 DIV 000063 \*E TEMP HIGH=000116 \*X STOP >> The user runs APLINK. He then loads his program, in file BENCH:RB. Since Bench uses the scalar divide subroutine, he links in the library of AP-120B subroutines, APLIB. APLINK extracts from the library any subroutines needed by BENCH, in this case DIV. The user prints out the loader map, and then ends with an "End" command, putting the linked-up code into file TEMP. To debug the program, the user would run APSIM, where he could execute the code put into TEMP. Another example loading session. RUN APLINK APLINK REV 1 \*F POLAR \*L APLIB \*S /TTØ HIGH=000166 SYMBOL TABLE' SYMBOL VALUE POLAR 000000 SQRT 000133 ATN2 000023 ATAN 000035 DIV 000077 \*A TEMP HIGH=000166 \*X STOP ` ` The user runs APLINK. He wants to install subroutine POLAR into his computer operating system, so that he can CALL POLAR from Fortran. The "F" (force) command sets up APLINK so that it will load in POLAR from the library APLIB. The loader map shows that POLAR used subroutines SQRT (square root), ATN2 and ATAN (arc-tangent), and DIV (divide). The "A" command stores the linked-up code as host assembly language in file TEMP. The host assembly code in TEMP is assembled by the host computer assembler and the resulting relocatable binary saved where it can be linked with Fortran programs. INTENTIONALLY BLANK #### SECTION 4 ERROR MESSAGES Any deviation from the prescribed command syntax will cause APLINK to output a "?" to the user console. The illegal command is ignored, and APLINK outputs a "\*" to indicate its readiness to accept a new command. If a specified "FILENAME" cannot be found, or is otherwise unavailable for use the message: #### FILE NOT FOUND!!! is outputted and the command is ignored. The specific error messages outputted by APLINK are the result of loading errors detected during execution of an "L" (load) command. There are two classes of loading errors: - F Fatal. Reinitialization of the loader (the "R" command) is required before loading can continue. - W Warning. An advisory message indicating a nonerror. Any fatal error detected during loading will cause immediate termination of the "L" (load) command following the error message. If the user attempts to execute another "L" command, the program will output the message: #### RESET!!! and ignore the command. After the user reinitializes the loader ("R" command) he must reload any programs loaded up to that point. Following are the error messages, along with notes of explanation for each: - F SYMBOL TABLE OVERFLOW The loader symbol table is full. The only recourse is to recompile APLINK with a longer symbol table size. - F PROGRAM MEMORY OVERFLOW nnnnnn An attempt was made to load past the upper limit of Program Source Memory. The load module is too large to fit in program memory. "nnnnn" is the memory location involved. - F OVERWRITE nnnnnn An attempt was made to overwrite a previously loaded program memory location. The loader does not permit any given program memory location to be loaded more than once. "nnnnn" is the program memory location involved. - F ILLEGAL BLOCK TYPE nnnnnn An illegal relocatable object code block type was encountered. The File specified does not contain legal object code. "nnnnnn" is the illegal block type, as read from the block header in question. #### W MULTIPLE ENTRY An \$ENTRY symbol having the same name as one already defined was encountered during a load. The name and value of the offending symbol is output to the console: SSSSS nnnnnn where "ssssss" is the symbol name and "nnnnnn" the symbol value. The loader proceeds by ignoring the latest definition. #### W MISSING OR IMPROPER ENTRY The user attempted to put out host assembly code (an "A" command) from a load module which either 1) did not have any entry points (defined entry global symbols), or 2) the first entry point loaded did not have an S-Pad parameter count. # APPENDIX A SUMMARY OF APLINK COMMANDS These abbreviations are used: | Symbol | Meaning | |-----------------|---------------------------------------------------------------------------------------------------| | ↓<br>filename | Carriage Return Name of a file, as appropriate for the host operating system being used. | | loc<br>name | A location, in octal or hex as appropriate A symbol name, 6 characters or less. | | Command | Effect | | L↓<br>filename↓ | Load the program in file FILENAME, link with previously loaded programs. | | S↓<br>filename↓ | Output the loader symbol table to file FILENAME. | | U↓<br>filename↓ | Output any undefined symbols to file FILENAME. | | B↓<br>loc↓ | Set APLINK to load the next program at location LOC. | | R↓ | Reset the loader. | | F↓<br>name↓ | Force the loading of a program defining symbol Name from any subsequent program libraries loaded. | | M↓ | Output the highest program memory location used. | | E↓<br>filename↓ | End the loading session. Store the resulting load module into file FILENAME. | | A↓<br>filename↓ | End the loading session. Output host computer assembly code for use with APEX into file FILENAME. | | N↓<br>number↓ | Set the Radix for numeric user console I/O to either 8, 1 $\emptyset$ , or 16. | | Χψ | Exit to the operating system. | INTENTIONALLY BLANK # APPENDIX B RELOCATABLE OBJECT CODE BLOCK TYPES Unlike most relocatable binary, the relocatable object code produced by APAL consists of numbers written as decimal integer characters. Those were output (and readable) by Fortran formatted I/O statements. An advantage is that relocatable library files may be edited with an ordinary text editor. This makes unneccessary the need for a special-purpose Librarian or Library File Editor. The relocatable object code is divided into a series of blocks. The order in which blocks appear, if each type is present, is as follows: (the block type number is in parenthesis). - 1. Title Blocks (3) - 2. Entry Blocks (4) - 3. Code Blocks (0) - 4. External Blocks (5) - 5. End Block (1) An object module contains at least a Title Block and a Start/End Block. The presence of one or more of the other block types will depend upon the particular program. The first line of each block is a block header, which contains four seven-digit numbers: - 1. Block type numbers - 2. Number of items in the block - 3. Initial address, if relevant - 4. Unused In addition, the block header is flagged with "\*\*\*" to aid in identification of blocks. Each block type is described below, in numeric order by block type numbers #### B.1 CODE BLOCK $(\emptyset)$ | LINE | | CONT | ENTS: | | | | |-------|---|------|-------|------------|------------|------------| | Ø | ; | | Ø | count | address | Ø*** | | ĺ | | Bits | 0-15 | Bits 16-31 | Bits 32-47 | Bits 48-63 | | 2 | | | . *** | 11 | 11 | tf | | : | | | • . | . 0 | <b>0</b> | • | | • | | | • | • | | • | | count | | | 11 | | 11 | <b>**</b> | Each code line contains a 64-bit plogram source word. | B.2 END BLOCK (1 | <u>.)</u> | | | | |------------------------------------------|--------------|--------------------|--------|----------------------------| | LINE CON 1 | TENTS | 0 | 0 | 0*** | | B. 3 TITLE BLOCK | <u>(3)</u> | | | | | LINE CON<br>0<br>1 | TENTS | 1 0 | 0 | O***TITLE | | B. 4 ENTRY SYMBOL | BLOCK (4) | | | | | LINE CON' | TENTS | | ;<br>! | | | 0 4<br>1 nam<br>2 " | <b>e</b> | count<br>value | 0 | 0***<br># S-Pad parameters | | count " | | • • | | • | | B. 5 EXTERNAL SYM | BOL BLOCK (5 | <u>)</u> | | | | LINE CON' 0 5 1 name 2 " 3 " : : count " | FENTS<br>e | count<br>link<br>" | 0 | 0*** | | B.6 LIBRARY STAR | r BLOCK (6) | | | | | LINE CONTO | rents | 0 | 0 | 0*** | | B. 7 LIBRARY END 1 | BLOCK (7) | | * | | | LINE CONTO | TENTS | 0 | 0 | 0*** | An example relocatable object module, from the Dot Product program. | • | | | | | | |--------|--------|---------|----------|----------|-------------| | 3• | 1. | 0. | 0 • ***T | ITLE | Title block | | DØTPR | 0• | 1.0 | | 7 | TILLE DIOCK | | 4• | 1 • | 0• | 0.*** | 7_ | Entry block | | DØTPR | 0 • | 6. | | 5 | Entry block | | . 0 •. | 9• | 0 • | 0 • *** | | | | 16384• | 0 • | 0. | 48. | - 1 | , | | 16520. | 0 • | 0 • | 48• | 1 | | | 596• | 0• | 18948 • | 0• | . 1 | | | 8257• | 55808• | 0• | 48• | <u> </u> | Code block | | 661 • | 32768. | 256• | 5888• | | Code block | | 8392. | 403• | 0. | 4144. | | | | 0• | 0• | 18948. | 4096 • | . \ | • | | 8257• | 37453. | 0 • | 48• | | | | 16656. | 224. | 0 • | 112. | | • | | 1. | 0• | 0• | 0.*** | . } | End block | | | | | | | | The Title block contains the title of the program, DOTPR The Entry block has the name of the entry point, DOTPR; its relative address, $\emptyset$ ; and the number of expected S-Pad parameters, 6. The Code block contains the 9 AP-120B program words in DOTPR, each as four 16-bit quarters of a 64-bit program word. The End block tells APLINK that it has reached the end of the program. Example output from APLINK produced by an "E" (End) command from the Dot Product program. APDBUG would load this output into either the simulated AP-120B (APSIM), or the actual hardware, for debugging. | | • | | | | |------------------------|-----------|---------|---------|-------| | 9 } | number of | AP-120B | program | words | | 0<br>0<br>48 | word #1 | | | | | 0 0 | word #2 | | | | | 48<br>596<br>0 | word #3 | • | | | | 18948<br>0<br>8257 | | | | | | -9728<br>0<br>48_ | word #4 | | | | | -32768<br>256 | word #5 | | | - | | 5888<br>8392<br>403 | | | | | | 4144 | word #6 | | | | | 0<br>0<br>18948 | word #7 | | | | | 4096<br>8257<br>-28083 | | | | | | 0<br>48<br>16656 | word #8 | | | | | 224 / | word #9 | | 1 | | | | | ÷ | | | Example host assembly code produced by APLINK from the Dot Product program. | | · · | |---------|---------------| | | • TITL DØTPR | | • | · ENT DØTPR | | | • EXTD • APEX | | | -1 | | DØTPR : | - | | DUTPR : | | | | 6• | | | 9• | | 7 | <b>0</b> • · | | | 16384 | | | 0• | | | 0• | | | 48• | | • | 16520 • | | | 0. | | .* | 0• | | | 48• | | | 596• | | | | | | 0. | | | 18948. | | | 0. | | | 8257• | | | -9728• | | | 0• | | | 48• | | | 661 • | | | -32768 | | | 256• | | | 5888 | | | 8392 | | | 403 | | | 0• | | | 4144. | | | 0. | | • | | | | 0. | | | 18948• | | - | 4096• | | | 8257• | | | -28083 | | | 0• | | | 48• | | | 16656• | | .* | 224. | | * | 0• | | | 112. | | | • END | | | | This is the appropriate host computer assembly code for use with Data General Corporation Fortran IV on Nova or Eclipse computers. A CALL DOTPR in Fortran ends up at location "DOTPR" in the Nova, which does a subroutine jump to APEX, the AP-120B executive. If this is the first CALL of DOTPR, then APEX will load the AP-120B program words for DOTPR from Nova memory into AP-120B Program Memory. The "6" following the JSR@ .APEX is the number of arguements expected in the Fortran CALL. "9" is the number of AP-120B program words in DOTPR. "0" is the re; ative starting address of DOTPR. Following these three parameters is the nine AP-120B program words in DOTPR, 16-bits at a time. For each particular host computer, the exact form of the host computer assembly language is different, but the content is the same. INTENTIONALLY BLANK AP-120B APDEBUG DEBUGGER MANUAL 7277- 02 FPS-7277-01 C FLOATING POINT SYSTEMS, INC. 1976 ALL RIGHTS RESERVED PRINTED IN THE UNITED STATES OF AMERICA REVISION 02, MARCH 31, 1976 # TABLE OF CONTENTS | Appendix | A | | |----------|-----------------------------------|-----| | A.1 | Program Execution Commands | A-1 | | A.2 | Register Examination/Modification | A-2 | | | Commands | | | A.3 | Memory Load/Dump Commands | A-3 | | A.4 | Accessable Functional Units | A-3 | | A.5 | Program Word Fields | A-4 | # APPENDIX A AP-120B SUMMARY OF DEBUG COMMANDS #### Abbreviations used below: | Symbol Symbol | Meaning | |---------------|-------------------------------------------------------| | <u> </u> | Carriage Return | | loc | An integer location number | | count | An integer count | | val | An integer value | | fpn | A floating-point number in form acceptable to FORTRAN | | mem | The name of an AP-120B internal memory | | reg | The name of an AP-120B internal register | Debug types a "\*" when ready for further action. A "?" is typed when a command is not understood. ## A.1 Program Execution Commands | B↓<br>mem↓<br>loc↓ | Breakpoint. Delete the last breakpoint and set a new breakpoint at location LOC of memory MEM. MEM must be PS, MD, or TM. | |--------------------|-------------------------------------------------------------------------------------------------------------------------------------------------| | D↓ | Delete. Delete the current breakpoint | | L. | List. List the current breakpoint | | Q+<br>count | Set the continue counter to (COUNT). | | S↓<br>val↓ | Step. If (VAL) is not zero place the AP-120B in step mode. | | I↓<br>val↓ | Initialize. If VAL is not zero, reset the AP-120B before program execution is resumed next | | R↓<br>loc↓ | Run. Begin program execution at Program Source location LOC | | P↓ | Proceed. Begin instruction execution at<br>the Program Source location pointed to by<br>the AP-120B (PSA) (Program Source Address)<br>Register. | | X↓ | Exit to the operating system | #### A.2 Register Examination/Modification Commands $\mathrm{E} \psi$ Examine register. Print out the contents of reg↓ AP-120B register REG $\mathbf{E} \downarrow$ Examine memory. Pring out the contents of AP-120B memory MEM, location LOC mem ↓ loc ± Re-examine the currently open register or memory location (the last location examined) Examine the next higher sequestial memory location of the memory that is currently open Examine the next lower sequential memory location of the memory that is currently open F ↓ Floating Point Flag, affects the input/output of val ↓ 38-bit wide registers and memory locations. 3 integers (Exponent, High Mantissa, Low Mantissa) VAL=O: VAL≠O: floating-point V + Program Source field value flag, affects input/output val. ↓ of program source memory location. VAL=O: 4 integers (the four 16-bit quarters of PS) VAL≠O: Decode into the 24 instruction word field values. C +Change. Change the contents of the currently open val register or memory location to VAL. The format of VAL depends on the width of the current open locations as follows: 16-bit wide registers: an integer of the current radix. 38-bit wide registers: Val three integers in the current radix VAL↓ thich represent the exponent, high VAL↓ mantissa, and low mantissa F≠Ø: FPN → a floating point number legal to Fortran 64-bit wide registers: VAL↓ four integers in the current radix VAL↓ which are the four quarters of an AP-120B VAL↓ program word VAL<sup>↓</sup> V≠O: FIELD is the name of the instruction VAL + field to be changes, VAL is the new integer value. | N↓<br>VAL↓ | Number radix. Set the radix for integer user I/O to VAL, which must be 8 (for octal), 10 (for decimal), or 16 (for hexadecimal). | |------------|----------------------------------------------------------------------------------------------------------------------------------| | O↓<br>VAL↓ | Offset. Sets the base address to which Program Source Memory addresses are relative (for user I/O). | | Z | Zero. Zero out all AP-120B memories and registers. | # A.3 Memory Load/Dump Commands | Y + | Yank. Load memory MEM starting at location | |-----------|--------------------------------------------| | MEM ↓ | LOC from an external data FILENAME. | | LOC + | MEM can be PS, MD, OR TM. | | filename↓ | | | W↓<br>MEM↓ | Write. | Dump memory MEM starting at location | |------------|---------|--------------------------------------| | START. | (START) | and ending at location (STOP) to | | STOP↓ | externa | 1 data FILENAME. | | file name↓ | MEM can | be PS, MD, or TM. | # A.4. Accessable Functional Units AP-120B Functional Units that may be examined or changed using DEBUG: | Memorie | <u>s:</u> | Contents: | |----------------|---------------------------------------------|-----------------------------------------------| | PS<br>MD<br>TM | Program Source Memory Main Data Memory | 64-bit instruction word 38-bit floating-point | | DPX<br>DPY | Table Memory Data Pad X Data Pad Y | 11 11 11 11 11 | | IODEV<br>SP | I/O Devices | 11 11 | | SRS | S-Pad Registers<br>Subroutine Return Stack* | 16-bit integer | | Registe | ers | Content | <u>ts</u> | |---------|-----------------------------------|---------|----------------| | MA | Memory Address | | integer | | TMA | Table Memory Address | . 11 | | | DPA | Data Pad Address | - 11 | | | PS A | Program Source Address | - 11 | | | SPD | S-Pad Destination Address | 11 | | | STAT | AP-120B Internal Status Register | 11 | | | DA | I/O Divice Address | 11 | | | SWCH | Panel Switch Register | 17 | | | LGTS | Panel Lights Registerr | 31 | | | MDR . | Memory Read Data Buffer* | 38-bit | floating point | | TMR | Table Memory Data Buffer* | 11 - | <b>3</b> 1 | | ΜI | Memory Input Register* | 11 | | | DPBS | Data Pad Bus* | 11 | | | INBS | Input Bus* | 7.7 | • | | PNBS | Panel Bus* | 16-bit | integer | | SPFN | S-Pad Function | | | | FLAG | Program flags* | tt. | | | SRA | Subroutine Return Stack Address* | 11 | | | A1 | Floating Adder Input Reg. #1* | 38-bit | floating point | | A2 | Floating Adder Input Reg. #2* | 11 | | | FA | Floating Adder Output* | 11 | | | Ml | Floating Multiplier Input Reg. #1 | * " | | | M2 | Floating Multiplier Input Reg. #2 | | | | FM | Floating Multiplier Output* | 11 | | | | | | • | <sup>\*</sup>Accesable only when using the AP-120B Simulator # A.5 Program Word Fields Fields within an instruction word that may be examined or changed by name: | • | • | |---------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Name | Program Word Bits | | D SOP SH SPS SPD FADD A1 A2 COND DISP DPX DPY DPBS XR YR XW YW FM M1 M2 M1 MA DPA TMA | 0<br>1-3<br>4-5<br>6-9<br>10-13<br>14-16<br>17-19<br>20-22<br>23-26<br>27-31<br>32-33<br>34-35<br>36-38<br>39-41<br>42-44<br>45-47<br>48-50<br>51<br>52-53<br>54-55<br>56-57<br>58-59<br>60-61<br>62-63 | | SOP1 SPEC STST HPNL SPSA PSEV PSOD PS SEXT FAD1 IO LREG RREG IOUT SNSE FLAG CONT | 6-9<br>6-9<br>10-13<br>10-13<br>10-13<br>10-13<br>10-13<br>10-13<br>17-19<br>17-19<br>20-22<br>20-22<br>20-22<br>20-22<br>20-22<br>20-22<br>20-22 |