## MIT/LCS/TR-248 # ALGORITHMS FOR INTEGRATED CIRCUIT LAYOUT: AN ANALYTIC APPROACH Andrea Suzanne LaPaugh # Algorithms for Integrated Circuit Layout: An Analytic Approach Andrea Suzanne LaPaugh November 1980 © Massachusetts Institute of Technology The research reported here was supported in part by a Xerox Special Opportunity Fellowship and National Science Foundation grant MCS 78-05849. MASSACHUSETTS INSTITUTE OF TECHNOLOGY LABORATORY FOR COMPUTER SCIENCE CAMBRIDGE MASSACHUSETTS 02139 # Algorithms for Integrated Circuit Layout: An Analytic Approach by #### Andrea Suzanne LaPaugh Submitted to the Department of Electrical Engineering and Computer Science on August 29, 1980 in partial fulfillment of the requirements for the Degree of Doctor of Philosophy #### ABSTRACT In this thesis, the problem of designing the layout of integrated circuits is examined. The layout of an integrated circuit specifies the position on the chip of functional components and wires interconnecting the components. We use a general model under which components are represented by rectangles, and wires are represented by lines. This model can be applied to circuit components defined at any level of complexity, from a transistor to a programmable logic array (PLA). We focus on the standard decomposition of the layout problem into a placement problem and a routing problem. We examine problems encountered in layout design from the point of view of complexity theory. The general layout problem under our model is shown to be NP-complete. In addition, two problems encountered in a restricted version of the routing problem -- channel routing -- are shown to be NP-complete. The analysis of heuristic algorithms for NP-complete problems is discussed, and the analysis of one common algorithm is presented. The major result presented in this dissertation is a polynomial time algorithm for a restricted case of the routing problem. Given one rectangular component with terminals on its boundary, and pairs of terminals to be connected, the algorithm will find a two-layer channel routing which minimizes the area of a rectangle circumscribing the component and the wire paths. Each terminal can appear in only one pair of terminals to be connected, and the rectangle used to determine the area must have its boundaries parallel to those of the component. If any of the conditions of the problem are removed, the algorithm is no longer guaranteed to find the optimal solution. Thesis Supervisor: Ronald L. Rivest Title: Associate Professor of Computer Science and Engineering Key words: VLSI (very large scale integrated) circuit layout, component placement (rectangles), channel routing, NP-completeness, algorithm analysis, heuristic algorithms. #### **Acknowledgements** I would like to thank my thesis advisor, Ronald Rivest, for his suggestions and support. My discussions with him — both on general research issues and on technical details — have been invaluable. I would also like to thank my readers, Jonathan Allen and Gary Miller, for contributing their viewpoints to this thesis. In addition, I would like to thank Alan Baratz, for many interesting discussions on VLSI design and layout; Ron Pinter, for carefully reading a draft of this dissertation; and Errol Lloyd for an enjoyable collaboration studying heuristic algorithms for channel assignment. I would like to thank my friends at the Laboratory for Computer Science for their fellowship and support. Finally I would like to thank my husband, Michael Lipkowitz, for his encouragement and patience throughout the writing of this thesis. en en magazia, parmente ha tradición de la comercia. grands and a second state of the second #### **Table of Contents** | Abstract | 2 | |-----------------------------------------------------------------|------------| | Acknowledgements | | | 1 Introduction | 6 | | 2 A Review of Layout Automation Techniques | 10 | | 2.1 Classical Approach to Layout | 10 | | 2.2 Alternate Approaches to Layout Automation | 22 | | 2.3 Summary | 23 | | 3 Models for Integrated Circuit Layout | 24 | | 3.1 The Geometric Model | 24 | | 3.2 A Graph Embedding Model | 29 | | 4 Complexity of Layout Problems | 37 | | 4.1 NP-Completeness and Heuristic Algorithms | 37 | | 4.2 Placement and Routing: NP-complete Formulations | 40 | | 4.2.1 The point model: some known results. | 40 | | 4.2.2 The rectangle model: a new result. | 42 | | 4.3 NP-completeness in Channel Routing | 48 | | 4.3.1 Channel assignment within a street. | 50 | | 4.3.2 Channel assignment with intersections. | 55 | | 4.4 Heuristics for Channel Assignment | 67 | | 5 An Algorithm for Routing Terminals on a Rectangular Component | 78 | | 5.1 Preliminaries | <i>7</i> 8 | | 5.2 Reduction to Maximizing Matchings | 82 | | 5.3 Defining Regions | 86 | | 5.4 Assignment within Designs | | | 5.5 Handling a "failure" | 100 | |--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------| | 5.6 Running Time of the Algorithm | 130 | | 5.7 Summary | 134 | | Appendix: Notation | 135 | | 6 Discussion of the Algorithm | 138 | | 6.1 Removing Assumptions | 138 | | 6.2 A Special Case for Procedure MAX-MATCH | 140 | | | 145 | | 6.3.1 Removing the separate layer - separate direction restriction. | 145 | | And the second was an analysis of the second | 148 | | 6.4 Summary | 156 | | 7 Summary and Directions for Research | 159 | | estimien opis och modern som som som skalt som sentet ett fraktive skritter i det i store i som i som i som i | | | | 165 | | Annotated Bibliography | 167 | | and applying the first of the original his and the state of the original and the plant of the plant of the transfer | -: * | | Biographical note | 175 | and the control of ,我们还是我们的人,我们就是一个人,我们就是我们的人,我们就是我的事情,我们就是我们的事情,不要做到了。" "我们我们我们就是我们的人,我们就是我们的人,我们就是我的事情,我们就是我们的人,我们就是我们的人,我们就是我们的人,我们就是我们的人,我们就是我们的人,我们 and the second of o [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [1] 15 [ Control of the control of the state of the ang kanananan mengantah di kananan mengantah di kepada dan kebanan di kepada dan berasah dan kebanan dan berasa Control of the Contro #### Chapter 1: Introduction The research reported in this thesis is an investigation of algorithms for the layout of integrated circuits. Integrated circuits are formed on silicon chips by creating layers of different substances (e.g. metal, polysilicon) in geometric patterns on the chip through a variety of fabrication techniques. Electronic components are formed by the interaction of regions in the different layers. Wires connecting components are simply regions between two components on a layer. Laying out a circuit consists of determining the patterns for each layer on the chip to create the desired components and interconnections. For example, in nMOS/FET technology; a designer creates a transistor by drawing a region for the polysilicon layer and a region for the diffusion layer which cross when the designs for the two layers are superimposed [Me80]. In general, designing a layout requires knowledge of the interactions between layers for the technology being used and limitations of the fabrication process being used. The goals of the designer are to put as much circuitry as possible in as small an area as possible, and to have it work correctly and as fast as possible. A good example is the layout of a microprocessor, where the amount of information which can be processed, the number of functions which can be performed, and the speed of processing are important. The layout problem as described above contains a huge number of variables and leaves much room for cleverness by the person designing the layout. It does not lend itself well to an algorithmic approach where a well-defined model and set of operations are employed. However, standard layouts can be used for each component needed in a circuit. These components need not be simple electronic components such as transistors or resistors, but may be logic gates or even higher level circuit subsystems. Given these components and the interconnections necessary to realize the desired circuit function, the layout problem consists of allocating a proper-sized region of the chip for each component and determining the pattern of wires forming the interconnections on each layer. This is the layout problem as we will mean it. We have lost the flexibility of tailoring the layout of each component to the particular application, but have greatly simplified the problem. In general, we will wish to place a set of components on a possibly multi-layered planar surface. The components will contain terminals, which are points to which wires can connect. The desired interconnections will be specified by giving disjoint sets of terminals. Each set of terminals, called a net, should be interconnected. The interconnections will be made by wires which define paths between terminals in the layers. There will be constraints which the layout must satisfy, such as a minimum separation between unconnected wiges. These constraints are called design rules. The major motivation for developing algorithms to solve the layout problem is the complexity of the integrated circuits being designed. A chip may now contain tens of thousands of transistors. Hand layout of these integrated circuits, even with the aid of computerized graphics, is very costly, time consuming, and error prone, Standardized components, such as logic gate cells, are already employed in the industry to simplify circuit design and layout, and layout by computer has been implemented [Fe76] [Per77]. The tradeoffs are similar to those in computer programming. Programming in assembly language gives a programmer the freedom to devise clover ways to make a program run faster or require less storage. However, when many large, complicated programs need to be written, the savings in time and the conceptual simplification gained by using a high level language and compiler are worth the loss of flexibility. Just as one might write an often used subroutine in assembly language, one might do the layout for a circuit subsystem to be used in many circuits by hand. However, for most projects, the savings in time and the added faith in the correctness of the design due to the simplified structure make the component approach preferable. Although working automated layout systems do exist, the problem of designing algorithms to do integrated circuit layout is not solved. Many of the existing algorithms place further restrictions on the problem. Typically, they require that all components be the same size in one dimension. Such an algorithm places the components in rows, forming an array. The wiring (called routing) is done in the spaces between rows. Also, although several algorithms have been designed and implemented, little analysis of the algorithms has been performed. Empirical evidence is often cited in the literature. However, papers describing algorithms which may not find optimal solutions rarely present mathematical analyses of the quality of the solutions found. For example, one does not find statements of the form "this algorithm always finds a solution within 50% of the optimal." In this dissertation, we discuss layout algorithms from a complexity theory point of view. We focus on the performance of algorithms, both in terms of the quality of the solutions they produce and the running times they require. We work with subproblems of the layout problem since they are easier to approach and are commonly encountered. An example of such a subproblem is the routing problem when components have been placed in rows. The thesis is organized as follows. In Chapter 2, we review the techniques used by existing automatic layout systems. In Chapter 3, the model we will use for layout problems is presented. We also describe a second model — the graph theoretic model — which has proven very useful in characterizing the area required by various interconnection patterns. In Chapter 4, we discuss the complexity of a number of versions and subproblems of the layout problem. A review of previously known NP-completeness results is given. We prove the NP-completeness of a rectangle placement problem and two problems encountered in channel routing. We analyze a previously known heuristic algorithm for one of the channel routing problems. In Chapter 5, a new algorithm is presented for a special case of the routing problem. In this problem, terminals lie on the boundary of one rectangular component. Pairs of terminals must be interconnected. The algorithm finds a minimum area routing for a channel routing model and has running time $O(t^3)$ , where t is the number of terminals. By developing this algorithm, we have shown that there are non-trivial routing problems which are not NP-hard. Most routing problems are either known to be NP-hard or are so closely related to NP-hard problems as to be considered intractable without an actual proof of NP-hardness. In Chapter 6, we discuss properties of the algorithm. In Chapter 7, we summarize and present open problems and directions for future research. A review of basic definitions and notation used throughout this thesis is presented in the appendix. #### Chapter 2: A Review of Layout Automation Techniques #### 2.1 Classical Approach to Layout The problem of placing components on a surface and making the required interconnections in one or more layers is not new. Research on the layout problem was initially done for printed circuit boards in the 1960's. The layout problem for printed circuit boards is closely related to that for integrated circuits -- components are placed on a board and interconnections are made by conducting strips (wires) printed on two or more layers of the board. Wires on different layers are insulated from each other, but wires on the same layer must not cross unless a connection is intended. Depending on the manufacturing technique, a conducting path may be able to change layers only at fixed positions on the board (called fixed vias) or anywhere on the board (called floating vias). Some researchers have used models for the layout problem which they intend to apply to both printed circuit boards and chips [Han76]. However, as we shall discuss below, the objectives and assumptions for printed circuit boards and integrated circuits are different. Traditionally, layout of circuits has been divided into two phases — the placement phase and the routing phase. Separate algorithms have been designed for each. In the placement phase, components are assigned positions; in the routing phase, the paths which the wires will use are determined. For printed circuit boards, the goal is usually to minimize the total length of wire used. Generally, the board is divided by a rectilinear grid and components can be placed only at certain locations on the grid. Terminals where wires must attach are at fixed positions on each component; these positions match locations on the grid. Automatic layout systems for integrated circuits have borrowed the algorithms and models from printed circuit board research and expanded on them. Most systems use standard cells of uniform size which are arranged in rows and columns, leaving a grid of horizontal and vertical streets in which connections can be made. Examples of standard cell systems can be found in [Fe76], [Per77]. For placement algorithms, components are generally modeled as points and the board or chip as an array of locations on which the components must be placed. For printed circuits, components are usually standard size packages so that size is not a factor. For integrated circuits, a size parameter may be associated with each point and a capacity with each location. For example, in [Sc76] each location represents a row of standard cells and has a capacity which is the length of the row. In the majority of placement problem formulations, the objective is to find a placement which most facilitates the routing to follow. A function of the placement is chosen as an objective function to be optimized. The function is supposed to be an indication of the difficulty of the routing given the placement. Most often the function is an estimate of the total wire length used for routing. Various estimates of the wire length needed to route one net (set of terminals to be interconnected) are used, e.g. the half-perimeter of the smallest rectangle enclosing all terminals of the net; the length of the shortest spanning tree of the net. The estimate must be easy to compute. There are two types of placement algorithms — constructive initial placement and iterative improvement. Most automated layout systems use both, although either can be used alone: iterative improvement can be used with a random initial placement. Constructive initial placement algorithms place components one by one based on their connectivity to components previously placed. In many systems, a designer may choose and place the first components. When the first component is chosen by the algorithm, a special component such as a bonding pad may be chosen, or an arbitrary component may be chosen. Given an initial placement, iterative improvement algorithms move components to improve the objective function. These algorithms are local optimization algorithms. One common technique is to exchange pairs or rearrange larger sets of components and test whether the value of the objective function has been improved. In another method, called Force-Directed Relaxation, a component is moved to a point where the "forces" due to its connections to other components are balanced. A description of various placement algorithms of both types can be found in Chapter 5 of [Des72]. An experimental comparison of several of these algorithms is presented in [Han76]. The routing portion of the layout problem has received much attention in the past ten or fifteen years. Many algorithms have been developed to find "near optimal" solutions to the routing problem when sets of points which must be interconnected are given as input. Most of the algorithms attempt to minimize total wire length. These algorithms almost exclusively use the rectilinear (also known as Manhattan) measure of distance.<sup>1</sup> Routing can be approached a number of different ways. Nets may be connected using Steiner trees, i.e allowing wires to branch at points between terminals; or may be restricted to spanning trees, where no branching wires are allowed. (See Figure 2.1) Connection paths may be allowed to change layers at arbitrary points, at fixed points, or not at all. The tree of connections for each net may be determined before the actual paths are found -- called wire list determination -- or the trees may be determined as paths are laid out. When each path must lie totally on one layer, the layer assignment, i.e. determining which paths will lie on which layer, is often done first. All paths on a layer must be routed so that they do not cross. However, many of the most recent routers for integrated circuits and printed circuits use the restriction that horizontal wire segments and vertical wire segments are on separate layers. (Horizontal and vertical are the perpendicular directions of the rectilinear metric.) Paths are composed of horizontal and vertical segments. Each change of direction is a change of layer. Typically, two layers are used -- one for each direction. In the projection of the layers on one plane, two paths may intersect within perpendicular segments without being electrically connected. When paths can change layers only at fixed points, called vias, the vias may be assigned to particular connections before routing. For printed circuits, the number and location of vias is often restricted due to the fabrication technology. For integrated <sup>1.</sup> Under the rectilinear metric, points $(x_1,y_1)$ and $(x_2,y_2)$ in a cartesian coordinate system are distance $|x_1-x_2|+|y_1-y_2|$ apart. circuits, vias are contact cuts [Me80]. They are usually allowed anywhere, although it is desirable to minimize the number used since they require extra area. Most algorithms to do the actual routing of wire paths fall into one of the following four categories: maze routers, line routers, cell routers, and channel routers. The first algorithms used on printed circuit boards were made routers. Maze routers find one path at at time. They are based on Lee's algorithm [Lee61] for finding the shortest path between two nodes in a graph. In fact, a path can be found between members of two sets of nodes rather than between two specific nodes. The graph used for routing is a grid graph containing forbidden regions. The algorithm is a breadth first search of the usable grid points. (See Figure 2.2.) Multiple layers may be modeled by using one two-dimensional grid for each layer. Each grid point (for unrestricted vias) or each of a special set of grid points (for fixed vias) on one planar grid is adjacent to corresponding grid points on other planar grids. Cellular routers [Hit69] also divide up the routing plane into a regular arrangement of cells. Again, each path is found, one at a time, by a breadth first search over all cells. Each cell represents a region of the routing surface, including all layers. This representation is obtained by projecting all layers on one plane, then partitioning the plane into pieces of uniform size and shape. These pieces are the cells. Each cell is large enough to fit more than one wire width on each layer. The routing algorithm must define the entrance and exit points of each routing path in the set of cells traversed by the path. From a given point on a cell boundary, the reachable points along the cell's boundaries are calculated taking into account wires which have been previously routed and have out off parts of the cell from other parts (Figure 2.3). The determination of the actual paths used within each cell is left for a second algorithm. Cellular routers do not require the large number of grid points which must be represented when using a maze router. Line routers [Hi69] do not partition the plane. These routers work directly with horizontal and vertical line segments. They build up a path between two points out of these segments. A line router Figure 2.1: Steiner tree connections versus spanning tree connections. Figure 2.2: Lee's Algorithm. indicates that the node has a shortest path from the start node of length n All nodes at distance i are labeled, then all the nodes at distance i+1, etc. beginning with i+1. finds paths only in one plane. Paths area determined one at a time. Given two target points to be connected, line segments are extended from the targets until they hit obstacles. Points along these segments from which a line can be extended which will "escape" an obstacle previously hit are found. In this way, "escape lines" are generated until a path between the two target points can be constructed from segments of these escape lines. (See Figure 2.4.) The channel routing technique [Has71] was developed for routings with horizontal and vertical wires on separate layers. The area available for routing is divided into horizontal and vertical streets. Each horizontal or vertical street can contain a fumber of horizontal or vertical path segments, respectively. Paths are first found through the streets without regard to conflicts within each street. After all connections have been globally routed through the streets, the segments within each street are arranged so that no two overlap. The number of parallel lanes or channels is each street is minimized. Short segments perpendicular to the street direction thust be used from each terminal out into the street. For some algorithms, such short segments are also permitted to allow a path segment to change channels. (See Figure 2.5.) The global routing problem is actually a path finding problem on a graph where more that one path may use the same edge. The local routing problem (or channel assignment problem) resembles a packing problem rather than a path-finding problem. These problems will be described in more detail in Chapter 4. The algorithms described above are those most often used by layout automation systems. They do not encompass all algorithms. A survey of routing algorithms can be found in Chapter 6 of [Des72] and in [Hi74]. A study of router performance is described in [Kel77]. None of the algorithms are guaranteed to find an optimal solution unless they are allowed to rip up and reroute — exhaustively searching all possible routings. Furthermore, if the amount of area which can be used is bounded, the <sup>1.</sup> The use of the term channel has become confused in the literature. It is used by some authors to denote a street, by others to denote a lane within a street. We follow the terminology of [Has71]. Figure 2.3: Cellular routing. - entrance/exit point of first path entrance/exit point of second path - - indicates general route of a path - indicate reachable boundaries of a path entering as indicated after first and second paths have been routed 사고 사용 생활기 Figure 2.4: Line routing. algorithms are not guaranteed to route all connections, even though such a routing exists. In practice, failed connections are completed manually. Many systems use a combination of algorithms, beginning with faster algorithms to route most connections and using slower but more successful algorithms to route the final connections (e.g. see [Po79]). Some algorithms are more susceptible to the problem of failing to complete all connections than others. When each point-to-point connection is assigned to a single layer and then routed, the paths found first may encircle a terminal not yet connected to anything. It is impossible to find a path to this terminal without changing layers or rigiping up a previously routed path. (See Figure 2.6.) For routers of this type, the order in which the paths are assigned for a problem can make a great difference in whether the algorithm is successful in routing all the connections. Given this, it is interesting that Abel [Ab72] concludes from his empirical study that, overall, the performance of such routers is not significantly affected by various ordering criteria. (The router used is a maze router with an added heuristic so that paths do not run next to a row of terminals at minimum spacing from the terminals. The paths being avoided would lead to a large number of blocked terminals.) The channel routing technique is less susceptible to the problem of failure to route all connections since the exact position of each segment is determined after all paths are globally assigned. In fact, if area is not limited, the streets can contain an arbitrarily large number of channels and 100% routing can always be obtained. Several characteristics of the models used for past research on the layout problem are very restrictive, especially when considering the layout of very large scale integrated (VLSI) circuits, where up to a million transistors are packed on one chip in a very dense arrangement. The use of cells which have one or both dimensions fixed limits the type of components which can be used. One may imagine having in the same circuit a very large component which is an array of registers and a number of small components realizing a special function. Once the components being used are of varying size, it is a waste of space to place them in an array separated by "streets" running the length and width of the chip. The size of each component and the way the components fit together on the chip are important. Figure 2.5: Channel routing. - terminal which is surrounded. No path can reach this terminal. - spacing indicated here is minimum allowed. When an array of locations is used for cell placement; it is reasonable to use total wire length as a measure of the worth of a particular layout. Different arrangements of components within the array only affect the area of the layout insofar as they affect the interconnection paths taken. The total wire length is an easily computed approximate measure of the amount of routing surface which has been used at any point while routing is being done. This, in turn, is an approximate measure of the congestion of wires in routing areas. Congestion may affect the routability of interconnections still to be made. When the area used for routing between components is fixed, which is usually true for printed circuit boards, too much congestion may result in failure to route some of the interconnections. When the area used for routing can be expanded by moving the components (while keeping their relative positions fixed), as is more likely for integrated circuit design, too much congestion may result in a larger overall layout size. When components are not of relatively uniform size, their placement has great effect on total layout size. Wire length is no longer a good approximation to layout size. This is illustrated in Figure 2.7. The placement of components and routing of wires interact in a much more complex manner to determine the total size of a layout. The most recently developed layout systems no longer treat component size as a parameter which is of secondary importance. Components are modeled as rectangles — giving them shape as well as size — rather than as points with a size parameter attached. When rectangles must be placed on a plane, the way they fit together influences the area used by the placement and the shape of the spaces left for routing. Preas and Gwyn [Pr78] have retained a constructive initial placement based on connectivity but place rectangles in a plane rather than points on a grid. Their iterative improvement phase tries to minimize the area of the circuit by selecting as the candidate for placement modification a component one of whose dimensions contributes to the widest part of the layout in that dimension. This component may be rotated, reflected, shifted, or exchanged with another component. A modification is accepted if the area is reduced. In [La79], the initial placement is produced by dividing a square of area the same as the total area of the components into rectangles of the Figure 2.7: Tradeoff of wire length versus area. total area: 15x15 = 225 total wire length: 2 total area: 17x11, = 187 total wire length; 16 where wire width is 1 unit, minimum spacing is 1 unit same area as the individual components. This gives an approximate placement which is modified to fit the actual components. Improvements are made to the placement by using rotations and shifts. Routing is taken into account in the improvement phase. The width of each street is estimated based on global routing and included in the area calculation. Brinkmann [Br76] also uses the technique of dividing a large rectangle into smaller ones to find an approximate placement. Routing programs for the most recent systems also try to minimize area rather than wire length. Channel routers are used in [Pr78] and [La79]. Channel routers are easy to use when area is the parameter to be optimized since local routing minimizes street widths. Streets can be allowed to shrink or expand as needed to complete the routing. In [Lo79], components are allowed to take positions above the routing area independent of one another so that area is not wasted unnecessarily by aligning the edges of the components. Wire length remains an important parameter for a layout because it directly affects the quality Carter de 1861 e referê and speed of signals in the circuit. In a situation such as that shown in Figure 2.7 where both size and total wire length cannot be minimized at the same time, a tradeoff must be made. In this context, it may be decided that wire length is most important regardless of the resulting circuit size. At other times, only Sario sigli iz sad sa dami a maximum wire length might be imposed on certain interconnections in the circuit. To make things more complex, we might imagine a situation in which the requirement was that two interconnections have approximately the same length, say when two outputs of one component are the inputs to another component. In short, the desired measure for the worth of a layout can be made very complicated if The planting of them that between the enough factors are considered. Physical quantities may depend on other layout properties such as the ាំស្ត្រាស់ សកសាស៊ីពី សំទៅសេចាប់ density of wires in an area. In [Agu77] and [Ru77], a system is described in which total power is minimized and timing constraints are observed. In [No76], connections whose delay must be minimized can be designated "critical" and treated special. #### 2.2 Alternate Approaches to Layout Automation The approaches described in Section 2.1 separate the placement and routing phases of layout. องสมโดยสมาเขาสมาเกราะทำ แปก The input is a set of components -- either specified as points or rectangles -- and a set of interconnections electric de la companya de la companya de la companya de la companya de la companya de la companya de la compa to be made either among the components as points or among terminal points on the components. There inking i kumpan kungan mbili mba sa kanasa sa are alternate approaches in the literature where the topological aspects of layout are modeled using graph and some rounding the second as in the theory. When finding the layout of a circuit, placement and routing are considered together by finding a BE THE STATE OF THE REPORT OF THE SECOND SECTION planar embedding of a graph modeling the circuit. The models do not necessarily associate one node on the arm in the continue was the first the factors. with each component. Each component may be modeled as a set of nodes and edges, e.g. a cycle. A. A SANDONE OF BUILDING AROUND A STORES Interconnections among a set of terminals may be modeled as a set of edges or a set of nodes and edges. For example, the branch point of a wire may be represented as a point under a Steiner tree representation of connections. Several models have been suggested. A summary can be found in [van76]. Graph embedding techniques have the advantage that placement and routing interact completely at the erange to the constitution of the second of the constitution th topological level. However, geometry -- the size and shape of the components -- is completely ignored e de les la compañas de la compañas da Maria de la compaña de la compaña de la compaña de la compaña de la com and must be accounted for separately. In the standard approach and in the graph embedding approach, the only information about the en vision am de los es baxonadas di Meler Madrif vilo laborif. As layout provided by the input is the set of nets. In an alternate approach based on stick diagrams [Me80], A firm of a 1886 is a called reflected builty up to a real 1 proceed topological information about the desired layout is also provided. In stick diagrams, regions in various sengraja sno ke kumakanaka milik kasal**ahkisi kula** mit disartika art. layers of the integrated circuit are represented as lines. For example, in nMOS technology, a polysilicon and a transfer of the States of Control of the control of the control of line and a diffusion line crossing represent a transistor. The relative positions of components and the ng historia was shi kalima **trakvati.** Ib**andik**iat rindrakvati ito ni general path of each wire are indicated. The layout automation system must expand the "sticks" into The State of Gradity at the course of the course rectangles of the proper dimensions based on design rules, and modify the layout so that components and in mide in domini wires fit together with the proper spacing. Examples of such systems are STICKS [Wi77] and CABBAGE e i de bistima de distribuir de la compania del compania de la compania de la compania de la compania del compania de la del la compania del la compania de la compania de la compania de la compania de la compania del compan [Hs79]. These programs attempt to pack the layout as much as possible while satisfying the design rules and maintaining the original relative positions of components and wires. This technique exploits the an ang Pari seriggyarang Kinggaran ba human designer's ability to do overall layouts, i.e. rough sketches. The program FLOSS [Ch77] also packs a rough manual layout; it works from a hand drawn sketch. Other systems for which the designer does the general layout use a symbolic representation of the layout [Gi76], [Per73]. The symbolic layout is produced by the designer and expanded into a fully specified layout automatically. Other techniques for design automation use special structures such as Programmable Logic Arrays (PLAs), with known layouts. Functional specifications can be automatically converted into PLA implementations [Ay79]. Special interconnection patterns can also be exploited to assist in design [Jo79]. #### 2.3 Summary All of the above techniques have been developed to automate, at least in part, the design of an an ang ang panggan ang ang panggang panggang panggang panggang panggang panggang panggang panggang panggang circuit layouts. The research reported in this dissertation is restricted to placement and routing as er va en described in the first section. Although many algorithms have been designed and tested, little mathematical analysis of the algorithms has been performed. The techniques of complexity theory are not regularly applied to layout problems. One notable exception is the work of So, Ting, Kuh and others to the the trees of the profitors are ([Ku79], [So74], [Ti76], [Ti78], [Ti79], [Ts79]) for printed circuit board routing. In this work, a routing ngang at galakan berasia dan ber problem for printed circuit boards with fixed vias in columns is broken up into several problems, ending Catholic of the pattern of the control of the with a number of instances of a routing problem for a row of terminals on a single layer. The problems 通過的國際 的复数经验 医阴茎的 化多 are analyzed. Necessary and sufficient conditions for a single-row, single-layer routing to be optimal are developed. However, the model is not well suited to integrated circuits. The research reported in this dissertation also focuses on particular subproblems of the layout problem. The problems are motivated by the channel routing model of interconnections. #### Chapter 3: Models for Integrated Circuit Layout 直面细胞基本细胞产生多级基础 电流电池 · 最好值,要最轻的证明,并不在此的不好识别的人概要的证明,如此的证据的对象的人。 As is evident in the discussion in the previous chapter, most models of circuits for layout use points or rectangles to model components. In the research presented in Chapters 4 through 6, we use rectangles. The model is described precisely below. In the second section of this chapter, we describe a graph model in which components are points. We discuss its use in proving bounds on the area required by circuits with certain interconnection patterns. #### 3.1 The Geometric Model We have chosen to use a rectangle model of components because it captures the geometric and topological aspects of the layout problem. Components are rectangular in shape and variable in size. Wires lie on any of several layers and are of uniform width. The model was guided by the design rules presented in [Me80] for nMOS technology, but it is intended to be applicable to many technologies. Formally, the model is as follows. The input for a layout problem will consist of a set of components and a set of nets. Each namen a la Manneri les thems **describ prair parti parti l**egit legicità di la Carle de component, C<sub>i</sub>, will be a rectangle with given dimensions x<sub>i</sub> and y<sub>i</sub>. At given locations on the boundary of र १० ५० र १ मही तर एको तम् हिन्दा राज्यात । यह अपने रेप को मेल में स्टब्स के वित्तार के प्राप्त है । यह दा प्रा a component $C_i$ are terminals $t_{ii}$ , $j = 1,...,n_i$ , where $n_i$ is the number of terminals on component $C_i$ . Each नेता अनुसर ६ । अने अपने आधीर्ष मानु अधीर्ष कार्यक्षिण हा कि अधीर ही कि इस्ति नामा है। net is a set of terminals, and the nets are pairwise disjoint. Each net represents a collection of terminals i o governo projekturgam i jem stigme prod **gaddidaad b**orne**iffine b**ala **ynseepole** i boddians gas which must be electrically connected. The layout problem is to place the components on a plane and ila le la citatrajana e beksektist ejena (ekon) ekonomi ili keristisia. form the interconnections specified by the nets in the minimum possible area. The interconnections can all motorial and anti-choice repolitor due individue in occupatival distributions in be made in any of N layers, where N is specified a priori. The layers are numbered 1 through N; layer i is by the compact multiple traded of electroscienticitis. considered to be adjacent to layers i+1 and i-1, for $2 \le i \le N-1$ . Typically, N is two. The wires used to interconnect terminals have a uniform width w. There is a minimum spacing between wires on a layer, between components, and between a wire and a component of s. The interconnections form paths which lie on the surface of the plane not covered by the components. These paths go between nodes. A node is a terminal or an additional point on one or more layers of the plane. Each path segment between two nodes has a designated layer. The paths induce a partition of the nodes into disjoint sets of connected nodes; each set contains terminals from exactly one net. Path segments in different layers may intersect. Path segments in the same layer cannot intersect unless they are associated with the same net and intersect at a node. Additional points are allowed as nodes so that one wire may split into several and so that a path may change layers. A path may change from layer i to layer j, for any i and j between 1 and N, at any node; however, the node must lie on all layers between layers i and j, inclusive. The path is considered to the on each of these layers at the node. Therefore, no path associated with a different net can intersect like node on any of the layers between i and j. The set of nodes and paths in any one layer represent an embedding of a planar graph. The area of the layout is the area of the smallest rectangle on the plane which will simplese all components and wires. Figure 3.1 illustrates. The model described above takes into account both the geometric and topological aspects of the integrated circuit layout problem. The wires are explicitly given width. The restriction that components be rectangular is used solely to somewhat simplify the problem aspecially the calculation of free area for routing. We believe that restricting components to have rectangular shapes still provides a model that is applicable to the layout of VLSI circuits. For those logical components whose layouts would fill only a small portion of a rectangle, e.g. "L"-shaped layouts, it may be feasible to break the component into two or three physical components with more nearly rectangular layouts. One of the advantages of the model is that it is applicable to various levels of modularization. Components may represent transistors, logic gates, or even arithmetic units, allowing any conceptual level of design. A hierarchical approach to layout such as that used by Preas and Gwyn [Pr78] can easily be taken using this model. Additional parameters can be allowed as inputs to the layout problem in this model. Upper Figure 3.1: Illustration of the geometric model for layout design. W,X,Y,Zorare nets of the control There are only 2 layers. \_\_\_\_\_ indicates layer 2 bounds on the length and width or the total area used by the layout may be given. Alternatively, a bound on the aspect ratio, i.e. the ratio of the length to the width, of the layout area may be given. This would restrict the shape of the layout. Any of these limitations may be necessary to insure that the layout produced is usable. For example, suppose a minimum area solution resulted in an extremely long and thin chip. The shape could prohibit the fabrication or the probability of the chip. Another possibility is that the circuit being laid out is a component at a higher level in a design hierarchy. The bounds might be derived from requirements at the higher level. Bounds such as these provide a range of solutions within which the algorithm should work. It is also possible to introduce wire length requirements to the layout problem. As mentioned in Chapter 2, wire length is an important factor in signal quality. Individual nets may be given upper bounds on the total wire length of their interconnections. (Wire length can be measured using the center line through a wire.). Relationships between the wire lengths for various nets may also be imposed. In the following chapters, we will put restrictions on the types of layouts allowed within the above model. We will restrict ourselves to rectilinear paths and orthogenal orientations for components. Directions "horizontal" and "vertical" perpendicular to each other will be chosen, and all components will be placed so that their boundaries are parallel to one of these directions. Also, all paths will be composed of horizontal and vertical segments. This restriction is imposed for two reasons. First, the problems examined are motivated by currently used layout algorithms. As discussed in the previous chapter, the rectilinear metric is almost always used. Second, the sestriction limits the number of possible solutions and makes the problems easier to analyze. Most of the time, we also restrict horizontal and vertical segments to be on different layers. Given that only horizontal and vertical-wire segments are being used, this assumption not only greatly simplifies the description of allowable paths (those that may intersect at a point but do not overlap) but is reasonable when only two layers are available. In this case, two segments running in parallel on different layers, one on top of the other, prohibit any path from crossing from one side of these segments to the other. Any pair of terminals, one on each side of the segments, which must be connected will require a path which goes around the parallel segments. Therefore, one would rarely want such segments. Also, if the segments are extremely long, they may cause electrical problems due to capacitance. Restricting adjacent layers to contain segments in perpendicular directions eliminates these potential problems. There are several technical aspects of the layout problem which the model does not take into account. We discuss those below and indicate how the model can be extended or modified to include them. In actual component dosign, a component may have several terminals to which a particular connection can be made. These terminals may be either physically equivalent, i.e. they are connected inside the component, or logically equivalent, i.e. you early tell them apart functionally. An example of logically equivalent terminals is the set of input terminals of an "and" gate. The problem of deciding which terminal to use in connecting a particular net is called the *pin assignment problem* for printed circuit boards, and is usually solved before a routing algorithm is used [HTM]. Both physically equivalent and logically equivalent terminals can be modeled as sets of terminals. A set of equivalent terminals, rather than an individual terminal, would be a member of the set defining a net. A set of physically equivalent terminals would appear in only one net, while a set of logically equivalent terminals would appear in only one net, while a set of logically equivalent terminals would appear in several nets, but no more ness than the number of terminals in the set. When layout is complete, each set of interconnected terminals must contain at most one terminal from each set of logically equivalent terminals. This model of equivalent terminals is used by van Cleemput fvan761. In integrated circuits, layers are made of different materials and different design rules may apply. Allowing the width of wires and separation between objects to vary between layers is a minor modification to our model, although the resulting layout problem is more difficult. However, the design rules for various layers can be more complicated than the model will allow. For example, in nMOS technology [Me80], wires in diffusion and polysilicon layers cannot cross. Therefore, we model them as one layer. However, the minimum distance between two wires on either polysilicon or diffusion is different from the minimum distance between wires of different layers. This is not provided for in our model. We must use the largest of the actual required separations. Luckily, the metal layer in mMOS can cross both diffusion and polysilicon wires. If it could only cross one, then the actual layout problem would not be captured by our model: two layers could be used for interconnection, but a concept of "coloring" wires in each layer according to the design rules would have to be added. Another design rule which we have not accounted for concerns changing layers. When a conducting path changes layers, a contact cut must be made to connect the layers through an insulating layer. The contact cut requires a square area larger that the width of the wire. This is not taken into account in our area calculation. Even worse, some wires on a particular layer may be significantly wider than other wires on the same layer; depending on the electrical load on the wire. For example, the wires supplying power to a large circuit usually look like the human arterial system: there is a large main wire and a network of wires of decreasing size-reaching every part of the circuit. We recognize that our model overtooks many details of actual layout design, but we feel that it is a reasonable approximation of the major issues in the layout of integrated circuits. #### 3.2 A Graph Embedding Model We now discuss the uses of a model in which a circuit is represented as a graph. We call this model the graph embadding model. Each component is represented as a node in the circuit graph, and each connection to be made is represented as an edge between two nodes. The layout problem is defined as the problem of embedding the circuit graph in a two-dimensional grid graph. An embedding maps each node representing a component to a node of the grid. This mapping is one-to-one. Each edge representing a connection is mapped to a path in the grid graph. This path can only contain two grid nodes which correspond to component nodes — those that correspond to the endpoints of the edge being العراضية المرافي العراز العراز العرازات embedded. Paths corresponding to distinct edges are allowed to intersect at grid nodes but are not allowed to use the same grid edge. To make such an embedding possible, each node in the circuit graph is restricted to having at most four edges adjacent to it, i.e. it is restricted to be of degree at most four. Such an embedding is an edge-disjoint homeomorphic embedding. Given an embedding, there are two measures of area which we will use. The first, which we will call the node area and denote $\Lambda_N$ , is a count of the number of grid nodes used as images of components and on paths which are images of edges. The second, called rectangle area and denoted $\Lambda_R$ , is the number of nodes contained in a rectangle whose boundaries lie on grid edges and which circumscribes all nodes used in the embedding, i.e. all nodes counted by $\Lambda_N$ . For a circuit graph, C, let $\Lambda_N(C)$ and $\Lambda_R(C)$ denote the minimum node area and rectangle area, respectively, over all embeddings of C. Obviously, $\Lambda_N(C) \leq \Lambda_R(C)$ . An embedding is shown in Figure 3.2. The model presented above can be viewed as describing a layout which uses only horizontal and vertical wire segments (segments in the two grid directions), each direction on a separate layer. Alternatively, the nodes of the grid may be viewed as representing unit area squares on a plane. Each grid edge adjacent to a node represents a boundary of the corresponding square. Two paths which use a node need not actually intersect on the plane — they may run diagonally, cutting across opposite corners of the corresponding square. (See Figure 3.3.) Thompson [Th80] uses a model of circuit layout which divides the plane into such unit squares. He views connections as running primarily in a single layer of metal. Crossovers are achieved by using short runs in a second layer such as polyablicon. The graph embedding model lumps all terminals of a component into one point. The point-to-point connections made by wires are pre-determined and represented in the graph for the circuit. If Steiner tree interconnections of the terminals are desired (i.e. branching wires), these must be explicitly modeled in the circuit graph by adding a node for each branch point and edges from each branch point to components or other branch points. The model does not represent the area required by components or Figure 3.2: Embedding a graph. Figure 3.4: Effects of the order of ternihials around a component. Figure 3.3: A node representing a square. represents: the fact that terminals have a specific order around the component. This order causes some routings for the graph embedding model to be impossible in the geometric model (see Figure 3.4.). However, the model is very useful for investigating the implications of certain interconnection patterns. When node area is uses, each circuit node contributes exactly one unit of area. Therefore, any circuit graph with n nodes which requires more than O(n) area must have an interconnection pattern which requires a lot of area to route. Lower bounds on the amount of area required by a graph are proven using this measure of area. Note that any lower bound on node area is a lower bound on rectangle area. Thompson [Th80] proves a lower bound on the area required to embed a graph as a function of the minimum bisection width of a subset of the nodes of the graph. Given a graph G and a subset, S, of the nodes of G, a set of edges in G bisects S if the removal of these edges partitions the nodes of G into two sets such that - i) LISI/21 of the nodes of S are in one set and FISI/27 are in the other set; - ii) After removing the edges, there are no paths between nodes in different sets. Let w<sub>S</sub> be the size of the smallest set of edges bisecting S in G. Then: Theorem (Thompson [Th80]): Given a graph G with nodes of degree at most four, and a subset, S, of the nodes, $$A_N(G) \ge (w_8^2)/4$$ . Proof: See [Th80]. An *n*-superconcentrator is a graph with n designated input nodes and n designated output nodes such that for any sets of k input nodes and k output nodes, $1 \le k \le n$ , there are k node disjoint paths connecting the k input nodes to the k output nodes [Val75]. For the set of input nodes, I, $w_1 \ge Ln/2J$ . Therefore, for any n-superconcentrator, G, $A_N(G) \ge (n-1)^2/16$ . Since for any n, there are n-superconcentrators with O(n) nodes, all of bounded degree [Pi77], there are graphs with n nodes which require node area $\Omega(n^2)$ . Note that although the superconcentrators in [Pi77] do not have degree at most four, it is straightforward to reduce the degree of each node by adding a node for each edge. The number of nodes added is then bounded by the original number of edges. Figure 3.5 illustrates. Thompson also defines average and worst case information complexities of a function. He derives a lower bound on the average or worst case time required by a graph to compute a function: ## average (or worst case) time ≥ (1/w<sub>1</sub>)(average (respectively worst case) information complexity of the function) where I is a special set of input nodes in the graph. Combining the results, Thompson obtains a lower bound on $A_N x$ (average time)<sup>2</sup> for a graph which computes an n-point discrete fourier transform of $Ln/8J^2log^2n$ ; he derives a lower bound of $\Omega(n^2log^2n)$ on $A_N x$ (worst case time)<sup>2</sup> for a graph which sort n numbers. The reader should refer to [Th80] for details. Bounds for other functions have also been derived by various authors using Thompson's technique, e.g. [Abs80], [Sav79]. Upper bounds can also be obtained on the area to embed various classes of graphs. Upper bounds are derived for rectangle area, $A_R$ . Any such upper bound is also an upper bound for node area, $A_N$ . First observe that any graph with n nodes, each of degree at most four, can be embedded in rectangle area at most $6n^2+3n$ . We give here a modification of the proof presented in [Val79]. This modification improves the bound from a (3n)x(3n) square area to a (3n)x(2n+1) rectangular area. Since n-superconcentrators require $\Omega(n^2)$ area, proportional to $n^2$ area is both necessary and sufficient for embedding an n-superconcentrator. The embedding achieving rectangle area of at most $6n^2 + 3n$ is shown in Figure 3.6. The directions used below refer to the figure. The nodes are embedded in one vertical column of the grid — one node every three grid points. There is a column for each edge, either to the left or the right of the column in which the nodes are embedded. The path representing an edge must reach the column for the edge from each node representing an endpoint of the edge. Therefore, each path includes two horizontal Figure 3.5: Reducing the degree of nodes in a superconcentrator. As a state of the dels the degree of the nodebed play one (a) the to the term of the control All sets of node disjoint paths are preserved (a) The second of secon Figure 3.6: Embedding an arbitrary graph in $O(n^2)$ area. segments between the column containing the embedded nodes and the column for the edge. The first (last) segment of the path is either one of these segments or a vertical edge from the endpoint of the path to the grid node just above or just below it. If a vertical edge begins the path, the horizontal segment going left or right from the second node is used. As long as the edges of the circuit graph can be partitioned into two sets — those whose columns are to the left of the node column, and those whose columns are to the right—such that at most three edges adjacent to a node are in the same set, such an embedding exists. The edges of any graph whose nodes are of degree at most four can be colored using at most six colors such that no two edges adjacent to the same node are the same color [Be66, p.262]. Let edges with odd numbered colors have columns to the left, and those with even numbered colors have columns to the right. At most three edges adjacent to a node are on the same side, as desired. Since any graph with n nodes, all of degree at most four, can have at most 2n edges, there are at most 2n+1 columns. Other classes of graphs can be embedded in less than $\Omega(n^2)$ area. Valiant [Val79] and Leiserson [Lei80] have independently shown that given a graph G with a nodes, each of degree at most four, if G is planar, then $\Lambda_R(G)$ is $O(n\log^2 n)$ ; if G is a tree, then $\Lambda_R(G)$ is O(n). Valiant actually shows that an O(n) embedding for a tree can be achieved without allowing paths to cross. It is an open question whether there is an n node planar graph which requires $\Omega(n\log^2 n)$ area. Leiserson proves a general result which relates a separator theorem for any class of graphs to an upper bound on the rectangle area required to embed any graph of the class. The results which we have reviewed above illustrate that the graph embedding model is useful for proving bounds on layouts for particular classes of graphs and for proving time/space bounds for function implementation. Using it, we can identify easy and hard interconnection patterns to route. In the rest of this dissertation we will be interested in algorithms to actually do the layout. Therefore, we will use the rectangle model described in Section 3.1. ### Chapter 4: Complexity of Layout Problems Regardless of the exact formulation of the layout problem, we are interested in finding an efficient algorithm which computes an optimal layout. If this is not possible, we would like an efficient algorithm which computes an optimal layout much of the time and a good layout the rest of the time. This algorithm may actually be a collection of algorithms to solve subproblems which together give a layout. Again, we would like the algorithms for the subproblems to find solutions efficiently. However, most problems associated with circuit layout are NP-complete. The definition of an NP-complete problem is given below. From a practical point of view, the NP-completeness of a problem indicates that it is probably impossible to find an efficient algorithm which solves the problem. ## 4.1 NP-Completeness and Heuristic Algorithms Two major classes of problems in complexity theory are the classes P and NP. A problem is in P (NP) if there is a deterministic (nondeterministic) Turing machine and a polynomial P such that the Turning machine solves any instance of the problem with an input of length n in a number of steps no greater than P(n). The length of an input is the length of its representation as a character string in a predetermined character set. We will not formally define Turing machines here. The interested reader should see [Ah74]. Very often, a nondeterministic Turing machine solves a problem by "guessing" a solution and testing to see if its guess is actually a solution. When the problem of interest is a minimization (or maximization) problem, it is reformulated so that candidate solutions can be tested independent of each other. In the new problem, a parameter k is part of the input. A solution to the new problem is a feasible solution to the old problem for which the quantity to be minimized (maximized) is less than (respectively greater than) k. A feasible solution is one which satisfies all requirements of the old problem except optimality. For example, if the original problem is to find a minimum area layout of a circuit, the new problem, given the circuit and parameter k, is to find a layout of the circuit of area less than k. Under this formulation, a feasible solution can be tested by criteria depending only on the feasible solution to see if it is a solution. When the minimization formulation of the problem is used, feasible solutions must be compared against each other to find the actual solutions. The number of steps taken by a deterministic Turing machine is polynomially related to the number of steps taken under a model of computation that corresponds to the instruction set of a computer if the lengths of numbers operated on is taken into account [Ah74]. Therefore, any problem which is in P can be solved in a polynomial number of steps by an algorithm in a high level programming language. The number of steps executed by an algorithm is referred to as the time taken by the algorithm. We would like all the problems we need to solve to be in P. Benchmark to digita taskishiri ku kubat gira da kacama malabi sa The question of whether P = NP is one of the major open questions in complexity theory. It is believed that $P \neq NP$ . Problems in NP have been found whose months in P implies NP = P. These problems are called NP-complete problems. A problem is NP-hard if for each problem in NP, there is a Larry Carl (fr. Farrantia) is provide the carry space of the carry polynomial P and a transformation computable by a deterministic Turing machine in a polynomial a different and proper and the state of the second number of steps which transforms an instance of the problem in NP with an input of length n to an instance of the NP-hard problem with an input of length P(n). An NP-complete problem is one which is र कार अक्षांना गाँउ अक्षां जो से अक्षां के अंदर्भ के अंदर्भ की अंदर्भ की अंदर्भ के अंदर्भ के अन्य अ NP-hard and is in NP. There are no known deterministic algorithms for NP-complete problems which n. He is hell tall had had statement that fill a state in the electric short a short and fill take a number of steps polynomial in the length of the input. The fact that such well studied problems as and a tribut second lagrency finite and finite and the least the finite and the second second and the second a integer programming and the travelling salesman problem are NP-complete [Gar79] strongly suggests a is the control of the line of the control that $NP \neq P$ . Therefore, proving a problem NP-complete is very strong evidence that any algorithm the commence of the second state of the second seco which solves the problem will be time consuming. The most common way to prove that a problem is NP-complete is to find a reduction from a known NP-complete problem to the new problem which can be executed in deterministic polynomial time. Then, since the composition of polynomials is still a polynomial, all problems in NP can be reduced through the known NP-complete problem to the new problem in deterministic polynomial time. and Asset to fine When a problem has been proven NP-complete, the usual course is to try to find an algorithm which runs in polynomial time and finds a solution most of the time. When the problem proven NP-complete is the parameterized version of an optimization problem, an algorithm which can find solutions close to optimal, if not optimal is desired. Heuristics are developed which can direct the algorithm towards a good solution. When the algorithm is searching through a large (exponential) number of possible solutions, heuristics remove large numbers of possible solutions based on the likelihood that none will be optimal solutions. Very often a heuristic algorithm is tested and validated by empirical evidence. All of the algorithms discussed in Chapter 2 are heuristic algorithms for placement and routing. All have been judged by comparing the solutions they produce to manually produced solutions for the same problems. These algorithms are also compared to each other to judge their worth. In contrast, in this thesis, heuristic algorithms for optimization problems are judged by the relationship of the solutions they produce to optimal solutions. Consider a layout problem in which minimum area is desired. Let area alg (C) be the area of the layout for circuit C found by a particular algorithm. Let area opt (C) be the minimum area layout of C. Then define the worst case performance of the algorithm, denoted we alg (n), as the maximum over all circuits of size n of area alg (C)/area opt (C). Define the average case performance of the algorithm, avg alg (n), as the average over all circuits of size n of area alg (C)/area opt (C), where the average is taken with respect to a predetermined distribution of circuits. The size of a circuit can be defined in various ways depending on the circuit model, e.g. the number of terminals, or the sum of the number of components and the number of nets. The size should be defined so that the length of the input specifying the circuit to the algorithm is polynomial in the size. Average case performance is more likely to correspond to the observed performance of an algorithm, especially if the average is taken over "realistic" circuits. However, it is often very difficult to analyze. In this dissertation, the analysis is limited to worst case performance. If a lower bound on area opt and an upper bound on area at can be derived, an upper bound on $wc_{alg}$ can be concluded. Ideally, a bound of $1+\varepsilon$ for small $\varepsilon$ is desired. In reality, we are happy with any constant bound. However, as will be seen in Section 4.4, even constant bounds are not achieved by some common algorithms. In the next section, we will review NP-completeness statults for problems related to layout. In Section 4.3, we will consider two subproblems of channel routing and show that they are both NP-complete. In Section 4.4 we will analyze a heuristic algorithm for one of the problems shown NP-complete in Section 4.3. ## 4.2 Placement and Routing: NP-complete Formulations In this section we consider the complexity of the problems resulting from the decomposition of the layout problem into placement and routing. un traing air traightain aireachan air mar tomhairt a tha 1960 a tha 1960 air and the substrate of a larger and a property of the first of the second contract sec #### 4.2.1 The point model: some known results. First consider the placement problem used for printed circuit boards and standard cells. Recall that components are modeled as points and total wire length is the quantity to be minimized by the layout. The quadratic assignment problem is one formulation of placement: ## Quadratic Assignment Problem to relice of the damper to be against the proportion of the Given: Locations 1 through m and distance matrix $\{c_{ij} | 1 \le k, h \le m\}$ between locations; components 1 through n and connection matrix $\{c_{ij} | 1 \le i, j \le n\}$ between components. i graficeo-regalesi (. 1944) si pri si pri re **bonñoù** ed klimbresik hall archarac barr barr bar and he havens was by their secure of apprehensions and secure he Find: A one-to-one mapping, p, of components to locations such that $COST(p) = sum over i \neq j$ from 1 to n of $(c_{ij}d_{p(i)p(j)})$ is minimized. Sahni and Gonzales [Sah76] prove that the parameterized quadratic assignment problem is NP-complete. In fact, they prove that unless NP = P, there is no approximation algorithm for quadratic assignment for which there is an $\epsilon > 0$ such that for all instances of the problem: $$COST(p_{alg})/COST(p_{out}) \le 1 + \epsilon$$ . The proof does rely on instances of the problem with connection matrix values $c_{ij}$ greater that en. Consider only instances of the problem for which the $c_{ij}$ are limited to a bounded range of values. The proof that the existence of an approximation algorithm for which the ratio of $COSF(p_{alg})$ to $COSF(p_{opt})$ is no more than $1 + \epsilon$ implies NP = P no longer holds for all $\epsilon$ . However, the restricted problem is NP-complete as long as the $c_{ij}$ are allowed to take on the value 0 or 1. The quadratic assignment problem is a formulation of the placement problem in which all point-to-point connections are specified. The cost c<sub>ij</sub> represents the number of connections between components i and j. The distance d<sub>kh</sub> is the estimate of the wire length needed to connect terminals at locations k and h. If a placement problem formulation in which an estimate of the length of wire needed to connect whole nets is used, the quadratic assignment problem is a special case in which all nets are of size two [Sah80]. Therefore, the quadratic assignment problem reduces to this formulation of the placement problem. It follows that this formulation of the placement problem in NP-complete. Let us now turn to routing. If a Steiner tree interconnection pattern is desired for each net, then even finding the connection paths for one net is NP-complete. Formally: #### The Steiner Tree Problem Given: A set of points, P, in the plane with integer coordinates. Find: A set of integer points, Y, such that the minimum length spanning tree of PUY is minimal over all sets of integer points containing P. One of two measures of distance may be used: - (i) The discretized Euclidean length: $\Gamma((x_1-x_2)^2+(y_1-y_2)^2)^{\frac{1}{2}}$ where $(x_1,y_1)$ and $(x_2,y_2)$ are the points. The problem is then the Discretized Euclidean Steiner Tree Problem; - (ii) The rectilinear metric: $|x_1-x_2|^2+|y_1-y_2|$ , giving the Rectilinear Steiner Tree Problem. For either metric, the Steiner tree problem is NP-complete. If the standard Euclidean metric is used, the problem is NP-hard, but is not known to be in NP (Gar79). The minimum spanning tree using either metric can be solved in polynomial time [Ah74]. For the rectilinear metric, the length of the minimum spanning tree is at most 3/2 the length of a minimum Steiner tree [Hw78]. Therefore, any algorithm to find minimum length spanning trees is a heuristic algorithm for finding minimum length Steiner trees with a worst case ratio of length over length of 3/2. A discussion of heuristic algorithms for the Rectilinear Steiner Tree Problem can be found in [Hw78]. The above two problems apply to the model of layout in which components are points and minimum length wiring is desire. Other problems related to this model are also NP-complete. Ting et. al. [Ti79] show that a via assignment problem encountered in their approach to routing is NP-complete. A summary of a number of NP-complete problems associated with this model can be found in [Sah80]. ### 4.2.2 The rectangle model: a new result. The above NP-complete results do not directly apply to the model of layout in which components are rectangles and minimum area is desired. We shall now prove that even when no interconnections are needed, the placement of rectangular components to minimize area is NP-complete. Since this is a special case of the layout problem when interconnections are required, the more general layout problem is NP-complete. The proof we present below does requires some entra assumptions in addition to those given in the description of our model in Chapter 3. All components and the circumscribing rectangle determining the area must be oriented so that each of their sides is in the direction of one of two perpendicular axes. This does put some restrictions on the placements allowed but is consistent with the concept of "horizontal" and "vertical" being special directions which wires follow. We also restrict all points and dimensions to be integer valued so that the problem has discrete solutions. Problem P1: Discrete Inyout with no interconnections Given: A set of n rectangles and an integer, A. For $1 \le i \le n$ , each rectangle $r_i$ has dimensions $h_i$ and $w_i$ which are positive integers. Question: Is there a placement of the rectangles on the plane with a cartesian coordinate system imposed so that: - (i) Fach boundary is parallel to one of the coordinate system axes; - (ii) Corners of the sectangles lie on integer points in the plane; - (iii) No two rectangles overlap; - (iv) The boundaries of any two rectangles are separated by at least a unit distance; - (v) There is a rectangle in the plane which circumscribes the placed rectangles, has boundaries parallel to the axes, and is of area at most A. The boundary of the circumscribing rectangle is allowed to contain boundaries of placed rectangles. Lemma 4.1: Problem PI .- discrete layout with no interconnections -- is NP-complete. Proof: Consider only placements for which the lowest leftmost corner of any rectangle is at (0,0). All other placements are just translations of these. The coordinates of the lower left corner of each rectangle and the orientation of the rectangle, i.e. whether the side of length $h_i$ is in the x direction or the y direction, determine its position. Since the coordinates of each lower left corner can only take on integer values between 0 and A- $h_i$ -1, there are at most $A^2$ choices for each point. A nondeterministic Turing machine can guess a possible placement and write it down. Given a placement, conditions (i) through (v) can be checked deterministically in polynomial time. This shows that P1 is in NP. The proof that P1 is NP-hard is accomplished by reducing the Bin Packing Problem to it. Since the Bin Packing Problem is NP-complete [Gar79], this proves that P1 is NP-hard. ### The Bin Packing Problem: Given: A set of n items, each of size c<sub>i</sub> a positive integer; also, positive integers B and C, the number of bins and bin capacity, respectively. Question: Is there an assignment of items to bins so that for each k, 1 \le k \le R, the sum of c, over all items assigned to bin k does not exceed C? Given any instance of the Bin Packing Problem, we will construct an instance of P1 as follows. There will be n+1 rectangles. One, called R, will be of size h by w, where w=(2B+1)C-1 and h=2Bw+1. The remaining rectangles will directly correspond to items. Rectangle $r_i$ will have dimensions $h_i=(2B+1)c_i-1$ and $w_i=1$ . The bound on area will be A=wk+2Bw. Note that the length of the input to the Bin Packing Problem is $\Omega(n+\log C+\log B)$ . The dimensions of the rectangles and $\Lambda$ can be calculated taking no more than polynomial time in this length. We must show that there is a placement satisfying conditions (i) through (v) if and only if there is an assignment of the items to it bins without exceeding capacity C in any bin. Given a bin packing, Figure 4.1 illustrates a satisfactory placement. We now argue that any placement satisfying (i) through (v) corresponds to a legal bin packing. Figure 4.2 will illustrate. Without loss of generality, let the side of ing an early which in the highest the beautiful given arises in the conrectangle R of dimension h be in the y direction. Let "left" denote towards lower numbers in the x in a color of more and court to and interest in the contract of the color co direction and "right" denote towards higher numbers; let "above" denote towards higher numbers in the and the second of the contraction contractio y direction and "below" denote towards lower numbers. At any point, the width of the layout in the x ा ने हर है। अनुसारक किए ने के देखकी जाने देखका जा कर 2000 में जनने दर्श direction is strictly less than w+1. If not, then area $\geq h(w+1) = hw+2Bw+1 > A$ . Therefore, all and a problem of read which is seen as in the second control of the control of the control of the control of the rectangles, r, must lie above or below R. None can extend a unit beyond the length h sides of R, so that ing the second of the control of the second for each rectangle, some line in the y direction intersects the rectangle and R. No rectangle, r., is oriented The second process of the properties and the properties of the second process sec so that its long side is in the y direction. Otherwise, the dimension of the circumscribing rectangle in the y direction would be at least $h+(2B+1)c_1-1+1$ , where the added 1 accounts for the separation between gradustina a filozofici o este estima interesti interesti interesti interesti interesti interesti interesti di rectangle boundaries. Then: $$arca \ge w(h+(2B+1)c_i) \ge w(h+2B+1) > A$$ and (v) is not missied. We now know that all rectangles, r<sub>p</sub> are oriented so that their long sides are in the x direction. Any placement can be modified slightly without increasing the area so that the r<sub>i</sub> form rows above and Figure 4.2: A packing given a placement. there may be t; s at the top and at the bottom below R. (Any $r_i$ and $r_j$ less than two units above or below R must be separated from each other by at least one unit to the left and right. These can be shifted to form the first rows above and below R. The next rows are formed analogously above the upper boundary of the row above R and below the lower boundary of the row below R. Figure 4.2 illustrates.) Each row of rectangles can be considered the packing of a bin. If the rows correspond to a legal bin packing, we are done. Suppose there are K rows. Then the dimension of the circumscribing rectangle in the y direction must be h + 2K, since there must be a unit space separating rows from each other and R. If K > B, then area > w(h + 2B) = A, contradicting (v). Therefore, at most B bins are used. It remains for us to show that the sum of sizes $c_i$ of the items in any bin is at most C. Each item corresponds to a rectangle with $h_i = (2B + 1)c_i - 1$ . $\Sigma$ ((2B+1)c<sub>i</sub>-1) + the number of rectangles in the row - 1 rectangles in row $\leq$ the width of a row of rectangles $\leq$ w+1 = C(2B+1) giving $(2B+1)(\text{sum of } c_i \text{ for } r_i \text{ in row}) - 1 < C(2B+1)$ Therefore, $(\text{sum of } c_i \text{ for } r_i \text{ in row}) < C + 1/(2B+1)$ Since all c, and C are positive integers, the above implies (sum of c, in one bin under corresponding bin packing) $\leq C$ as desired. Corollary 4.1: The modification of P1 removing the minimum spacing requirement is NP-complete. Proof: The same proof is used. Rectangle R has dimensions h = Bw + 1 and w = C(B+1). For each i, $r_i$ has dimensions $h_i = c_i(B+1)$ and $w_i = 1$ . The NP-hardness part of this proof does not require that any of the dimensions be integers or that the rectangles be placed so that their corners are on integer points of the coordinate system. Lemma 4.1 is presented with spacing required between rectangles to closely mirror the problem in circuit layout. The dimension of each component can be increased by one unit to account for the required spacing implicitly. This will cause the circumscribing rectangle to be one unit too large in each dimension. When spacing is not explicitly required, the problem remains NP-complete even if the aspect ratio of the circumscribing rectangle is bounded. ## Problem P2-a: Layout with hounded aspect ration and no interconnections Given: A set of n rectangles and a positive number A. For $1 \le i \le n$ , each rectangle, $r_p$ has dimensions h, and w. Question: Is there a placement of the rectangles on the plane with a cartesian coordinate system imposed so that: - (i) Each boundary is parallel to one of the coordinate system axes; - (ii) No two rectangles overlap; - (iii) There is a rectangle in the plane which circumscribes the placed rectangles, has boundaries parallel to the axes, is of area at most A, and has aspect ratio (long side)/(short side) at most $\alpha$ , where $\alpha$ is a rational number not less than one. The boundary of the circumscribing rectangle is allowed to contain boundaries of placed rectangles. ## Lemma 4.2: Problem P2-a is NP-hard for any a a rational number not less than one. Proof: The proof is a reduction from bin packing similar to the proof of Lemma 4.1. Given $c_i$ for $1 \le i \le n$ , C, and B, construct R with $w = \alpha C(B+1)$ and $h = w/\alpha - B$ . Each $r_i$ is of dimensions $h_i = \alpha c_i(B+1)$ and $w_i = 1$ . The bound on area is $A = w^2/\alpha = \alpha C^2(B+1)^2$ . The aspect ratio implies that the larger side of the desired circumscribing rectangle is at most $(\alpha A)^{1/2} = w$ . Therefore, assuming R is oriented as in the proof of Lemma 4.1, none of the $r_i$ can lie to the left or right of R. The rest of the proof is analogous to that for Lemma 4.1 and is left to the reader. Corollary 4.2: If the bound on aspect ratio, $\alpha_n$ is allowed as an imput for problem P2- $\alpha_n$ , the problem remains NP-hard. Proof: The construction in the proof of Lemma 4.2 can be computed in time polynomial in the length of the representation of a; therefore, a can be an input. Lemmas 4.1 and 4.2 prove that the layout problem we are studying is NP-complete even in the degenerate case when only placement is required. Given a placement of components, is the routing problem NP-complete? We do not have a proof of a general NP-completeness result for routing. However, in the next section, we will present two NP-completeness results for subproblems encountered in channel routing. ## 4.3 NP-completeness in Channel Routing In this section, we will prove that two problems encountered in a channel routing approach are NP-complete. Recall that in the channel routing approach, the routing area is divided into horizontal and vertical streets. Terminals lie along the sides of the streets. Each street is made up of a set of parallel channels in the direction of the street. Each channel is wide enough to account for the width of a wire and the required separation between wires. First the interconnection pattern of paths through the streets is chosen (street routing). Then, within each street, the actual routes of the path segments assigned to the street -- using the channels -- is determined (channel assignment). The goal is to minimize the overall layout area. (See Figure 4.3.) The street routing problem can be represented as a graph problem. There will be a node in the graph for each position along a street at which there is are terminals. (Two terminals directly across from each other on a street are represented by one node.) There will also be a node in the graph for each intersection of streets. Each edge of the graph represents a portion of a street between two positions at which there are terminals of intersections. For each net, we wish to finite a subgraph which is a tree whose nodes consist of the nodes representing terminals of the net and nodes representing street intersections. This is a Steiner tree problem on the graph. The intersection nodes in the graph are analogous to the continue on a report of the series in a week or the expression as the tell of the Figure 4.3: Streets and channels in the channel routing approach. indicates the streets surrounding the components added nodes in the plane for a Rectilinear or Euclidean Steiner tree problem. However, we do not wish to find a minimum length Steiner tree in the graph for each net. Using a minimum length Steiner tree for each net does not necessarily yield a minimum area layout. The eventual area of a layout must be estimated when the Steiner tree for each net is being chosen. The second phase -- channel assignment -- determines the actual area of the layout. We are assuming each street is of variable width. The number of channels used in a street directly corresponds to the street width. Channel assignment determines the actual paths in the plane realizing the interconnection pattern determined by street routing. The paths are composed of horizontal and vertical segments. The path segments within each street are determined independently except that the segments for paths which change streets must be connected at the intersections. The horizontal segments in a horizontal street (and vertical segments in a vertical street) lie in channels. Each channel is the region between two lines parallel to the street direction; the lines are spaced so that there is room for one wire width and required separation between wires in any channel. Wire segments perpendicular to the street direction are used to connect segments in channels to each other and to terminals. ### 4.3.1 Channel assignment within a street. The first NP-completeness result which we present proves that, with certain restrictions, the routing of paths in a street so that the number of channels used is minimized is NP-complete. The restrictions are that (i) all terminals to be interconnected lie in the street (i.e. there are no street intersections to worry about), and (ii) each set of wires interconnecting one net uses exactly one channel. This problem is represented as follows. ### Problem P3: Channel Assignment within a Street Given: A line segment, S, containing equally spaced points numbered 1 through h, and a set of terminals, T. The line segment represents the street. Each terminal, t, is an ordered pair (i,b), where i is a number between 1 and h indicating the position of the terminal along the street and b is an element of $\{0,1\}$ representing the side of the street on which the terminal lies. For any two terminals $(i,b_1)$ and $(j,b_2)$ with $b_1 = b_2$ , $|i-j| \ge s$ , where s is a positive integer chosen a priori. Integer supersents the wire width plus required separation. Also given are a collection of n disjoint sets of terminals, N<sub>1</sub> for $1 \le i \le n$ , representing nets, and a parameter k. Question: For each not, $N_i$ , let $a_i$ be the position of the terminal of lowest position in $N_i$ and $z_i$ be the position of the terminal of highest position. Let $\{a_i,a_i\}$ denote the set of all points on S between $a_i$ and $z_i$ inclusive. Is there a mapping, ch, which assigns each net a number between 1 and k inclusive such that for any $N_i$ and $N_i$ , $1 \le i,j \le n$ : - (i) If terminal $(x,0) \in N_i$ and terminal $(y,1) \in N_i$ and $|x-y| \le x$ , then $ch(N_j) \le h(N_j)$ ; - (ii) If $[a_i, z_i] \cap [a_i, z_i]$ is not empty, then $ch(N_i) \neq ch(N_i)$ . The mapping, ch, represents the assignment of a wire segment between points $a_i$ and $z_i$ to one of k channels for each net $N_i$ . The interval $[a_i, z_i]$ is called the interval of net $N_i$ . Segments perpendicular to the direction of the street are assumed to go from each terminal in the net $N_i$ to the segment in the channel. The restriction that these segments must not overlap is represented as condition (i) above. If there are no terminals satisfying the hypothesis of condition (i), i.e. condition (ii) is the only relevant condition, then Problem P3 is the interval coloring problem. The interval coloring problem is: given a finite set of intervals on a line and a positive integer, k, assign a color (positive integer) to each interval so that no two overlapping intervals have the same color and no more than k colors are used. Nets define intervals, and "channel" is just another name for "color". The interval coloring problem can be solved by a polynomial time algorithm [Gav72] [Has71]. The solution to this problem uses the same number of channels as the maximum over all points on S of the number of nets whose interval [a<sub>p</sub>z<sub>p</sub>] intersects the point. Therefore even if we allow wires for one net to use more than one channel, the solution found using one channel is optimal. Without additional restrictions, the channel assignment problem stated as Problem P3 is NP-complete. The same was a second of the same s and of the property of the party of the control Lemma 4.3: Problem P3 is NP-complete. Proof: In polynomial time, a nondeterministic Turing Machine can guess a mapping and check to see if (i) and (ii) are satisfied. Therefore, the problem is in NP. We will reduce the circular arc coloring problem; an NP-complete problem [Gar], to Problem P3 to prove that P3 is NP-hard. The circular arc graph coloring problem is similar to the interval coloring problem except that arcs on a circle rather than intervals on a line are use. Figure 4.4 illustrates. The Circular Arc Coloring Problem Given: A finite set of arcs of a circle and a positive integer, k. Question: Is there an assignment of colors numbered 1 though k to the arcs such that any two ares which overlap are assigned different colors? Arcs which intersect only at their endpoints are not considered as overlapping. Since arcs which intersect at endpoints are not overlapping, we may modify any set of arcs so that no arcs have endpoints in common (see Figure 4.4). The actual length of the arcs is irrelevant. Therefore, for n arcs, we can number the endpoints from 1 through 2n while traversing the circle in some direction. Each arc will be represented as an ordered pair, (i.j.), listing the endpoints of the arc as encountered in the traversal of the circle. Oliven an instance of the circular arc coloring problem with n arcs and k colors, we will produce an instance of the channel assignment problem with a channels and (2n+c(2k-c+1)) nets, where c is the number of arcs which contain arc (2n,1). Each net will contain exactly two terminals. Intuitively, we cut the circle between points 2n and 1 and stretch it out straight. This results in n+c intervals, since c arcs have been cut in two. The n-2c intervals which do not have endpoints on the cut fine will become the intervals of nets. Considering only these nets, any legal assignment of channels will be a legal assignment of colors to the corresponding arcs and vice versa. Figure 4.4 Circular Arc coloring. colored with two colors (Each color can be represented as a distance above the circle.) arcs (a<sub>1</sub>,b<sub>1</sub>); (a<sub>2</sub>,b<sub>2</sub>); (a<sub>3</sub>,b<sub>3</sub>); (a<sub>4</sub>,b<sub>4</sub>) traversing clockwise. Figure 4.5 Construction of a channel assignment problem given a circular are coloring problem. represents the interval of a net. Arrows point to the side of the street containing the terminal. There are 2c intervals with endpoints on the cut line -- two intervals per arc. If we can insure that both intervals of each pair are assigned the same channel, then each channel assignment will correspond to an arc coloring. We do this by extending the intervals beyond the cut line and adding nets which force the pairs to be assigned to the same channel. Figure 4.5 gives the construction. The formal definition is given below. The points on which terminals lie will be numbered from -c(k-(c-1)/2)+1 to 2n+c(k-(c-1)/2) rather than from 1 to 2n+c(2k-c+1). For each arc (a,z) which does not contain (2n,1), there will be a net $\{(a,0),(z,1)\}$ . Order the arcs which do contain (2n,1) by increasing starting point. For the i<sup>th</sup> such arc, there will be 2(k-i+1) nets—half have terminals within the negatively numbered points and that have terminals within the points numbered above 2n. For arc $(a_i, z_i)$ , $1 \le z_i$ $(a_i \le 2n$ , the i<sup>th</sup> arc containing (2n,1), the nets are defined as follows. For $1 \le i \le c$ , let $$sum(i) = \sum (k-j) = i (k-1/2(i-1))$$ $$j = 0$$ $$p_i = -sum(c) + sum(i-1)$$ $$p_i^+ = 2n + sum(c) - sum(i-1)$$ Then, there are nets: $$\begin{split} N_{ij} &= \{(p_i^- + 1, 0), (z_i, 1)\} \text{ and } \\ N_{ij} &= \{(p_i^- + j, 1), (p_i^- + j + 1, 0)\} \text{ for } 1 \leq j \leq k-i; \\ N_{i0}^+ &= \{(a_i, 1), (p_i^+, 0)\} \text{ and } \\ N_{ij}^+ &= \{(p_i^+ - j, 0), (p_j^+ - j + 1, 1)\} \text{ for } 1 \leq j \leq k-i; \end{split}$$ For each i between 1 and c, the application of condition (i) in the definition of problem P3 results in two chains of k-i inequalities for the set of nets corresponding to the i<sup>th</sup> arc containing (2n,1) - one chain of inequalities $ch(N_{ij}) < ch(N_{ij+1})$ and one of inequalities $ch(N_{ij}) < ch(N_{ij+1})$ for $0 \le j \le k$ -i-1. Therefore, nets $N_{10}$ and $N_{10}$ must be assigned channel 1 if no more than k channels are to be used. Nets $N_{20}^{-}$ and $N_{20}^{+}$ can be assigned channel 1 or 2 without violating condition (i), but their intervals overlap those for nets $N_{10}^{-}$ and $N_{10}^{-+}$ , respectively. Therefore channel 2 is the only choice for nets $N_{20}^{-}$ and $N_{20}^{-+}$ . Proceeding in this way, we see that nets $N_{10}^{-}$ and $N_{10}^{-+}$ must be assigned channel i if no more than k channels are to be used. Given any channel assignment for the complete set of nets, we color the arcs which do not contain (2n,1) by the same numbered color as the corresponding channel. For the i<sup>th</sup> arc which contains (2n,1), we use the number of the channel assigned to nets $N_0$ and $N_1$ . For any pair of arcs which overlap, there is at least one corresponding pair of nets whose intervals overlap. Therefore, any legal channel assignment corresponds to a legal coloring using the same number of colors as channel. Given a coloring of the arcs using at most k colors, we can assign the nets to channels as follows. Permute the colors so that the $i^{th}$ are containing arc (2n,1) is assigned the $i^{th}$ color. Assign nets $N_{ij}$ and $N_{ij}^{+}$ to channel i+j, for $1 \le i \le c$ and $0 \le j \le k+i$ . The remaining nets are assigned the same numbered channel as the color of the corresponding arc. Between points 1 and 2n, intervals overlap if and only if their corresponding arcs overlap. Elsewhere, no $N_{ij}^{-}$ and $N_{pq}^{-}$ for i < p overlap except $N_{i0}^{-}$ and $N_{pq}^{-}$ . However, $ch(N_{i0}^{-}) = i < p+q = ch(N_{pq}^{-})$ , so no $N_{ij}^{-}$ and $N_{pq}^{-}$ which overlap are assigned the same channel. An analogous argument shows that the $N_{ij}^{-}$ are properly assigned to channels. Therefore, for each coloring, there is a legal channel assignment using the same number of channels as colors. Since the construction uses only nets with two terminals, we have: Corollary 4.3: Problem P3 restricted to nets containing exactly two terminals is NP-complete. ## 4.3.2 Channel assignment with intersections. The second problem which we prove NP-complete deals strictly with the ordering of paths in intersections so that the resulting street widths minimize the area. This problem is somewhat similar to the channel assignment problem in that if two paths are approaching an intersection from the same direction in a street, and one path needs to go left at the intersection and the other needs to go right, then they cannot share the same channel in the new street unless they are in the proper order when they reach the intersection. The problem will be modeled using a graph to represent the streets. Subgraphs which are trees will be used to represent the interconnection patterns resulting from street routing. Let the street graph, S, be some subset of a two dimensional grid graph. Each node in S is labeled with integer coordinates (p,q). Fach edge is either horizontal; i.e., between nodes (p,q) and (p+1,q), or vertical, i.e. between nodes (p,q) and (p,q+1). The graph S is partitioned into streets. Each street is a path in S using only vertical or only horizontal edges. A horizontal street goes between nodes (i,k) and (j,k) for some k and i(j; a vertical street goes between nodes (k,i) and (k,j) for some k and i(j, A node represents an intersection of two or more streets. An edge represents the portion of a street between two intersections. The interconnection pattern for each not is represented by a tree in S. Each tree, T, will be called a net tree. We would like to assign each occurrence of an edge in a net tree to a channel. Let ch be a mapping from each occurrence of an edge in a net tree to a positive integer. The integer indicates the number of the channel containing that edge in the street to which the edge belongs. We require: - (1) If edge e of S is in distinct trees $T_{1}$ and $T_{2}$ then the occurrence of e in $T_{1}$ is assigned to a different channel than the occurrence of e in $T_{2}$ . (No overlapping wires.) - (2) If $e_1$ and $e_2$ are adjacent edges in a net tree T, and $e_1$ and $e_2$ belong to one street in S, then $ch(e_1) = ch(e_2)$ . (A connection path cannot change channels within a street.) - (3) Suppose horizontal edges $e_1 = ((p-1,q),(p,q))$ and $e_2 = ((p,q),(p+1,q))$ , for some p and q, belong to a street, s. Furthermore, suppose $e_1$ and $e_2$ belong to distinct that trees $T_1$ and $T_2$ , respectively. If $ch(e_1) = ch(e_2)$ , then: - (i) e<sub>2</sub> is not in T<sub>1</sub> and e<sub>1</sub> is not in T<sub>2</sub>. (This follows from (1) and (2) above.) - (ii) Suppose that vertical edges ((p,q),(p,q+1)) and ((p,q-1),(p,q)) are in one street, and each of $T_1$ and $T_2$ contains at least one of the edges. Then $ch(x_1) < ch(x_2)$ , where $x_1$ represents any occurrence in $T_1$ of either of the edges and $x_2$ represents any occurrence in $T_2$ of either of the edges. This condition insures that the wire segment represented by horizontal edge ((p-1,q),(p,q)) does not overlap the wire segment represented by horizontal edge ((p,q),(p+1,q)). - (4) Analogous to (3) but for vertical edges ((p,q-1),(p,q)) and ((p,q),(p,q+1)) in $T_1$ and $T_2$ , respectively. If $x_1$ and $x_2$ are occurrences of horizontal edges ((p-1,q),(p,q)) and/or ((p,q),(p+1,q)) in $T_1$ and $T_2$ , respectively, then $ch(x_1) < ch(x_2)$ . We want to find an assignment of edges in trees to channels so that the resulting overall area is minimized. The assignment is called the intersection channel assignment since the intersections induce the constraints on the assignment of channels within each street. Area will be measured as follows. For a given channel assignment, ch, let width(ch,q) be the sum over all vertical streets containing a node (i,q) for some i, of the number of channels used in the street; let height(ch,p) be the sum over all horizontal streets containing a node (p,j) for some j, of the number of channels used in the street. Let width(ch) be the maximum of width(ch,q) over all integers, q, appearing as the second coordinate of some node in the street graph S. If the maximum is zero, then width(ch) is one. Let height(ch) be the maximum of height(ch,p) over all integers p which appear as the first coordinate of some node in S. If the maximum is zero, then height(ch) is one. Then, area(ch) is defined as the product of width(ch) and height(ch). Using the representation presented above, we have the following problem, illustrated in Figure 4.6. ### Problem P4: The Intersection Channel Assignment Problem Given: A street graph, S, partitioned into streets; a collection of net trees, $T_i$ ; and a positive integer, A. Figure 4.6 The Intersection Changel Assignment Problem # an optimal assignment # not an optimal assignment esalo, ota 19 anni da ario appartuali, na profibet Question: Is there an assignment, ch, of the occurrences of edges in net trees to channels in streets which satisfies conditions (1) through (4) above such that area(ch) $\leq A$ . Lemma 4.4: The intersection channel assignment problem in NP-complete. Proof: Conditions (1) through (4) and the area of an assignment can be checked by a nondeterministic Turing machine in polynomial time. Therefore, the problem is in NP. We show that the problem in NP-hard by reducing 3-satisfiability [Gar79] to it: ### 3-Satisfiability Given: A boolean expression composed of the conjunction of k clauses, $c_i$ , for $1 \le i \le k$ . Each clause is the disjunction of three distinct literals, where a literal is a boolean variable or its complement, i.e. $c_i = (y_{i1} \lor y_{i2} \lor y_{i3})$ , where $y_{ij}$ is x or $\neg x$ for some variable $\vec{x}$ . and the contract of the first of the second WE FOR A COLUMN A Question: Is there an assignment of truth values to the boolean variables such that the expression is satisfied? Given an instance of the 3-satisfiability problem with k clauses and v variables, we will construct an instance of the intersection channel assignment problem with 2k + 1 horizontal streets, 2v + 1 vertical streets and A = 2vk(3v + 3). Let S be the (2v + 1) by (2k + 1) grid graph with nodes numbered from (0,0) through (2v,2k). Each path from (0,i) to (2v,i) is a horizontal street, for any i between 0 and 2k; each path from (i,0) to (i,2k) is a vertical street, for any i between 0 and 2v. Certain streets are associated with the clauses and variables of the boolean expression as follows: - (a) With each clause $c_i$ , $1 \le i \le k$ , associate the horizontal street from (0,2i-1) to (2v,2i-1) and name it $C_i$ . The remaining horizontal streets are unnamed. - (b) For each variable $x_j$ , $1 \le j \le v$ , associate the vertical street from (j-1,0) to (j-1,2k), named $X_j$ , and the vertical street from (2v-j+1,0) to (2v-j+1,2k), named $X_{ij}$ . westered in a court of the But the second (c) The one remaining vertical street, that from (v,0) to (v,2k), is named street M. We construct two net trees for each variable. For variable $x_j$ , the first net tree, $T_j$ , contains the following edges: - (1) In street $C_i$ , for each i from 1 to k: all edges on the path from (j-1,2i-1) to (2v-j+1,2i-1); - (2) In street $X_j$ : the edge from (j-1,0) to (j-1,1) and for all even i between 1 and k-1 inclusive, the two edges on the path from (j-1,2i-1) to (j-1,2i+1). Also, if k is even, the edge from (j-1,2k-1) to (j-1,2k); - (3) In street $X_{j0}$ : for all odd i between 1 and k-1 inclusive, the two edges on the path from (2v-j+1, 2i-1) to (2v-j+1, 2i+1). Also, if k is odd, the edge between (2v-j+1, 2k-1) and (2v-j+1, 2k); - (4) In street M: if variable x, appears in clause c, uncomplemented, then if i is even, the edge between (v,2i-1) and (v,2i); if i is odd, the edge between (v,2i-2) and (v,2i-1). If x, appears in c, complemented, then if i is even, the edge between (v,2i-2) and (v,2i-1); if i is odd, the edge between (v,2i-1) and (v,2i-1) and (v,2i). The second net tree for variable x, Tio contains the following edges: - (1) In street C<sub>i</sub>, for each i from 1 to k: all edges on the path from (j-1,2i-1) to (2v-j+1,2i-1); - (2) In street X<sub>j</sub>: for all odd i between 1 and k-I inclusive, the two edges on the path from (j-1, 2i-1) to (j-1, 2i+1). Also, if k is odd, the edge between (j-1, 2k-1) and (j-1, 2k); - (3) In street $X_{j0}$ : the edge from (2v-j+1,0) to (2v-j+1,2i+1) and for all even 1 between 1 and k-1 inclusive, the two edges on the path from (2v-j+1,2i+1) to (2v-j+1,2i+1). Also, if k is even, the edge from (2v-j+1,2k-1) to (2v-j+1,2k): - (4) In street M: if variable $x_j$ appears in clause equincomplemented, then if i is odd, the edge between (v,2i-1) and (v,2i); if i is even, the edge between (v,2i-2) and (v,2i-1). If $x_i$ appears in c<sub>i</sub> complemented, then if i is odd, the edge between (v,2i-2) and (v,2i-1); if i is even, the edge between (v,2i-1) and (v,2i). The following observations are useful. For any coordinate $p_i$ call an edge from (p,2i-1) to (p,2i-2) an edge down from street $C_i$ ; call an edge from (p,2i-1) to (p,2i) an edge up from $C_i$ . Each net tree $T_j$ contains a path which begins with an edge in street $X_j$ down from street $C_j$ . The path goes through $C_1$ to street $X_{j0}$ , goes up from $C_1$ to street $C_2$ , and through $C_2$ to street $C_2$ . The path continues snaking through the horizontal streets associated with the clauses, using street $C_j$ to go from street $C_j$ to street $C_{i+1}$ when i is even, and using street $C_i$ to $C_i$ to $C_{i+1}$ when i is odd. Each net tree $C_i$ also contains a path which snakes through the $C_i$ , but in the opposite direction: it begins with an edge in $C_i$ down from $C_i$ ; uses street $C_i$ to change from $C_i$ to $C_{i+1}$ when $C_i$ is even; and uses street $C_i$ to change when $C_i$ is in the same direction from $C_i$ as the edge of the tree in street $C_i$ if $C_i$ appears complemented, the edge in street $C_i$ is in the same direction from $C_i$ as the edge of the tree in street $C_i$ . Figure 4.7 gives an example of the construction. For each i between 1 and k, each net tree contains the edges between horizontal positions v-1 and v+1 in street $C_i$ . Therefore, there must be a channel in each $C_i$ for each net tree, giving a height of 2vk for any channel assignment. Any channel assignment which gives area at most 2vk(3v+3) must give width at most 3v+3. At each intersection of street M with a street $C_i$ , there are six net trees which contain edges of M incident on the node for the intersection — one pair of net trees, $T_j$ and $T_{j0}$ , for each variable $x_j$ appearing in $c_i$ . (We assume that each variable occurs at most once in a clause. Note that $x \vee \neg x \vee y$ is always satisfied.) Half of the edges are up, from street $C_i$ and half are down. Therefore, at least three channels are required for M. Three channels will be used in M exactly when the boolean expression of interest is satisfiable. Figure 4.7 Construction of the proof of Lemma 4.4. Espression: $$(x_1 \lor x_2 \lor \neg x_3) & (\neg x_1 \lor x_2 \lor x_3)$$ $$C_1 & C_2$$ Truth assignment: $x_1 = x_2 = x_3 = \text{true}$ . If the width is required to be at most 3v + 3, and street M contributes at least three, then the width contributed collectively by streets X<sub>j</sub> and X<sub>j0</sub> for 1≤j≤v, must be at most 3v. Consider the intersections of streets X, and X, with street C, for i odd. The edge from (j-1,2i-2) to (j-1,2i-1) in X, belongs to T<sub>i</sub>, and the edge from (j-1,2i-1) to (j-1,2i) in **X** belongs to T<sub>i</sub>. Therefore, if only one channel is used in X<sub>j</sub>, the channel in C<sub>i</sub> containing the edges of P<sub>i</sub> must have a lower number than the channel in C<sub>i</sub> containing the edges of Tip (by condition (4) given for channel assignments). However, in Xin the edge from (2v-j+1,2i-2) to (2v-j+1,2i-1) belongs to $T_{ij}$ and the edge from (2v-j+1,2i-1) to (2v-j+1,2i) belongs to T. If only one channel is used in X., the channel in C. containing edges of T. must have the lower number. Therefore, only one of X<sub>i</sub> and X<sub>i0</sub> can contain exactly one channel. If i is even, the edges which belong to T, and Tin are just interchanged from that above; we reach the same conclusion. Therefore, each pair of streets, X and X contribute at least three channels to the width. However, if collectively at most 3v channels are contributed by streets X, and X, 1515v, at most three channels must be used by each pair. Note that T<sub>i</sub> and T<sub>i</sub> "fit together" so that they can always share one channel in X<sub>i</sub> or one channel in X<sub>10</sub>. In the other of X<sub>1</sub> and X<sub>10</sub>, edges of T<sub>1</sub> are assigned to one channel and edges of T<sub>10</sub> are assigned to a different channel. We will associate a channel assignment in which X, has one channel with a value assignment in which $x_i$ has value "true"; we will associate a channel assignment in which $X_{j0}$ has one channel with a value assignment of "false" for x<sub>i</sub>. Given an assignment of truth values to the variables $x_j$ such that the boolean expression is satisfied, the corresponding channel assignment giving width 3v+3 is as follows. For each j between 1 and v, if $x_j$ is true, one channel is used in $X_j$ ; if $x_j$ is false, one channel is used in $X_{j0}$ . Street M contains three channels. We first determine what trees will share channels at each intersection of M and some $C_j$ . Note that only adjacent edges in a tree are required to use the same channel: edges of a tree which are in the same street but not part of the same path in the street are not required to use the same channel. Let $x_j$ be a variable whose occurrence in $c_j$ is true under the given assignment of truth values. The edges of $T_j$ and $T_{i0}$ in M at $C_i$ can share a channel. This follows from the fact that if $x_j$ is uncomplemented in $c_i$ , the edges are in the same direction as those in $X_p$ and the edges in $X_j$ share a channel; if $x_j$ is complemented, the edges are in the same direction as those in $X_{j0}$ and the edges in $X_{j0}$ share a channel. However, other pairs of edges in M at $C_i$ will not be able to share a channel if the corresponding literal in $c_i$ is false. Therefore, let the edge for $x_j$ in M down from $C_i$ share with the edge-up from $C_i$ for a second variable $x_j$ in $c_i$ . Let the edge down for $x_p$ share with the edge up for the third variable $x_q$ . Finally, the edge down for $x_q$ shares with the edge up for $x_j$ . The new constraints on the assignment of channels in $C_i$ induced by this sharing are consistent with the constraints from any sharing at the intersections of $C_i$ with $X_j$ , $X_p$ , $X_q$ , $X_{j0}$ , $X_{p0}$ , and $X_{q0}$ . (See Figure 4.8.) The edges in the horizontal streets between the $C_1$ are not contained in any net tree. Therefore, the only condition relevant at the intersections of M with these streets is that a path in a net tree which goes through an intersection cannot change channels. There will be a path in a net tree through such an intersection if the same literal appears in clauses $c_1$ and $c_{1+1}$ , causing an edge up from $C_1$ and down from $C_{1+1}$ in one of $T_1$ and $T_{10}$ . The edges are adjacent at (v.2i). The assignment so channels in M can be determined as follows. At the intersection of M with the street at horizontal position Q arbitrarily amign each edge in a net tree incident on the intersection to a different channel aroung channels. It through 3. These edges are edges down from street $C_1$ . At $C_1$ , the edges down have been assigned; assign the edges up so they share with the edges down as previously determined. Now consider the intersection of M with the street between $C_1$ and $C_2$ . The channels for any tree paths from $C_1$ to $C_2$ have been determined at $C_1$ ; the remaining edges down from $C_2$ can be assigned arbitrarily. In this way, edges down from each $C_1$ can be assigned channels at the intersection of M with the horizontal street before $C_1$ so that paths do not change channels. Edges up are assigned at the intersection of M with $C_1$ so that channels are shared properly. Thus, if there is any assignment of boolean variables satisfying the boolean expression, there is a channel assignment, ch, with width(ch) = 3v+3 and arou(ch) = $2vk(3v+3) \le A$ as desired. Figure 4.8 Compatibility of the constraints at the intersection of C<sub>1</sub> with M and with X<sub>1</sub> and X<sub>10</sub>. "Ty down" represents the tree among Ty and Ty0 which contains the edge down from $C_i$ for variable $x_y$ - y equal to j, p, or q. "Ty up" represents the tree containing the edge up. U V indicates the channel in C<sub>i</sub> containing edges of net tree U must have a lower number than that containing edges of net tree V. indicates that the order could be either way. For any choice of direction for the two constraints are consistent. Given a channel assignment with area(ch) $\leq A$ , we must show how to construct a boolean assignment which satisfies the expression. We have shown that any such channel assignment induces a truth assignment by choosing $x_i$ true if $X_i$ contains only one channel and $x_i$ false if $X_{ij}$ contains only one channel. It remains to show that this assignment satisfies the expression. We know that exactly three channels are used in M. At each intersection of M with some C<sub>i</sub>, edges from two different net trees must share each channel. Among the trees containing edges in M at the intersection with C, consider the tree whose edges in C<sub>i</sub> are in the lowest numbered channel. This tree must contain an edge in M down from C<sub>i</sub>. Suppose the tree is for variable x<sub>i</sub>. The channels of C<sub>i</sub> assigned to edges in T<sub>i</sub> and T<sub>i0</sub> are in the proper order for the edges of T<sub>i</sub> and T<sub>i</sub> in M to share a channel. If x<sub>i</sub> appears uncomplemented in c<sub>i</sub>, then the edges in M at $C_i$ for $T_j$ and $T_{j0}$ are in the same direction as those in $X_j$ at $C_i$ . It must be street $X_j$ which contains only one channel. Therefore $x_i$ is true, and $c_i$ is satisfied by literal $x_i$ . If $x_i$ appears complemented in c<sub>i</sub>, then the edges in M at C<sub>i</sub> for T<sub>i</sub> and T<sub>i0</sub> are in the same direction as those in X<sub>i0</sub>. It must be street X<sub>i0</sub> which contains only one channel. The value of x<sub>i</sub> is false, and c<sub>i</sub> is satisfied by literal ¬x<sub>j</sub>. A literal satisfying each clause can be identified by looking at the intersection of M with the street for the clause. Therefore, there is an assignment of truth values to the variables such that the boolean expression is satisfied. In the construction used to prove Lemma 4.4, height(ch) is the same for any channel assignment. Therefore, we conclude: Corollary 4.4: The modified Intersection Channel Assignment Problem in which one desires a channel assignment, ch, such that width(ch) $\leq$ D for some given integer, D, (or such that height(ch) $\leq$ D) is NP-complete. Lemma 4.4 shows that even ignoring terminals, assigning channels is a difficult problem. In the next section we again consider channel assignment within one street. A heuristic algorithm is analyzed. An example is constructed to show that the algorithm can be made to do arbitrarily badly with respect to the optimal solution. ## 4.4 Heuristics for Channel Assignment In this section, we discuss heuristics for the channel assignment problem within a street. We begin with a general discussion of various restrictions whose removal changes the optimal channel assignment. Then we present a heuristic algorithm and its analysis for Problem P3, the version of the problem proven NP-complete. The ordering on $ch(N_i)$ required by condition (i) of the statement of Problem P3 can be represented using a directed graph, which we will call the constraint graph. There will be one node for each net. If $ch(N_i) < ch(N_j)$ is required under condition (i) of the statement of Problem P3, then there is an edge directed from the node for $N_i$ to the node for $N_j$ . It is possible for the graph to be cyclic. In this case, there is no channel assignment for the problem as statest (P3). If each net can use more than one channel — by using wire segments in the direction perpendicular to the street direction to connect segments in different channels — then a channel assignment may exist. The segments perpendicular to the street direction are called jogs. Jogs used for different nets, like any other wire segments in the same direction for different nets, must be separated by the minimum spacing. Even when jogs are allowed at any point along a street, the charnel assignment problem may not have a solution. Figure 4.9 gives an example. However, if jogs are allowed anywhere between points on a street rather than only at the points, then the channel assignment problem is solvable in polynomial time [Ka79]. The model of a street used in [Ka79] differs slightly from the formulation used in P3. For this discussion, we only need note that, in [Ka79] terminals on apposite sides of the street are either at the same point along the street or are at points separated by at least minimum spacing. Allowing jogs between points implies that an arbitrarily large number of segments Figure 4.9 Channel assignment problem with no solution even if jogs are allowed. Nets: $$\{(1,0), (3,1)\} = N_1 \quad \{(2,0), (4,1)\} = N_3 \quad \{(1,1), (3,0)\} = N_2 \quad \{(2,1), (4,0)\} = N_4$$ If $N_1$ or $N_2$ uses a jog at 2 to change channels, neither $N_3$ nor $N_4$ can use a jog at 3: Figure 4.10 Inserting jogs between points to effectively eliminate terminals across from one another. perpendicular to the street direction may be inserted between any two terminals. The length of the street — the dimension of the street in the direction of the street — must be variable, rather than just the street width being variable. The algorithm in [Ka79] minimizes only street width, not the area of the street. Let the maximum overlap of a set of nets be the maximum over all points on the street of the number of nets whose intervals overlap at the point. An optimal solution under the model in [Ka79] requires at most one more channel than the maximum overlap of the nets. Recall that if there are no constraints due to condition (i), the channel assignment problem P3 is solvable in polynomial time. Assume, as in [Ka79], that terminals are either at the same position along the street or have sufficient space between them. By using one channel and inserting a jog between every pair of consecutive terminal positions, we can effectively separate the terminals on opposite sides of the street so that there are no constraints under condition (i). (See Figure 4.10.) We at most double the street length. Then the number of channels needed (excluding the channel we used to create the effect of suitably spaced terminals) is the same as the maximum overlap of the nets. This implies that one never need to lengthen the street by more than double to get a channel assignment which uses within one channel of the minimum. Since we are thinking of a street as running along the boundary of a component and we do not wish to think of each component as expandable, lallowing streets to lengthen is not satisfactory. An alternative is to use jogs in a street intersection. When a jog is needed, a segment which goes to the end of the street and out into the street intersection is used. In the intersection, the path can use a segment (the jog) in the perpendicular street to change to another channel in the original street, and reenter the region of the street it was previously in. This type of solution requires that perpendicular streets be available and that nots are allowed to have wire segments in two channels of a street at the same position along the street. Figure 4.11a illustrates. Such a routing not only affects the width of the street containing the <sup>1.</sup> In fact, some components are expandable [Jo79]. terminals being interconnected, but affects the width of the perpendicular street as well. Using this scheme, any collection of nets consisting of terminals in one street can be interconnected. Figure 4.11b shows the general connection pattern. Let us return to our original channel assignment problem, problem P3. The proof of NP-completeness of problem P3 relies on the fact that jogs are not allowed. Figure 4.12 shows that even for instances of the problem derived from circular arc coloring, allowing jogs reduces the number of channels needed. We do not have a proof that the channel assignment problem with jogs in NP-hard, although the author believes that this is the case. We now present a simple houristic algorithm for problem P3 and analyze the quality of the solutions it produces. This algorithm is the basic algorithm used in [Deu76] when jogs are not allowed. We will assume that the constraint graph produced by applying condition (i) does not have any cycles so that a solution without jogs does exist. We first define the level of a node in the constraint graph: - (1) All nodes which have no edges into them are on level 1. - (2) Assuming that the nodes on levels 1 through k-1 are defined for k>1, a node is on level k if all edges entering it are from nodes on lower levels and it is not on a lower level. Since the constraint graph is acyclic, the levels are well-defined. They can be computed by starting with level 1 nodes and following edges. We will say that a net is at level k if its node is at level k. The nodes from which there are edges to a given node are called *predecessors* of the given node, and the corresponding nets are called *predecessors* of the given not. Order the nets in increasing order by the position, a, of their terminals of lowest position. If there are no constraints due to condition (i), then the following algorithm finds the optimal solution: <sup>1.</sup> This analysis is part of joint research with Errol Lloyd. Figure 4.11 Problems requiring jags. # A. Solving the Problem of Pigure 4.9 # B. A general solution. One u-shaped path per net. Top terminals connect to the top half of the u; bottom terminals connect to the bottom half. Figure 4.12: Use of jogs when not necessary. We look at a problem derived from a set of circular arcs: circular arcs (8,5), (3,7), (4,2), (6,1) # Low to High Fill [Has71] Repeat for each channel, beginning with channel 1 and increasing the channel number by one for each new channel until all nets are assigned: Choose the first unassigned net under the above ordering. Assign it to a new channel. Repeat until the channel is full: Choose the first unassigned net whose terminal of lowest position is at a higher position than the terminal of highest position in the last net to be assigned to the channel. Assign this net to the channel. Claim [Has71]: When there are no constraints due to condition (i), the above algorithm uses exactly the number of channels as the maximum overlap of the nets. Proof: Look at the lowest point, a<sub>0</sub>, in the interval of the first net, N, assigned to the last channel. Each lower numbered channel must contain a net whose interval contains point a<sub>0</sub>. If there were a channel, k, which did not contain such a net, then the net N should have been placed in this channel. This follows because, in the net ordering, N is before the nets, if any, which were placed in channel k after point a<sub>0</sub>. Therefore, all lower numbered channels are assigned nets whose intervals contain point a<sub>0</sub>. The overlap at this point is equal to the number of channels used. Algorithm "Low to High Fill" will be the basis of the algorithm to assign nets to channels when constraints are present. Channels are again filled beginning with channel 1. At any point during execution of the algorithm, let the set, A, of available nets contain those nets which are unassigned and all of whose predecessors are assigned. "Low to High Fill" is modified to choose nets to be assigned only 化二甲基甲基乙基醇三甲基酚磺胺甲基酚酚 医皮肤 化电影开发 机等 <sup>1.</sup> In [Has71], Hashimoto and Stevens use a different approach to prove the algorithm correct. They prove a different but equivalent claim. The proof given here is based on the proof used by Gavril [Gav72] to prove that his algorithm for chordal graph coloring is correct. from set A (the nets are ordered as before) and to update A after every assignment. Lemma 4.5: For an instance I of Problem P3, let $d_k$ be the maximum over all points on the street of the number of nets at level k whose interval overlaps the point. Let h be the highest level. Then (number of channels)<sub>alg</sub>(I)/(number of channels)<sub>opt</sub>(I) $\leq$ min(h, avg( $d_k$ )) $\leq$ n<sup>1/2</sup> where the algorithm is the modified Low to High Fill algorithm using set A, avg( $d_k$ ) is the average of the $d_k$ for $1 \leq k \leq h$ , and n is the number of nets in instance I. **Proof:** We know that (number of channels) out $\{i\} \ge \max_{i \in I} \{i\}$ maximum of the $d_k$ and h. We prove that (number of channels) $a_{k+1}(l) \leq \sum_{k+1}^{h} (d_k)$ . Let $C_1$ denote the set of channels containing a net at level 1. For k > 1, let $C_k$ denote the set of channels with higher numbers than the last channel containing a net at level k - 1 and which contain nets at level k. The number of channels used by the algorithm is the sum of the $|C_k|$ over all levels. For a particular k, $C_k$ may be empty. If $C_k$ is not empty, look at the first net at level k placed in the highest numbered channel in $C_k$ ; denote this net $N_k$ . Let $a_k$ be the lowest point in its interval. If any channel in $C_k$ is not assigned a net whose interval contains $a_k$ , then $N_k$ should have been placed in this channel. This follows because $N_k$ must have been in A when the channel was assigned — all level k - 1 nets are places before channel after point $a_k$ . Therefore, $|C_k| \leq d_k$ . We have: (number of channels) $$_{alg}(I)/(number of channels)_{alg}(I) \le h(avg(d_k))/max(h,max(d_k))$$ $\le max(h,avg(d_k)) \times min(h,avg(d_k))/max(h,max(d_k)) \le min(h,avg(d_k))$ The number of nets at level k is at least d<sub>k</sub>. Therefore, $$\sum_{k=1}^{h} (d_k) = h \times avg(d_k) \le n \quad and \quad \min(h, avg(d_k)) \le n^{\frac{1}{2}}.$$ Sangle a militar a remaining the san and If the above bound on the worst case performance of the algorithm is accurate, the algorithm can do very badly. In fact, the bound is the same as would be obtained for an algorithm which applies "Low to High Fill" on nets of a fixed level, one level at a time, requiring all nets on level k $(1 \le k \le h)$ to be assigned to lower numbered channels than those of level k+1. Nonetheless, our bound is tight. Lemma 4.6: There is an instance of Problem P3 for which modified Low to High Fill using set A produces a channel assignment whose ratio to the optimal solution is $\Omega(n^{\frac{1}{12}})$ for $avg(d_k) = h = n^{\frac{1}{12}}$ . Proof: Figure 4.13 gives the instance. There are $n^{\frac{1}{2}}$ groups of nets, each of which forms an $n^{\frac{1}{2}}$ level chain. For all level numbers, k, between 1 and h inclusive, the intervals of all nets on level k overlap, giving $d_k = n^{\frac{1}{2}}$ . The algorithm assigns only one net to each channel. However, there is a channel assignment which does not assign nets to channels in a low to high order. Among nets on a level, some nets which are later in the net ordering are assigned to lower numbered channels than nets which come before them in the ordering. The nets which came first in the ordering can share channels with nets which are on higher levels than they are, but whose predecessors have already been assigned. Refer to the figure for details. The analysis presented above illuminates the fact that an algorithm used in practice is capable of finding very bad solutions. It forms a point of comparison for more "clever" or more complicated algorithms. The "low to high fill" method can be used as the basis of an exponential time algorithm to find optimal channel assignments by trying all possible choices for each assignment rather than taking the first net in an ordering [Ker73]. In the next chapter, we present an algorithm for a special case of channel routing which is not NP-complete. The problem is to route interconnections among two point nets whose terminals lie on the outside of one rectangle. This problem is reminiscent of the circular arc coloring problem, which is NP-complete. However, the paths around the outside of the rectangle are allowed to change "color" as they go around corners, i.e. the order of paths need not be the same on adjacent sides of the rectangle. This order can change because horizontal and vertical wire segments are allowed to cross. Once the paths have been routed through the four streets surrounding the rectangle, the channel assignment is four instances of the interval coloring problem. In the next chapter, we show how to do the street routing optimally. ## Chapter 5: An Algorithm for Routing Terminals on a Rectangular Component #### 5.1 Preliminaries We will now present an algorithm which finds an optimal solution to the following routing problem in polynomial time. Figure 5.1 presents an example. #### The One Rectangle Routing Problem Given: a rectangular component with terminals around its outside edge. Terminals lie on positions which have at least unit spacing along the edge of the rectangle. The unit spacing represents the width of a wire plus the minimum spacing between wires. A list of nets, each containing a pair of terminals which must be connected, is given. Each terminal belongs to exactly one net. Find: An optimal routing of the wires between pairs. Paths must be composed of line segments which are parallel to some side of the rectangle. Distinct paths may cross at right angles; however, parallel segments belonging to distinct paths must be separated by the minimum spacing. (We are assuming that there are two layers for interconnection. One layer is used for the line segments in one direction, and the other layer is used for line segments in the second direction.) All paths must lie outside the rectangular area of the component. An optimal routing is one which minimizes the area of the smallest rectangle which circumscribes the component and all routing paths. The sides of the circumscribing rectangle must be parallel to those of the component. Place the rectangular component on a cartesian coordinate system so that its sides are parallel to the axes. Arbitrarily choose one axis direction to be horizontal and one vertical. Label the horizontal sides of the rectangle as top and bottom, the vertical sides as left and right. Each pair of terminals can be connected either by a path which goes clockwise from one of the terminals or by a path which goes counterclockwise from the terminal. The directions of the connections determine a set of intervals which path segments will use along each side. Once the intervals to be used by each path along a side are determined, minimizing the length added out from the side is a channel Terminals are represented as points on the rectangle boundary. Pairs of terminals with the same number must be connected. An optimal solution is shown. to the east, and to the fire, he is made a specific make the first to himself it is The height added to the top by this solution is 2 units; the height added to the bottom is 3 units. The width added to the left side is 3 units; the width added to the right side is 2 units. a corner as far as needed to meet each other without violating any spacing requirements. Therefore, the channels in the street along one side of the component can be assigned independent of the assignments within other streets. Within each street, there are no constraints due to terminals across from one another (condition (i) of problem P3 in Chapter 4). We have four instances of the interval coloring problem—one for each side. Therefore, the length added out from each side is equal to the maximum overlap of the intervals on that side, and it suffices to use paths which only change direction to round a corner of the component or to connect to a terminal. We must determine the direction of each connecting path so that the resulting area will be minimized. We consider two types of connections. Pairs of terminals on the same side or adjacent sides of the component are called local connections. For these, it suffices to choose the direction which goes around the fewest sides. A path which goes the long way adds at least one unit in each dimension to the circumscribing rectangle. A path which goes the short way cannot add more than this. Therefore it is never better to go the long way around. The second type of connection contains those which go from the left side to the right side or the top to the bottom. The choice of direction for top-bottom connections is independent of the direction of left-right connections and vice versa, since regardless of the direction used for a top-bottom connection, one unit is added to the horizontal dimension of the circumscribing rectangle. Therefore, we have two instances of the following problem: #### **Top-Bottom Routing Problem** Given: Terminals on a top and a bottom, a set of top to bottom connections, and some local connections on these sides. Find: A direction -- left or right -- for each top to bottom connection so that the resulting total vertical dimension is minimized. Each connection is made by going around to the left or the ### right as indicated by the direction. In this description, connections which originally went from the top or bottom to an adjacent side are considered to be local connections which go to the very edge of the top or bottom. (The portion used to turn the corner is not of interest here.) To solve the above problem, we first reduce it to a problem of assigning 0/1 values to elements of two vectors so that a matching on the vectors is maximized. The vectors represent the top and bottom terminals. The representation removes unnecessary information about local connections. The value of "0" or "1" represents the direction --left or right-- of the connection at a terminal. The matching identifies which segments will share the same channel in the final routing. The major phases of the algorithm to solve the new assignment problem are as follows: - 1. Partition each vector into regions within which matching can be localized. After assigning values within a region, the regions are recalculated. The algorithm iteratively assigns values within these regions until there is only one unassigned region for each of the top and bottom vectors. - 2. For the remaining top region and bottom region, the algorithm assigns values from left to right along the top region, maintaining certain properties of the number of "0"s and "1"s in portions of the vectors. These properties guarantee a maximum matching for the vectors. It may happen that not all properties can be satisfied simultaneously when some element is assigned a value. At this point, we say a failure has occurred. When a failure occurs, the algorithm may still be able to determine an assignment which suffices to guarantee a maximum matching. However, it may be necessary to try both values for the assignment. - 3. When the algorithm must try both choices for an assignment, it does not treat both choices equivalently. For the choice of value "1", the algorithm is applied recursively to complete the 我可以知识强权决定 化水质铸造性化系统 assignment. For the choice of value "0", the algorithm is applied with modification. We assume that the choice of "0" leads to a better solution, not just as good a solution, as the choice of "1". Therefore, as soon as there is evidence that the "0" choice will lead to no better a solution, the algorithm may stop pursuing this choice. In particular, if the situation in which the algorithm must try both choices reoccurs, it will turn out that the second "0" choice leads to no better solution that the first "1" choice. This second "0" choice can be eliminated. Only the "1" choice is used, and the search done by the algorithm is thus bounded. The technique of bounded search is crucial to the algorithm. Without the ability to bound the search, the algorithm would have exponential running time. The following sections described in more detail the algorithm outlined above. During this description, a large amount of notation will be introduced. The appendix to this chapter summarizes that notation. #### 5.2 Reduction to Maximizing Matchines The Top-Bottom Routing Problem is reduced to the problem of assigning values within two vectors to maximize a matching. The vectors, T(1,m) and B(1,n), represent the terminals at the top and bottom, respectively, numbered from left to right. (We will drop the T and B when it is clear from context whether we are referring to a top or bottom terminal.) A value of "0" or "1" is used to represent the direction of the connection at each terminal. Define the value function VT: $T(1,m) \rightarrow \{0,1,7\}$ as follows: - VI(i) = 0 if the direction of the path from terminal i to its pair is to the left - 1 if it is to the right - ? if it is undetermined Value function VB: $B(1,n) \rightarrow \{0,1,?\}$ is defined the same way. Let p: $T(1,m)\cup B(1,n)\to T(1,m)\cup B(1,n)\cup \{*\}$ be the pairing function. Terminal p(x) should be connected to terminal x. When p(x) = \*(don't care), the terminal x is connected to a terminal on an adjacent side. Initially, the pairing function is known, and the value functions have 0 or 1 values only for the local connections. We would like to find VT and VB for which all directions are determined and are consistent with the initial values given. We will say that a value function V' is consistent with V if V'(i) = V(i) whenever $V(i) \neq ?$ . We also say that V' extends V. For any pair of value functions we require that V''(i) = V(i) if p(T(i)) = B(j). We would like to balance paths to the left with those to the right on the top and bottom simultaneously. Let $m_{V\Gamma}$ be a function matching terminals in T(1,m) of value "0" to terminals with higher index in T(1,m) and of value "1", i.e. matching paths to the left and right. The function is formally defined as a one-to-one partial function from T(1,m) to itself such that if $m_{V\Gamma}(i)=j$ , then VT(i)=0, $V\Gamma(j)=1$ , and j > i. Define $m_{VB}$ similarly. For a particular VT and VB, let $M_{V\Gamma}$ be the maximum over all matching functions $m_{V\Gamma}$ of [range( $m_{V\Gamma}$ )], i.e. $M_{V\Gamma}$ is the maximum number of matches among T(1,m) for a given VT. Parameter $M_{VB}$ is defined similarly. We will reduce our routing problem to a problem of assigning "0"s and "1"s to maximize $M_{\rm VT} + M_{\rm VB}$ . This is justified by the following lemma: Lemma 5.1: Given a Top-Bottom Routing Problem with local connections represented by value functions $VT_0$ and $VB_0$ , the problem of finding $VT_f$ and $VB_f$ under which all directions are defined and for which the vertical dimension of the circumscribing rectangle is minimized is equivalent to the problem of finding $VT_f$ and $VB_f$ such that $M_{VT_f} + M_{VB_f}$ is maximized over all VT and VB which map T and are T and Proof: The matching function $m_{V\Gamma_f}$ corresponds to determining which horizontal segments above the top side will share the same channel. If $m_{V\Gamma_f}(i) = j$ , then in some channel, the segment ending (going left to right) at terminal i is followed directly by the segment which starts at terminal j. Each channel begins with a segment running from the top left edge (a segment for which $\forall T_f(i) = 0$ and $p(i) \in B(i,n)$ or p(i) = 0) or with a segment which begins at a 1-valued terminal which is unmatched under $m_{VT_f}$ . The rightmost segment in each channel is a segment running to the top right edge $(\forall T_f(i) = 1 \text{ and } p(i) \in B(i,n) \text{ or } p(i) = 0)$ or a segment ending at a 0-valued terminal which is unmatched under $m_{VT_f}$ . Figure 5.2 illustrates possible configurations. The channels can be in any order above the side; therefore, we have found an assignment of segments to channels for each $m_{VT_f}$ . The argument can be reversed so that we find a matching for each assignment to channels. There is a one-to-one correspondence between channel assignments and matchings giving: number of channels used = number of segments going to left edge + number of unmatched "I"s = number of segments going to right edge + number of unmatched "U"s This gives: total vertical dimension corresponding to matching functions $m_{VT_g}$ and $m_{VR_g}$ - = ht. of rectangle (denoted h) + # channels at top + # channels at bottom - = h + # top segments going to left edge + # unmatched all satisfies and a segment of the - + # buttom segments going to right edge + # unmatched "0"s at buttom - = h + # top segments going to left edge + # "1"s at top |range(m<sub>VT</sub>| - + # bottom segments going to right edge + # " sat bottom | range(myg) Note that: # "1"s at top = # top segments going to right adge + # top segments going to neither adge and therefore: # top segments going to left: + # "1"s at top = # top segments. This gives: minimum total vertical dimension over all matching functions for $VT_f$ and $VB_f$ Where C = (# local connections top and bottom) + 2(# top-bottom connections) is a constant for any instance of the routing problem. Therefore, minimizing the height is equivalent to magnitude and the standard of the continuous connections. We now present the various phases of the algorithm which finds value functions VT, and VB, Figure 5.2: Possible configurations when filling channels. maximizing $$(M_{VT_f} + M_{VB_f})$$ . #### 5.3 Defining Regions The following description will be in terms of T. An analogous development is assumed for B. Let T(x,y) denote the terminals from x to y inclusive. In any left interval, T(1,i), for any $i \le m$ , we would like to maintain the property that no more than half the terminals are 1-valued. This property must hold if every 1-valued terminal in (1,i) is to be matched to a 0-valued terminal with smaller index. If more that half the terminals in a left interval are 1-valued; some of these 1-valued terminals cannot be matched under any $m_{VT}$ . If at least half the terminals are 1-valued, any ?-valued terminals in the interval should become 0-valued if we are to maximize the number of matches. Similarly, we want no more than half 0-valued terminals in any right interval to insure that these 0-valued terminals can be matched. Because of the local connections, it is not always possible to maintain these properties on any left or right interval. Instead, we define regions within which these bounds hold. Within these regions matching can be localized. For a given value function, VT, let ZEROS<sub>VT</sub>(S) = {i \in S|VT(i) = 0}; ONES<sub>VT</sub>(S) = {i \in S|VT(i) = 1}; UNDET<sub>VT</sub>(S) = {i \in S|VT(i) = ?} where SCT. Define the property OK-1(VT,x,y) (similarly OK-0) which is true if and only if $|ONES_{VT}(x,y)| \le L!/(y-x+1)J$ . (We us the notation "ONES<sub>VT</sub>(x,y)" rather than "ONES<sub>VT</sub>(T(x,y))") Also define Pull-1(VT,x,y) (similarly Full-0) which is true if and only if $|ONES_{VT}(x,y)| \ge \Gamma!/(y-x+1)T$ . Property Full-"b"(VT,x,y) is true when at least half the terminals in T(x,y) have value "b" under VT. Note that Full-1(VT,x,y) and OK-1(VT,x,y) can be simultaneously true only if $|ONES_{VT}(x,y)| = \frac{1}{2}(y-x+1)$ , an integer. Full-1(VT,1,i) indicates that the terminals in UNDET<sub>VT</sub>(1,i) should become 0-valued if we are to maximize the number of matches. We will now define the regions of T within which matching can be localized. Left-regions under VT are formed scanning T from 1 to m. A new region begins when the previous region has at least half "1"s. Similarly, right-regions under VT are intervals of T scanning from m to 1 and counting "0"s. First define functions $I_{VT}$ and $r_{VT}$ mapping T to itself: $$l_{\text{VT}}(1) = 1$$ $l_{\text{VT}}(i) = i$ if Full-1(VT, $l_{\text{VT}}(i-1)$ , $i-1$ ) $l_{\text{VT}}(i-1)$ otherwise, for $i > 1$ $$r_{VT}(m) = m$$ $r_{VT}(j) = j$ if Full-0(VT,i+1, $r_{VT}(i+1)$ ) $$r_{VT}(i+1) \text{ otherwise, for } j \le m$$ The function $l_{\rm VT}$ induces an equivalence relation on T(1,m) under which two terminals i and j are equivalent if and only if $l_{\rm VT}(i) = l_{\rm VT}(j)$ . The resulting equivalence classes are intervals of T(1,m); the lowest element of each is an element of the range of $l_{\rm VT}$ . These equivalence classes are called *left-regions* under VT. A new region begins when the previous region has at least half "1"s. Similarly, $r_{\rm VT}$ induces right-regions under VT which are intervals whose highest element is an element of the range of $r_{\rm VT}$ . Lemma 5.2: (a)OK-1(VT, $$I_{VT}(i)$$ ,i) unless $I_{VT}(i) = i$ and VT(i)=1 (b)OK-0(VT, $i$ , $r_{VT}(i)$ ) unless $r_{VT}(i) = i$ and VT(i)=0. Proof: We will prove (a). The proof for (b) is analogous. On all the left-regions except possibly the last, at least half the terminals are 1-valued. Such regions are called *full* left-regions. We define a delimiter for the full regions. Let $L_{VT}$ =m if Full-1(VT, $l_{\rm VT}$ (m),m) and $l_{\rm VI}$ (m)-1 otherwise. Let $R_{\rm VI}=1$ if Full-0(VT, $l_{\rm rVI}$ (1)) and $r_{\rm VI}$ (1)+1 otherwise. Lemma 5.3 shows that the full left-regions can overlap the full right-regions only when all terminals in the interval of overlap are 0/1-valued. Lemma 5.3: If $L_{VI} \ge R_{VI}$ , then $|UNDET_{VI}(R_{VI}, L_{VI})| = 0$ , i.e. all directions are known on $(R_{VI}, L_{VI})$ . **Proof:** Assume $L_{V\Gamma} \ge R_{V\Gamma}$ . The proof counts the number of "0"s and "1"s necessarily in $(R_{V\Gamma}, L_{V\Gamma})$ using the definitions of $R_{V\Gamma}$ and $L_{V\Gamma}$ . Case 1: $|\mathsf{ONES}_{VI}(\mathsf{R}_{VI},\mathsf{L}_{VI})| \geq |\mathsf{ZEROS}_{VI}(\mathsf{R}_{VI},\mathsf{L}_{VI})|$ . Suppose $\mathsf{L}_{VI}$ is in range( $\mathsf{r}_{VI}$ ). Then $(\mathsf{R}_{VI},\mathsf{L}_{VI})$ is a set of right-regions of T. Since Full-0 is true for any right-region of terminals greater than $\mathsf{R}_{VI}$ , it must be that Full-0(VT, $\mathsf{R}_{VI},\mathsf{L}_{VI}$ ). However, we have assumed that there are at least as many "1"s as "0"s in $(\mathsf{R}_{VI},\mathsf{L}_{VI})$ . This gives $|ONES_{VT}(R_{VT}L_{VT})| = |ZEROS_{VT}(R_{VT}L_{VT})| = M(L_{VT}R_{VT}+1)$ and no elements of $(R_{VT}L_{VT})$ have value "?" under VT. We now suppose Lyr is not in the range of ryr. Then $$|ZEROS_{VI}(L_{VI}+1,r_{VI}(L_{VI}+1))| < \Gamma \frac{1}{2} (r_{VI}(L_{VI}+1)-(L_{VI}+1)+1)$$ and $r_{VI}(L_{VI}+1)=r_{VI}(L_{VI})$ . Interval $(R_{VI},r_{VI}(L_{VI}))$ is a set of right-regions and Full-0 holds. This gives: $$|\mathsf{ZEROS}_{VI}(\mathsf{R}_{VT},\mathsf{L}_{VI})| = |\mathsf{ZEROS}_{VI}(\mathsf{R}_{VT},\mathsf{r}_{VI}(\mathsf{L}_{VI}))| - |\mathsf{ZEROS}_{VI}(\mathsf{L}_{VT}+\mathsf{L},\mathsf{r}_{VI}(\mathsf{L}_{VI}))|$$ 经企为通知证据的基本经验的,这个的分别。5000 $|ZEROS_{VT}(R_{VT}L_{VT})| > \Gamma \% (r_{VT}(L_{VT}) - R_{VT} + 1) \text{ I.e. } Full-0(VT, R_{VT}L_{VT}) \text{ again implying no elements of } (R_{VT}L_{VT}) \text{ have value "7" under VT.}$ Case 2: $|ONES_{VI}(R_{V\Gamma}L_{VI})| < |ZEROS_{VI}(R_{V\Gamma}L_{VI})|$ This case is proven in the same manner as case 1 with the roles of $L_{VI}$ and left-regions interchanged with the roles of $R_{VI}$ and right-regions. The matching within the full left-regions $(1,L_{\rm VI})$ or the full right regions $(R_{\rm VI},m)$ can be maximized independent of the rest of T. Our algorithm uses this property to break up the problem. # Theorem 5.1 formally states the independence of the regions. Theorem 5.1: Let $V\Gamma_0$ be a given initial value function defining $R_{V\Gamma_0}$ and $L_{V\Gamma_0}$ . For every $V\Gamma$ consistent with $V\Gamma_0$ and every matching function $m_{V\Gamma}$ , there is a matching function $m'_{V\Gamma}$ agreeing with $m_{V\Gamma}$ on 0-valued terminals in $(1_{V\Gamma_0} + 1_{,m})$ but matching each 0-valued terminal in $(1, L_{V\Gamma_0})$ to a 1-valued terminal in $(1, L_{V\Gamma_0})$ and such that $|\text{range}(m'_{V\Gamma})| \geq |\text{range}(m_{V\Gamma})|$ . Analogously, there is a matching function $m''_{V\Gamma}$ agreeing with $m_{V\Gamma}$ on 1-valued terminals in $(1, R_{V\Gamma_0} - 1)$ but matching each 1-valued terminal in $(R_{V\Gamma_0}, m)$ to a 0-valued terminal in $(R_{V\Gamma_0}, m)$ and whose range is at least as large. **Proof:** We will only prove the existence of $m'_{V\Gamma}$ . The proof of the existence of $m''_{V\Gamma}$ is similar. The proof is by induction on the number of left-regions in $(1, I_{V\Gamma_0})$ . Basis: $(1,L_{V\Gamma_0})$ contains at most one region. If $L_{VT_0} = 0$ , then $(1, L_{VT_0})$ is empty. If $L_{VT_0} = 1$ , then $VT_0(1) = 1$ and $(1, L_{VT_0})$ does not contain any "0"s. In either case, the theorem is trivially true, If $L_{VT_0} > 1$ , then $l_{VT_0}(L_{VT_0}) = 1$ . Therefore, $OK-1(VT_0,1,L_{VT_0})$ . We also have Full-1( $VT_0,1,L_{VT_0}$ ) and can conclude $|ONES_{VT_0}(1,L_{VT_0})| = \frac{1}{2}L_{VT_0}$ . For any value function, a maximum matching can be found by scanning T from 1 to m. Each time a "1" is encountered, it is matched to the lowest numbered yet unmatched "6". As long as there are no more "1"s than "0"s in an interval (1,i), there will be an yet unmatched "6" in (1,i-1) to match a 1-valued terminal i. In the case we are considering, $OK-1(VT_0,1,i)$ holds for every $i \le L_{VT_0}$ . Therefore every 1-valued terminal can be matched to a 0-valued terminal if all 7-valued terminals in $(1,L_{VT_0})$ become 0-valued. Since there are $\frac{1}{2}L_{VT_0}$ 1-valued terminals in $\frac{1}{2}L_{VT_0}$ under $\frac{1}{2}L_{VT_0}$ such a matching function would match all 0-valued terminals in $\frac{1}{2}L_{VT_0}$ . For each 0-valued or ?-valued terminal under $\frac{1}{2}L_{VT_0}$ , we can associate the 1-valued terminal to which it would match under the above matching. Given any value function $\frac{1}{2}L_{VT_0}$ consistent with $\frac{1}{2}L_{VT_0}$ and any matching function, $\frac{1}{2}L_{VT_0}$ the desired matching function $\frac{1}{2}L_{VT_0}$ can be created by matching each 0-valued terminal in $(1, I_{V\Gamma_0})$ to the the 1-valued terminal it is associated with and using the matching defined by $m_{V\Gamma}$ for 0-valued terminals in $(L_{V\Gamma_0}, m)$ . Since $m'_{V\Gamma}$ matches at least as many "0"s as $m_{V\Gamma}$ , |range( $m'_{V\Gamma}$ )| is at least as large as |range( $m_{V\Gamma}$ )|. (Recall that matching functions are one-to-one so that the domain of a matching function is the same size as its range.) Induction: Consider the first left-region. By the argument used in the basis, the matching of the "O"s in this region can be restricted to the "1"s in the region. Therefore, we may remove the first region and consider the remaining terminals as a new problem. The left-regions of the remaining terminals are unchanged, and we apply the inductive assumption. The desired m'VI uses the matching of "0"s in the first left-region to "1"s in the first left-region and the matching of "0"s in the remaining full left-regions to "1"s in the remaining full left-regions obtained by the inductive assumption. ## 5.4 Assignment within Regions Given Theorem 5.1, we can show that it suffices so consider functions VT consistent with VT<sub>0</sub> which assign all ?-valued terminals in $(1,L_{\rm VT_0})$ the value "0" and all ?-valued terminals in $(R_{\rm VT_0},m)$ the value "1". This also holds for VB. Theorem 5.2 states this formally. The algorithm for assigning values begins by assigning these values to the originally ?-valued terminals and their pairs. When a conflict arises, i.e. $T(i)\in (1,L_{\rm VT_0})$ and $B(j)=p(T(i))\in (R_{\rm VB_0},n)$ or $T(i)\in (R_{\rm VT_0},m)$ and $B(j)=p(T(i))\in (1,L_{\rm VB_0})$ , either choice can be made. We choose to define the algorithm so that assignments are made in the following order: - 1. Make all ?-valued terminals in (1.L $_{ m VI}_{ m 0}$ ) and their bottom pairs 0-valued. .. - 2. Make all ?-valued terminals in (R<sub>VI</sub>, in) and their bottom pairs 1-valued. - 3. Make all ?-valued terminals in (1, L<sub>VB</sub>) and their top pairs 0-valued. - 4. Make all 2-valued terminals in ( $R_{VB_n}$ , n) and their top pairs 1-valued. This order gives a proference to assignments dictated by the top regions. Executing each of 1 through 4 defines new functions VT and VB which extend VT<sub>0</sub> and VB<sub>0</sub> and which may be extended to value functions achieving the maximum matching $(M_{VS_f} + M_{VB_f})$ . These new functions may induce new left-regions and right-regions. The algorithm repeatedly computes new $L_{VI}$ , $R_{VI}$ , $L_{VB}$ , and $R_{VB}$ , and applies one of 1 through 4, each in turn, until there are no full regions containing ?-valued terminals. Theorem 5.2: Let VT and VB be any pair of value functions consistent with $VT_0$ , $VB_0$ , and the pairing function, p. If there are ?-valued terminals under $VT_0$ in $(1,L_{VT_0})$ , then there are functions VT' and VB' consistent with $VT_0$ , $VB_0$ , and p, such that all ?-valued terminals in $(1,L_{VT_0})$ under $VT_0$ are 0-valued under VT', and $M_{VT'} + M_{VB'} \ge M_{VT} + M_{VB'}$ . Similarly, there are value functions assigning ?-valued terminals in $(R_{VT_0},m)$ , $(1,L_{VB_0})$ , or $(R_{VB_0},n)$ the values "1", "0", or "1" respectively, without decreasing the sum of the matchings on T and B. Proof: We will only prove the statement for fixed values in $(1,L_{\rm VF_0})$ . Proofs for the other intervals are analogous. Suppose there are ?-valued terminals in $(1,L_{\rm VF_0})$ under VT<sub>0</sub>. If these terminals are 0-valued under VT, we are done. Suppose there are such terminals which are not 0-valued under VT. Define value function VT' to agree with VT on $(L_{\rm VF_0}+1,m)$ but assign each ?-valued terminal under VT<sub>0</sub> in $(1,L_{\rm VT_0})$ the value "0". Let $m_{\rm VT}$ be a maximum matching function for VT which matches each 0-valued terminal in $(1,L_{\rm VT_0})$ under VT to a 1-valued terminal in $(1,L_{\rm VT_0})$ . By Theorem 5.1, such a $m_{\rm VT}$ must exist. Let $m_{\rm VT}$ be a matching function for VT' which agrees with $m_{\rm VT}$ on $(L_{\rm VT_0}+1,m)$ and matches each 0-valued terminal in $(1,L_{\rm VT_0})$ . Again, $m_{\rm VT}$ is guaranteed to exist by Theorem 5.1. Then, $|\text{range}(m_{VT'})| - |\text{range}(m_{VT})| = |\text{ZEROS}_{VT'}(1,L_{VT_0})| - |\text{ZEROS}_{VT}(1,L_{VT_0})| = d.$ But, $M_{VT'} \ge |\text{range}(m_{VT'})|$ and $M_{VT} = |\text{range}(m_{VT})|$ . Therefore, $M_{VT'} - M_{VT} \ge d$ . The corresponding VB' differs from VB by changing at most d "?"s or "1"s to "0"s. This can destroy at most d range values of any $m_{VB'}$ giving $M_{VB} - M_{VB'} \le d$ . Therefore $M_{VT'} + M_{VB'} \ge M_{VT} + M_{VB'}$ . It remains to assign 0/1 values to any ?-valued terminals of $T(L_{VT_e}+1,R_{VT_e}-1)$ and $B(L_{VB_e}+1,R_{VB_e}-1)$ , where $VT_e$ and $VB_e$ are the last extensions of $VT_0$ and $VB_0$ obtained by the above assignments. Intervals $T(L_{VT_e}+1,R_{VT_e}-1)$ and $B(L_{VB_e}+1,R_{VB_e}-1)$ may be empty. Given Theorem 5.1, we can maximize the sum of matches within $T(L_{VT_e}+1,R_{VT_e}-1)$ and $B(L_{VB_e}+1,R_{VB_e}-1)$ independent of the full regions. We remove the full regions and define a new T and B containing only the remaining terminals. We get a problem on vectors T(1,m') and B(1,n') with new pairing function, p', and new initial value functions, $VT'_0$ and $VB'_0$ , such that $$\begin{split} m' &= R_{V\Gamma_e} - L_{V\Gamma_e} - 1 \text{ and } n' = R_{VB_e} - L_{VB_e} - 1, \\ V\Gamma'_0(i) &= V\Gamma_e (L_{V\Gamma_e} + i) \text{ and } VB'_0(i) = VB_e (L_{VB_e} + i). \end{split}$$ The pairing function, p', corresponds to the the old pairing function when both terminals of a pair are within $(L_{V\Gamma_e}+1,R_{V\Gamma_e}-1)$ and $(L_{VB_e}+1,R_{VB_e}-1)$ . If a terminal within one of these intervals was originally paired with a terminal outside the intervals, then the new function pairs this terminal with \* (don't care). Any terminal originally paired with a terminal outside these intervals is 0-valued or 1-valued under $VT_e$ or $VB_e$ . Since the pairing function is only needed for ?-valued terminals, changing the pairing of such a terminal to \* does not change the problem. The new value functions, $VT_0$ and $VB_0$ , induce exactly one left-region and one right-region on each of T(1,m') and B(1,n'). None of these regions is full. The assignment to the new vectors is made using procedure SCAN-ASSIGN. The procedure is called with inputs (T(1,m'),B(1,n'),p',VT'<sub>0</sub>,VB'<sub>0</sub>). This procedure scans the top vector, reassigning top-bottom pairs of ?-valued terminals the value "0" whenever this assignment would not create a bottom right interval, B(s,n), with more that half 0-valued terminals. When such an interval would result, SCAN-ASSIGN tries to reassign the terminals the value "1". If this would create a top left interval, T(1,q), with more than half I-valued terminals, the procedure stops and returns "FAIL". The procedure which called SCAN-ASSIGN must handle the failure. If no failure occurs, SCAN-ASSIGN continues assigning until there is a full top right-region, i.e. a right interval with at least half "0"s. Any remaining ?-valued terminals are assigned the value "1". The procedure SCAN-ASSIGN is defined in Figure 5.3. The following two lemmas and theorem prove the correctness of SCAN-ASSIGN(T(1,m),B(1,n),p, $VT_0$ , $VB_0$ ) when it succeeds in assigning "0"s or "1"s to all ?-valued terminals in T(1,m) and B(1,n). Lemma 5.4: Let $VT_i$ and $VB_i$ be the value functions assumed by variables VT and VB after considering terminal T(i) in either the first or second FOR loop during the execution of $SCAN-ASSIGN(T(1,m),B(1,m),p,VT_0,VB_0)$ . If $VT_0$ and $VB_0$ do not define any full regions on T or B, then for any i, $1 \le i \le m$ , $VT_i$ and $VB_i$ satisfy the following properties: For all k, $1 \le k \le m$ , OK-1(VT, 1,k) For all k, $1 \le k \le n$ , OK-0(VB, k,n) Furthermore, if SCAN-ASSIGN(T(1,m),B(1,m),p,VT<sub>0</sub>,VB<sub>0</sub>) enters the second FOR loop, then for all value functions VT<sub>j</sub> and VB<sub>j</sub>, where T(j) is considered in this loop, $|ZEROS_{VT_i}(1,m)| = \Gamma \frac{1}{2}mT$ . Proof: We first prove the properties for functions defined in the first FOR loop by induction on i. Basis: The properties are true for $VT_0$ and $VB_0$ by hypothesis. Induction: By inductive assumption, the properties are true after considering terminal T(i-1), ie. after the $(i-1)^{th}$ execution of the loop. Function $VT_i$ differs from $VT_{i-1}$ at most for T(i). For this to affect $OK-1(VT_i,1,k)$ for some k, the following must hold: $|ONES_{VT_{i-1}}(1,k)| = L \frac{1}{2}k L J$ , $k \ge i$ , $VT_{i-1}(i) = ?$ , and $VT_i(i) = 1$ . However, if $|ONES_{VT_{i-1}}(1,k)| = L \frac{1}{2}k L J$ , then $VT_i(i)$ is not assigned "1". Therefore, for all k, $OK-1(VT_i,1,k)$ holds. For $OK-0(VB_i,k,n)$ not to hold for some k, it must be that: $|ZEROS_{VB_{i-1}}(k,n)| = L \frac{1}{2}k(n-k+1)J$ , $k \le p(i)$ , $VB_{i-1}(p(i)) = ?$ , and $VB_i(p(i)) = 0$ . However, if $|ZEROS_{VB_{i-1}}(k,n)| = L \frac{1}{2}k(n-k+1)J$ , then $VT_i(i)$ and $VB_i(p(i))$ are not assigned "0". Therefore, $OK-0(VB_i,k,n)$ must hold for all k. If SCAN-ASSIGN enters the second FOR loop, let h be the first value of j considered in this loop. Execution of the first FOR loop is completed because Full-0( $VT_{h-1}$ ,q,m) holds for some q. We Figure 5.3: Definition of SCAN-ASSIGN ``` SCAN-ASSIGN(T(1,m),B(1,n),p,VT_{p},VB_{p}) ``` /Assigns 0/1 to ?-valued terminals in T and B where $L_{VT_0} = 0$ , $R_{VT_0} = m+1$ , $L_{VB_0} = 0$ , $R_{VB_0} = n+1$ , and no region is full./ ``` VT := V\Gamma_0 \text{ and } VB := VB_0 FOR \ i = 1 \ STEP \ 1 \ UNTIL \ (3q)Full-0(VT,q,m); IF \ VI(i) =? \ THEN IF \ (3s) \ p(i) \in B(s,n) \ and \ [ZEROS_{VB}(s,n)] = L \frac{1}{2}(n-s+1)J \ THEN IF \ (3q) \ i \leq q \leq m \ and \ |ONES_{V2}(1,q)| = L \frac{1}{2}(1,q) ``` know that $h \ge 2$ and $q \le h$ since $VT_{h-1}$ agrees with $VT_0$ on Wh,m) and for no k does Full-0( $VT_0,k,m$ ) hold. If $q \ne 1$ , we know $OK-1(VT_{h-1},1,q-1)$ holds by the first part of this proof. This implies that $|ZEROS_{VT_{h-1}}(1,q-1)| \ge \Gamma \frac{1}{2}(q-1)$ since no terminals in T(1,h-1) are ?-valued under $VT_{h-1}$ . Then $$|ZEROS_{VT_{h-1}}(1,m)| \ge \Gamma\frac{1}{2}(q-1)^{\frac{1}{2}} + \Gamma\frac{1}{2}(m-q+1)^{\frac{1}{2}} \ge \Gamma\frac{1}{2}m^{\frac{1}{2}}.$$ However, $|ZEROS_{VT_{h-2}}(1,m)| < \Gamma \frac{1}{m}$ ; otherwise, the first loop would not have been repeated for h-1. Therefore, $|ZEROS_{VT_{h-1}}(1,m)| = \Gamma \frac{1}{m}$ . The second FOR loop does not assign any "0"s. Therefore, for all $VT_j$ , $h \le j \le m$ , $|ZEROS_{VT_j}(1,m)| = \Gamma \frac{1}{m}$ . Also, since $OK - 0(VB_{h-1},k,m)$ holds for any k, $OK - 0(VB_j,k,m)$ holds for any k. It remains to show that $OK-1(VT_j,1,k)$ holds for all k. Since any $VT_j$ agrees with $VT_{h-1}$ on T(1,h-1), $OK-1(VT_j,1,k)$ must hold for k<h. Suppose that for some $k \ge h$ and some $j \ge h$ , $OK-1(VT_j,1,k)$ does not hold. Since the number of 1-valued terminals increases on each iteration, $OK-1(VT_m,1,k)$ must also not hold. Since no terminal is ?-valued under $VT_m$ , $|ZEROS_{VT_m}(1,m)| = \Gamma \frac{1}{2}m \Gamma$ implies $|ONES_{VT_m}(1,m)| = L \frac{1}{2}m \Gamma$ ; therefore $k \ne m$ . For any $$i \ge h$$ , $|ZEROS_{VT_m}(i,m)| = |ZEROS_{VT_0}(i,m)| < \Gamma \frac{1}{2}(m-i+1)$ 7. This implies $$|ONES_{VI_{m}}(k+1,m)| > L1/6(m-(k+1)+1)J.$$ Then $$|ONES_{VI_m}(1,k)| < L\%mJ - L\%(m-k)J$$ , giving $|ONES_{VT_m}(1,k)| < \Gamma \frac{1}{2}k$ , contradicting our assumption that $OK-1(VT_m,1,k)$ does not hold. Lemma 5.5: For any call to SCAN-ASSIGN as in Lemma 5.4, the first FOR loop is completed with i≤m. **Proof:** If the lemma does not hold, then after i = m in the first FOR loop: $$|ZEROS_{VI_m}(1,m)| < \Gamma \frac{1}{m}$$ However, by Lemma 5.4, $$|ONES_{VI_m}(1,m)| \le L \frac{1}{m}$$ , implying $|ZEROS_{VI_m}(1,m)| \ge \Gamma \frac{1}{m}$ . Theorem 5.3: Let $VT_0$ and $VB_0$ be initial value functions which induce no full regions on T and B. If SCAN-ASSIGN(T(1,m), B(1,n), p, $VT_0$ , $VB_0$ ) returns (SUCCESS, $VT_m$ , $VB_m$ ), then $VT_m$ and $VB_m$ maximize $M_{VT} + M_{VB}$ over all VT and VB consistent with $VT_0$ and $VB_0$ . Proof: Since $OK-1(V\Gamma_m,1,k)$ holds for $1 \le k \le m$ and $|ONES_{V\Gamma_m}(1,m)| = L 1/2 m J$ , $M_{V\Gamma_m} = L 1/2 m J$ , the maximum possible. Since $OK-0(VB_m,k,n)$ holds for $1 \le k \le n$ , each 0-valued bottom terminal can be matched, and $M_{VB_m} = |ZEROS_{V\Gamma_m}(1,n)|$ . If $M_{VB_m} = L 1/2 n J$ , then no larger matching is possible and we are done. Suppose $M_{VB_m} < L 1/2 n J$ . Let $M_{VB_m} = L 1/2 n J - \Delta$ , $\Delta > 0$ . Let $V\Gamma$ and VB be any other value functions consistent with p, $V\Gamma_0$ , and $VB_0$ . We must show that $$M_{VT} + M_{VB} \le M_{VT_m} + M_{VB_m} = L\frac{1}{2}nJ - \Delta + L\frac{1}{2}mJ$$ Since the initially ?-valued terminals are assigned 0/1 values in top-bottom pairs, we have: $$|ZEROS_{VI_0}(1,m)| - |ZEROS_{VB_0}(1,n)| = |ZEROS_{VI_{m}}(1,m)| - |ZEROS_{VB_{m}}(1,n)|$$ $$= \Gamma \% m - L \% n + \Delta \quad \text{and} \quad$$ $|ZEROS_{VI}(1,m)| - |ZEROS_{VI}(1,m)| = |ZEROS_{VB}(1,n)| - |ZEROS_{VB}(1,n)|.$ Let $|ZEROS_{VI}(1,m)| = |ZEROS_{VI}(1,m)| =$ $$\begin{aligned} |ZEROS_{VB}(1,n)| &= |ZFROS_{VT}(1,m)| \cdot (|ZFROS_{VT}(1,m)| \cdot |ZFROS_{VB}(1,n)|) \\ &= \lceil \frac{1}{2}m \rceil + z \cdot (\lceil \frac{1}{2}m \rceil \cdot \lfloor \frac{1}{2}m \rfloor + \Delta) = \lfloor \frac{1}{2}m \rfloor + z \cdot \Delta. \end{aligned}$$ Case 1: z<0. Then $$M_{VT} + M_{VB} \le |ZEROS_{VI}(1,m)| + |ZEROS_{VB}(1,n)|$$ $\le \Gamma / 4m + z + L / 4m + r \triangle < L / 4m + L / 4m + \Delta$ . Case 2: $$0 \le z$$ . Then $M_{VT} + M_{VB} \le |ONFS_{VT}(1,m)| + |ZEROS_{VB}(1,n)|$ $\le L\frac{1}{2}mJ-z+L\frac{1}{2}mJ+z-\Delta = L\frac{1}{2}mJ+L\frac{1}{2}mJ-\Delta$ . It remains for us to deal with a return of (FAIL k,VT,VB). Before describing how a failure is handled, we prove that for certain value functions, the value of a terminal can be switched from one of "0" or "1" to the other without decreasing M<sub>VT</sub>+M<sub>VB</sub>. These value functions are consistent with the initial value functions used as input to SCAN-ASSIGN and assign only "0"s and "1"s. They assign an imbalance of "0"s and "1"s in some left or right interval. We will use this ability to change the values of terminals to prove that the value functions returned by SCAN-ASSIGN do not assign too many or too few "0"s and "1"s to achieve an optimal matching. Lemma 5.6: Let $VT_0$ and $VB_0$ be initial value functions which define no full regions on T and B. Let VT and VB be any function consistent with $VT_0$ , $VB_0$ , and p under which no terminal is ?-valued. A) For any x, $1 \le x \le m$ , if $\{ONES_{VI}(1,x)\} - \{ZEROS_{VI}(1,x)\} > 1$ , then there is a terminal i in T(1,x) such that $VT_0(i) = ?$ , VT(i) = 1, and the functions VT' and VB' obtained by changing the values of i and p(i) to "0" are such that $M_{VT'} + M_{VB'} \ge M_{VT} + M_{VB}$ . B) For any y, $1 \le y \le n$ , if $|ZEROS_{VB}(y,n)| - |ONES_{VB}(y,n)| > 1$ , then there is a terminal i in B(y,n) such that $VB_0(i) = ?$ , VB(i) = 0, and the functions VT' and VB' obtained by changing the values of i and p(i) to "1" are such that $M_{VT'} + M_{VB'} \ge M_{VT} + M_{VB'}$ . Proof: We will only prove A. The proof for B is analogous. Since $|ONES_{VI}(1,x)|-|ZEROS_{VI}(1,x)|>1$ , there are at least two "1"s in T(1,x) unmatched under any matching function for VT. Let there be a matching function achieving $M_{VT}$ for which the terminal of lower index among two unmatched "1"s is ?-valued under $VT_0$ . We will show by contradiction that such a matching function must exist. Given such a matching function, change the value of the terminal of lower index to "0", changing the value of its bottom pair as well. This defines VT' and VB'. The new "0" can match the second unmatched "1", giving $M_{VT'} = M_{VT} + 1$ . In B(1,n), at most one range value of any matching function has been destroyed; therefore, $M_{VB'} \ge M_{VB} - 1$ . It follows that $M_{VT'} + M_{VB'} \ge M_{VT} + M_{VB}$ . We now show that the matching function described above must exist. Figure 5.4 illustrates. Suppose that for any matching function achieving $M_{VT}$ , at most one 1-valued terminal in T(1,x) which was initially ?-valued (i.e. under $VT_0$ ) is unmatched, and that if there is such a terminal, it is the terminal Figure 5.4: The proof of Lemma 5.6. # Claim exists: ### Otherwise: any 1-valued terminal under VT<sub>0</sub> is matched to a 0-valued terminal here Figure 5.5: Definition of C. of highest index among all unmatched "1"s in T(1,x). Then, for any matching function achieving $M_{VP}$ there is at least one 1-valued terminal under VT<sub>0</sub> which is unmatched. Consider all matching functions achieving $M_{VT}$ for which a maximum number of 1-valued terminals in T(1,x) are matched. Among these, M. THE SERVICES OF SERVICE STATE SAN consider only those functions for which a maximum number of originally ?-valued, now 1-valued terminals in T(1,x) are unmatched (either zero or one such terminals). For each of these, note the indices of any terminals in T(1,x) which are unmatched and are 1-valued under $VT_0$ . Choose the matching function for which the highest index, say u, is obtained. No initially ?-valued terminal in T(1,u-1) which is 1-valued under VT is unmatched. Each 0-valued terminal in T(1,u-1) must match an initially 1-valued I Mills on Waters terminal in T(1,u-1). Otherwise, the matching function could be modified so that the offending 0-valued and a basista of the brist of the terminal matched u. If the offending O-valued terminal had been unmatched, this would produce a larger right which has but the city of the matching; if it had matched a terminal in T(x+1,m), this would produce a matching under which more 1-valued terminals in T(1,x) are matched. If the offending terminal had matched an initially ?-valued terminal in T(1,x), matching it to u would produce a matching with a larger number of unmatched 1-valued terminals which are ?-valued under $V\Gamma_0$ ; if it had matched a terminal in T(u+1,x) which is COMPRESSION SPECKED AS A SUPER-1-valued under VT<sub>0</sub>, this would produce a matching with an unmatched terminal which is 1-valued under VT<sub>n</sub> and of higher index than u. Any of these possibilities contradict our choice of matching function. There can be no matched 1-valued terminals in T(1,u-1) which are initially ?-valued nor any unmatched terminals of this type. All 1-valued terminals in T(1,u-1) are initially 1-valued. Since all 0-valued terminals in T(1,u-1) must match 1-valued terminals in T(1,u-1), we have: $$|ZEROS_{VI}(1,u-1)| \le |ONES_{VI}(1,u-1)| = |ONES_{VI_0}(1,u-1)|$$ Since no terminals are ?-valued under VT, this implies $|ONES_{VI_0}(1,u-1)| \ge \Gamma \frac{1}{2} (u-1) \frac{1}{2}$ , contradicting the hypothesis that there are no full regions under $VI_0$ . #### 5.5 Handling a "failure" Let us suppose that SCAN-ASSIGN returns (FAIL k, VT, VB). We will now describe what is Consider and Combined Physical Indiana and the American State of the Combined t known about T, B, and their value functions after SCAN-ASSIGN returns. We will use this information of the entransfer of the continuity conti to handle the failure. Procedure SCAN-ASSIGN returns the value, k, of loop variable i when el glacione glacak useg og singe k**eddes bådenka**ndare elle (a. 1911 – Alexand SCAN-ASSIGN encountered the failure. Therefore we know that VT<sub>k-1</sub> and VB<sub>k-1</sub> are the value functions returned. There is a $q \ge k$ such that $|ONES_{VT}| = L / 4qJ$ . We will use the smallest such q. There is an s satisfying the following three properties: (i) $p(k) \ge s$ , (ii) $|ZEROS_{VB}| (s,n) = s$ ว และ ประเทศที่ เขาเกราะ (สาย โดย โดย โดย โดยได้เหลือ เรียนเลาะ โดยได้นี้ เป็นเป็นเดิมได้เกิดได้ ได้ โดย เราะ L½(n-s+1), and (iii) each terminal in T(1,k-1) which has been assigned the value "1" by SCAN-ASSIGN is paired with a terminal in B(s,n), i.e. ONES, (1,k-1)-ONES, (1,k-1)⊆p(B(s,n)). This last property follows from the fact that SCAN-ASSIGN only assigns VT(i)=1 when, at the time, ा राज है। वार क्षेत्रक वार का का का का का का है जिसे हैं अपने की किसी की के स्वारी के स्वारी का की है। है है $(\exists s)(p(i) \in B(s,n))$ and $|ZEROS_{VB}(s,n)| = L\frac{1}{2}(n-s+1)J$ . Therefore, we can associate an $s_i$ with each galdal lar lett H. Ledaum ara Latin in die ei ?-valued T(i) made 1-valued. Once $|ZEROS_{VB_i}(s_i,n)| = L\frac{1}{2}(n-s_i+1)J$ for some $VB_i$ , the number of "0"s n centeria in branco di la giatoria di fili di indiana in the interval does not change in later iterations. Therefore, choosing the smallest of any such s, and the rational files and their a declaration of his AP a**thesis absence with his** built all more thanks s, associated with the failure, we have an index, s, which satisfies all three properties. In fact, we use the u radam keundala kebidakan mengantah dibidah kebasaran dia 1777 yang beberapa largest s which satisfies all three properties. Note that $VT_{k-1}(k) = VB_{k-1}(p(k)) = ?$ . We also know that each terminal in B(s,n) which is assigned a "0" or a "1" value by SCAN-ASSIGN must be paired with a to make the first of the American indicate the reason of the conservaterminal in T(1,k-1), since no initially ?-valued terminals in T(k,m) or their bottom pairs have been othermic or the him denimed bushed the age with the difficult reassigned. Let C be the set of initially ?-valued terminals in T(1,q) whose pairs are in B(s,n) (Figure 5.5). For an optimal assignment at the top, all remaining ?-valued terminals in T(1,q) should become "0"s. For an optimal assignment at the bottom, all remaining ?-valued terminals in B(s,n) should become "1"s. Obviously, on C these are conflicting goals. We wish to find 0/1 assignments for the remaining ?-valued terminals in C (including k) for which some extension will achieve the maximum sum of matches. To do this, we first prove that we need only consider value functions which assign a number of "0"s or "1"s in C and the anticologists of the control of the second of the control within a certain range. Let: $$c_1 = |ONES_{VT_{k-1}}(C)|$$ $$c_0 = |ZEROS_{VT_{k-1}}(C)|$$ $$c_1 = |UNDET_{VT_{k-1}}(C)|$$ To simplify the statement of the lemmas to follow, let "assume the standard state after SCAN-ASSIGN returns 'FAIL'" mean: "Assuming $VT_0$ and $VB_0$ are initial value functions which define no full regions on T and B, let SCAN-ASSIGN(T(1,m), B(1,n), p, $VT_0$ , $VB_0$ ) return (FAIL k, $VT_{k-1}$ , $VT_{k-1}$ ). Let q, s, C, $c_0$ , $c_1$ , and $c_7$ be as we have defined them above." Lemma 5.7: Assume the standard state after SCAN-ASSIGN returns: "FAIL". Let VT and VB be arbitrary value functions consistent with $VT_0$ , $VB_0$ , and p under which there are no ?-valued terminals. If $c_7 > 1$ , then for any x, $1 \le x \le c_7 - 1$ , there are value functions VT' and VB' also consistent with $VT_0$ , $VB_0$ , and p under which there are no ?-valued terminals such that: $$M_{VT}' + M_{VB}' \ge M_{VT} + M_{VB}$$ and $|ONES_{VT}'(C)| = c_1 + x$ If $c_1 = 1$ , there are such functions VT' and VB' for which $c_1 \le |ONES_{v,T'}(C)| \le c_1 + 1$ . **Proof**: If s > 1, let D be the set of initially ?-valued terminals in T(1,q) whose pairs are in B(1,s-1). Then: $|ONES_{VI}(1,q)| = |ONES_{VI}(C)| + |ONES_{VI}(D)| + |ONES_{VI_0}(1,q)|$ Note that $|ONES_{VI_{k-1}}(1,q)| = |ONES_{VI_0}(1,q)| + |ONES_{VI_{k-1}}(C)| = L \frac{1}{2} q J$ Case 1 $|ONES_{VI}(C)| \ge c_1 + x$ , where if $c_1 = 1$ , x = 1. We use induction on $|ONES_{VI}(C)| + |ONES_{VI}(D)|$ . Basis: $|ONES_{VI}(C)| + |ONES_{VI}(D)| = c_1 + x$ . Then $|ONES_{VI}(C)| = c_1 + x$ as desired. Induction: Assume that the lemma holds if: $$c_1 + x \le |ONES_{VI}(C)| + |ONES_{VI}(D)| < c_1 + x + i, i > 0.$$ When $|ONES_{VI}(C)| + |ONES_{VI}(D)| = c_1 + x + i$ , then: $|ONES_{VI}(1,q)| = c_1 + x + i + |ONES_{VI_0}(1,q)| = L \frac{1}{2} + x + i \ge L \frac{1}{2} + 2$ If $|ONES_{VI}(C)| = c_1 + x$ , we are done. Otherwise, we use Lemma 5.6-A. $$|ONES_{VI}(1,q)| - |ZEROS_{VI}(1,q)| \ge 1.44qJ + 2 - (\Gamma 1/4qT - 2) \ge 3$$ Therefore, by Lemma 5.6-A, there are VT and YB' such that $M_{VT'} + M_{VB'} \ge M_{VT} + M_{VB}$ and $$|ONES_{VT}(C)| + |ONES_{VT}(D)| = |ONES_{VT}(C)| + |ONES_{VT}(D)| - 1.$$ By inductive assumption, there are VI" and VB" such that $M_{VI''} + M_{VB'} \ge M_{VI'} + M_{VB'}$ and $|ONES_{VI''}(C)| = c_1 + x$ . Case 2 $$|ONES_{VI}(C)| \le c_1 + x$$ , where if $c_7 = 1$ , $x = 0$ . Then $|ZEROS_{VI}(C)| \ge c_0 + c_1 + c_2 - (c_1 + x) = c_0 + c_2 - x$ . If q < m, let E be the set of initially ?-valued terminals on B(s,n) whose pairs are in T(q+1,m). Let p(C) denote the bottom pairs of C. $$|ZEROS_{VB}(s,n)| = |ZEROS_{VB}(p(C))| + |ZEROS_{VB}(E)| + |ZEROS_{VB_n}(s,n)|$$ We know $|ZEROS_{VB_{k-1}}(s,n)| = |ZEROS_{VB_{k-1}}(p(C))| + |ZEROS_{VB_0}(s,n)| = L1/(n-s+1).1$ We use induction on $|ZEROS_{VB}(p(C))| + |ZEROS_{VB}(E)|$ . Basis: $|ZEROS_{VB}(p(C))| + |ZEROS_{VB}(E)| = c_0 + c_7 - x$ . Then $|ZEROS_{VB}(p(C))| = c_0 + c_7 - x$ , implying $|ONES_{VB}(p(C))| = c_1 + x$ as desired. Induction: Assume the lemma holds for: $$c_0 + c_7 - x \le |ZEROS_{VB}(p(C))| + |ZEROS_{VB}(E)| < c_0 + c_7 - x + i$$ , for $i > 0$ . When $|ZEROS_{VB}(p(C))| + |ZEROS_{VB}(E)| = c_0 + c_7 - x + i$ : $$|ZEROS_{VB}(s,n)| = c_0 + c_7 - x + i + |ZEROS_{VB_0}(s,n)| = L\frac{1}{2}(n-s+1)J + c_7 - x + i \ge L\frac{1}{2}(n-s+1)J + 2.$$ If $|ZEROS_{VB}(p(C))| = c_0 + c_7 - x$ , then $|ONES_{VI}(C)| = c_1 + x$ as desired. Otherwise, we use Lemma 5.6-B. $$|{\rm ZEROS}_{\rm VB}(s,n)| - |{\rm ONES}_{\rm VB}(s,n)| \ge L \frac{1}{2} (n-s+1) \, {\rm J} + 2 \cdot (\Gamma \frac{1}{2} (n-s+1) \, {\rm J} - 2) \ge 3.$$ By Lemma 5.6-B, there are VT' and VB' such that $M_{VB'} + M_{VB'} \ge M_{VB} + M_{VB}$ and $$|\mathsf{ZEROS}_{\mathsf{VB}'}(\mathsf{p}(\mathsf{C}))| + |\mathsf{ZEROS}_{\mathsf{VB}'}(\mathsf{E})| = |\mathsf{ZEROS}_{\mathsf{VB}}(\mathsf{p}(\mathsf{C}))| + |\mathsf{ZEROS}_{\mathsf{VB}}(\mathsf{E})| - 1.$$ By inductive assumption, there are VT" and VB" such that $M_{VT''} + M_{VB''} \ge M_{VT'} + M_{VB'}$ and $|ONES_{VT''}(C)| = c_1 + x$ . We now know that there are value functions schleving the maximum sum of matches which assign at least as many "1"s in C as SCAN-ASSIGN has similared. However, we do not know if the particular choices for 1-valued terminals will lead to an optimal solution. We will now show that except for possibly one terminal which we can easily find, SCAN-ASSIGN has made good choices. Note that the distinction made between $c_1 \neq 1$ and $c_2 \geq 1$ as in Lamma 5.7 is necessary. When $c_2 \geq 1$ , regardless of the extensions of $VT_0$ and $VB_0$ being considered, there are enough necessarily unmatched "1"s in T(1,q) or unmatched "0"s in B(s,n) so that we can trade off top matches against bottom matches. When $c_2 = 1$ , we know that an optimal assignment will have at least one unmatched "1" on T(1,q) or one unmatched "0" on B(s,n). However, we cannot trade these against each other because when a "0" is unmatched in B(s,n), the extra "0" in T(1,q) may match a "1" in T(q+1,m). Similarly, an extra "1" in B(s,n) may match a "0" in B(1,s-1). The algorithm must try both choices — leaving one unmatched "1" in T(1,q) or one unmatched "0" in B(s,n). The terminal which may have been incorrectly assigned is the smallest numbered 1-valued terminal in B(s,n) which is 7-valued under $VB_0$ . Assume this is terminal i. Given two terminals, u and v, with u < v in T and p(u) < p(v) in B, assigning "0" to u and "1" to v is always preferable to assigning "1" to u and "0" to v. This is because any terminal which v or p(v) can match when they are 0-valued, u or p(u), respectively, can match when they are 0-valued; any terminal which can be matched to u or p(u) when they are 1-valued. Therefore, assigning "0" to u and p(u) and "1" to v and p(v) gives at least as large a matching on each of T and B as the opposite assignment. Therefore, if terminal v and its pair are smaller numbered than p(k) and k, respectively, then it is better to have terminal 1 0-valued and terminal k 1-valued than vice versa. However, the preferred assignment is not consistent with $VB_{k-1}$ and $VB_{k-1}$ . We define new value functions which maintain the important properties of $VT_{k-1}$ and $VB_{k-1}$ , but allow the preferred assignment. $$VT_{fx}(\mathbf{k}) = 1 \qquad VT_{fx}(\mathbf{p}(\mathbf{h})) = ?$$ $$VB_{fx}(\mathbf{p}(\mathbf{k})) = 1 \qquad VB_{fx}(\mathbf{h}) = ?$$ Note that $OK-1(VT_{fx},1,x)$ holds for all x, $1 \le x \le q$ , and $OK-0(VB_{fx},y,n)$ holds for all y, $s \le y \le n$ . The "standard state after SCAN-ASSIGN returns 'FAIL'" will now include h. $VT_{fx}$ , and $VB_{fx}$ as just defined. We will now prove that $VT_{fx}$ and $VB_{fx}$ are satisfactory extensions of $VT_0$ and $VB_0$ , i.e. they will lead to a pair of value functions which achieve the maximum matching $M_{VT_f} + M_{VB_f}$ . Lemma 5.8 gives properties of extensions of $VT_{fx}$ and $VB_{fx}$ which will be needed to prove that it is sufficient to consider only these extensions. Lemma 5.8: Assume the standard state after SCAN-ASSIGN returns "FAIL". Let VT and VB be extensions of $VT_{fx}$ and $VB_{fx}$ . A. If there is a ?-valued terminal in T(p(h),q) under $VT_{fi}$ which is 1-valued under VT, then, for any 1-valued terminal, i, in T(1,q) under VT, there is a matching function $m_{VT}$ achieving $M_{VT}$ which matches each 0-valued terminal in T(1,q) to a 1-valued terminal in T(1,q) and leaves i unmatched. B. If there is a ?-valued terminal in B(s,h) under $VB_{fx}$ which is 0-valued under VB, then, for any 0-valued terminal, j, in B(s,n) under VB, there is a matching function $m_{VB}$ which matches Figure 5.6: Definition of h. each 1-valued terminal in B(s,n) to a 0-valued terminal in B(s,n) and leaves j unmatched. Proof of A: If suffices to show that $|ZEROS_{VI}(x,q)| < |ONES_{VI}(x,q)|$ for all x in T(1,q). If this is true, then we can produce a matching on T(1,q) which matches each "0" in T(1,q) to a "1" in T(1,q) and does not match a given 1-valued terminal i. We define the matching function by scanning T(1,q) from q to 1 and matching each "0" executive to a yet unmatched "1" of higher index other than i. The fact that $|ZEROS_{VI}(x,q)| < |ONES_{VI}(x,q)|$ guarantees that we will find such an unmatched "1" for each "0". This matching can be extended to a matching which achieves $M_{VI}$ by letting the matching agree with any matching function achieving $M_{VI}$ on "0"s in T(q+1,m). For any x in T(1,q), we will show that $|ZEROS_{VI}(x,q)| \leftarrow |ONES_{VI}(x,q)|$ by showing that more than half the terminals in T(x,q) are 1-valued under VT. Suppose $1 \le x \le p(h)$ . Then, by hypothesis: $$|ONES_{VI}(x,q)| \ge |ONES_{VI_{f_n}}(x,q)| + 1.$$ For x=1, $|ONES_{VT_{fix}}(1,q)| = L\frac{1}{2}qJ$ . For x>1, we know that $OK-1(VT_{fix},1,x-1)$ holds. Therefore, $|ONES_{VT_{fix}}(x,q)| \ge L\frac{1}{2}qJ + L\frac{1}{2}(x-1)J \ge L\frac{1}{2}(q-x+1)J$ and $|ONES_{VT}(x,q)| \ge L\frac{1}{2}(q-x+1)J + 1$ , as desired. Suppose $p(h) \le q$ . We claim that $|ONES_{VI_{A_k}}(1,x-1)| \le L \frac{1}{2}(x-1)J$ . If true, this gives: $|\mathsf{ONES}_{\mathsf{VT}}(x,q)| \geq |\mathsf{ONES}_{\mathsf{VT}_{\mathsf{fx}}}(x,q)| > \mathsf{L}_{\mathsf{M}}(x-1)\mathsf{J} \geq \mathsf{L}_{\mathsf{M}}(q-x+1)\mathsf{J} \text{ as desired.}$ If x > k, then $|\mathsf{ONES}_{\mathsf{VT}_{\mathsf{fx}}}(1,x-1)| = |\mathsf{ONES}_{\mathsf{VT}_{\mathsf{k}-1}}(1,x-1)| < \mathsf{L}_{\mathsf{M}}(x-1)\mathsf{J}$ . Otherwise, we would have chosen x-1 as q, but $x \leq q$ . If $x \leq k$ , then since x > p(h), $p(h) \neq k$ . In this case, $$|ONES_{V\Gamma_{fx}}(1,x-1)| = |ONES_{V\Gamma_{k-1}}(1,x-1)| - 1$$ since $VT_{fx}(p(h)) = ?$ and $VT_{k-1}(p(h)) = 1$ and the value functions agree everywhere else on T(1,k-1). Since $OK-1(VT_{k-1},1,x-1)$ holds, $|ONES_{VT_{ac}}(1,x-1)| < L \frac{1}{2}(x-1)J$ . Proof of B: It suffices to show that $|ONES_{VB}(s,y)| < |ZEROS_{VB}(s,y)|$ for all y in B(s,n) so that all 1-valued terminals in B(s,n) can be matched to 0-valued terminals in B(s,n) without using j. Produce the matching by scanning from s to n. As in the proof of A, we prove that more than half the terminals in B(s,y) are 0-valued. For $y \ge h$ , by hypothesis: $$|ZEROS_{VB}(s,y)| \ge |ZEROS_{VB_{fk}}(s,y)| + 1.$$ We know $|ZEROS_{VB_{qq}}(s,n)| = L^{1/4}(n-s+1)J$ and, if $y \le n$ , $OK-O(VB_{fr}y+1,n)$ holds. Therefore: $$|ZEROS_{VB_{fit}}(s,y)| \ge L \frac{1}{2}(n-s+1)J \cdot L \frac{1}{2}(n-(y+1)+1)J \ge L \frac{1}{2}(y-s+1)J$$ and $|ZEROS_{VB}(s,y)| \ge L \frac{1}{2}(y-s+1)J + 1$ , as desired. For y < h, B(y+1,n) contains p(k) and all 1-valued terminals in C under $VB_{k-1}$ . Therefore, if $|ZEROS_{VB_{k-1}}(y+1,n)| = L^{1/2}(n-y)J$ , we would have chosen y+1 as s, but $y \ge s$ . Therefore: $$|ZEROS_{VB_{fi}}(y+1,n)| = |ZEROS_{VB_{fi}}(y+1,n)| \le L \mathcal{U}(n-y) \perp \text{ and}$$ $$|ZEROS_{VB}(s,y)| \ge |ZEROS_{VB_{fi}}(s,y)| > L \mathcal{U}(n-s+1) \perp \cdot L \mathcal{U}(n-y) \perp \ge L \mathcal{U}(y-s+1) \perp \cdot \square$$ Now that we have Lemma 5.8, we can prove a two part theorem which, when combined with Lemma 5.7, proves that in our search for an optimal pair of value functions, it is sufficient to consider only value functions consistent with extensions of $VT_{fx}$ and $VB_{fx}$ . If $c_{\gamma} > 1$ , the extensions give all terminals on T(1,q) and B(s,n) "0" or "1" values; if $c_{\gamma} = 1$ , the terminals h and p(h) are the only terminals in these intervals whose values are not fixed. Theorem 5.4-A: Assume the standard state after SCAN-ASSIGN returns "FAIL". Let VT and VB be value functions consistent with VT<sub>0</sub>, VB<sub>0</sub> and p for which no terminal is ?-valued and such that $|ONES_{VI}(C)| = c_1 + c_2 \cdot 1$ . There are extensions, VT<sub>ex</sub> and VB<sub>ex</sub> of VT<sub>fx</sub> and VB<sub>fx</sub> consistent with p and such that: - (i) no terminals are ?-valued - (ii) all ?-valued terminals in T(1,q) under VT<sub>fx</sub> with pairs in B(1,s-1) are 0-valued under VT<sub>ex</sub> - (iii) all ?-valued terminals in B(s,n) under VB<sub>fx</sub> with pairs in T(q+1,m) are 1-valued under VB<sub>ex</sub> (iv) if $c_2 > 1$ , all ?-valued terminals in C under $VT_{fx}$ except p(h) are 1-valued under $VT_{ex}$ (c<sub>2</sub>-1 of them), and $VT_{ex}(p(h)=0$ $$(v) M_{VT_{ex}} + M_{VB_{ex}} \ge M_{VT} + M_{VB}.$$ Proof: Properties (ii) through (iv) define all valued of $VT_{ex}$ and $VB_{ex}$ on T(1,q) and B(s,n) except those of p(h) and h when $c_7 = 1$ . If $c_7 = 1$ , we will begin by letting $VT_{ex}(p(h)) = VB_{ex}(h) = 0$ . This may be changed. The values of terminals in T(q+1,m) and B(1,s-1) whose pairs are in B(s,n) and T(1,q) are also defined by ii and iii. Note that ii, iii, and iv deal with disjoint sets of terminals so no conflicts arise. Figure 5.7 illustrates. We still must specify the values under $VT_{ex}$ and $VB_{ex}$ of terminal pairs with one terminal of each pair in T(q+1,m) and the other in D(1,s-1). For ?-valued terminals under $VT_{fx}$ and $VB_{fx}$ of this type, $VT_{ex}$ and $VB_{ex}$ agree with VT and VB. We prove $M_{VT_{ex}} + M_{VT_{ex}} \ge M_{VT} + M_{VB}$ by comparing the maximum matchings in various intervals and combining. Let $m_{VB}$ be a matching function achieving $M_{VB}$ and $m_{VT}$ be a matching function achieving $M_{VT}$ . I. Consider B(1,s-1). Let $S_1$ be the set of ?-valued terminals in B(1,s-1) under $VB_0$ with pairs in T(1,q) which are 1-valued under VB. The only terminals in B(1,s-1) which are ?-valued under $VB_0$ and 1-valued under $VB_{ex}$ are those which are ?-valued under $VB_{fx}$ with pairs in T(q+1,m). This follows from the fact that any ?-valued terminal under $VB_0$ which is 1-valued under $VB_{fx}$ is in B(s,n), and any ?-valued terminal under $VB_{fx}$ with a pair in T(1,q) is 0-valued under $VB_{ex}$ . Functions $VB_{ex}$ and VB agree on all ?-valued terminals in B(1,s-1) under $VB_0$ with pairs in T(q+1,m). Therefore, every 1-valued terminal in B(1,s-1) under $VB_{ex}$ is 1-valued under $VB_0$ . Each of these terminals which is matched under $VB_0$ can be matched to the same 0-valued terminal under a matching function for $VB_{ex}$ . Let $VB_{ex}$ be a matching function for $VB_{ex}$ which matches "1"s in B(1,s-1) as $VB_0$ does. The matching under $VB_0$ on B(s,n) will be defined later. We have: Figure 5.7: Assignments under Theorem A4-A. |matched "1"s in B(1,s-1) under $m_{VB_{ext}}$ | = |matched "1"s in B(1,s-1) under $m_{VB}$ | - |matched "1"s in $S_1$ under $m_{VB}$ | |matched "1"s in B(1,s-1) under $m_{VB}$ | - |S<sub>1</sub>|. II. Consider T(q+1,m). Let $S_0$ be the set of ?-valued terminals under $VT_0$ in T(q+1,m) with pairs in B(s,n) which are 0-valued under VT. Each 0-valued terminal in T(q+1,m) under $VT_{ex}$ is 0-valued under $VT_0$ or is ?-valued under $VT_0$ and $VT_{fx}$ and is paired with a terminal in B(1,s-1). Therefore, every 0-valued terminal in T(q+1,m) under $VT_{ex}$ is 0-valued under VT. Every such "0" which is matched under $T_{ex}$ are matching function for $T_{ex}$ . We let $T_{ex}$ be such a matching function on T(q+1,m). |matched "0"s in T(q+1,m) under $m_{VT_{ex}}|$ = |matched "0"s in T(q+1,m) under $m_{VT}|$ - |matched "0"s in S<sub>0</sub> under $m_{VT}|$ |matched "0"s in T(q+1,m) under $m_{VT_{qq}} \ge |matched$ "0"s in T(q+1,m) under $m_{VT} - |S_0|$ . III. Consider B(s,n). Since $VB_{ex}(h) = 0$ , Lemma 5.8-B holds. Under $VB_{ex}$ , each 1-valued terminal in B(s,n) can be matched to a 0-valued terminal in B(s,n) while leaving any particular 0-valued terminal unmatched. Complete the matching $m_{VB_{ex}}$ defined on B(1,s-1) in I by such a matching on B(s,n) which leaves h unmatched. We know that: $$|ONES_{VT}(C)| = |ONES_{VT_{ext}}(C)| = c_1 + c_7 - 1$$ and |1-valued terminals in B(s,n) under $VB_{ex}$ which are ?-valued under $VB_0$ and have pairs in T(q+1,m) - = |terminals in B(s,n) which are ?-valued under $VB_n$ and have pairs in T(q+1,m) - = |1-valued terminals in B(s,n) under VB which are ?-valued under VB<sub>0</sub> and have pairs in T(q+1,m)| + |S<sub>0</sub>| Then, |matched "1"s in B(s,n) under $m_{VT_{ex}}| = |ONES_{VB_{ex}}(s,n)| = |ONES_{VB}(s,n)| + |S_0|$ $\geq |matched "1"s in B(s,n) under m_{VR}| + |S_0|.$ IV. Consider T(1,q). We must consider c, = 1 and c, > 1 separately. For $c_i > 1$ , there are other terminals in UNDEF (C) beside h. These terminals must lie in T(k+1,q), since no terminal except h in T(l,k) in 1-valued under $VF_{q_k}$ . Since $k \ge p(h)$ and these terminals are 1-valued under $VF_{q_k}$ . Lemma 5.8-A applies. Under $VF_{q_k}$ every "0" in T(l,q) can be matched to a "1" in T(l,q). Complete the matching $m_{VF_{q_k}}$ defined on $T(q_{q_k}, l, m)$ in H with such a matching on T(l,q). We know that: $$|ZEROS_{VI}(C)| = |ZEROS_{VI}(C)| + c_0 + 1;$$ O-valued terminals in T(1,q) under VI which are 2-valued under VI and have pairs in B(1,s-1) - = [0-valued terminals in T(1,q) under VF which are ?-valued under VF<sub>0</sub> and have pairs in B(1,s-1)] + |S<sub>1</sub>|. Then | matched "0"s in T(1,q) under $m_{VT_{ex}} = |ZEROS_{VT_{ex}}(1,q)| = |ZEROS_{VT_{ex}}(1,q)| + |S_1|$ $\geq |matched "0"s in <math>T(1,q)$ under $m_{VT} + |S_1|$ . Combining I through IV gives: $M_{VT_{ex}} + M_{VB_{ex}} \ge |range(m_{VT_{ex}})| + |range(m_{VB_{ex}})| \ge |range(m_{VT})| + |range(m_{VB})| = M_{VT} + M_{VB}$ For $c_7 = 1$ , since all ?-valued terminals in T(1,q) under $VT_{fx}$ are 0-valued under $VT_{ex}$ . ONES<sub>VT\_{ex}</sub> $(1,q) = ONES_{VT_{fx}} (1,q)$ . Since $OK:1(VT_{fx},1,x)$ holds for all x in T(1,q), all $L!_2qJ$ "1"s in T(1,q) can be matched to "0"s in T(1,q) under $VT_{ex}$ . The definition of $m_{VT_{ex}}$ on T(1,q) is such a matching. |matched "0"s on T(1,q) under $m_{VT_{ext}} = L \frac{1}{2} = |ZFROS_{VT_{ext}}(1,q)| - (F \frac{1}{2} - L \frac{1}{2} - L \frac{1}{2})|$ $\geq |ZEROS_{VT}(1,q)| + |S_1| - (F \frac{1}{2} - L \frac{1}{2} - L \frac{1}{2})|$ $\geq |matched "0"s in T(1,q) under <math>m_{VT}| + |S_1| - (F \frac{1}{2} - L \frac{1}{2} - L \frac{1}{2})|$ (i) If q is even, the above implies, with I through III, that $M_{VT_{ex}} + M_{VB_{ex}} \ge M_{VT} + M_{VB}$ as desired. (ii) If any of the inequalities deduced in I through IV is suffet, we have $$M_{VT_{ex}} + M_{VB_{ex}} > M_{VT} + M_{VB} \cdot (\Gamma / q - \Gamma / q) + Implying$$ $$M_{VT_{ex}} + M_{VB_{ex}} \ge M_{VT} + M_{VB} \text{ as desired.}$$ Assume (i) and (ii) do not hold. If $|S_1| = 0$ , then It follows that under $m_{VT}$ more than half the terminals in T(1,q) are matched "0"s. This can only be true if an element of ZEROS $_{VT}(1,q)$ matches an element of QNPB $_{VT}(q+1,m)$ under $m_{VT}$ . Each 1-valued terminal in T(q+1,m) under VT is 1-valued under $VT_{ex}$ . Also, any 1-valued terminal in T(q+1,m) matched under $m_{VT_{ex}}$ is matched to a "0" in T(q+1,m) and is matched to the same "0" by $m_{VT}$ . Therefore, there must be an unmatched 1-valued terminal under $m_{VT_{ex}}$ in T(q+1,m). We can extend $m_{VT_{ex}}$ so that the unmatched "0" in T(1,q) matches this "1". Then, |matched "0"s in T(1,q) under $m_{VT}$ | = |matched "0"s in T(1,q) under $m_{VT}$ | and $M_{VT}$ + $M_{VB}$ $\geq M_{VT} + M_{VB}$ . We now suppose that $|S_1| > 0$ . We know that |matched "1"s in B(1,s-1) under $m_{VB_{ex}}$ | = |matched "12s in B(1,s-1) under $m_{VB}$ | - |S<sub>1</sub>|. Also |ZEROS<sub>VB</sub>(1,s-1)| = |ZEROS<sub>VB</sub>(1,s-1)| + |S<sub>1</sub>|. Therefore |ZEROS<sub>VB</sub> (1,0-1)|-|matched "1"s in B(1,5-1) tinder m<sub>VB</sub> |= $|ZBROS_{VB}(1,p+1)| + |S_1| - |matched "I's in B(1,p-1) under m_{VB}| + |S_1| \ge 2$ . There must be an unmatched "0" in B(1,s-1) under $m_{VB_{ex}}$ , since no "0" in B(1,s-1) matches a "1" in B(s,n) under this matching function. When $m_{VB_{ex}}$ was defined on B(s,n) in RI, h was left unmatched. Modify $VT_{ex}$ and $VB_{ex}$ so that $VT_{ex}(p(h)) = VB_{ex}(h) = 1$ . On ZEROS<sub>VT</sub> (q+1,m), $m_{VT_{ex}}$ is unchanged. On T(1,q), Lemma 5.8-A now applies and we can define $m_{VT_{ex}}$ so that all D-valued terminals on T(1,q) are matched to 1-valued terminals on T(1,q). |matched "0"s in T(1,q) under $m_{VT_{av}} = L \frac{1}{2} q J$ as before. On ZEROS<sub>VB</sub> (1,s-1), m<sub>VB</sub> is extended so that a previously unimatched "0" is matched to h. On ZEROS<sub>VB</sub> (s,n), m<sub>VB</sub> is unchanged. Therefore $M_{VT_{ex}} + M_{VB_{ex}} \ge |range(m_{VT_{ex}})| + |range(m_{VB_{ex}})| + |range(m_{VT})| + |range(m_{VB})| = M_{VT} + M_{VB}$ . Note that we have changed $VT_{ex}$ and $VB_{ex}$ so that p(h) and h are 1-valued. Now that we have Theorem 5.4-A, we know how to handle "failures" when c,>1. Lemma 5.7, there are value functions VT<sub>f</sub> and VB<sub>f</sub> consistent with p, VT<sub>0</sub> and VB<sub>0</sub> for which पहले हिर्देश हो र लेकार्य संस्कृति हो होता है। से प्राप्त है $|ONES_{V\Gamma}(C)| = c_1 + c_2 - 1$ and which achieve the maximum of $M_{V\Gamma} + M_{VB}$ over all functions consistent with p, VT<sub>0</sub> and VB<sub>0</sub>. Given these functions, we apply Theorem 5.4-A to deduce that there are extensions of VT<sub>fx</sub> and VB<sub>fx</sub> consistent with p and satisfying (i) through (iv) of Theorem 5.4-A which achieve $M_{VT_c} + M_{VB_c}$ . Therefore, for all terminals which are 0-valued or 1-valued under $VT_{fx}$ and $VB_{fx}$ , we can fix their values to be those under VT and VB . For all terminals whose values are dictated by (ii) through (iv) of Theorem 5.4-A, we can fix their values as dictated. We now have new initial value functions under which no terminal in T(1,q) or B(s,n) is ?-valued. We are guaranteed that there are value functions which achieve $M_{VT_f} + M_{VB_f}$ and which are consistent with these new initial value functions. Some terminals within T(q+1,m) and B(1,s-1) may have acquired 0/1 values under the new initial value functions. We can recursively apply the algorithm, beginning with computation of left-regions and right-regions, to find extensions of the new initial value functions which maximize the sum of matches on T(1,m) and B(1,n). The number of ?-valued terminals has decreased, since at least h and p(h) have changed from ?-valued to 0-valued. When $c_1 = 1$ , we first need a modified version of Theorem 5.4-A. When $c_1 = 1$ , Lemma 5.7 only guarantees that there are functions achieving the maximum matching, $M_{V\Gamma_f} + M_{VB_f}$ , with $|ONES_{V\Gamma_f}(C)|$ equal to $c_1$ or $c_1 + 1$ . Theorem 5.4-A can only be applied if $|ONES_{V\Gamma_f}(C)| = c_1$ . Therefore, we prove a modified version, Theorem 5.4-B, for the case that $|ONES_{V\Gamma_f}(C)| = c_1 + 1$ . Theorem 5.4-B: Assume the standard state after SCAN-ASSIGN returns "FAIL". Let $c_1 = 1$ . Let VT and VB be value functions consistent with VT<sub>0</sub>, VB<sub>0</sub> and p for which no terminal is ?-valued and such that $|ONES_{VT}(C)| = c_1 + 1$ . There are extensions, VT<sub>ex</sub> and VB<sub>ex</sub> of VT<sub>fx</sub> and VB<sub>fx</sub> consistent with p and such that: - (i) no terminals are ?-valued. - (ii) all ?-valued terminals in T(1,q) under $VT_{fx}$ with pairs in B(1,s-1) are 0-valued under $VT_{ex}$ , - (iii) all ?-valued terminals in B(s,n) under $VB_{fx}$ with pairs in T(q+1,m) are 1-valued under $VB_{ex}$ , (iv) $$M_{VT_{ex}} + M_{VB_{ex}} \ge M_{VT} + M_{VB}$$ . Proof: We need to modify the proof to Theorem 5.4- $\Lambda$ so that the roles of intervals T(1,q) and B(s,n) are interchanged. Functions $VT_{ex}$ and $VB_{ex}$ are defined as before except that we begin with $VT_{ex}(p(h)) = VB_{ex}(h) = 1$ . I. In B(1,s-1), as for Theorem 5.4-A. II. In T(q+1,m), as for Theorem 5.4-A. III. In B(s,n). Lemma 5.8-B does not hold, but ZEROS<sub>VB<sub>ex</sub></sub>(s,n) = ZEROS<sub>VB<sub>fx</sub></sub>(s,n). Therefore, OK-0(VB<sub>ex</sub>,y,n) holds for all y in B(s,n). All L½(n-s+1)J "0"s in B(s,n) can be matched to "1"s in B(s,n) under VB<sub>ex</sub>. As in the proof of Theorem 5.4-A: $$|ONES_{VB}(s,n)| + |S_0| = |ONES_{VB_{ex}}(s,n)|$$ and $|Imatched|$ "1"s on B(s,n) under IV. Lemma 5.8-A does hold and we can define m<sub>VI</sub> so that p(h) is left unmatched and |matched "0"s on T(1,q) under $m_{VT_{ex}}$ | = |ZEROS<sub>VT\_ex</sub>(1,q)| = $$|ZEROS_{VI}(1,q)| + |S_1| \ge |matched| "0"s in T(1,q) under m_{VI}| + |S_1|$$ We have $M_{VT_{ex}} + M_{VB_{ex}} \ge M_{VT} + M_{VB} - (\Gamma / (n-s+1) - L / (n-s+1) - L / (n-s+1) - L / (n-s+1) = 0$ . If n-s+1 is even or any of the inequalities in I through IV are strict, we have the desired result. Assume not. If $|S_0| = 0$ , then $$L \frac{1}{2}(n-s+1)J + 1 = [matched "1"s on B(s,n) under m_{vn}].$$ There must be a 0-valued terminal in B(1,s-1) which matches a 1-valued terminal in B(s,n) under $m_{VB}$ . This 0-valued terminal under VB is also 0-valued under VB<sub>ex</sub> and unmatched under $m_{VB_{ex}}$ . We can modify $m_{VB_{ex}}$ so that the unmatched "1" in B(s,n) matches this "0", giving $$M_{VT_{ex}} + M_{VB_{ex}} \ge M_{VT} + M_{VB}$$ If $|S_0| > 0$ , we modify $VT_{ex}$ and $VB_{ex}$ . We have: $$|ONES_{VT_{ex}}(q+1,m)|$$ - |matched "0"s in $T(q+1,m)$ under $m_{VT_{ex}}$ = $$|ONES_{VT}(q+1,m)| + |S_0|$$ (|matched "0"s in $T(q+1,m)$ under $m_{VT}| - |S_0| \ge 2$ We conclude that there is an unmatched 1-valued terminal in T(q+1,m) under $m_{VT_{ex}}$ . Let $VT_{ex}(p(h)) = VB_{ex}(h) = 0$ . Now p(h) can match the unmatched 1-valued terminal in T(q+1,m) and [range( $m_{VT_{ex}}$ )] is increased by 1. In B(s,n), Lemma 5.8-B now applies and we still have [matched "1"s in B(s,n) under $$m_{VB_{ac}} = L \frac{1}{2} (n-s+1) J$$ We conclude that $$M_{VT_{ext}} + M_{VB_{ex}} \ge M_{VT} + M_{VB}$$ . Given Theorem 5.4-B in addition to Theorem 5.4-A, we can deduce that there are value functions consistent with $VT_{fx}$ and $VB_{fx}$ and satisfying (ii) and (iii) of Theorem 5.4-A/B which achieve the maximum of $M_{VT} + M_{VB}$ over all functions consistent with p, $VT_0$ and $VB_0$ . However, we do not know what the value of p(h) and h should be. If the terminal pair is assigned "0", there is an extra "0" at the top which may be matched to some terminal in T(q+1,m), but one 0-valued terminal in T(q+1,m) and ?-valued terminals in T(1,q) and B(s,n) under $VT_0$ and $VB_0$ , we cannot use a recursive application of the algorithm. Therefore, the algorithm must try both possibilities for h and p(h). If we were to apply the algorithm recursively for each choice, the running time could be exponential in the number of terminals. Therefore, we use a modification of the procedures we have described so far. The main procedure, which given $VT_0$ and $VB_0$ finds VT and VB which maximize $M_{VT} + M_{VB}$ , is called MAX-MATCH. The modified version is called BETTER-MATCH. When we are trying value "1" for h and p(h), we will recursively apply MAX-MATCH, beginning with the computation of left-regions and right-regions on T(1.m) and D(1.n) for initial value functions which extend VI's and VBs as dictated by properties (ii) and (iii) of Theorem 5.4-A/B and under which p(h) and h are 1-valued. When we are trying value "0" for h and p(h), we use initial valued functions which are the same as for the first choice except that p(h) and h are 0-valued. However, in this case, BETTER-MATCH is used. Procedure BETTER-MATCH does not look for a pair of value functions consistent with the new initial value functions and which maximizes $M_{V\Gamma} + M_{VB}$ over all consistent function pairs. Rather, the procedure looks for a pair of consistent functions which maximizes M<sub>VT</sub> + M<sub>VR</sub> over all consistent function pairs and achieves a better matching than any function pair consistent with the initial value functions when p(h) and h are 1-valued. Consider all function pairs which maximize the sum of matchings on T and B over all functions consistent with p, VT and VB t. If under one of these, h and p(h) are 1-valued, then a function pair achieving a better matching with p(h) and h 0-valued does not exist. Looking only for better matchings allows us to bound the search space of Here his survivience but of the the algorithm so that exponential running time is avoided. Note that we could equally well have chosen to look for maximizing matchings under the "0" choice and better matchings under the "1" choice. Lemma 5.9 is used to bound the search. in the contract was a section of the contract Lemma 5.9 (see Figure 5.8): Assume the standard state after SCAN-ASSIGN returns "FAIL". Let $c_7 = 1$ . Let $VT_{ex0}$ and $VB_{ex0}$ be value functions consistent with $VT_{fx}$ , $VB_{fx}$ , p, and (ii) and (iii) of Theorem 5.4-A/B under which p(h) and h are 0-valued. If under some matching function $m_{VB_{ex0}}$ which achieves $M_{VB_{ex0}}$ , there is an unmatched "0" in B(1,s-1), then there are valued functions $VT_{ex1}$ and $VB_{ex1}$ consistent with all of the above and under which p(h) and h are 1-valued such that $$\mathsf{M}_{\mathsf{VT}_{\texttt{ext}}} + \mathsf{M}_{\mathsf{VB}_{\texttt{ext}}} \geq \mathsf{M}_{\mathsf{VT}_{\texttt{ext}}} + \mathsf{M}_{\mathsf{VB}_{\texttt{ext}}}$$ Proof: Let $VT_{ex1}$ and $VB_{ex1}$ agree with $VT_{ex0}$ and $VB_{ex0}$ everywhere except at p(h) and h, where $VT_{ex1}(p(h)) = VB_{ex1}(h) = 1$ . Define $m_{VT_{ex1}}$ to assign exactly those matches which are assigned by $m_{VT_{ex0}}$ and do not involve p(h), where $m_{VT_{ex0}}$ achieves $M_{VT_{ex0}}$ . Then: $$M_{VT_{ex1}} \ge |range(m_{VT_{ex1}})| \ge |range(m_{VT_{ex0}})| \cdot 1 = M_{VT_{ex0}} \cdot 1$$ Since $VB_{ex0}$ is an extension of $VB_{fx}$ and $VB_{ex0}(h) = 0$ , Lemma 5.8-B applies. Let $m'_{VB_{ex0}}$ be a matching function for $VB_{ex0}$ which agrees with $m_{VB_{ex0}}$ for each range value in B(1,s-1) but matches each "1" in B(s,n) to a "0" in B(s,n) while leaving h unmatched. Any "0" in B(1,s-1) which was unmatched under $m'_{VB_{ex0}}$ is unmatched under $m'_{VB_{ex0}}$ . $$|\operatorname{range}(m'_{VB_{ex0}})| = |\operatorname{matched} "1's \text{ on } B(1,s-1) \text{ under } m_{VB_{ex0}}| + |\operatorname{ONES}_{VB_{ex0}}(s,n)|$$ = $|\operatorname{range}(m_{VB_{ex0}})| = M_{VB_{ex0}}$ Let $m_{VB_{ex1}}$ be a matching function for $VB_{ex1}$ which assigns all matches that $m'_{VB_{ex0}}$ assigns, and, in addition, matches h to an unmatched 0-valued terminal in B(1,s-1). Then $$\begin{aligned} \mathbf{M_{VB}_{ex1}} \geq |\mathrm{range}(\mathbf{m_{VB}_{ex1}})| &= |\mathrm{rangc}(\mathbf{m'_{VB}_{ex0}})| + 1 = \mathbf{M_{VB}_{ex0}} + 1 \text{ and} \\ \mathbf{M_{VT}_{ex1}} + \mathbf{M_{VB}_{ex1}} \geq \mathbf{M_{VT}_{ex0}} + \mathbf{M_{VB}_{ex0}} \end{aligned}$$ Lemma 5.9 allows us to modify the assignment procedures described so far while pursuing the 0-valued choice for p(h) and h. While pursuing this choice, suppose that MAX-MATCH would make an assignment giving a new pair of value functions, either after computing left-regions and right-regions or Figure 5.8: Configuration in proof of Lemma 5.9. If: Then becomes: after processing a return of "FAIL" from a call of SCAN-ASSIGN. Suppose we can deduce that there is a pair of value functions, $VT_{60}$ and $VB_{60}$ , consistent with pairing function p and the new value functions, which maximizes $M_{VT} + M_{VB}$ over all such consistent functions but for which a matching function achieving $M_{VB_{60}}$ leaves a "0" in B(1,s-1) unmatched. Then BETTER-MATCH need not consider any pair of functions consistent with the new value functions. None will induce better matchings than the functions found when pursuing the 1-valued choice for p(h) and h. This true because Lemma 5.9 can be applied to $VT_{60}$ and $VB_{60}$ to give functions yielding as good a sum of matchings under which h and p(h) are 1-valued. If the pair of new value functions which has been rejected is one of two choices after a call on SCAN-ASSIGN which returned "FAIL", then this choice is eliminated. BETTER-MATCH never needs to pursue two choices. If the pair of new value functions is defined by assignment to left-regions and right-regions, then by Theorem 5.2, any pair of value functions maximizing $M_{VT} + M_{VB}$ over value functions consistent with p and the new value functions also maximizes $M_{VT} + M_{VB}$ over value functions consistent with p and the old value functions. Since there is such a function under which a "0" in B(1,s-1) can be left unmatched, no function consistent with the old valued function will produce a sum of matchings better that that produced under the 1-valued choice for h and p(h). Procedure BETTER-MATCH need not pursue functions consistent with the old value functions either. Therefore, BETTER-MATCH returns "(null,null)", indicating that the 0-valued choice for h and p(h) will not yield a larger sum of matchings than the 1-valued choice. For the same reason, BETTER-MATCH returns "(null,null)" if c<sub>4</sub>>1 when SCAN-ASSIGN returns "FAIL". The main procedure, MAX-MATCH, is presented in Figure 5.9. Our original routing problem is solved by calling MAX-MATCH(T(1,m),B(1,n),p,VT<sub>0</sub>,VB<sub>0</sub>), where VT<sub>0</sub> and VB<sub>0</sub> represent the local connections. The modified procedure, BEITER-MATCH, used when processing a 0-valued choice for p(h) and h, is presented in Figure 5.10. Suppose BEITER-MATCH(T(1,m),B(1,n),p,VT<sub>0</sub>,VB<sub>0</sub>) returns Figure 5.9: The main procedure. ``` MAX-MATCH(T(1,m),B(1,n),p,VT_0,VB_0) /Finds extensions of VT_0 and VB_0 which maximize M_{VT} + M_{VB} over all such extenstions/ VT := VT_0 and VB := VB_0 Compute Lvr. LvR. Rvr. and RvR DO WHILE there are any ?-valued terminals in T(1,L_{\rm WT}), T(R_{\rm WT}m), B(1,L_{\rm VB}), and B(R_{\rm WB}m) FOR each ?-valued terminal x in T(1,L<sub>VT</sub>) DO VI(x) := 0 and VII(p(x)) := 0 END Compute R_{VI}, and R_{VR} /assigning "0"s only affects right-regions/ FOR each ?-valued terminal x in T(R<sub>vr</sub>m) DO VF(x):=1 and VF(x):=1 END /assigning "I"s only affects teft-region/ Compute Lyr and Lyr FOR each ?-valued terminal y in B(1,L_{VR}) DO VB(y):=0 and VI(p(y)):=0 END Compute R<sub>VI</sub>, and R<sub>VR</sub> FOR each ?-valued terminal y in B(R<sub>VB</sub>,n) DO VB(y):=1 and VT(p(y)):=1 END Compute Lvr and LvR END IF there are any ?-valued terminals in T(L_{VT}+1,R_{VT}-1) and B(L_{VR}+1,R_{VR}-1) THEN DO m' := R_{VT} \cdot L_{VT} \cdot 1 n' := R_{VR} - L_{VR} - 1 VT' is such that VT'(x) = VT(I_{VT} + x) VB' is such that VB' = VB(L_{VB} + x) p' maintains pairs corresponding to those under p when both terminals of the pair are in T(1,m') and B(1,n'). For other terminals in T(1,m') and B(1,n'), p' assigns "*" (STATUS, VT_r, VB_r) := SCAN-ASSIGN(T(1,m'),B(1,n'),p',VT',VB') IF STATUS = "FAIL k" THEN DO. VT' := VT_{and} VB' := VB_{a} Calculate q, s, h, and c, who is, in provided pickets are de- IF p'(h) \neq k THEN VT'(k): =1 and VB'(p'(k)): =1 FOR all x in T(1,q) other that p(h) mid k DO IF VT'(x) = ? THEN IF p(x) is in f(1,s-1) THEN VT'(x) := 0 and VB'(p'(x)) := 0 ELSE VT'(x) := 1 and VB'(p'(x)) := 1 ``` **END** ``` FOR all y in B(s,n') DO IF VB'(y) = ? and p'(y) is in T(q+1, m') THEN VB'(y):=1 and VF'(p'(y)):=1 END IF c, > 1 THEN DO VB'(h):=0 and VF'(p'(h)):=0 (VT_nVB_n):=(MAX-MATCH(T(1,m'), B(1,n'),p',VT',VB')) END IF c_7 = 1 THEN DO VB'(h):=1 and VT'(p'(h)):=1 (VT_{r1},VB_{r1}):=MAX-MATCH(T(1;m'),B(1;n'),p',VT',VB')) VB'(h) := 0 and VT'(p'(h)) := 0 (VT_{nb},VB_{nb}):=BETTER-MATCH(T(1,m'),B(1,n'),s-1,p',VT',VB')) IF V\Gamma_{r0} = \text{null OR } M_{V\Gamma_{r0}} + M_{VB_{r0}} \le M_{V\Gamma_{r1}} + M_{VB_{r1}} THEN (VT_r, VB_r) := (\widetilde{V}T_{r1}, VB_{r1}) ELSE (VT_r, VB_r) := (VT_{r0}, VB_{r0}) END END FOR each x in T(L_{VT}+1,R_{VT}-1) DO V\Gamma(x) := V\Gamma_{\Gamma}(x-L_{VT}) END FOR each y in B(L_{VB}+1,R_{VB}-1) DO VB(x) := VB_r(x-L_{VR}) END END RETURN(VT, VB) ``` **END MAX-MATCH** Figure 5.10: The modified procedure for 0-valued choices. # BETTER-MATCH(T(1,m),B(1,a),d,p,VT, VB,) /If can guarantee that a pair of extensions of $VT_0$ and $VB_0$ which maximize $M_{VT} + M_{VB}$ over all such pairs of extensions allow an unmatched $\Phi$ -valued terminal in B(1,d) under a maximum matching, then returns (null,null). Otherwise, returns a pair of extensions of $VT_0$ and $VB_0$ . If d < n, it is assumed that B(d+1,n) has no ?-valued terminals under $VB_0$ and that matchings on B(d+1,n) and B(1,d) can be maximized independent of each other./ ``` VT := VT_0 and VB := VB_0 Compute L<sub>VI</sub>, L<sub>VB</sub>, R<sub>VI</sub>, and R<sub>VB</sub> IF there is a right-region in M(I,d) containing exactly one terminal THEN RETURN(null,null) - No. 1940年代1942 DO WHILE there are any ?-valued terminals in T(1,L_{VT}), T(R_{VT},m), B(1,L_{VR}), and B(R_{VR},n) FOR each ?-valued terminal x in T(1,L<sub>vT</sub>) DO IF p(x) is in B(R<sub>VB</sub>,d) THEN RETURN (pull, pull) ELSE VI(x): =0 and VB(p(x)): =0 END Compute R<sub>VI</sub>, and R<sub>VR</sub> IF there is a right-region in B(1,d) containing exactly one terminal THEN RETURN(null,null) FOR each ?-valued terminal x in T(R<sub>VI</sub>,m) DO VI(x) := 1 and VB(p(x)) := 1 END Compute Lyr and Lyn FOR each ?-valued terminal y in B(1,L<sub>VB</sub>) DO VB(y) := 0 and VI'(p(y)) := 0 END Compute R<sub>VP</sub>, and R<sub>VR</sub> IF there is a right-region in B(1,d) containing exactly one terminal THEN RETURN(null,null) FOR each ?-valued terminal y in B(R<sub>vp</sub>,n) DO VB(y) := 1 and VT(p(y)) := 1 END Compute L<sub>VI</sub> and L<sub>VB</sub> IF there are any ?-valued terminals in T(L_{VT}+1,R_{VT}-1) and B(L_{VR}+1,R_{VR}-1) THEN DO m' := R_{Vr} \cdot L_{Vr} \cdot 1 n' := R_{VR} - L_{VR} - 1 VI' is such that VT'(x) = VT(I_{VT} + x) VB' is such that VB'(x) = VB(L_{VB} + x) p' maintains pairs corresponding to those under p when both terminals of the pair are in T(1,m') and B(1,n'). For other terminals in T(1,m') and B(1,n'), p' assigns "*" (STATUS, VT_1, VB_2) := SCAN-ASSIGN(T(1,m'),B(1,n'),p',VT',VB') ``` ``` IF STATUS = "FAIL L" THEN DO VT' := VT_r and VB' := VB_r Calculate q. s. h. and c. IF c, > 1 THEN RETURN (null, null) IF c, = 1 THEN DO FOR all x in T(1,q) DO IF VI'(x) = ? THEN IF p'(x) is in B(1,s-1) THEN VT'(x) := 0 and VB'(p'(x)) := 0 ELSE VI'(x): = 1 and VB'(p'(x)): = 1 /Only k and p'(k) are assigned by the ELSE statement/ END FOR all y in B(s,n') DO IF VB'(y) = ? and p'(y) is in T(q+1,m') THEN VB'(y):=1 and VI'(p'(y)):=1 END (VT_r, VB_r) := (BETTER-MATCH(T(1,m'), B(1,n'), n',p',VT', VB')) IF VT = null THEN RETURN (null, null) END FOR each x in T(L<sub>VT</sub>+1,R<sub>VT</sub>-1).DO VT(x) := VT_r(x-L_{VT}) END FOR each y in B(L_{VB}+1,R_{VB}-1) DO VB(x) := VB_{(x-L_{VB})} END ``` END **END BETTER-MATCH** RETURN(VT, VB) $VT_f$ and $VB_f$ which are not "null". Functions $VT_f$ and $VB_f$ may not maximize $M_{VT} + M_{VB}$ over all value functions consistent with p, $VT_0$ and $VB_0$ . However, if $M_{VT_f} + M_{VB_f}$ is better that the maximum sum of matches for the 1-valued choice, then $M_{VT_f} + M_{VB_f}$ does maximize $M_{VT} + M_{VB}$ . Lemma 5.10 states the property of $M_{VT_f} + M_{VB_f}$ which allows us to make this conclusion. Lemma 5.10: Let $VB_0$ and d be such that if d < n, no terminal in B(d+1,n) is ?-valued, $[ZEROS_{VB_0}(d+1,n)] = L^{1}/2(n-d)J+1$ , and for any 0-valued terminal j in B(d+1,n), each 1-valued terminal in B(d+1,n) can be matched to a 0-valued terminal in B(d+1,n) while leaving j unmatched. If BETTER-MATCH(T(1,m), B(1,n), $d_p$ , $VT_0$ , $VB_0$ ) returns "(null,null)", then there is a pair of value functions consistent with p, $VT_0$ and $VB_0$ maximizing $M_{VT} + M_{VB_0}$ over all such consistent functions and for which there is a matching function achieving the maximum matching on B which does not match all "0"s in B(1,d). If BETTER-MATCH(T(1,m), B(1,n), $d_p$ , $VT_0$ , $VB_0$ ) returns "( $VT_p$ , $VB_p$ )" with $VT_p$ and $VB_p$ not "null", then there are no ?-valued terminals under $VT_p$ and $VB_p$ and if there are $VT_p$ and $VB_p$ consistent with p, $VT_0$ and $VB_0$ for which $M_{VT} + M_{VB} > M_{VT_p} + M_{VB_p}$ , then there are $VT_p$ and $VB_p$ consistent with p, $VT_0$ and $VB_0$ for which $M_{VT_p} + M_{VB_p} > M_{VT_p} + M_{VB_p}$ , then there are $VT_p$ and $VB_p$ for which $M_{VT_p} + M_{VB_p} > M_{VT_p} + M_{VB_p}$ and under which some matching function achieving $M_{VB_p}$ leaves a "0" in B(1,d) unmatched. Proof: By induction on the number of recursive calls to BEITER-MATCH before the call being considered returns. Basis: no recursive calls. By Theorem 5.2, any extensions of $VT_0$ and $VB_0$ defined in the WHILE loop, which assigns within full regions, can be extended to functions maximizing $M_{VT} + M_{VB}$ over all functions consistent with p, $VT_0$ and $VB_0$ . We know that if d < n, d+1 must be the end of a full right-region. Otherwise, we would have: $|ZEROS_{VB_0}(d+1,r(d+1))| < \Gamma \frac{1}{2}(r(d+1)-d)$ , which implies $|ONES_{VB_0}(d+1,r(d+1))| > L \frac{1}{2}(r(d+1)-d)$ . In this case, not all "1"s in B(d+1,n(d+1)) could be matched to "0"s in B(d+1,n), contradicting part of the hypothesis. It follows that $d \ge R_{VB_0}$ . Let VB and VT be equal to VB<sub>0</sub> and VT<sub>0</sub> or extensions of VB<sub>0</sub> and VT<sub>0</sub> defined in the WHILE loop. If under VB there is a right-region containing exactly one terminal, say j, then: $$|ZEROS_{VB}(j,d)| = |ZEROS_{VB}(j+1,d)|+1 \ge \Gamma \frac{1}{2} \left(\frac{d-j}{2} + 1\right) \frac{1}{2} + 1$$ For any extension of VB, there will be a matching function maximizing the matching on B which matches all "1"s in B(d+1,n) to "0"s in B(d+1,n) and which leaves a "0" in B(j,d) unmatched. Therefore, "(null, null)" is seturned correctly when there is a right-region containing exactly one terminal. If under some VT defined as above there is a ?-valued terminal in $T(1,L_{\rm VI})$ with its pair in $B(R_{\rm VB},d)$ , then by Theorem 5.2, there are extensions, $VI_{\rm ex}$ and $VB_{\rm ex}$ , of VB and VT which assign this terminal the value "0" and maximize the sum of matchings on T and B over all extensions of VT and VB. Then: $$|ZEROS_{VB}(R_{VB},d)| \ge \Gamma \frac{1}{2} (d-R_{VB}+1)^{\frac{1}{2}} + 1.$$ There is a matching function achieving $M_{VB}$ which matches all "1"s in B(d+1,n) to "0"s in B(d+1,n) and leaves a "0" in $B(R_{VB},d)$ unmatched. Therefore, "(null,null)" is returned correctly. Let $VT_w$ and $VB_w$ be the final extensions of $VT_0$ and $VB_0$ upon exiting the WHILE loop. If no terminals in $T(L_{VT_w}+1,R_{VT_w}-1)$ and $B(L_{VB_w}+1,R_{VB_w}-1)$ are ?-valued, then $VT_w$ and $VB_w$ maximize $M_{VT}+M_{VB}$ over all functions consistent with p, $VT_0$ and $VB_0$ . There are no functions consistent with p, $VT_0$ and $VB_0$ for which $M_{VT}+M_{VB}>M_{VT_w}+M_{VB_w}$ , and $(VT_w,VB_w)$ is correctly returned. If there are ?-valued terminals, let T(1,m') and B(1,n') be intervals. $T(L_{VT_w}+1,R_{VT_w}-1)$ and $B(L_{VB_w}+1,R_{VB_w}-1)$ when renumbered. Let $VT_w'$ and $VB_w'$ be the restrictions of $VT_w$ and $VB_w$ to these intervals. Let $VT_r$ and $VB_r$ be a pair of valued functions which maximizes the sum of matchings on T(1,m') and B(1,n') over all functions consistent with p', $VT_w'$ and $VB_w'$ . Theorem 5.1 guarantees that the function pair which agrees with $VT_r$ and $VB_r$ on $T(L_{VT_w}+1,R_{VT_w}-1)$ and $B(L_{VB_w}+1,R_{VB_w}-1)$ and agrees with $VT_w$ and $VB_w$ elsewhere maximizes the sum of matchings on T and B over all functions consistent with p, $VT_w$ and $VB_w$ . When SCAN-ASSIGN returns (SUCCESS, $VT_w$ , $VB_w$ ), $VT_w$ and $VB_w$ maximize the sum of matchings on T(1,m') and B(1,n') and do not leave any terminals ?-valued. Therefore, BETTER-MATCH correctly returns functions which maximize $M_{VT} + M_{VB}$ over all functions consistent with p, $VT_0$ and $VB_0$ . If SCAN-ASSIGN returns (FAIL k.VT., VB.) and c, >1, then by Lemma 5.7 and Theorem 5.4-A, there is a terminal s and a pair of functions, VT and VB, which maximizes the sum of matchings on T(1,m') and B(1,n') over all functions consistent with p', VT, and VB, and for which [ZEROS<sub>VB</sub>(s,n')] = 1.14(n's+1).1+1. Any matching function for VB, leaves a "0" in B(s,n') unmatched. The functions on T(l,m) and B(l,n) which agree with VT, and VB, on $T(l_{VT_w}+l_{i}R_{VT_w}-l)$ and $B(l_{VB_w}+l_{i}R_{VB_w}-l)$ and agree with VT, and VB, elsewhere maximize $M_{VT}+M_{VB}$ over all functions consistent with p, VT<sub>0</sub> and VB<sub>0</sub>. Given these functions, there is a maximum matching on B which matches each T1" in $T(R_{VB_w},n)$ to a "0" in $T(R_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invoking Theorem 5.1) and leaves a "0" in $T(l_{VB_w},n)$ (invo If the recursive call to BETTER-MATCH using T(1,m'), B(1,n'), n', p', $VT_{x1}$ and $VB_{x1}$ returns "(null,null)", then by inductive assumption, there is a pair of valued functions which maximizes the sum of matchings on T(1,m') and B(1,n') over all functions consistent with p', $VT_{x1}$ and $VB_{x1}$ and for which a maximum matching of B(1,n') leaves a "0" in B(1,n') immatched. Either this pair of functions also maximizes $M_{VT} + M_{VB}$ over all function pairs consistent with p', $VT_x$ and $VB_x$ or a function pair for which p'(h) and h are 0-valued maximizes $M_{VT} + M_{VB}$ over these functions. In either case, there is a value function pair which maximizes $M_{VT} + M_{VB}$ over all functions consistent with p', $VT_w'$ and $VB_w'$ and for which a "0" in B(s,n') is unmatched under some maximum matching for B(1,n'). As for the case when $c_1 > 1$ in the proof of the basis above, "(null,null)" is correctly returned by the original call to BETTER-MATCH. Suppose the recursive call to BFITER-MATCH returns $VT_r$ and $VB_r$ which are not "null". Those functions are consistent with $VT_w'$ and $VB_w'$ since $VT_{x1}$ and $VB_{x1}$ are consistent with $VT_w'$ and $VB_w'$ . Given any pair of value functions on T(1,m') and B(1,n'), let the expansions of these functions denote those functions on T(1,m) and B(1,n) which agree with the given functions on $T(L_{VT_w}+1,R_{VT_w}-1)$ and $B(L_{VB_w}+1,R_{VB_w}-1)$ and agree with $VT_w$ and $VB_w$ elsewhere. The original call to BEITER-MATCH returns $VT_r$ and $VB_r$ which are the expansions of $VT_r$ and $VB_r$ . Since no terminal is ?-valued under $VT_r$ and $VB_r$ , none is ?-valued under $VT_r$ and $VB_r$ . Suppose there is a pair of functions, VT and $VB_r$ consistent with $VT_w$ and $VT_v$ and $VT_v$ . By Theorem 5.2, there are functions $VT_1$ and $VT_1$ consistent with $VT_v$ and $VT_v$ and $VT_v$ for which $VT_v$ and $VT_v$ . Since pairs $VT_1$ . $VT_v$ and $$M_{VT_1'} + M_{VB_1'} > M_{VT_r} + M_{VB_r}$$ where $VT_1'$ and $VB_1'$ are the restrictions of $VT_1$ and $VB_1$ . By Lemma 5.7 and Theorems 5.4-A and 5.4-B, there are functions, $V\Gamma_2'$ and $VB_2'$ , consistent with p', $V\Gamma_x$ and $VB_x'$ and assigning no "?"s such that: $$M_{VT_{2}'}+M_{VB_{2}'} \ge M_{VT_{1}'}+M_{VB_{1}'}>M_{VT_{1}}+M_{VB_{1}'}$$ Case 1: Suppose $V\Gamma_2'$ and $VB_2'$ are consistent with $V\Gamma_{x0}$ and $VB_{x0}$ . For $VB_2'$ , there is an unmatched "0" under any matching function on B(1,n'). Let $V\Gamma_2$ and $VB_2$ be the expansions of $V\Gamma_2'$ and $VB_2'$ . Then $M_{V\Gamma_2} + M_{VB_2} \ge M_{V\Gamma_1'} + M_{VB_1'}$ , and there is a matching function achieving $M_{VB_2}$ which leaves a "0" in $B(L_{VB_2} + 1, R_{VB_2} - 1) \subseteq B(1,d)$ unmatched. Functions $V\Gamma_2$ and $VB_2$ are the desired $V\Gamma_2'$ and $VB_2'$ . Case 2: Suppose VT'<sub>2</sub> and VB'<sub>2</sub> are consistent with VB<sub>x1</sub> and VB<sub>x1</sub>. By inductive assumption, there are functions VT'<sub>g</sub> and VB'<sub>g</sub> consistent with p', VB<sub>x1</sub> and VB<sub>x1</sub> for which $M_{VT'_2} + M_{VB'_3} \ge M_{VT'_2} + M_{VB'_2}$ and under which some matching function achieving $M_{VB'_3}$ leaves a "0" in B(1,n') unmatched. Since VB<sub>x1</sub> and VB<sub>x1</sub> are extensions of VT'<sub>w</sub> and VB'<sub>w</sub>, VT'<sub>g</sub> and VB'<sub>g</sub> are consistent with VT'<sub>w</sub> and VB'<sub>w</sub>. Let VT<sub>g</sub> and VB<sub>g</sub> be the expansions of VT'<sub>g</sub> and VB'<sub>g</sub>. They are consistent with VT<sub>0</sub> and VB<sub>0</sub>. We have: $$M_{VT_g} + M_{VB_g} \ge M_{VT_2} + M_{VB_2} \ge M_{VT_1} + M_{VB_1}$$ Under $VB_g$ , there is a matching function achieving $M_{VB_g}$ which leaves a "0" in $B(E_{VB_g} + 1, R_{VB_g} - 1)$ , a subset of B(1,d), unmatched. Thus, $VT_g$ and $VB_g$ are the desire functions. Lemma 5.10 completes the technical development needed to verify the correctness of the algorithm. Theorem 5.5 states the correctness of the main procedure, MAX-MATCH. Theorem 5.5: Let $VT_0$ and $VB_0$ be value functions consistent with pairing function p and such that the only ?-valued terminals are members of top-bottom pairs. Then MAX-MATCH(T(1,m),B(1,m),p. $VT_0$ , $VB_0$ ) returns functions $VT_f$ and $VB_f$ consistent with p, $VT_0$ and $VB_0$ which maximize $M_{VT} + M_{VB}$ over all such consistent functions. Proof: The correctness follows from the development presented in this chapter. The formal argument is by induction on the number of recursive calls; it is similar to that used in proving Lemma 5.10 except that verifying the correct return of "(null,null)" is not necessary. We only present the argument used when $c_7 = 1$ after a call to SCAN-ASSIGN. It is under this condition that BETTER-MATCH is called by MAX-MATCH and Lemma 5.10 is needed in the proof. Assume the standard state after the call to SCAN-ASSIGN in MAX-MATCH returns "FAIL". Let $VT_x$ and $VB_x$ be the extensions of the functions returned by SCAN-ASSIGN under which all terminals in T(1,q) and B(s,n') except h and p'(h) are 0-valued or 1-valued. Let $VT_{x0}$ and $VB_{x0}$ extend $VT_x$ and $VB_x$ so that p'(h) and h are 0-valued, and $VT_{x1}$ and $VB_{x1}$ extend them so that p'(h) and h are 1-valued. By inductive assumption, the functions $VT_{r1}$ and $VB_{r1}$ returned by MAX-MATCH(T(1,m'), $B(1,n'),p',VT_{x1},VB_{x1}$ ) maximize $M_{VT}+M_{VB}$ over all functions consistent with p', $VT_{x1}$ and $VB_{x1}$ . Suppose BETTER-MATCH(T(1,m'),B(1,n'),s-1,p',VT<sub>x0</sub>,VB<sub>x0</sub>) returns "(null,null)". Then there is a pair of functions $VT_{ex0}$ and $VB_{ex0}$ consistent with $VT_{x0}$ and $VB_{x0}$ maximizing $M_{VT} + M_{VB}$ over all functions consistent with p', $VT_{x0}$ and $VB_{K0}$ for which there is a matching function achieving $M_{VB_{ex0}}$ which does not match all "0"s in B(1,x-1). By Lemma 5.9, there are value functions $VT_{ex1}$ and $VB_{ex1}$ consistent with p', $VT_{x1}$ and $VB_{x1}$ such that $M_{VT_{ex1}} + M_{VB_{ex1}} \ge M_{VT_{ex0}} + M_{VB_{ex0}}$ . Then: $$\mathbf{M}_{\mathsf{VT}_{r1}} + \mathbf{M}_{\mathsf{VB}_{r1}} \geq \mathbf{M}_{\mathsf{VT}_{\mathsf{ex1}}} + \mathbf{M}_{\mathsf{VB}_{\mathsf{ex1}}} \geq \mathbf{M}_{\mathsf{VT}_{\mathsf{ex0}}} + \mathbf{M}_{\mathsf{VB}_{\mathsf{ex0}}},$$ and $VT_{r1}$ and $VB_{r1}$ maximize $M_{VT} + M_{VB}$ over all functions consistent with p', $VT_x$ and $VB_x$ . The expansions of $VT_{r1}$ and $VB_{r1}$ maximize $M_{VT} + M_{VB}$ over all functions consistent with p, $VT_0$ and $VB_0$ . Suppose BETTER-MATCH(T(1,m'),B(1,n'),s-1,p',VT<sub>x0</sub>,VB<sub>x0</sub>) returns VT<sub>r0</sub> and VB<sub>r0</sub> which are not "null". Suppose VT<sub>r0</sub> and VB<sub>r0</sub> do not maximize $M_{VT} + M_{VB}$ over all functions consistent with p', VT<sub>x0</sub> and VB<sub>x0</sub>. Let VT<sub>ex0</sub> and VB<sub>ex0</sub> be value functions consistent with p', VT<sub>x0</sub> and VB<sub>x0</sub> which do maximize $M_{VT} + M_{VB}$ over all such functions. Then, $M_{VT} + M_{VB} > M_{VT} + M_{VB}$ . By Lemma 5.10, there are functions VT<sub>g</sub> and VB<sub>g</sub> consistent with p', VT<sub>x0</sub> and VB<sub>x0</sub> for which $M_{VT} + M_{VB}$ . $\geq M_{VT} + M_{VB} = M$ unmatched. By Lemma 5.9, there are value functions $VT_{g1}$ and $VB_{g1}$ consistent with p', $VT_{x1}$ and $VB_{x1}$ such that: $$\begin{split} & M_{VT_{g1}} + M_{VB_{g1}} \geq M_{VT_g} + M_{VB_g} \geq M_{VT_{ex0}} + M_{VB_{ex0}} & \text{Therefore,} \\ & M_{VT_{r1}} + M_{VB_{r1}} \geq M_{VT_{g1}} + M_{VB_{g1}} \geq M_{VT_{ex0}} + M_{VB_{ex0}} > M_{VT_{r0}} + M_{VB_{r0}}. \end{split}$$ Functions $VT_{r1}$ and $VB_{r1}$ maximize $M_{VT} + M_{VB}$ over all functions consistent with p', $VT_x$ and $VB_x$ and are correctly expanded and returned by MAX-MATCH. If $VT_{r0}$ and $VB_{r0}$ do maximize $M_{VT} + M_{VB}$ over all functions consistent with p', $VT_{x1}$ and $VB_{x1}$ , then the maximum of $M_{VT_{r0}} + M_{VB_{r0}}$ and $M_{VT_{r1}} + M_{VB_{r1}}$ maximizes $M_{VT} + M_{VB}$ over all functions consistent with p', $VT_x$ and $VB_x$ . The pair of functions achieving the maximum (with ties broken in favor of $VT_{r1}$ and $VB_{r1}$ ) are correctly expanded and returned by MAX-MATCH. ### 5.6 Running Time of the Algorithm We will find an upper bound on the running time of the algorithm without making assumptions about the details of the data structures used to implement the algorithm. We will assume that each of the following takes one step: addition to a counter, testing the value of a scalar variable, and assignment to a scalar variable. Assignment to a function variable is assumed to be assignment to one scalar variable for each domain value of the function; this allows a function to be represented as an array. The number of steps taken by the algorithm will be measured as a function of the number of terminals (m + n) and the number of ?-valued terminal pairs uniter the initial value functions. Recall that the algorithm assumes that all ?-valued terminals are members of top-bottom pairs. Let u dengte the number of ?-valued pairs. Procedure SCAN-ASSION does not use any other procedures, including recursive calls to itself. The second secon <sup>1.</sup> This follows the unit cost model for random access machines of Abo, Hoperoft, and Ullman [Ah74]. The logarithmic cost model, which takes into account the number of bits required to represent a number, would multiply any running time by log\_(m+n). The procedure considers each terminal in T at most once. For each ?-valued terminal it considers, it must test the sizes of acts of 0-valued terminals and 1-valued terminals and make an assignment when possible. If the sets are recomputed for each ?-valued terminal considered, the number of steps used to process one ?-valued terminal is at most k(m+n) for some constant k. The stopping test of the first FOR loop must also be computed, but only when an assignment has been made. This test also requires computing the size of a set of terminals. Let SC(m,n,u,a) denote the maximum number of steps taken by SCAN-ASSIGN on any m top terminals and n bostom terminals with u ?-valued terminal pairs when a ?-valued terminals pairs are assigned by SCAN-ASSIGN. Then $SC(m,n,u,a_s) \le k_1(a_s+1)(m+n)$ , for some constant $k_1$ . If $a_s < u$ , SCAN-ASSIGN returns "FAIL" and $(a_s + 1)$ is the number of ?-valued terminal pairs considered. Let SC(m,n,u) denote the maximum number of steps taken by SCAN-ASSIGN on any m top terminals and n bottom terminals with u ?-valued terminal pairs. $SC(m,n,u) \le maximum \text{ over all } a_i \le u \text{ of } SC(m,n,u,a_i) \le k_1(u+1)(m+n) = O(u(m+n)).$ Procedure BETTER-MATCH may use procedure SCAN-ASSIGN and a recursive call to itself. First BETTER-MATCH computes the full left-regions and right-regions and assigns to ?-valued terminals within them. This is done in the WHILE loop. After assigning within any of $(1,L_{VT})$ , $(R_{VT}m)$ , $(1,L_{VB})$ , and $(R_{VB}n)$ , some regions must be recomputed. This takes O(m+n) steps. Once regions are computed, testing for the conditions under which "(null-null)" is returned takes only a constant number of steps. Suppose $a_{w}$ initially ?-valued terminals are assigned $a_{v}$ values in the WHILE loop. Then its execution, including the initial computation before entering it, takes at most $a_{v}$ and $a_{v}$ in the part of steps. After the WHILE loop is completed, if any 7-valued pairs remain, SCAN-ASSIGN is called. Preparation for calling SCAN-ASSIGN takes O(m'+n') steps to assign to VT', VB', and p'. If SCAN-ASSIGN returns "SUCCESS", then merging the functions that SCAN-ASSIGN returns with the functions on the full regions to create the functions returned by BETTER-MATCH takes O(m'+n') steps. If SCAN-ASSIGN returns "FAIL", it takes O(m'+n') steps to assign to VT and VB' and calculate q, s, h, and c<sub>1</sub>. If c<sub>2</sub>=1, BETTER-MATCH is called recursively, returning functions or "null"s. If the recursive call returns functions, then the processing required to return their expansions takes O(m'+n') steps. Let B(m,n,u) be the maximum number of steps taken by any call to BETTER-MATCH with m top terminals, n bottom terminals, and u initially ?-valued pairs. Such a call to BETTER-MATCH under which a initially ?-valued pairs are assigned in the WHILE loop and a pairs are assigned by SCAN-ASSIGN takes at most $k_2(a_w+1)(m+n) + SC(m',n',u-a_w,a_s) + B(m',n',u-a_w-a_s-1) + k_3(m'+n')$ steps for some constant $k_3$ , where B(m,n,-1)=0 by definition for any m and n. Using the bound for SC gives: $\leq k_2(a_w+1)(m+n)+k_1(a_s+1)(m'+n')+k_3(m'+n')+10(m',n',u-a_w-a_s-1) steps.$ We know $1\leq m'\leq m$ and $1\leq n'\leq n$ . For $k_4=k_1+k_2+k_3$ , the number of steps is bounded above by $k_4(a_w+a_s+1)(m+n)+10(m',n',u-a_w-a_s-1).$ expression 1 Then $B(m,n,u) \le (maximum over all m' \le m, n' \le n, and 0 \le a_+ + a_- \le u \text{ of expression 1})$ Order all triples (m,n,u), for $m \ge 1$ , $n \ge 1$ , and $u \ge 0$ , lexicographically. We prove by induction on this ordering that for all such triples, $B(m,n,u) \le k_s(u+1)(m+n)$ , for some constant $k_s$ . The basis, (1,1,0), is trivial. For any triple, (m,n,0), $B(m,n,0) \le k_s(m+n)$ , so $k_s \ge k_s$ suffices. Let the proposition be true for any triple lexicographically smaller that (m,n,u) where $u \ge 0$ . We know that $1 \le m' \le m$ and $1 \le n' \le n$ whenever BETTER-MATCH is recursively called. When $a_w + a_s \le u$ , $0 \le u - a_w - a_s - 1 \le u - 1$ , and we can use the inductive assumption. When $a_w + a_s = u$ , $u - a_w - a_s - 1 = -1$ but $k_s(u - a_w - a_s)(m' + n') = 0$ is still an upper bound on the contribution of the recursive call to BETTER-MATCH, since no recursive call is made. Therefore: expression $$1 \le k_4(a_w + a_x + 1)(m + n) + k_5(u - a_w - a_x)(m' + n')$$ expression $$1 \le k_5(a_w + a_s + 1)(m+n) + k_5(u-a_w-a_s)(m+n)$$ using $k_5 \ge k_4$ and $m' + n' \ge m+n$ $\le k_5(n+1)(m+n)$ and $B(m,n,u) \le k_q(u+1)(m+n) = O(u(m+n))$ . We now turn to MAX-MATCH. The bound on the number of steps used in the WHILE loop is of the same order as that for BETTER-MATCH, by the same argument. Procedure MAX-MATCH does not do the testing to return "(null,null)" which BETTER-MATCH does, but this only adds a constant per assignment to the number of steps used by BETTER-MATCH, and only affects the constant for MAX-MATCH's bound. The pre-processing for SCAN-ASSIGN and the post-processing when SCAN-ASSIGN returns "SUCCESS" have the same bound on the number of steps used as that for BETTER-MATCH. However, if SCAN-ASSIGN returns "FAIL", a recursive call to MAX-MATCH and possibly a call to BETTER-MATCH are used. The pre-processing and post-processing for these calls take O(m'+n') steps. Let M(m,n,u) be the maximum number of steps used by any call to MAX-MATCH with m top terminals, a bottom terminals, and u initially ?-valued pairs. Then for constants $k_6$ and $k_7$ $$M(m,n,u) \le \max_{i=1}^{n} m' \le m, n' \le n, \text{ and } 0 \le a_w + a_s \le u \text{ of}$$ $$k_0(a_w + 1)(m + n) + S(m',n',u \cdot a_w,a_s) + B(m',n',u \cdot a_w \cdot a_s \cdot 1) + M(m',n',u \cdot a_w \cdot a_s \cdot 1) + k_1(m' + n') = \exp_{a_w} (m' (m'$$ We now prove by induction on the ordering of triples that $$M(m,n,u) \le k_g(u+1)^2(m+n)$$ for some constant $k_g$ . The basis and cases where u = 0 are straightforward if $k_8 \ge k_6 + k_7$ . Suppose the proposition is true for all triples lexicographically less than (m,n,u) where u > 0. Then expression $$2 \le k_6(a_w + 1)(m + n) + k_1(a_g + 1)(m' + n') + k_5(u - a_w - a_g)(m' + n') + k_g(u - a_w - a_g)^2(m' + n') + k_7(m' + n')$$ $$\le k_9(u + 1)(m + n) + k_8(u - a_w - a_g)^2(m + n) \qquad \text{for } k_9 = k_6 + k_1 + k_5 + k_7$$ Letting $k_g \geq k_g$ , we have expression $$2 \le k_n(u^2+u+1)(m+n) \le k_n(u+1)^2(m+n)$$ and $$M(m,n,u) \le k_8(u+1)^2(m+n) = O(u^2(m+n)) = O((m+n)^3)$$ We have shown that our algorithm runs in a number of steps at worst proportional to the cube of the number of terminals. and the first of the first of the second second and the first of the section ### 5.7 Summary In this chapter, we have prescrited an algorithm which routes the connections between pairs of terminals located on the outside of a rectangular component. The routing assumes horizontal and vertical wires are on separate layers and uses minimum area. The problem is reduces to an assignment problem on vectors. Procedure MAX-MATCH solves the problem on vectors using procedures BETTER-MATCH and SCAN-ASSIGN. Procedures SCAN-ASSIGN and BETTER-MATCH have $O((m+n)^2)$ running time and MAX-MATCH has $O((m+n)^3)$ running time. Procedure MAX-MATCH is actually used twice in routing terminals on a rectangle—once for top-bottom connections and once for left-right connections. Before using MAX-MATCH, local connections are assigned directions in a predetermined manner. Assigning the local connections uses O(t) computation steps, where t is the number of terminals on the rectangle. Each call to MAX-MATCH uses $O(t^3)$ steps. Therefore, given a rectangular component with t terminals around its boundary, the problem presented in Section 5.1 can be solved in polynomial time $O(t^3)$ . Paragraph Control of the Control of the Control of the Control of the Control of the Control of the Control of enakan kalendari Kabupatèn The way to the first with the first of f ## Appendix to Chapter 5: Notation The notation used in Chapter 5 is listed here, with definitions, in order of appearance. ràsti personalia, e Contraction of the contraction of the and the state of t T(1,m) denotes top terminals 1 through m. B(1,n) denotes bottom terminals 1 through n. VT: $T(1,m) \rightarrow \{0,1,?\}$ and VB: $B(1,n) \rightarrow \{0,1,?\}$ indicate the directions of connections from top and bottom terminals, respectively, as follows: VT / VB (i) = 0 if the direction of the path from terminal i to its pair is to the left - 1 if it is to the right - ? if it is undetermined p:T(1,m)UB(1,n) $\rightarrow$ T(1,m)UB(1,n)U{\*} is the pairing function indicating what terminals should be connected: For the following definitions, there are analogous definitions for B and VB: $m_{VT}$ : $T \to T$ and is the function matching terminals in T(1,m) of value "0" to terminals with higher index in T(1,m) and of value "1", given value function VT. If $m_{VT}(i) = j$ , then VT(i) = 0, VT(j) = 1, and i < j. in the more and it is the highest the best of the second o er all a salar and the first fire the commence of the salar $M_{VT}$ is the maximum over all matching functions, $m_{VT}$ , for a value function VT, of [range( $m_{VT}$ )]. ZEROS<sub>VI</sub>(S) = { $i \in S|VT(i) = 0$ } for a value function VT and S $\subseteq T$ . ONES<sub>VI</sub>(S) = $\{i \in S | VT(i) = 1\}$ for a value function VT and S $\subseteq T$ . UNDET<sub>VT</sub>(S) = { $i \in S|VT(i) = ?$ } for a value function VT and SCT. ONES<sub>VI</sub>(x,y) is simplified notation for ONES<sub>VI</sub>( $\Gamma(x,y)$ ), etc. OK-1(VT,x,y) (similarly OK-0) is true if and only if $|ONES_{v1}(x,y)| \le LV_0(y-x+1)J$ . Full-1(VT,x,y) (similarly Full-0) is true if and only if $|ONES_{VF}(x,y)| \ge \Gamma \frac{1}{2} (y-x+1) \gamma$ . $l_{VT}:T \to T$ and $r_{VT}:T \to T$ are used to define left-regions and right-regions on T under a given value function VT: $$l_{VT}(1) = 1$$ $l_{VT}(i) = i$ if Full-1(VT, $l_{VT}(i-1)$ ,i-1) $l_{VT}(i-1)$ otherwise, for $i > 1$ $r_{VT}(m) = m$ $$r_{VI}(m) = m$$ $r_{VI}(j) = j$ if Full-0(VT,i+1, $r_{VI}$ (i+1)) $r_{VI}$ (i+1) otherwise, for j < m Left-regions are defined as the equivalence classes induced by the equivalence relation on T(1,m) under which two terminals i and j are equivalent if and only if $l_{VT}(i) = l_{VT}(j)$ . Right-regions are the equivalence classes induced by the equivalence relation using rvr. $L_{VT}$ is defined to equal m if Full-1(VT, $l_{VT}$ (m),m) and to equal $l_{VT}$ (m)-1 otherwise. (1, $L_{VT}$ ) are the full left-regions of T under VT. $R_{VT}$ is defined to equal 1 if Full-0(VT,1, $r_{VT}$ (1)) and to equal $r_{VT}$ (1)+1 otherwise. ( $R_{VT}$ m) are the full A the Paris June revention of the contract of the contract of right-regions of Tunder VT. The following definitions are made after SCAN-ASSIGN returns "(FAIL k, VT<sub>k-1</sub>, VB<sub>k-1</sub>)": The second distribution of the second of aligned and the separate of the parties of the con- and the second of o q is the smallest $$i \ge k$$ such that $|ONES_{VT_{k-1}}(1,i)| = L1/kiJ$ . s is the smallest j satisfying the following three properties: han liv anappantinato, kuri jiriyalakiya i kisi i kaliki (i) $$p(k) \ge j$$ ; (ii) $$|ZEROS_{VB_{k-1}}(j,n)| = L \frac{1}{2}(n-j+1) J;$$ (iii) each terminal in T(1,k-1) which has been assigned the value "1" by SCAN-ASSIGN is paired with a terminal in B(j,n). in the structure of the relation to the complete of the complete of C is the set of initially ?-valued terminals in T(1,q) whose pairs are in B(s,n). $$c_1 = |ONES_{VT_{k-1}}(C)|$$ $$c_0 = |ZEROS_{VT_{k-1}}(C)|$$ $$c_{?} = |UNDET_{VT_{k-1}}(C)|.$$ h is defined as follows: Let i be the terminal of smallest index in B(s,n) which is ?-valued under $VB_0$ and 1-valued under $VB_{k-1}$ . Certainly, $p(i) \le k$ . If $i \le p(k)$ , then h=i; otherwise, h=p(k). $V\Gamma_{fx}$ and $VB_{fx}$ are defined as follows: If h=p(k), then $V\Gamma_{fx}=V\Gamma_{k-1}$ and $VB_{fx}=VB_{k-1}$ . If $h\neq p(k)$ , then $V\Gamma_{fx}$ and $VB_{fx}$ agree with $V\Gamma_{k-1}$ and $VB_{k-1}$ except at h, k, p(h), and p(k), where: $$\begin{split} VT_{fx}(k) &= 1 & VT_{fx}(p(h)) &= ? \\ VB_{fx}(p(k)) &= 1 & VB_{fx}(h) &= ?. \end{split}$$ #### Chapter 6 Discussion of the Algorithm ### **6.1 Removing Assumptions** In this chapter we make some observations about the algorithm we have just presented. The algorithm is of the channel routing variety. The streets to be used by each connection path are chosen by the algorithm. Only one choice is considered for local connections. For top-bottom and left-right connections, the procedure MAX-MATCH makes the decision. Three assumptions are crucial to the working of the algorithm: (1) only pairs of terminals need to be connected; (2) there are only four routing areas—one parallel to each side of the rectangle; (3) the segments from each terminal to the routing area (perpendicular to the routing direction in this area) cannot conflict regardless of their lengths. We do not necessarily need to have terminals around the outside of one rectangle as long as the above assumptions are satisfied. In fact, terminals may also lie along a rectangular boundary circumscribing the rectangular component, i.e. on the opposite side of the routing area, as long as these terminals can be projected on the rectangle to obtain an instance of the original problem (see Figure 6.1.). This configuration might be found when connecting terminals of a functional component or set of functional components to bonding pads. The bonding pads lie along the outside edges of the chip, and the routing regions lie between these pads and the rest of the integrated circuit. Minimizing the area used for the interconnections minimizes the size of the chip. When the third assumption above is removed, we have already seen in Chapter 4 that the resulting routing problem is NP-complete even for one routing channel with terminals along its side. When either of assumptions (1) or (2) is removed, our division into local connections and opposite side connections no longer limits the choice of directions for each connection. Consider the problem for one rectangular component when it may be necessary to interconnect three or more terminals. When three terminals are involved, there are three choices instead of two for the type of path used to interconnect the Figure 6.1: Alternate configuration of terminals for our algorithm. indicates a terminal between adjacent terminals as the boundary is traversed. Any path which follows two adjacent pieces of the boundary can be used to connect the terminals. When the terminals are on three different sides, two of the possible path types use more sides that the third. However, we cannot eliminate these possibilities. It is not true that the third choice is always as good as the other two. Figure 6.2-A gives a counterexample. Connecting these terminals involves three sides of the rectangle. Therefore, we have lost the independence of the two dimensions which we had when routing tup-bottom and feft-right connections. If three terminals are only on two sides, even the path type using all four sides cannot be eliminated. This is shown in Figure 6.2-B. It is an open problem whether the one component routing problem can be solved in polynomial time when sets of three or more terminals need to be interconnected. Consider retaining the restriction that only pairs of terminals are interconnected but allowing terminals to lie on one of two rectangular components which are placed side by side. Ignore for the moment any connection which involves terminals which lie on the boundaries of the street between the two rectangles. The resulting routing problem is similar to the one component routing problem except that paths can take a "short cut" through the street between the two rectangles. However, this option adds many choices of paths to be used to interconnect terminals. The top and bottom sides are no longer independent of the left and right sides. Figure 6.3 illustrates the difficulties presented by the short cut. The complexity of this routing problem is also open. ### 6.2 A Special Case for Procedure MAX-MATCH The procedure MAX-MATCH, with its sub-procedures, solves the optimization problem defined in Section 5.2. We now prove that when all terminals are 7-valued under the initial value functions, there is always an assignment of "0"s and "1"s such that the sum of maximum matchings is Figure 6.2: The One Component Routing Problem with more than two terminals in nets. Terminals with the same number should be connected. A: Three terminals on three sides -- three choices. B: Three terminals on two sides - three choices. Figure 6.3: Extending the routing problem of Chapter 5 to two components. within one of the upper bound obtained when the specific pairing of top terminals to bottom terminals is disregarded. We always assume that all ?-valued terminals are members of top-bottom pairs under p. When all terminals are ?-valued under the initial value functions, the number of top terminals, m, must equal the number of bottom terminals, n. Lemma 6.1 Let $V\Gamma_0$ and $VB_0$ assign "?"s to all terminals in T and B. Then MAX-MATCH(T(1,m),B(1,m),p,V $\Gamma_0$ ,VB0) returns $V\Gamma_f$ and $VB_f$ such that $M_{V\Gamma_f} + M_{VB_f} \ge m-1$ . Proof: Initially, there are no full regions and SCAN-ASSIGN is called. If SCAN-ASSIGN returns (SUCCESS, $V\Gamma_p$ , $VB_p$ ), then $\Gamma$ 'am' 1-terminals are 0-valued in each of T and B. We know OK-0( $VB_p$ , 1,m) holds. Therefore, m is even. The matchings give $M_{VP} \pm M_{VB} = 1/4m + 1/4m = m$ . Suppose SCAN-ASSIGN returns (FAII: $k_i V T_i$ , $V D_i$ ). Let $q_i$ , $s_i$ $h_i$ and $c_i$ be as defined in Chapter 5. All terminals in $T(k_i m)$ are 2-valued under $V T_i$ . Therefore, $q = k_i$ , $c_i = 1$ , and $k_i$ is odd. We know: Combining (a) and (b) gives: Lik(m-s+1) $J = \frac{1}{2}(m-s) = \frac{1}{2}(k-1)$ and $\frac{1}{2}EROS_{VB_r}(s,m) = \frac{1}{2}EROS_{VT_r}(1,k-1)$ . Either $VB_r(s) = 1$ , or s = p(k). Otherwise, $|ONES_{VB_r}(s+1,m)| = |ONES_{VB_r}(s,m)| = \frac{1}{2}(m-s)$ and B(s+1,m) contains p(k) and the pairs of all "1"s in T(1,q). Therefore, we would have chosen s+1 rather than s. In either of the two possible cases, s = h. Let $VT_{x1}$ and $VB_{x1}$ be the value functions defined when the "1" value is tried for h. We know $|ZEROS_{VB_{x1}}(s+1,m)| = \frac{1}{2}(m-s)$ and $OK-O(VB_{x1},x,m)$ holds for all x in B(s,m). Therefore, B(s+1,m) is a set of full right-regions with no ?-valued terminals. All $\frac{1}{2}(m-s)$ "0"s in $\frac{1}{2}(s+1,m)$ can be matched. Unless s=1 or s=2, interval $\frac{1}{2}(1,s)$ contains one left-region and one right-region, neither of which are full. Since $\frac{1}{2}(n-s) = \frac{1}{2}(n-s)$ and $\frac{1}{2}(n-s) = \frac{1}{2}(n-s)$ holds for all x in $\frac{1}{2}(1,k)$ is a set of full right-regions in which $\frac{1}{2}(k-1)$ "0"s can be matched to "1"s in $\frac{1}{2}(1,k)$ . When s=1, in which case k=m, we have $$M_{VT_{x1}} + M_{VR_{x1}} = \frac{1}{2}(m-1) + \frac{1}{2}(m-1) = m-1$$ and the desired result follows. When s > 1, T(k+1,m) contains one left-region and one right-region, neither of which are full. If s = 2, B(1,2) is a full left-region and B(1) is ?-valued under $VB_{x1}$ . Terminals B(1) and p(B(1)) = T(m) are assigned the value "0" by the recursive call to MAX-MATCH. Terminal B(1) can match B(2), but T(m) is unmatched. The resulting maximum matchings sum to $\frac{1}{2}(m-2) + 1 + \frac{1}{2}(m-2) = m-1$ as desired. If s > 2, let T(k,m) be the renumbering of T(k+1,m). Procedure SCAN-ASSIGN is called for T(1,m') and B(1,s) using the appropriate restrictions of $VT_{x1}$ and $VB_{x1}$ . If the second call to SCAN-ASSIGN returns (SUCCESS, VI'<sub>r1</sub>, VB<sub>r1</sub>), then $|ZEROS_{VB_{r1}}(1,s)| = |ZEROS_{VT_{r1}}(1,m')| = \Gamma \% m' 7 \text{ and } M_{VT_{r1}} + M_{VB_{r1}} = L \% m' 7 = m'.$ Combining this with the matchings on the full regions gives $M_{VT_{fl}} + M_{VB_{fl}} = m' + k(m-s) + k(k-1) = m-k + k-1 = m-1$ where $VT_{fl}$ and $VB_{fl}$ are the expansions of $VT_{rl}$ and $VB_{rl}$ which agree with $VT_{xl}$ and $VB_{xl}$ on $T_{xl}$ and If the second call to SCAN-ASSION returns (FAIL k'/VI', VB'), let q', s', and h' be the special terminals for this call. Under the initial value functions for this call to SCAN-ASSIGN, all terminals in T(1,m') and all terminals in B(1,s-1) are ?-valued and s is 1-valued. Therefore, q' = k', k' is odd, and $$|ONES_{VB}'(s',s-1)| = |ONES_{VI}'(1,k'-1)| = \frac{1}{2} \frac{1}{2$$ VR, is mil. Six bound and Man n which need not be considered e restange. ### (3 The Un of Layers #### CONTRACTOR OF THE PARTY The algorithm presented in Chapter 5 minutes is existing attents in which only horizontal and vertical wire arguments are used and for which harizontal and vertical arguments are on different layers. The use of only vertical and horizontal wire arguments for routing arrived a certangle is commonable, since the paths will follow the boundary of the rectangle. The assumption that vertical and horizontal wires are Figure 6.4: The One Component Routing Problem with no local connections. Terminals labeled with the same number should be connected. 3.701他 m 计预记息 connections county interests. Substituting the substitution of two plans of the paper pa is actually achievable for a very simple couring problem shown in Figure 6.6. Can be extended at the ends of the side wideout equation conflicts. Any number of terminals can be like up anotherible facility back a side of the sid start later. The exchange causes the two originally connected segments to overlap but no other segments to overlap. (See Figure 6.5) In this way, we can modify the routing so that there is only one horizontal segment for each connection. This segment must lie above the terminal or terminals to which it needs to be connected since some segment originally did, and segments have not been shortened, just merged. The vertical segments which connect to terminals can be added on the vertical layer. Also, after the modification has been made to each side, segments which minst connect at the corners can be lengthened or shortened to do so without causing conflicts. This upper bound on the amount of height added by using separate layers for the two directions is actually achievable for a very simple routing problem shown in Figure 6.6. The argument used to prove Lemma 6.2 holds for any region above a component side as long as all wires are either perpendicular or parallel to the side, segments of arbitrary length extending from different terminals perpendicular to the side will not intersect, and segments which run parallel to the side can be extended at the ends of the side without causing conflicts. Any number of terminals can be interconnected, not only pairs. # 6.3.2 Routing abound a rectangle using more than two layers: 8 dea of the second secon Let us now consider the One Component Routing Problem when there are more than two layers for interconnection. To take advantage of the entra layers while retaining the limitation to horizontal and vertical wire segments, some parallel segments innut lie one on adjoint another. The connection of segments on different layers is no longer trivial. When two segments on different layers must connect, no segment intersecting the connection point can lie on any layer between the layers of the connecting segments without also being connected to them. The simplest way of avoiding unwanted connections is to put segments which connect on adjacent layers: Suppose that we do wish to retain the "separate direction - separate layer" strategy for more Figure 6.5: Construction for Lemma 6.2. | • | | | | | | | | |------------------|---------------------------------------|----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------|--------------|------------------------------| | | , | , | g saman i sym i bye.<br>T | organistic oper Adhay " | . Ethy Roph i | | | | | | | e de la serie de la compansión de la compansión de la compansión de la compansión de la compansión de la compa | anne mitalia i viri<br>⊯ | one chai | nnel, two la | yers | | | | 1 | ener - Length September | e paka setum | ****<br>********************************* | | | | | | | • | ž. | | | | | becomes | | • | T. S. | | | | | | | | | | v<br>Alamini | the second | | | | | | .* | | - | | | | | | · · · · · · · · · · · · · · · · · · · | | • . | | two cl | hannels | | | | • | | | , | | · | | | | | | | | | | | | | | | | | | | | | | | | | - | | | | | | | | Same Aller | . 11. | San Style | | *** | | * | | | | | | | | | erging segments | for one cons | nection. | | • | | | | | for | layer conne | etion - | avely comments. | as per s <del>eas</del> | No. | Т. | | | | - | | | eresident and the second | | | | | | | | | | * | for vert | ical connection | | | | | Samuel Company | | | for vert | ical connection | | after splitting: | | * | Silving Section (1980) And Secti | | | for vert | ical connection | | | | | The second secon | | | for vert | ical connection | | | | | Silver of State St | | | for verti | ical connection - right of 1 | | | right of 1 | | | | | for verti | | | | | | right of 2 | | | for verti | | | | | | right of 2 | The second secon | | for verti | | | | | | right of 2 | | | for verti | - right of 1 | | | | | right of 2 | | | for verti | | | after splitting: | right of 1 | | right of 2 | | | for verti | · | | | right of 1 | | right of 2 | | | • | - right of 1 | | after splitting: | right of 1 | | | - Programme Company | | • | - right of 1 | | after splitting: | right of 1 | | right of 2 | - Programme Company | | • | right of 1 | | after splitting: | right of 1 | | | - Programme Company | | • | - right of 1 | | after splitting: | right of 1 | | | - Programme Company | | • | right of 1 | | after splitting: | right of 1 | | | - Programme Company | | • | right of 1 | Figure 6.6: Instance of One Component Routing Problem realizing upper bound of Lemma 6.2. When one layer for horizontal segments and one layer for vertical segments: When no restriction on the direction of segments in either layer: than two layers. We will let all odd numbered layers be for horizontal segments and all even numbered layers be for vertical segments. Each layer is adjacent to layers containing wires in the other direction. Let "track" denote a perficular layer in a channel. When there is only one layer for each direction, the number of channels used for routing along a side of the component is the same as the number of tracks used. When there are H layers for sequired. More than this number of channels required is as little as 1/H of the number of tracks required. More than this number of channels may be needed to allow segments to connect at corners properly. The connections at corners place constraints on the layers used. All layers in a channel may not be usable. It is important to observe that there is no longer guaranteed to be an optimal routing in which each path uses at most one segment running along each side of the rectangular component. Permitting a path to use more than one segment along a side allows the path to change layers. In this way, layers may be better utilized. Pigure 6.7 illustrates. Let us suppose see do wish to restrict the routing to one segment along each side per path. We can choose which segments will share the same track; e.g. using the matching function of Chapter 5. Given any such choice, we must usely the resulting sets of segments to channels and layers in the channels so that the connections at corners are properly made. A legal assignment minimizing the area of the layout is desired. We have broken up the anignment of path segments to tracks into two phases: determining what segments will share the same track, and determining what track they will share. When there is only one layer for each direction, the second part of this problem is trivial — any assignment will do. If we require that all pairs of connecting segments be on adjacent layers, then we can model the constraints imposed by the connections between segments using a directed graph. The graph contains one node for each set of segments sharing a track. Let a node representing a set of segments above the top of the component be called a top node. Similarly, the terms bottom node, left node, and right node denote nodes representing sets of segments on the indicated sides. There is an edge between two nodes Figure 6.7: Layer assignment for the One Component Routing Problem. It is no longer true that one channel per street for each net is always optimal. Given 4 layers, no assignment of layers uses only one channel on each side. One attempt is shown. Versus a successful assignment Graph representation of track connections for first track assignment when the sets of segments represented by the nodes contain segments which must be connected at a corner. Edges are directed as follows: from a top node to a left node, representing a connection in the top left corner; from a left node to a bottom node, supresenting a connection in the bottom left corner; from a right node to a top node, representing a connection in the bottom right corner; from a right node to a top node, representing a connection in the top right corner. The graph consists of disjoint chains and cycles. Figure 6.7 shows the graph for the segments of that example. Since connections are always between adjacent layers, it suffices to determine the layer for each set of segments. The actual channel used by each set of segments can be arbitrarily chosen; just as they could for only two layers. The only restriction is that there be only one set of segments on each layer in any channel. Therefore, we wish to assign a layer number to each node of the graph so that left and right nodes have odd numbers and top and bottom nodes have even numbers. Adjacent nodes in the graph must be numbered with adjacent numbers. For any given side, each occurrence of a number will be in a different channel. We wish to find a numbering which minimizes the area of the corresponding layout. Let us now consider graphs consisting only of disjoint cycles. We restrict our attention to these graphs to illustrate how cyclic constraints can be handles without worrying about graphs with different numbers of tracks on different sides, which can occur if chains are present. A method of assigning layers to nodes in a graph consisting only of cycles is as follows. Choose some cycle; begin with some top node of this cycle. Assign layer 1 to this node. Moving in both directions away from this node, assign the nodes along the cycle — one node in each direction at each step — in the following pattern. Going in the direction of the edges (clockwise), beginning with i=1, assign in consecutive steps: i+1 to the next left node; i to the next bottom node; i+1 to the next right node; i+2 to the next top node. Increase i by two and repeat the pattern. Going apposite to the direction of the edges (counterclockwise), beginning with i=1, in consecutive steps assign: i+1 to the next right node; i to the next bottom node; i+1 to the next left node: i+2 to the next bottom node; i+1 to the next left node: i+2 to the next top node. Increase i by two and repeat the battern. The first node reached by both directions will either be a top or a bottom node. Since these are being assigned the same number at the same step, the cycle is completed properly. If the step assigning 21/1 to a right node clockwise and a left node counterclockwise is reached before the cycle is finished, a pattern of descending numbers is used. For clockwise, beginning with i=2H-1, assign: i to the next top node; i-1 to the next left node; i to the next bottom node; i-1 to the next right node. Decrease i by two and repeat. For counterclockwise, beginning with i=2H-1, assign: i to the next top mode; ill to the next right mode; i to the next bottom node; i-1 to the next left node. Decrease i by two and repeat. Again, if the step assigning 2 to a right node clockwise and a left node counterclockwise is reached before the cycle is finished, the original pattern of ascending numbers is repeated. By repeating the two patterns through increasing and decreasing numbers alternately, a cycle of any size can be labeled. (See Figure 6.8). After the first cycle is finished, a second cycle can be labeled beginning within the next step of the pattern which assigns to a top or bottom node. If the next step which assigns to one of the two sides assigns to a top node; then an arbitrary top node of the new cycle is chosen as the starting point. If the step assigns to a bottom node, then an arbitrary bottom node is chosen as the starting point. In this way, each cycle can be assigned in turn. Each pass through increasing or decreasing labels uses one channel on each side. At most one layer is unused in any channel. One pass through increasing numbers assigns numbers to H groups of four nodes. Each group consists of consecutive top, left, hottom; and right modes on a cycle. One pass through decreasing numbers assigns numbers to H-1 such groups. Therefore the number of channels uncom any side under this assignment method is at most livery as one could be assignment method is at most livery as a second relative to where n(cycle) is the number of groups of four nodes in the cycle. Note that when a graph consists of only cycles, it contains the same number of nodes for each side. Therefore aquation 1 is equal to where n is the number of nodes for each side. The number of channels used on any side is at least 1/H of the number of nodes for the number of character used on a side by an optimal settlens out it as the assignment urbaique to the number of character used on a side by an optimal settlens out it as most 451/211-1 + 4H/n<sub>s</sub>. For 2 ≤ H < 1/12n<sub>s</sub>, this ratio is less than three. His reasonable to expect n<sub>s</sub> to be number that his recurrence we do expect no benefits and the connections, and therefore sequences that havers for interconnect. To Custor 5 universes the number of If subgreather blockwes the sets of segments which will share tracks. formating may not lead to a minimum acca reading. An aff sati to smaller a more legens to be used in grane channels, resulting in less channel segments may not werthings for To the top and bettom or left and right may not be optimal ejum mand-aut policy by to optimize the distribution of top reachs and buttern with the used. It usly minimizes their som. The "wrong" distribution might result in traff-unliked chargeds. In addition our orginal choice of pada for ichel consections is no longer sufficient to find a V.S. come V. Lendron band R. pan gantabalisti resti v shows that the optimal nartine its side at mid all four sides of the comprehent. #### 6.4 Summary The algorithm presented in Chapter 5 finds an optimal course when corsin restrictions are placed on the problem and on the allowed routing pades. In this chapter we have discussed the represents of removing some of these restrictions. When we allow solutions which use only even layers but allow herizomal and sertical segments on both layers or which use more than two layers, the algorithm as longer always finds as optimal solution. In these cases the algorithm may be considered a heuristic algorithm. The algorithm liquids the range of solutions it considers but finds an optimal solution within this limited set. When two layers are used for both increased and vertical experience, we have of the number of nodes for that side. Therefore, the ratio of the number of channels used on a side by the assignment technique to the the number of channels used on a side by an optimal assignment is at most $4H/2H-1 + 4H/n_s$ . For $2 \le H < 1/12n_s$ , this ratio is less than three. It is reasonable to expect $n_s$ to be much larger that H, since we do expect to have many more interconnections, and therefore, segments, than layers for interconnect. The algorithm presented in Chapter 5 minimizes the number of tracks used in routing around a rectangle. The algorithm produces the sets of segments which will share tracks, called a packing of the segments. However, the packing may not lead to a minimum area routing. An alternate packing of the segments may allow more layers to be used in some channels, resulting in less channels being used. Even the distribution of tracks to the top and bottom or left and right may not be optimal. The algorithms for top-bottom routing does not try to optimize the distribution of top tracks and bottom tracks used. It only minimizes their sum. The "wrong" distribution might result in half-utilized channels. In addition, our original choice of paths for local connections is no longer sufficient to find an optimal routing. Figure 6.9 shows that the optimal routing may require that a local connection use a path around all four sides of the component. ### 6.4 Summary The algorithm presented in Chapter 5 finds an optimal routing when certain restrictions are placed on the problem and on the allowed routing paths. In this chapter, we have discussed the repercussions of removing some of these restrictions. When we allow solutions which use only two layers but allow horizontal and vertical segments on both layers or which use more than two layers, the algorithm no longer always finds an optimal solution. In these cases, the algorithm may be considered a heuristic algorithm. The algorithm limits the range of solutions it considers but finds an optimal solution within this limited set. When two layers are used for both horizontal and vertical segments, we have Figure 6.9: The One Component Routing Problem with more than two layers. ## Connection for 5 goes around 4 sides. adds 2 units to horizontal dimension adds 2 units to vertical dimension ## Connection for 5 goes around 2 sides. adds 3 units to horizontal dimension adds 3 units to vertical dimension horizontal layer 1 horizontal layer 2 vertical layer 1 vertical layer 2 shown in Section 6.3 that the amount of length added to each dimension by the routing produced by the algorithm is never more that twice that added by the optimal routing. When separate layers are used for horizontal and vertical segments but there is more than one layer for each direction, the algorithm minimizes the number of tracks used in each direction. However, a new problem is encountered. Each set of segments sharing a track must be assigned to a channel and layer so that connections at corners can be made properly and the area of the resulting layout in minimized. Given sets of segments to be assigned, we do not know an algorithm to find an optimal assignment. Also, given a collection of segments, we do not know how to pack them into tracks so that the resulting area, rather than the resulting number of tracks, is minimized. Finally, given a One Component Routing Problem, we do not know how to choose directions for connection paths so that the resulting area, rather than the resulting number of tracks used, is minimized. ### Chapter 7 Suprmary and Directions for Research Angle Ang In this thesis, we have examined problems encountered in circuit layout from the perspective of complexity theory. Our goal has been to gain better understanding of the problems and discover better techniques for their solution. We have presented three proofs of NP-completeness — one for an orthogonal layout problem with rectangular components of arbitrary size, one for the channel assignment problem in a street, and one for the intersection channel assignment problem, i.e. channel assignment over all streets and intersections. We also have analyzed a common heuristic algorithm for channel assignment within a street. This analysis shows that the ratio of the number of channels used in the solution produced by the algorithm to the optimal number is not bounded by any constant. This algorithm and its analysis provide a reference point against which other algorithms can be compared. The development of better algorithms for channel assignment remains a topic for research. The proof of NP-completeness of the channel assignment problem holds only when each net is required to use at most one channel, i.e. channel assignment without jogs. The complexity of the channel assignment problem with jogs remains open. The author conjectures that the problem is NP-complete. In practice, jogs are allowed within a street. Therefore, the analysis of known channel assignment algorithms which use jogs and the development of better algorithms are useful areas of research. The lower bounds on optimal channel assignments without jogs do not apply when jogs are allowed. Analysis of solutions when jogs are allowed is necessary to determined useful lower bounds. In Chapter 5, we have presented an algorithm which finds the optimal channel routing for a special case of the layout problem in polynomial time. Among other requirements, the layout problem must involve only one rectangular component, and all nets must contain exactly two terminals. We have shown in Chapter 6 that when any restrictions on the problem are removed, the algorithm is no longer guaranteed to find the optimal solution. Assumptions about satisfactory routing paths no longer hold. When nets are allowed to have more than two terminals but all other restrictions of Chapter 5 hold, the complexity of the layout problem is open. A modification of the algorithm presented in Chapter 5 or a completely different algorithm may find an optimal solution in polynomial time. The author has been unable to find such an algorithm or produce a proof of NP-hardness. An alternate relaxation of the restrictions allows two components which lie side by side, but does not allow terminals on the adjacent sides. The restriction to two terminal nets is retained. The ability of paths to run between the two rectangles is added. This version of the problem has also defied analysis. More generally, the techniques used in the algorithm may be used as heuristics for the general routing problem. Further research is necessary to determine the quality of these techniques as heuristics. An interesting combinatorial problem is suggested by the routing problem of Chapter 5. เมษาสาย สเมียชี้เราได้จะไป ผู้สายเหมือนสโด Circular arc coloring and routing around a rectangle resemble each other except that, for the rectangle, বি, ১৮১৫ ইটাইড চেডিড ট্রাকের একটাইর এইটারিটেড paths can go one of two ways around the rectangle and paths can change "color" at any corner. Suppose લાંક સ્ટાપ્ટલામાં દેવિવામાં ફ્રીક્ષાને નિર્ભાષ્ટ્રમાં હતી રાજમાં મેળજીક વર્ષોદ્રો હવી સામાન we extend circular are coloring so that "ares" can go around the circle in either direction. More precisely, เรียกสุด เพียงให้เกา และเหมือน และ โดยใหม่สาดให้ แ**สะที่ คามของเปลี้ยสู่ขอ**ง รีสาใต้เหมือนความแ we modify the circular arc coloring problem as follows. Pairs of points on a circle are given. The problem างการการ เมื่อการหน่ายเพิ่ม การเรียนสู่สิดส์ **โดยกลมที่**จะได้นำ สิดทุพย์เมื่อวิดี ยา และเพิ่ม โดย ได้ยาการ การ is to choose one of the two arcs determined by each pair and color the chosen arcs so that the number of tide troop esta obasilya desiriki dilik olasa bila bora siyar sisali asila bila bara siyar sisali asili dabila colors used is minimized. Since circular arc coloring is NP-complete, we suspect that this modification is and the collision were the contract which were a contract to the collision of the collision and the collision and as well. and the contract of the section of the section of We have chosen to use a model which represents components as rectangles and wires as objects with uniform width and uniform interwire spacing. The design rules for integrated circuit design [Me80] are more complicated. Widths and spacings wary from layer to layer. Under the model we use, the widest of the required widths and spacings must be use. This simplification removes the complications of particular technologies from the topological and geometric aspects of the layout problem. Problems which vary significantly from our model, such as power and ground wiring, require separate algorithms. We assume that no topological information about the desired layout is given with the problem specification. This is in contrast with the stick diagram approach reviewed in Chapter 2, which has topological information as part of the input. Both techniques have been found useful in practice, but lead to different problems. The study of algorithms for problems encountered in alternate approaches to layout from that in this thesis is an area of complementary research. In Chapter 3, in addition to presenting the model we have used, we discussed a graph theoretic model for layout. This graph theoretic model has proven very useful in obtaining bounds on the area required by interconnection patterns represented by various classes of graphs. There are many open problems in this area of research. Two open problems of great interest are to determine tight upper and lower bounds on the area needed to embed an arbitrary planar graph and on the area needed to embed a shuffle exchange graph [Lei80]. [Th80]. An alternate cost measure to those presented in Chapter 3 has recently been proposed by Storer [Su80]. He also proves the NP-hardness of optimally embedding a graph in the grid both under the cost measures defined in Chapter 3 and under his measure. In the problems we have considered, placement and routing are restricted to be orthogonal. Research is needed on the cost of this assumption. How much space is lost by allowing only horizontal and vertical wires? How much is gained by adding a third direction for wires, e.g. 45 degree wires. Tompa [To80] has shown that far, one street containing terminals on both sides such that all connections are of the form "connect the i<sup>th</sup> terminal on one side to the i<sup>th</sup> terminal on the second side", the optimal routing uses arcs of circles as wire paths. We have also assumed that horizontal and vertical wires lie on different layers. We have indicated how much this assumption costs for the problem of Chapter 5. How much this assumption costs in general is an open problem. When there are only two layers for interconnect, running two wires in parallel, one on top of the other, on separate layers across a chip separates the two sides of the chip. This suggests that restricting each layer to wires in one direction is desirable. At present, most integrated circuit designs are done with only two layers for interconnect. However, in the future, more layers may be available. When more than two layers are available restricting routing to two orthogonal directions may be very costly in terms of area of layouts. Should more directions be introduced and one direction be used for each layer? Should the "different layer, different direction" technique be abandoned completely? For more than two layers, a major area of research is to determine the restrictions which simplify the routing problem sufficiently to allow development of good algorithms and which result in good layouts in relation to the layouts produced without the restrictions. The model we have used considers only the area of layouts. In practice, other characteristics of circuit layouts, such as circuit timing and power consumption, are of interest. Research is needed into ways of modeling these parameters. Very few algorithms have been developed which take these parameters into account in any way. We would also like to have a less restrictive model of components. The extension of the model to include logically and physically equivalent terminals was discussed in Chapter 3. We would also like the ability to define variable size components—components which could stretch or shrink depending on the requirements of the layout. A very useful concept is that of semi-restricted regions for routing. These would be regions where some but not all layers are free to be used by wires. This concept in combination with stretchable cells could allow wires in certain layers to run over cells after the cells have been stretched to make room for the whet. These concepts have been used by some systems, but new representations and algorithms exploiting these representations are needed. In the thesis, we have separated placement and routing in the conventional fashion. The interaction of placement and routing is very complicated. One area of research is the development of algorithms which do not separate placement and routing or which at least allow placement and routing to interact more. The graph theoretic techniques mentioned in Section 2.2 do consider placement and routing simultaneously. However, these models do not represent the grainetry of the components. We have seen in Chapter 4 that packing rectangles in minimum area is in itself NP-cumplete. Therefore, the are separated, it is not clear whether it is important to get a good geometric packing or a packing which facilitates routing. At one entreme, a placement which is a good packing of components as rectangles can be sought. This, of course, is NP-complete. At the other entreme, the placement can be governed by the desired interconnections between components with little of no regard for how the rectangles fit together. One area for future remarch is the study of the tradeoffs between the two aspects of placement. Good criteria for judging whether a placement does facilitate the routing are needed. The criteria currently used most often estimate wire length, but not routing area. We have intended our model to be independent of the particular technology of interest, although we have used the design rules of [Me80] for nMOS as our guide. These design rules are intended to scale down as the fabrication technology becomes capable of creating smaller and smaller objects. Therefore, our model should remain useful despite the fact that actual parameters of designs change frequently. In this thesis, we have been concerned with the quality of algorithms to solve problems encountered in layout. The criteria of complexity theory are not the only criteria useful in judging an algorithm to be used in practice. Algorithms should be easy to use and maintain. They should allow some interaction with the human designer. The algorithm presented in Chapter 5 finds optimal solutions. Therefore, interaction with the designer is not as important as it is for heuristic algorithms, where the designer's insight is an important input. Nevertheless, the algorithm of Chapter 5 can be written to allow the designer to predetermine the paths of some wires. It will find the optimal routing using these paths. Another criterion for judging algorithms is the number of good solutions which an algorithm can produce efficiently. An algorithm which can present a designer with several good choices will be preferable to one which presents one good solution, unless the one solution is substantially better than any of the choices. Even among algorithms which find optimal solutions, one which obtains more than one optimal solution in approximately the same running time may be proferred. Criteria other than the optimization criteria may be used to choose among optimal solutions. For example, the algorithm of Chapter 5 is biased towards assigning "0"s to left top terminals. A dual version of the algorithm biased towards assigning "1"s to bottom right terminals might also be run to find an alternate solution. Complexity analysis can assist in the development of better algorithms for layout problems. However, the usability of an algorithm must also be considered in evaluating new algorithms for layout design. and the control of th tiga en la como como esta esta esta esta en la como de dela como de la como de la como de la como de la como de la como dela como de la como de la como de la como de la como de la como dela como de la ang mang pang mang panggan nang panggan ng manggalagi. Salagikan merinda milihat ganag sagat panggan banggan n and the control of th and the second second to the control of the second of the second of the control of the second of the second of त्र प्राप्त कर का प्राप्त के कि तुर्वमूच अम्बद्धा के वाराध्यक्ष के अध्यक्ष अभिनेत्र अभिनेत्र के विद्यालया है ह รางกับ ( ) แล้ว การ ( ) ครามหมาย หลุด ( ) การกระบบการ<del>สมองใน โดยความการ (สมองให้ความการ</del> The gradient was a second to the second and the second and the and the contraction of contr The street of the street of the second of the second of the grant at the teach and make at the fill them the ### Appendix: Basic Definitions In this thesis, the concepts of graph theory will frequently be used to describe problems. All definitions can be found in [Liu68]. We review them here. A graph, G = (V, E), consists of a finite set of nodes (or vertices), V, and a set of edges, E. Each edge, $e = (v_1, v_2)$ , is a pair of nodes. Edge e is said to be incident on vertices $v_1$ and $v_2$ ; vertices $v_1$ and $v_2$ are the endpoints of e. Vertices $v_1$ and $v_2$ are adjacent to each other. If the edges are ordered pairs, the graph is directed; otherwise, it is undirected. In a directed graph, an edge $(v_1, v_2)$ goes from (or out of) $v_1$ to (or into) $v_2$ . If a node has d edges incident to it, then the node is of degree d. A path in a graph is a sequence of edges: $$(v_1, v_2), (v_2, v_3), (v_3, v_4), ..., (v_n, v_{n+1}).$$ The nodes on the path are the nodes $v_1$ , $v_2$ , ..., $v_{n+1}$ . The path is of length n. Nodes $v_1$ and $v_{n+1}$ are the endpoints of the path; the path goes from $v_1$ to $v_{n+1}$ . If no node appears in the sequence of edges more than once, the path is said to be simple. We will always mean "simple path" when we say "path". A path is a cycle if $v_1 = v_{n+1}$ . A simple cycle is a path on which only $v_1 = v_{n+1}$ appears more than once. A graph is acyclic if there are no cycles in the graph. An acyclic undirected graph is called a tree. A subgraph, $S = (V_S, E_S)$ , of a graph, $G = (V_G, E_G)$ , is a graph whose set of nodes, $V_S$ , is a subset of $V_G$ and whose set of edges, $E_S$ , is a subset of $E_G$ containing only edges whose endpoints are in $V_S$ . A graph, G, is *planar* if there is a mapping from the nodes of G to points in the plane and from edges of G to curves in the plane such that: - (1) The endpoints of the curve corresponding to an edge are the images of the endpoint of the edge; - (2) No two curves intersect at any point other than their endpoints; - (3) A curve does not intersect the image of a node unless that node is an endpoint of the edge corresponding to the curve. Informally, a planar graph is one which can be drawn in the plane without crossing edges. An m by n two-dimensional grid graph is an undirected graph with node set: $$V = \{(i,j) \mid i \text{ and } j \text{ are positive integers, } 1 \leq i \leq m \text{ and } 1 \leq j \leq n\}$$ and edge set: $$E = \{(v_1,v_2) | v_1 = (i,j) \text{ and } v_2 = (i,j+1) \text{ for } 1 \le i \le m \text{ and } 1 \le j \le n-1$$ or $v_1 = (i,j) \text{ and } v_2 = (i+1,j) \text{ for } 1 \le i \le m-1 \text{ and } 1 \le j \le n\}.$ To indicate the growth rate of functions when discussing the performance of an algorithm, we need the following notation. Given real-valued functions f and g on the same domain, f is O(g) if there is a positive constant, c, such that for all but a finite number of domain values, $f(x) \le cg(x)$ . The O notation is used when discussing upper bounds on a function. A statement of the form "the running time is O(g(n))" gives an upper bound on the running time. When discussing lower bounds, the $\Omega$ notation is used. Given real-valued functions f and g on the same domain, f is $\Omega(g)$ if there is a positive constant, c, such that for all but a finite number of domain values, $f(x) \ge cg(x)$ . Then "the running time is $\Omega(g(n))$ " is stating a lower bound on the running time. The above definitions are of the concepts most frequently used in this thesis. Other definitions are presented as needed. ### Annotated Bibliography [Ab72] Abel, Luther C., "On the Ordering of Connections for Automatic Wire Routing," *IEEE Transactions on Computers*, Vol. C-21, No. 11, Nov. 1972, pp. 1227-1233. Presents experimental evidence that the success of routing one connection at a time (point-to-point) on one layer using a maze router is independent of the order in which connections are attempted. Performance is measured in terms of the total of the rectilinear distances between the endpoints of the connections successfully completed. The maze router includes a heuristic to avoid running a path most to a row of terminals and blocking all the terminals. - [Abs80] Abelson, H.; Andreae, P., "Information Transfer and Area-Time Tradeoffs for VLSI Multiplication," Communications of the ACM, Vol. 23, No. 1, Jan. 1980, pp. 20-23. - [Agu77] Agule, B.J.; Lesser, J.D.; Ruchli, A.E.; Wolff, P.K. Sr., "An Experimental System for Power/Timing Optimization of LSI Chips," Proceedings of the Fourteenth Design Automation Conference, IEEE, June 1977, pp. 147-152. Presents (with [Ru77]) a system to produce layouts of integrated circuits when power and timing are considered. Given a circuit, a layout minimizing the total power required to drive the circuit while satisfying timing constraints in desired. and the manifest the state of t [Ah74] Aho, A.; Hopcroft, J.; Uilman, J., The Design and Analysis of Computer Algorithms, Addison-Wesley Publishing Co., Reading, Mans., 1974. A good text on the analysis of algorithms. 据的东西建筑的城市,**第**100 的复数 1974。 - [Ay79] Ayres, Ron, "Silicon Compilation A Hierarchical Use of PLAs", Proceedings of the Sixteenth Design Automation Conference, IEEE, June 1979, pp. 314-326. Presents a method of converting a specification of a function in synchronous logic (a high level language) to an implementation of the function on a chip using one or more PLAs. - [Ba79] Baker, B.S.; Coffman, E.G. Jr.; Rivest, R.L., "Orthogonal Packing in Two Dimensions," Department of Electrical Engineering and Computer Science TRCS79-1, U. of California at Santa Barbara (also to appear in SIAM J. on Computing). Presents the analysis of a heuristic algorithm for a restricted placement problem for rectangles. Rectangles have a predetermined orientation. was first returned. (34) (44) (14) (14) - [Be66] Berge, C., The Theory of Graphs and its Applications, John Wiley & Sons, Inc., New York, 1966. - [Br76] Brinkmann, Mlynski, "Computer-Aided Chip Minimization for IC-Layout", Proceedings of the IEEE International Symposium on Circuits and Systems, April 1976, pp. 650-653. Presents a method of placing rectangular components on a sectangle. The goal is to minimize unused space. The method is to divide the large rectangle into smaller rectangles, and modify the small rectangles to the size of the actual components. [Ch77] Cho, Y.E.; Korenjak, A.J.; Stockton, D.E., "Floss: An Approach to Automated Layout for High-Volumn Designs," *Proceedings of the Fourteenth Design Automation Conference*, IEEF, June 1977, pp. 138-141. Describes FLOSS, a program for the automatic packing of the hand sketch of a circuit 44 de 1.71.14 layout. [Des72] Design Automation of Digital Systems, Volumn 1: Theory and Techniques, Melvin Breuer, ed. Prentice-Hall, Inc. Englewood Cliffs, N.J. 1972. Reviews various problems within design automation. Logic synthesis, simulation and test generation are covered as well as all phases of layout. A good introduction, but addresses primarily printed circuit layout. - [Deu76] Doutsch, D.N., "A Doglog' Channel Router," Proceedings of the Thirteenth Design Automation Conference, IEEE, 1976, pp. 425-433. - Describes a channel assignment algorithm when jogs are allowed. - [Fe76] Feller, A., "Automatic Layout of Low-Cost Quick-Turnaround Random-Logic Custom I.SI Devices," Proceedings of the Thirteenth Design Automation Conference, IFFF, June 1976, pp. 79-85. Presents the standard cell approach used by RCA. Components, called cells are placed in rows. of cost on the analysis of algorithms from - [Gar79] Garey, M.R.; Johnson, D.S., Computers and Intractability: A Guide to the Theory of NP-Completeness, W.H. Freeman and Co., San Prancisco, 1979. Gives a good presentation of the theory of NP-completeness. Techniques for proving problems NP-complete and methods of dealing with NP-complete problems are described. A list of over 300 NP-complete problems is given in the appendix. - [Gar] Garey, M.R.; Johnson, D.S.; Miller, G.L.; Papadimitriou, C.H., "The Complexity of Coloring Circular Arcs and Chords", SIAM Journal on Algebraic and Discrete Methods, to appear. - [Gav72] Gavril, F., "Algorithms for Minimum Coloring, Maximum Clique, Minimum Covering by Cliques, and Maximum Independent Set of a Chordal Graph, STAM Journal of Computing, Vol. 1, No. 2, June 1972, pp.180-187. The class of chardal graphs includes the class of interval graphs. Channel assignment under certain conditions and interval graphs boldsting are equivalent (see Chapter 4). [Gi76] Gibson, D.; Nance, S.; "St.IC - Symbolic Exposit of Integrated Circuits," Proceedings of the Thirteenth Design Automation Conference, IEEE, 1976, pp. 434-446. Presents a system under which designers use symbols to represent mask topologies. A design specified by symbols closely corresponds to a wild drawing. A symbolic design is automatically transformed into a full geometric specification. [Gu79] Gupta, U.; Lee, D.; Leung, J., "An Optimal Solution for the Channel Assignment Problem", IEEE Transactions on Computers, Vill C 28, No. 17, Nov. 1979, pp. 207-810. Presents an algorithm with (Antign) manufag time for solving the channel assignment problem when there are no communities due to textulisal serial from one another. The algorithm is generalized to solve a two-dimensional problem. [Han72] Hanan, M.; Kurtzberg, J., "A Review of the Placement and Quadratic Assignment Problems," SIAM Review, Vol. 14, No. 2, April 1972, pp. 324-342. A review of algorithms for the problem of placing points to optimize some function A TOTAL ASAL THE KINNERS AND THE THORK AND THERE IS NOT THE PARTY OF THE and a special case, called the quadratic assignment problem. A summary of experiments done with some programs is given. agents within all reget to a co - [Han76] Hanan, M.; Wolff, P.K.; Agule, B.J., "Some Experimental Results on Placement Techniques," Proceedings of the Thirteenth Design Automation Conference, 1998, June 1976, pp. 214-224. Presents the results of an empiracal study of placement algorithms. Both random and constructive initial placements are combined with various iterative improvement techniques. - [Has71] Hashimoto, A.; Stevens, J., "Wire Routing By Optimizing Channel Assignment within Large Apertures," Proceedings of the Eighth Design Automation Workshop, IEEE, 1971, pp. 155-169. This paper introduced the channel routing technique. The general method is outlined and an algorithm for channel assignment when there are no constraints due to terminals is given. - [He78] Heller, W.; Mikhail, W.; Donath, W., "Prediction of Wiring Space Requirements for LSI," Journal of Design Automation and Fault-Tolerant Computing, Vol. 2, 1978, pp. 117-144. Presents a method of estimating the space required for wiring using a grid and an array of locations. A stochastic model is used. - [Hi69] Hightower, David, "A Solution to Line-Routing Problems on the Continuous Plane", Proceedings of the Sixth Design Automation Workshop, SHARE, ACM, and IEEE, 1969, pp.1-24. This paper introduces the escape line technique for finding a path between two points. - [Hi74] Hightower, David, "The Interconnection Problem: A Tutorial," Computer, Vol.7, No.4, April 1974, pp. 18-32. A survey of the routing problem, primarily for printed circuit boards. Discusses pin assignment, wire list determination, layering, ordering, and "wire layout" (routing as we have discussed it). Includes a good summary of the major routing algorithms. Has a large bibliography. - [Hit69] Hitchcock, R.B., "Cellular Wiring and the Cellular Modeling Technique", Proceedings of the Sixth Design Automation Workshop, SHARE, ACM, and IEEE, 1969, pp.25-41. Introduces the cellular approach to routing. - [Hs79] Hsuch, Min-Yu; Pederson, Donald, "Computer-Aided Layout of LSI Circuit Building-Blocks", Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS), 1979, PP.474-477. This paper describes CABBAGE, a program which converts a schematic stick representation of an integrated circuit layout into a full geometric representation. Once a geometric representation is obtained, the program compacts the circuit layout. - [Hw78] Hwang, F.K., "The Rectilinear Steiner Problem," Journal of Design Automation and Fault-Tolerant Computing, Vol. 2, No. 4, Oct. 1978, pp. 303-310. Presents a review of results on the Rectilinear Steiner Problem. - [Jo79] Johannson, D., "Bristle Blocks: A Silicon Compiler", Proceedings of the Sixteenth Design Automation Conference, IEEE, June 1979, pp. 310-313. Presents a system for the design of LSI circuits base on a particular chip architecture. A hierarchical design approach is used. A number of representations are exploited. [Ka79] Kawamoto, T.; Kajitani, Y., "The Minimum Width Routing of a 2-Row 2-Layer Polycell-Layout", Proceedings of the Sixteenth Design Automation Conference, IEEE, June 1979, pp. 290-296. Presents a polynomial time algorithm to find a minimum channel assignment when jogs are allowed anywhere, i.e. when the street length can expand. Basil National Selection of Selection [Kel77] Kelly, Michael; Smith, Robert, "Analytical and Experimental Analysis of Routing Algorithms", Proceedings of the Eleventh Asilomar Conference on Gircuits, Systems, and Computers, IEEE, Nov. 1977, pp.362-369. Presents an empirical study of maze, line search, and channel routers. Data is difficult to interpret. [Ker73] Kernichan, B.; Schweikert, D.; Persky, G., "An Optimal Channel-Routing Algorithm for Polycell Layouts of Integrated Circuits," Proceedings of the Tenth Design Automation Workshop, IEEE, 1973, pp. 50-59. Presents an algorithm which finds an optimal channel assignment. The algorithm searches through all possible solutions and has exponential running time. [Ko69] Kodres, U.R., "Logic Circuit Layout," Proceedings of the Joint Conference on Mathematical and Computer Aids to Design, ACM/SIAM/IEFF, Oct. 1969, pp. 165-191. A survey paper which uses a graph theoretic model for circuit layout. Other subproblems of the layout problem in addition to placement and routing are discussed. - [Ku79] Kuh, E.S.; Kashiwabara, T.; Fujisawa, T., "On Optimal Single-Row Routing," IEEE Transactions on Circuits and Systems, Vol CAS-26, No. 6, June 1979, pp. 361-368. See entry for [So74]. - [La79] Lauther, Ulrich, "A Min-Cut Placement Algorithm for General Cell Assemblies Based on a Graph Representation", Proceedings of the Sixteenth Design Automation Conference, IEEE, June 1979, pp.1-10. Describes a method of placement of rectangular components of arbitrary size. A square of the same area as the total area of the components is partitioned into pieces of the same area as the individual components. In this way, the relative positions of the components in the layout are determined. [Lee61] Lee, C. Y., "An Algorithm for Path Connections and Its Applications", IRE Transactions on Electronic Computers, VFC-10, September 1961, pp.346-365. Presents an algorithm for finding the optimal path between two point in a graph using any of several optimality criteria. This algorithm is the foundation of the maze routing technique. [Lei80] Leiserson, Charles, "Area-Efficient Graph Layouts (for VLSI)" to appear in Proceedings of the Twenty-First Annual Symposium on Foundations of Computer Science, IEEF, Oct. 13-15, 1980. Presents upper bounds on the area needed to embed graphs in the grid. The bounds are for graphs belonging to classes with a separator theorem. - [Liu68] Liu, C.L., Introduction to Combinatorial Mathematics, McGraw-Hill Book Co., New York, 1968. A presentation of graph theory concepts can be found here. - [Lo79] Loosemore, K.J., "Automatic Layout for Integrated Circuits", Proceedings of the IEEE International Symposium on Circuits and Systems, 1979/pp.665-668. This paper describes an automatic layout system developed for the GABLIC design automation system. Components are variable size rectangles. They are placed in horizontal regions (more flexible that rows) with streets for routing in between. The components can be placed above the routing area independently, as that less area is wasted. Entra Reserve Medicial of Page 1966, Salve Herodor [Mc80] Mcad, Carver; Conway, Lynn, Introduction to VLSI Systems, Addison-Wesley Publishing Co., Reading, Mass., 1980. This text presents the fundamentals of designing digital systems in nMOS technology. A systems approach rather than an electronics approach is taken. This is an excellent reference for anyone who wishes to learn how to design chips. - [Na78] Nan, N.; Feuer, M. "A Method for the Automatic Wiring of LSI Chips," Proceedings of the IEEE International Symposium on Circuits and Systems, May 1978; pp. 11-16. Uses the channel touting technique of dividing souting into global and local problems. - [No76] Noto, Richard; Wagner, Barry, "Computer Aided Design for Complex Circuits", US Army Electronics Research and Development Command Technical Report DELEF-TR-76-1398-F. Presents a program which contains a method for placing and routing critical paths, whose delay must be minimized. The program produces output pin delay characteristics as well as a layout. - [Per73] Persky, G.; Gummel, H.K., "Grafos -- A Symbolic Routing Language," Proceedings of the Tenth Design Automation Workshop, IEEE, 1973, pp. 173-181. Presents a language for specifying connections. The designer symbolically represents a routing as a program specifying what line segments should be drawn between what points. The program realizes the specification. ดู (ค.ศ. 1986) ความสารที่ ค.ศ. (1986) ค.ศ. 1986 (ค.ศ. 1986) ค.ศ. (ค.ศ. 1986) ค.ศ. (ค.ศ. 1986) ค.ศ. (ค.ศ. 1986) [Per77] Persky, G.; Deutsch, D.N.; Schweikert, D.G., "LTX -- A Minicomputer-Based System For Automated LSI Layout", Journal of Design Automation and Fault-Tolerant Computing, Vol. 1, No. 3, May 1977, pp. 217-255. Presents Bell Laboratory's system for integrated circuit design using standard cells. Presents Bell Laboratory's system for integrated circuit design using standard cells. Many interesting algorithms have been developed for this system. They are outlined in this paper. - [Pi77] Pippenger, N., "Superconcentrators," SIAM Journal on Computing, Vol. 6, No. 2, June 1977, pp. 298-304. - [Po79] Porter, Edwin, "An Automatic Layout System for VLSI", Proceedings of the Eighteenth IEEE Computer Society International Conference (COMPCON), IEEE, February 1979, pp. 9-14. Describes the DIAL system, a system intended for VLSI layout. The system is designed to be completely automatic, i.e. to produce VLSI layouts without human assistance. - [Pr78] Preas, B.T.; Gwyn, C.W., "Methods for Hierarchical Automatic Layout of Custom LSI Circuit - Masks," Proceedings of the Fifteenth Design Automation Conference, June 1978, pp. 206-212. Presents a layout system which works with components which are rectangles of arbitrary size. A hierarchical approach to circuit design is used. - [Pr79] Preas, B.T.; vanCleemput, W.M., "Placment Algorithms for Arbitrarily Shaped Blocks," Proceedings of the Sixteenth Design Automation Conference, IEEE, June 1979, pp. 474-480. Describes a method of exhaustively searching for an optimal placement of rectangles of arbitrary size. Also presents a heuristic initial placement algorithm. An iterative improvement algorithm is discussed briefly. - [AsC] Proceedings of the Asilomar Conferences on Circuits, Systems, and Computers, sixth (1972) through thirteenth (1979). (Formerly Asilomar Conference on Circuits and Systems, first (1967) through fifth (1971).) 1968, 1977, 1978 copywrites by IEEE, remaining copywrites by Asilomar Conference. These conferences contain sessions on design automation. galing an analysis of John and State of Black and Bank and Light of Sales and A [DAC] Proceedings of the Design Automation Conferences, IEEE, twelvth (1975) through seventeeth (1980). (Formerly the Design Automation Workshops. Proceedings of second (1965) through eleventh (1973) available.) Sponsored by SHANE from 1965 through 1971. Joint sponsorship of ACM with SHARE and/or IEEE from 1967 through 1980. Major conference on design automation systems and techniques. Many examples of working design automation systems can be found. [CAS] Proceedings of the IEEE International Symposia on Circuits and Systems, 1974 through 1980 (formerly International Symposium on Circuit Theory), IPEE. These symposia usually contain several sessions on design automation. [Ru77] Ruchli, A.E.; Wolff, P.K. St.; Goertzel, G., "Analytical Power/Timing Optimization Technique for Digital System," Proceedings of the Fourteenth Design Automation Conference, IEEE, June 1977, pp. 142-146. Presents (with [Agu77]) a system to produce layouts of integrated circuits when power and timing are considered. Given a circuit, a layout minimizing the total power required to drive the circuit while satisfying timing constraints is distred. [Sah80] Sahni, S.; Bhatt, A., "The Complexity of Design Automation Problems," Proceedings of the Seventeenth Design Automation Conference, 1888, June 1980, pp. 402-411. Bereit and A. W. Land C. Madrid Land Market and Marchine and John Stein Stein and School and Stein Stein - Presents a survey of NP-hard problems related to design automation. The layout problems discussed use a point model for components. - [Sah76] Sahni, S.; Gonzalez, T.; "P-Complete Approximation Problems," Assiral of the ACM, Vol. 23, No. 3, July 1976, pp. 555-565. It is shown that, for several NP-complete optimization problems, no polynomial-time algorithm can find solutions which are guaranteed to have objective function values within a constant multiple of the optimal unless NP = P: Quadratic uniquation is one of the problems. [Sav79] Savage, J.E., "Area-Time Tradeoffs for Matrix Multiplication and Related Problems in VLSI Models," Brown University Department of Computer Science Technical Report No. CS-50, 1979. AND THE WORLD STATE AND A SECOND CONTROL OF THE SECOND [Sc76] Schweikert, Daniel, "A 2-Dimensional Placement Algorithm for the Layout of Electrical Circuits," Proceedings of the Thirteenth Design Automation Conference, IEEE, 1976, pp.408-416. The algorithm presented uses constructive initial placement and improvement by pairwise exchange. [So74] So, H.C., "Some Theoretical Results on the Routing of Multilayer, Printed-Wiring Boards," Proceedings of the IEEE International Symposium on Circuits and Systems, 1974, pp. 296-303. Presents a decomposition of the routing problem for multi-layer printed circuit boards, resulting in several instances of a routing problem for one row of points — the single-row, singe-layer routing problem. In this problem, space on either side of the row and in between the points can be used by wire paths. Sufficient conditions for routability of a collection of nets are studied. Theoretical research on the single-row, single-layer routing problem has been continued by several people. See [Ku79, T176, T178, T179]. - [St80] Storer, J.A., "The Node Cost Measure for Embedding Graphs on the Planar Grid," Proceedings of the Twelfth Annual ACM Symposium on Theory of Computing, 1980, pp. 201-210. - [Su73] Sutherland, Ivan; Oestreicher, Donald, "How Big Should a Printed Circuit Board Be?" IEEE Transactions on Computers, May 1973, pp.537-542; Presents an theory for predicting what size printed circuit boards should be for successful circuit layout. - [Ti78] Ting, B.S.; Kuh, E.S., "An Approach to the Routing of Multilayer Printed Circuit Boards," Proceedings of the IEEE International Symposium on Circuits and Systems, May 1978, pp.902-911. Presents an method which is an extension of that of So [So74]. [Ti79] Ting, B.S.; Kuh, E.S.; Sangiovanni-Vincentelli, A., "Via Assignment Problem in Multilayaer Printed Circuit Boards," *IEEE Transactions on Circuits and Systems*, Vol. CAS-26, No. 4, April 1979, pp. 261-271. Formalizes the via assignment problem resulting from the engaged of So [So 74]. The Formalizes the via assignment problem resulting from the approach of So [So74]. The NP-completeness of the formalized problem is proven. Heuristics are discussed. - [Ti76] Ting, B.S.; Kuh, E.S.; Shirakawa, I., "The Multilayer Routing Problem: Algorithms and Necessary and Sufficient Conditions for the Single-Row Single-Layer Case," *IEEE Transactions on Circuits and Systems*, Vol. CAS-23, No. 12, Dec. 1976, pp. 768-778. See entry for [So74]. - [Th80] Thompson, C.D., "A Complexity Theory for VLSI," Ph.D. thesis, Department of Computer Science, Carnegie-Mellon University, 1980. Presents a new measure of the complexity of a function in terms of the chip area and computing time of a VLSI realization of the function. This thesis has initiated a large amount of research using the measure. [To80] Tompa, Martin, "An Optimal Solution to a Wire-Routing Problem," *Proceedings of the Twelfth Annual ACM Symposium on Theory of Computing*, 1980, pp. 161-176. Addresses the problem of connecting terminals in order across a street, i.e. the leftmost terminal on one side is connected to the leftmost on the other side, etc. - [Ts79] Tsukiyama, S.; Kuh, E.S.; Shirakawa, I., "An Algorithm for Single-Row Routing with Prescribed Street Congestion," Proceedings of the IEEE International Symposium on Circuits and Systems, 1979, pp. 466-469. See entry for [So74]. - [Val75] Valiant, L., "On Non-Linear Lower Bounds in Computational Complexity," Proceedings of the Seventh Annual ACM Symposium on Theory of Computing, 1975, pp. 45-53. Introduces superconcentrators. - [Val79] Valiant, L., "Universality Considerations in VLSI Circuits," results presented at the IEEE International Conference on Information Pheory, Grignano, Italy, June 25-29, 1979. [van76] van Cleemput, W.M., "Mathematical Models for the Circuit Layout Problem," *IEEE Transactions on Circuits and Systems*, Vol. CAS-23, No./12, Dec 1976, pp. 759-767. Contains a brief summary of various models of circuits using graphs. A model developed by the author is presented. and the second - [van] van Cleemput, W.M. Computer Aided Design of Digital Systems A Bibliography, Computer Science Press, Inc. Vol.s I and H (1976), Vol. 111 (1978). A large bibliography of papers on any topic related to design automation, organized by topics. Includes author and keyword indices. Vol. I covers publications through Dec. 1974; Vol. III. covers Jan. 1975 through May 1976; Vol. 111 cover May 1976 through Oct. 1977. The bibliography is not annotated. - [Wi77] Williams, J.D., "SFICKS A New Approach to J.Sl Design," M.S. Thesis, Dept. of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, June 1977. Presents an program to convert hand sketched stick diagrams of circuits into layouts which satisfy design rules. Layouts are also compacted without changing relative positions of components. ত । বা বা বাবের বিষ্ণার করে। তার বিষ্ণার বিষ্ণার বিষ্ণার বা বাই এই বাই বিষ্ণার বিষ্ণার বিষ্ণার বিষ্ণার বিষ্ণার an ang ling and nagangga kalalas ang king king king king kabang makilidi. At it akaban tan berasa িয়া হ'ব বিভাগ হৈছে। তিনি বিভাগ কৰি চাৰ্কাৰ স্থানিক কিন্তু কৰি কিন্তু কৰি কৰিছে। তেওঁ কিন্তু কৰি কৰিছে কিন্তু বিভাগ বিভাগ বিভাগ বিভাগ কৰিছে বিভাগ বিভাগ বিভাগ বিভাগ বিভাগ কিন্তু কৰিছে কিন্তু কৰিছে। কৰিছে কিন্তু কৰিছে কিন্ TABLE OF THE STREET, BASE erature de la legación de la રિકાર કે પ્રિકાર પ્રાપ્ત એ કે કરી તે એ કે કે પ્રાપ્ત કરી મોટા કરો અને કે કે કે જ કે પ્રાપ્ત કે પ્રાપ્ત કરાવા છ - આ કે કે 100 આપ્ર કરે પ્રાપ્ત કે કે કે કે કે કે પ્રાપ્ત કે કે કે કે કા કાલા અને કે કો આવા છે. આ કાર્યા કે કે ### **Biographical Note** The author was born on June 26, 1952 in Middletown, Connecticut, where she spent her childhood. Following her graduation from Middletown High School, she attended Cornell University, enrolling in the College of Arts and Sciences. She received her A.B. degree cum laude in physics and with distinction in all subjects from Cornell in June 1974. In the following September, she began her graduate work in the Department of Electrical Engineering and Computer Science at Massachusetts Institute of Technology. She joined the Theory of Computation research group in the Laboratory for Computer Science, where she worked under the supervision of Ronald Rivest. She completed her M.S. and E.E degrees in June 1977. The author is married to Michael Lipkowitz. She will continue her residence in Massachusetts while joining the Computer Science faculty of Brown University as a visiting assistant professor.