Last update:
Sat Oct 14 13:25:31 MDT 2017
E. C. Anderson and
Y. Saad Solving sparse triangular systems on
parallel computers . . . . . . . . . . . 73--96
Z. Bai and
J. Demmel On a Block Implementation of Hessenberg
Multishift QR Iteration . . . . . . . . 97--112
J. Prost and
M. Becker Modeling Methodology for Performance
Evaluation of Parallel Architectures a
Case Study, LCAP . . . . . . . . . . . . 563--601
Takayuki Ito and
Kunihiko Fukushima and
Sei Miyake Realization of a neural network model
Neocognitron on a hypercube parallel
computer . . . . . . . . . . . . . . . . 1--16
H. Meijer and
S. Akl Parallel Binary Search with Delayed Read
Conflicts . . . . . . . . . . . . . . . 17--22
O. A. McBryan Optimization of Connection Machine
Performance . . . . . . . . . . . . . . 23--48
S. H. Cheon and
J. W. Cho Hybrid Combinators for Distributed
Evaluation of Functional Programs . . . 49--68
S. Raman and
L. M. Patnaik An Annealing-Based Circuit Partitioner
for Hypercube Architecture: Design and
Performance Evaluation . . . . . . . . . 49--68
R. A. Pearson and
P. E. Stokes Vector Evaluation in Induction
Algorithms . . . . . . . . . . . . . . . 85
J. Erhel Sparse Matrix Multiplication on Vector
Computers . . . . . . . . . . . . . . . 101--116
J. P. Sheu and
W. T. Chen and
H. M. Su A Hierarchical Multiprocessor Structure
Based on Multistage Networks . . . . . . 117--132
J.-M. Drouffe and
K. J. M. Moriarty FORTRAN Program for the Four-Dimensional
Ising Model . . . . . . . . . . . . . . 133--142
S. Doi and
A. Lichnewsky Some Parallel and Vector Implementations
of Preconditioned Iterative Methods on
Cray-2 . . . . . . . . . . . . . . . . . 143--180
P. Pandey and
C. Kenney and
A. J. Laub A Parallel Algorithm for the Matrix Sign
Function . . . . . . . . . . . . . . . . 181--192
C. Eisenbeis and
W. Jalby and
A. Lichnewsky Compiler Techniques for Optimizing
Memory and Register Usage on the Cray 2 193
C. Farhat Redesigning the Skyline Solver for
Parallel/Vector Supercomputers . . . . . 223--238
D. E. Womble and
B. C. Young A Model and Implementation of Multigrid
for Massively Parallel Computers . . . . 239--256
C. Siva Ram Murthy Parallel Iterative Solution of Large
Linear Systems on Hypercubes . . . . . . 257--264
S. Srinivas and
A. Basu and
K. G. Kumar and
others Studies on the Performance of a Parallel
Iterative Algorithm on Transputer Arrays 265
M. Schutz and
S. Wulfert and
S. Leutwyler Monte Carlo Simulations of Molecular
Clusters: From Scalar to Parallel . . . 289--310
S. F. McCormick and
U. Rüde On Local Refinement Higher Order Methods
for Elliptic Partial Differential
Equations . . . . . . . . . . . . . . . 311--334
R. Sarno and
V. C. Bhavsar and
E. M. A. Hussein Generation of Discrete Random Variables
on Vector Computers for Monte Carlo
Simulations . . . . . . . . . . . . . . 335--350
S. Raman and
L. M. Patnaik and
R. Mall Parallel Implementation of Circuit
Simulation . . . . . . . . . . . . . . . 351--374
J. W. Park and
S. R. Maeng and
J. W. Cho Line Accessible Memory System for Image
Processing . . . . . . . . . . . . . . . 375
S. Vandewalle and
R. van Driessche and
R. Piessens The Parallel Performance of Standard
Parabolic Marching Schemes . . . . . . . 1--30
K. Gallivan and
W. Jalby and
A. Malony and
others Performance Prediction for Parallel
Numerical Algorithms . . . . . . . . . . 31--62
Tao Li Parallel imprecise iterative deepening
for combinatorial optimization . . . . . 63--76
T. Arita and
M. Sowa High Speed Synchronization for a
Statically Scheduled Superscalar
Processor . . . . . . . . . . . . . . . 77
S. W. Hammond and
R. Schreiber Efficient ICCG on a Shared Memory
Multiprocessor . . . . . . . . . . . . . 1--22
M. D. Feng and
C. K. Yuen A Transputer-based Parallel Lisp
Implementation . . . . . . . . . . . . . 23--42
G. H. Chen and
M. S. Chern A. Pipelined Algorithm for
Multiple-Choice $ 0 a n d 1 $ Knapsack
Problem . . . . . . . . . . . . . . . . 43--48
K. Kuwahara Flow Simulation on Supercomputers and
its Visualization . . . . . . . . . . . 49
K. R. Bennett and
G. Fairweather A Parallel Boundary Value ODE Code for
Shared-Memory Machines . . . . . . . . . 71--86
P. Arbenz and
H. P. Luthi and
J. E. Mertz and
W. Scott Applied Distributed Supercomputing in
Homogeneous Networks . . . . . . . . . . 87--108
H. K.-C. Chang and
J. J.-R. Chen A. Parallel Algorithm for the Knapsack
Problem with Memory/Processor Tradeoff $
M^2 P = O(2^{n and 2}) $ . . . . . . . . 109--120
O. Kang and
B. M. Kim and
H. Yoon and
others Heuristic Subcube Allocation with Task
Migration in Hypercube Systems . . . . . 121--142
M. Mu and
J. R. Rice Row Oriented Gauss Elimination on
Distributed Memory Multiprocessors . . . 143--168
C. V. Sastry and
J. Pradhan Lower Bounds to the External Pathlength
of a Lopsided Binary Tree . . . . . . . 169--178
K. Bogineni and
P. W. Dowd An Optically Interconnected Distributed
Shared Memory System: Architecture and
Performance Analysis . . . . . . . . . . 179--212
C.-P. Chung and
W.-Y. Lin Vectorization of Sorting Algorithms . . 213--232
K. Zhang An Experiment with a Logic Program
Execution Model on the Transputer . . . 233
K. Lee On the Floating Point Performance of the
i860\TM Microprocessor . . . . . . . . . 251--268
K. Lee Achieving High Performance on the
i860\TM Microprocessor with Naspack
Subroutines . . . . . . . . . . . . . . 269--288
S. Arunkumar and
T. Chockalingam Randomized Heuristics for the Mapping
Problem . . . . . . . . . . . . . . . . 289--300
H.-C. Chou and
C.-P. Chung Upper Bound Analysis of Scheduling
Arbitrary-Delay Instructions on Typed
Pipelined Processors . . . . . . . . . . 301--312
J. H. Nang and
S. R. Maeng An AND-parallel Computational Model and
its Abstract Machine for the Parallel
Execution of Functional Logic Language 313--331 (or 313--332??)
R. B. Shu and
M. S. Kankanhalli and
W. G. Wang Worst and Average Case Evaluation of
Heuristics for Multi-Processor
Scheduling . . . . . . . . . . . . . . . 333--344
N. H. Naik and
V. K. Naik and
M. Nicoules Parallelization of a Class of Implicit
Finite Difference Schemes in
Computational Fluid Dynamics . . . . . . 1--50
D. H. Bailey Experience with Parallel Computers at
NASA Ames . . . . . . . . . . . . . . . 51--62
G.-R. Hoffmann Weather Forecasting and Parallel
Processing: A View from ECMWF . . . . . 63--70
E. V. Prasad and
A. K. Sarje Performance Evaluation of Multiprocessor
System Modelled as $t$-out-of-$s$ System 71--88
J. Y. Yang and
L. Bic and
A. Nicolau A Mapping Strategy for MIMD Computers 89
Arvind and
S. Brobst The Evolution of Dataflow Architectures:
From Static Dataflow to P-RISC . . . . . 125--154
T. Toffoli and
N. Margolus Programmable Matter: Concepts and
Realization . . . . . . . . . . . . . . 155--170
R. S. Nikhil The Parallel Programming Language Id and
its Compilation for Parallel Machines 171--223 (or 171--224??)
J. N. Kok Traces, Histories and Streams in the
Semantics of Nondeterministic Data Flow 225--242
G. W. Sabot Paralation Programming . . . . . . . . . 243--270
A. Nicolau Massive Parallelism and Fine-Grain
Parallelism: Are They Incompatible? . . 271--292
J. P. Katoen and
M. Rem Recognizing $K$-Rotated Segments . . . . 293
M. Furnari Memory Systems and Massive Parallel
Symbolic Computation . . . . . . . . . . 307--326
D. R. Lester and
G. L. Burn An Executable Specification of the
HDG-Machine . . . . . . . . . . . . . . 327--378
T. Gross An Overview of Programming the iWarp
System . . . . . . . . . . . . . . . . . 379--402
W. Wilcke MIMD Computers for Scientific
Applications . . . . . . . . . . . . . . 403--412
C. D. Polychronopoulos Parallel Programming Issues . . . . . . 413--474
M. Garber The TC2000 System --- A Large Scale,
Shared Memory, Multiprocessor . . . . . 475--490
D. L. Waltz Massively Parallel AI . . . . . . . . . 491
L. Dagum Parallel Integer Sorting with Medium and
Fine-Scale Parallelism . . . . . . . . . 503--522
R. Schreiber An Assessment of the Connection Machine 523--536
E. Rothberg and
A. Gupta An Evaluation of Left-Looking,
Right-Looking and Multifrontal
Approaches to Sparse Cholesky
Factorization on Hierarchical-Memory
Machines . . . . . . . . . . . . . . . . 537--594
T. Bemmerl and
B. Ries Programming Tools for Distributed
Multiprocessor Computing Environments 595--616
A. Khemka and
R. K. Shyamasundar Multiprocessor Scheduling of Periodic
Tasks in a Hard Real-Time Environment 617--636
C. Battista and
S. Cabasino and
F. Marzano and
others The APE-100 Computer: (I) The
Architecture . . . . . . . . . . . . . . 637--656
D. S. Reeves and
E. F. Gehringer Adaptive Routing for Hypercube
Multiprocessors: A Performance Study . . 1
L. W. Dowdy and
M. R. Leuze On Modeling Partitioned Multiprocessor
Systems . . . . . . . . . . . . . . . . 31
A. H. Karp Some Experiences with Network LINDA . . 55
J. Z. Li and
J. Srivastava An Array Assignment Method for Parallel
Computing Systems . . . . . . . . . . . 81
R. Mall and
S. Raman and
L. M. Patnaik A Parallel Simulated Annealing-Based
Channel Router . . . . . . . . . . . . . 101
D. A. Menasce and
S. C. Da Silva Porto and
S. K. Tripathi Static Heuristic Processor Assignment in
Heterogeneous Multiprocessors . . . . . 115
S. L. Min and
J. H. Nam and
M. S. Park and
others Cache-Based Data Distribution
Constrained Scheduling . . . . . . . . . 139
P. Ciarlet Implementation of a Domain Decomposition
Method Well-Suited for (Massively)
Parallel Architectures . . . . . . . . . 157
S. Chatterjee Programming Models, Compilers, and
Algorithms for Irregular Data-Parallel
Computations . . . . . . . . . . . . . . 183--222
Y. Yaacoby and
P. Cappello Bounded Broadcast in Systolic Arrays . . 223
M. W. Berry and
B. N. Parlett and
A. H. Sameh Computing Extremal Singular Triplets of
Sparse Matrices on a Shared-Memory
Multiprocessor . . . . . . . . . . . . . 239--276
W. D. Joubert and
G. F. Carey Embedded Gradient Iterative Solution of
a Class of Nonlinear PDE's on the
Connection Machine . . . . . . . . . . . 277
M. M. Eshaghian and
M. E. Shaaban Cluster-M Parallel Programming Paradigm 287
S. Olariu and
J. L. Schwing and
J. Y. Zhang Data Movement Techniques on
Reconfigurable Meshes, with Applications 311
V. Venkatakrishnan Parallel Computation of $ A x $ and $
A^T x $ . . . . . . . . . . . . . . . . 325
W. F. Wong and
E. Goto A Simulation Study on the Interactions
Between Multithreaded Architectures and
the Cache . . . . . . . . . . . . . . . 343
Y.-H. Shiau and
C.-P. Chung Effects and Handling of Instruction
Class Contention in Superscalar
Processing . . . . . . . . . . . . . . . 357
W.-Y. Lin and
C.-L. Chen A Parallel Algorithm for Solving
Tridiagonal Linear Systems on
Distributed-Memory Multiprocessors . . . 375
W. P. Petersen Lagged Fibonacci Series Random Number
Generators for the NEC SX-3 . . . . . . 387
K. E. Gates and
W. P. Petersen A Technical Description of Some Parallel
Computers . . . . . . . . . . . . . . . 399
J. Nang A Simple Parallelizing Scheme of Genetic
Algorithm on Distributed-Memory
Multiprocessors . . . . . . . . . . . . 451
T. Johnson and
A. Colbrook A Distributed, Replicated, Data-balanced
Search Structure . . . . . . . . . . . . 475
B. Djokic and
J. Ruppert and
I. Stojmenovic Constant Time Digital Geometry
Algorithms on the Scan Model of Parallel
Computation . . . . . . . . . . . . . . 501
H.-H. Wang and
R.-C. Chang A Hybrid Coherence Scheme for Software
Distributed Shared Memory . . . . . . . 519
S. R. Seidel and
M. A. Davis Global Synchronization Algorithms for
the Intel IPSC/860 . . . . . . . . . . . 537
J. Park and
H. Yoon and
H. Lee and
others The Ring-Banyan Network: A Fault
Tolerant Multistage Interconnection
Network for Multiprocessor Systems . . . 557
I. Gottlieb and
A. Herold Task Distribution on a Butterfly
Multiprocessor . . . . . . . . . . . . . 1
P. R. Amestoy and
M. J. Daydé and
Iain S. Duff and
P. Mor\`ere Linear Algebra Calculations on a Virtual
Shared Memory Computer . . . . . . . . . 21
S. C. S. Porto and
C. C. Ribeiro A Tabu Search Approach to Task
Scheduling on Heterogeneous Processors
Under Precedence Constraints . . . . . . 45
B. Hendrickson and
R. Leland and
S. Plimpton An Efficient Parallel Algorithm for
Matrix-Vector Multiplication . . . . . . 73
J.-H. Chung and
H. Yoon and
S. R. Maeng Restriction-Free Adaptive Wormhole
Routing in Multicomputer Networks . . . 89
S. L. Chung and
R. Setiono Efficient Neural Network Training on a
Cray Y-MP . . . . . . . . . . . . . . . 109
M. D. Feng and
Y. Q. Gao and
C. K. Yuen Implementing Linda Tuplespace on a
Distributed System . . . . . . . . . . . 125
J. Song and
R. Shu Parallel and Pipelined Parallel
Consecutive Sums on a Hypercube with
Application to Ray Casting . . . . . . . 145
L. Giraud Block Preconditioned Conjugate Gradient
Methods on a Distributed Virtual Shared
Memory Multiprocessor . . . . . . . . . 161--190
L. Yu. Kolotilina and
A. Yu. Yeremin Factorized Sparse Approximate Inverse
Preconditioning II: Solution of $3$D FE
Systems on Massively Parallel Computers 191--216
T. Nagai Benchmarking Fortran Intrinsic Functions 217--230
Y. M. Wang and
R. C. Chang A Minimal Synchronization Overhead
Affinity Scheduling Algorithm for
Shared-Memory Multiprocessors . . . . . 231--250
A. K. Gupta and
H. Wang On Improving the Performance of Tree
Machines . . . . . . . . . . . . . . . . 251--264
A. Sethi and
S. Biswas and
A. Sanyal Extensions to Cycle Shrinking . . . . . 265--284
A. Tridgell and
R. P. Brent A General-Purpose Parallel Sorting
Algorithm . . . . . . . . . . . . . . . 285--302
Y. Ben-Asher and
A. Schuster and
J. F. Sibeyn Load Balancing: A Programmer's Approach
or The Impact of Task-Length Parameters
on the Load Balancing Performance of
Parallel Programs . . . . . . . . . . . 303
Anonymous Instructions for Typesetting
Camera-ready Manuscripts . . . . . . . . ??
R. T. Scalettar and
K. J. Runge and
J. Correa and
others Simulations of Interacting Many Body
Systems Using p4 . . . . . . . . . . . . 327--350
M. Vijay Fault Tolerant Systolic Evaluation of
Polynomials and Exponentials of
Polynomials for Equispaced Arguments
Using Time Redundancy . . . . . . . . . 351--364
D. Beal and
C. Lambrinoudakis Design of a Processor Element for a High
Performance Massively Parallel SIMD
System . . . . . . . . . . . . . . . . . 365--390
P. Z. Lee Parallel Matrix Multiplication
Algorithms on Hypercube Multiprocessors 391--406
R. L. Cheng and
C. P. Chung An Approximate Agreement Algorithm for
Wraparound Meshes . . . . . . . . . . . 407--420
N. P. Lu and
C. P. Chung Memory System Design in Superscalar
Processing . . . . . . . . . . . . . . . 421--444
J. Pradhan and
C. V. Sastry On Optimal Weighted Binary Trees . . . . 445--464
M. N. Dorojevets and
V. G. Oklobdzija Multithreaded Decoupled Architecture . . 465
Anonymous Instructions for Typesetting
Camera-ready Manuscripts . . . . . . . . ??
S. Ahuja and
A. K. Sarje Processor Allocation in Extended
Hypercube Multiprocessor . . . . . . . . 481--488
M. Hamdi and
Y. Pan and
W. T. Kwong Efficient Image Processing Applications
on the MasPar Massively Parallel
Computers . . . . . . . . . . . . . . . 489--514
C. F. Lin and
S. J. Horng and
T. W. Kao Generalized Rotate Sort on
Mesh-Connected Computers with Multiple
Broadcasting Using Fewer Processors . . 515--530
Tao Li and
Lixin Tao Topological Feature Maps on Parallel
Computers . . . . . . . . . . . . . . . 531--546
X-H Sun and
R. D. Joslin A Parallel Prefix Algorithm for Almost
Toeplitz Tridiagonal Systems . . . . . . 547--576
S. Waser and
H. Burkhart A Case Study of Parallel Processing:
Informatics vs Scientific Computing . . 577--594
A. Ferreira Work and Memory-Efficient Parallel
Algorithms for the Knapsack Problem . . 595--606
Anonymous Instructions for Typesetting
Camera-ready Manuscripts . . . . . . . . ??
B. Bhanot and
J. Janak and
R. Walkup and
V. Sonnad Hierarchical decomposition: A parallel
implementation of the Barnes--Hut tree
algorithm . . . . . . . . . . . . . . . 1--12
C-Y Chang and
J-Y Tzeng and
J-P Sheu Design and implementation of a Fortran
assistant tool for vector compilers . . 13--46
D. P. Vidyarthi and
A. K. Tripathi Precedence-constrained task allocation
in distributed computing systems . . . . 47--56
R. Sinclair Optimization of reciprocals and square
roots on the i860 microprocessor . . . . 57--64
Shyong Jian Shyu and
H. K.-C. Chang and
K.-C. Chou Implementation of a linear quadtree
coding scheme on the parallel virtual
machine . . . . . . . . . . . . . . . . 65--79
J. Sjyu and
H. K-C Chang and
K-C Chou Implementation of a linear quadtree
coding scheme on the parallel virtual
machine . . . . . . . . . . . . . . . . 65--80
E. D. Deamine and
S. Srinivas A. novel routing algorithm for $k$-ary
$n$-cube interconnection networks . . . 81--92
M. Levy and
N. Pesky and
S. Solomon The complex dynamics of a simple stock
market model . . . . . . . . . . . . . . 93
K-C Li and
K. Zhang A Performance Adviser for the
Development of Parallel Programs . . . . 205--228
L. Kazerouni and
B. Rajan and
R. K. Shyamasundar Mapping Linear Recurrence Equations onto
Systolic Architectures . . . . . . . . . 229--270
T. Yang and
A. Gerasoulis Executing Scheduled Task Graphs on
Message-Passing Architectures . . . . . 271--294
R. Sarno and
V. C. Bhavsar and
E. M. A. Hussein A Comparison of Vectorizable Discrete
Sampling Methods in Monte Carlo
Applications . . . . . . . . . . . . . . 295
H-R Tsai and
S-J Horng and
S-S Lee and
others Parallel Sorting Algorithms on a
Hyper-Channel Broadcast Communication
Model . . . . . . . . . . . . . . . . . 307--318
J. Z. Lou and
R. Ferraro A Parallel Three-Dimensional
Incompressible Navier--Stokes Solver
with a Parallel Multigrid Kernel . . . . 319--346
J. Park and
H. Yoon Design and Performance Analysis of
Multistage Interconnection Networks
Using a Recursive Multicast Algorithm 347--362
A. K. Tripathi and
D. P. Vidyarthi and
A. N. Mantri A Genetic Task Allocation Algorithm for
Distributed Computing Systems
Incorporating Problem Specific Knowledge 363--370
T-W Kao and
S-J Horng and
Y-H Guo Constant Time Algorithms for Graph
Connectivity Problems on Reconfigurable
Meshes Using fewer Processors . . . . . 371--386
D-C Lou and
C-G Chang A Cost Optimal Search Technique for the
Knapsack Problem . . . . . . . . . . . . 1--12
S. H. Noh and
A-M Moon A Row Based Parallel Gaussian
Elimination Algorithm for the Connection
Machine CM-2 . . . . . . . . . . . . . . 13--24
J-W Jang and
K. G. lee Fast Parallel Radix Sort Using a
Reconfigurable Mesh . . . . . . . . . . 25--40
G. D. Ghare and
L. M. Patnaik Implementation of ART1 and ART2
Artificial Neural Networks on Ring and
Mesh Architectures . . . . . . . . . . . 41--56
K. N. Balasubramanya Murthy and
C. Siva Ram Murthy A New Bidirectional Cholesky
Factorization Algorithm for Parallel
Solution of Sparse Symmetric Positive
Definite Systems . . . . . . . . . . . . 57--72
W-S Luk and
O. Wing Waveform Krylov Subspace Methods on a
Massively Parallel Computer . . . . . . 73
A-E Al-Ayyoub and
K. Day Parallel Solution of Dense Linear
Systems on the $k$-ary $n$-Cube Networks 85--100
M. Amor and
J. Lopez and
F. Arguello and
others Mapping Tridiagonal System Algorithms
onto Mesh Connected Computers . . . . . 101--126
G. A. Papadopoulos and
F. Arbab Coordination Of Distributed and Parallel
Activities in the IWIM Model . . . . . . 127--160
M. C. Rinard Locality Optimizations for Parallel
Computing Using Data Access Information 161
K-L Chung and
Y-W Chen Mapping Pipelined Divided-Difference
Computations into Hypercubes . . . . . . 181--190
W. W. Johnston Rationale and Strategy for a 21st
Century Scientific Computing
Architecture: The Case for Using
Commercial Symmetric Multiprocessors as
Supercomputers . . . . . . . . . . . . . 191--222
C. K. Sandalci and
C. K. Koc and
S. M. Goodnick Three-Dimensional Monte Carlo Device
Simulation with Parallel Multigrid
Solver . . . . . . . . . . . . . . . . . 223--236
H. Song and
B. Kwon and
J-Y Kim H. Yoon Two Real-Time Flow Controls in Wormhole
Networks . . . . . . . . . . . . . . . . 237--258
K. Gopalan and
C. Siva Ram Murthy New Parallel Algorithms for Direct
Solution of Sparse Linear Systems: Part
I --- Symmetric Coefficient Matrix . . . 259--290
K. Gopalan and
C. Siva Ram Murthy New Parallel Algorithms for Direct
Solution of Sparse Linear Systems: Part
II --- Non-Symmetric Coefficient Matrix 291--310
K. Gopalan and
C. Siva Ram Murthy An Improved Mapping of Cyclic
Elimination onto Hypercubes Using Data
Replication . . . . . . . . . . . . . . 311--336
W. Y. Lee and
S. J. Hong and
J. Kim Dynamic Load Distribution on Meshes with
Broadcasting . . . . . . . . . . . . . . 337--358
K-P Fan and
C-T King Efficient Multicast on Wormhole
Switch-Based NOWP . . . . . . . . . . . 359--380
A. Jayadevan and
L. M. Patnaik Fault-Tolerant Characteristics and
Topological Properties of a Hierarchical
Network of Hypercubes . . . . . . . . . 1--18
T. Rauber and
G. Rünger and
C. Scholtes Scalability of Sparse Cholesky
Factorization . . . . . . . . . . . . . 19--52
C.-C. Wu and
C. Chen Grouping Memory Consistency Model for
Parallel-Multithreaded Shared-Memory
Multiprocessor Systems . . . . . . . . . 53--82
Y. Atif Dynamic Load Assignment of Real-Time
Tasks in Distributed Memory
Multiprocessors . . . . . . . . . . . . 83--114
A. V. Veidenbaum and
Q. Zhao and
A. Shameer Non-Sequential Instruction Cache
Prefetching for Multiple-Issue
Processors . . . . . . . . . . . . . . . 115--140
M. Yaqub and
Q. Shaikh and
S. S. Ahmad A Shared Memory Multiprocessor System
for the Recognition of Solid Objects . . 141--152
M. Giordano and
M. M. Furnari and
R. Napolitano and
A. Spagnolo Constant Propagation in a Hierarchical
Intermediate Program Representation . . 153--184
T. Rauber and
G. Rünger Diagonal-Implicitly Iterated
Runge--Kutta Methods on Distributed
Memory Machines . . . . . . . . . . . . 185--208
H. Hoteit and
R. Sauleau and
B. Philippe and
Ph. Coquet and
J. P. Daniel Vector and Parallel Implementations for
the FDTD Analysis of Millimeter Wave
Planar Antennas . . . . . . . . . . . . 209--234
Vikas B. Bajaj and
Anil K. Sarje A Comparative Study of Two Self Healing
Protocols for ATM Networks . . . . . . . 235--256
Tzung-Pei Hong and
Jyh-Jong Lee Parallel Neural Learning for Control
Problems on a Bus-Based Architecture . . 257--274
S. K. Jena An Improved Lempel--Ziv Algorithm for
Sequential Data Compression . . . . . . 275--284
Sangback Ma Comparisons of the Parallel
Preconditioners on the CRAY-T3E for
Large Nonsymmetric Linear Systems . . . 285--300
Toshinori Sato A Simulation Study of Combining Load
Value and Address Predictors . . . . . . 301--326
Jen-Chih Lin Simulation of Cycles in the IEH Graph 327--342
Kyungwan Nam and
Sunggu Lee and
Jong Kim Path Selection for Real-Time
Communication in Wormhole Networks . . . 343--360
B. Prameela and
L. M. Patnaik Parallel Implementation of Alternate
Quadrant Interlocking Factorisation
Method on Star Topology . . . . . . . . 361--378
Longsong Lin Balanced Generalized Hypercubes:
Complexity and Cost/Performance Analysis 379--398
Longsong Lin Balanced Generalized Hypercubes: Optimal
Communication Algorithms . . . . . . . . 399--426
Hung-Chang Hsiao and
Chung-Ta King and
Wei-Kuo Chen and
Hsian-Hsiung Lin and
Chien-Chao Tseng A Simulation Toolkit for x86-Compatible
Processors $ 3 / 4 $ Xsim . . . . . . . 427--446
Steven Huss-Lederman and
Enrique S. Quintana-Ortí and
Xiaobai Sun and
Yuan-Jye Y. Wu Parallel Spectral Division Using the
Matrix Sign Function for the Generalized
Eigenproblem . . . . . . . . . . . . . . 1--14
Keqin Li and
Xian-He Sun Average-Case Analysis of Isospeed
Scalability of Parallel Computations on
Multiprocessors . . . . . . . . . . . . 15--36
Paul N. Swarztrauber The Vector Multiprocessor . . . . . . . 37--54
Ole Mòller Nielsen and
Markus Hegland Parallel Performance of Fast Wavelet
Transforms . . . . . . . . . . . . . . . 55--74
Michael K. Ng $K$-Means-Type Algorithms on Distributed
Memory Computer . . . . . . . . . . . . 75--92
Juil Oh and
Ju-Wook Jang and
Tack-Don Han Overhead Reduction in the Network
Communication for Web Computing . . . . 93--110
Geyong Min and
Hamid Sarbazi-Azad and
Mohamed Ould-Khaoua Performance Analysis of $k$-ary $n$-Cube
Networks with Pipelined Circuit
Switching . . . . . . . . . . . . . . . 111--127
S. R. Sathe and
P. M. Nawghare Computation Time and Idle Time of Tiling
Transformation on a Network of
Workstations . . . . . . . . . . . . . . 129--149
Jau-Der Shih An Adaptive Fault-Tolerant Wormhole
Routing Algorithm for Hypercubes . . . . 151--166
Jae Bum Lee and
Chu Shik Jhon Improving the Execution Efficiency of
Barrier Synchronization in Software DSM
Through Static Analysis . . . . . . . . 167--188
Jen-Chih Lin Embedding Hamiltonian Cycles, Linear
Arrays and Rings in a Faulty Supercube 189--201
Anil Kumar Tripathi and
Biplab Kumer Sarker and
Naveen Kumar and
Deo Prakash Vidyarthi A GA Based Multiple Task Allocation
Considering Load . . . . . . . . . . . . 203--214
Yunkyung Lee and
Sunggu Lee Path Selection Algorithms for Real-Time
Communication . . . . . . . . . . . . . 215--222
Der-Lin Pean and
Cheng Chen Enhanced Linked-Based Cache Coherence
Protocols with a Hardware Mechanism to
Reduce the Migratory Sharing Overhead 223--252
Anonymous Author Index: Volume 11 (2000) . . . . . 253--254
Christian H. Bischof and
H. Martin Bücker and
Po-Ting Wu Time-Parallel Computation of
Pseudo-Adjoints for a Leapfrog Scheme 1
J. Al-Sadi and
K. Day and
M. Ould-Khaoua A New Fault-Tolerant Routing Algorithm
for $k$-ary $n$-Cube Networks . . . . . 29
Sangback Ma Comparisons of the Parallel
Preconditioners for Large Nonsymmetric
Sparse Linear Systems on a Parallel
Computer . . . . . . . . . . . . . . . . 55
Paul N. Swarztrauber The Communication Machine . . . . . . . 69