Part 1 Research Summary

We have developed systems of charts and functions on the charts that generalize the traditional tensor-product B-spline basis functions on the plane, to make surfaces of complex topology and arbitrary levels of parametric continuity [GRIM95a]. And in analogy with the traditional control-mesh structure of B-spline surfaces, our surface model automatically generates smooth parametric structures from polyhedral sketches of arbitrary topology. This simplifies the complicated underlying mathematical structures to the point where they become practical for novice users. Surfaces with complicated shapes can be easily modeled, and all conventional computer-graphics techniques such as pattern mapping can be applied to them. This model thus provides new expressive power in making arbitrary-degree-of-continuity models. Furthermore, the practical implementation of manifolds may help to restructure the way that surface models in computer graphics are described and represented, thus advancing our goal of rebuilding the fundamentals of graphics.

Self-Adjusting Constrained Optimization

A weakness of all parametric modeling techniques is that either the user must specify the parametric information or else the modeling system must provide it, which is typically done via defaults built into the algorithms. This arises in interpolation problems for both curves and surfaces and in animation issues, and is a serious issue lying below the surface of most parametric schemes. Few attempts have been made to deal with it, leaving the designer and animator to specify and ``tweak'' values. But knowing the right values to use can require a great deal of technical and mathematical expertise. This is a foundational issue: the parametric representations that are fundamental in much of graphics implicitly generate problems for users; the mathematical tractability of such representations makes them appealing, but the problems associated with the representation are pervasive.

Optimization techniques can help the designer ``fill in'' the gaps between the idea and the finished representation. Given an objective function and a set of (user-specified) constraints, a system can find a result that optimizes for the given criteria by searching in the parameter space. Recently linear optimization techniques have been used in graphics, animation, and modeling techniques.

Using nonlinear constrained optimization, this research gives the designer greater control over important parts of a model. We have added the parameter value(s) of the model as variables to the optimization. These nonlinear variables are part of the optimization and so become self-adjusting constraints. This strengthens the applicability and versatility of constrained optimization techniques. Nonlinear variables free the designer from the underlying parametrization of a model and allow concentration on the shape desired. We have combined more traditional techniques, such as shape editing of sweeps and warps, data fitting, and constrained optimization, with the self-adjusting parametric constraints to make the techniques more friendly and compliant to user needs. Center researchers have also done ``parameter-free'' constraint work in animation, posing such constraints as ``be sure to pass through this location, but at any time.'' Such new approaches, which free the user from a limitation of the underlying representations, support the goal of rebuilding foundations: we are working on making previously awkward representational paradigms more tractable for the future.

Developmental Modeling

In seeking to develop scientifically based modeling techniques, the Center has created a new type of modeling based on multicellular development. Based on the structured modeling techniques of Barzel's work in 1992, our developmental models combine elements of the chemical, cell lineage, and mechanical models of morphogenesis pioneered by Turing, Lindenmayer, and Odell, respectively; our developmental models are useful both for scientific predictions in computational biology (as described in the Ph.D. thesis [FLEIt95]) and in computer graphics modeling applications (as shown in the Siggraph 95 paper on cellular textures by Fleischer et al, [FLEI95].)

Developmental modeling is a cell-based modeling technique in which discrete cells are controlled by regulatory elements with conditional elements. The internal state of each cell in the model is represented by a time-varying state vector that is updated by piecewise differential equations. The differential equations are formulated as a sum of contributions from different sources, describing gene transcription, kinetics, and cell metabolism. Each term in the differential equation is multiplied by a (usually) smooth conditional expression that models regulatory processes specific to the process described by that term.

The resulting model has a broader range of fundamental mechanisms than other developmental models. Since gene transcription is included, the model can represent the genetic orchestration of a developmental process involving multiple mechanisms.

We show that a computational implementation of the model represents a wide range of biologically relevant phenomena in two and three dimensions. This is illustrated by a diverse collection of simulation experiments exhibiting phenomena such as lateral inhibition, differentiation, segment formation, size regulation, and regeneration of damaged structures. The same techniques are useful both for explanations of biological mechanisms and for computer graphics modeling of complex organic phenomena.

3.2 Rendering

Our research in rendering explores a variety of approaches to the problem of creating a synthetic image as quickly and as accurately as possible. This may involve radical prototypes rethinking the entire approach, as in image-based rendering, in which the traditional polygon is replaced by images. Alternatively, creating an accurate image efficiently may involve careful experimentation to determine the best parameters of a lighting model, as in gonioreflectometer measurements of surface reflection properties. In all cases, the research involves improvements to the fundamental science behind rendering, replacing hacks with physically based algorithms verified by experiments.

3.3 High-Performance Architectures

Real-world systems can be extremely complex, requiring inordinate amounts of computation to simulate and display. Thus, the Center is exploring high-performance architectures that perform well even with extremely large problems. Our work in high-performance architectures can be described by four focuses, two targeting a general-purpose system and two targeting a specific application:

software architectures, investigating the software tools necessary for high performance;
hardware architectures, investigating the hardware necessary for high performance;
tracking technology, investigating ways of improving the accuracy of virtual environments to the point that they can be used confidently for real problems, such as telemedicine;
radiosity-rendered walkthroughs of complex environments, investigating techniques to provide an extremely realistic and extremely rich immersive environment.

3.3.1 Software Architectures

Time-critical computing (TCC) is a new approach that can help improve performance in highly interactive graphics systems. The Center has been studying typical performance measurements, throughput and lag: how accurate they are in measuring performance, how they differ, how they can be improved, and, in particular, how time-critical computing can be applied to improve them. Time-critical computing spans broad classes of problems, such as scheduling algorithms, time-critical computation of behaviors, and time-critical rendering. The Center is doing research in all of these.

Extending the Funkhouser-Sequin Algorithm

The graphics community, in exploring TCC, has developed different techniques for determining how much computation and rendering to perform per frame in an interactive graphics environment. Some of these scheduling techniques can ensure constant update rates even in highly dynamic and compute-intensive scenes. The Funkhouser-Sequin algorithm is well known. However, Funkhouser-Sequin cannot handle multiple processors, a serious weakness in light of the increasing prevalence of MP machines. In addition, Funkhouser-Sequin is designed specifically for choosing among tasks with a small number of discrete choices for techniques (such as a chair modeled with 10, 100, or 1000 polygons). However, many tasks can be varied in complexity in a smooth fashion: for example, streamlines used to visualize fluid flow can be varied in length to adapt to different computational resources.

The Center has developed a scheduling algorithm that handles both multiple computational resources and continuously variable tasks. By using gradient search techniques, our algorithm can quickly find a good schedule for tasks. Note that finding an optimal schedule is a known NP-complete problem -- scheduling problems settle for approximations to the optimal solution. Because our algorithm uses gradient search, the algorithm itself is continuously variable in its complexity and accuracy. Thus the scheduler can schedule itself, thereby preventing the scheduling from starving out the application's computation.

Frameless Rendering Using Standard Graphics Hardware

The original paper on frameless rendering made use of pixel-oriented rendering, such as ray-tracing. However, most graphics hardware is primitive-oriented, filling pixels by traversing all primitives. We have developed an ``almost-frameless'' rendering technique to use hardware framebuffers and scan conversion. Rather than choosing pixels to update randomly, we choose pixels in an ordered dither. Each location in the dither corresponds to some portion of the hardware frame buffer. For example, if the hardware frame buffer is divided into four quadrants, the upper left pixel in every two by two block of pixels can be rendered into the upper left of the hardware buffer. A scale with three-fourths of the pixels masked then transfers pixels from the hardware buffer to the actual screen, updating only one in four pixels. If rendering is pixel-bound, the result is that, while one fourth as many pixels are drawn per frame, the lag from frame to frame is decreased fourfold.[WLOK95g]

Developing Time-Critical Collision Detection Algorithms

The Center has also developed a time-critical approach to collision detection. Collision detection is used by a variety of applications, ranging from games to walkthroughs to scientific visualization to telepresence. Our technique approximates the shapes of objects at multiple levels of detail by using sets of spheres arranged into hierarchies we call ``sphere-trees.'' Sphere-trees can be built automatically by a preprocess that uses medial-axis surfaces, which represent the shapes of objects in skeletal form. The root of a sphere-tree is a single bounding sphere. Collision detection between two bounding spheres is fast, but inaccurate. By traversing the hierarchy of spheres, we check for collisions of spheres that bound successively smaller portions of the object, leading to collision detection that provides more accurate results given more computation time.

Using sphere-trees is not only time-critical, it is inherently efficient, benchmarking favorably when compared to previous efficient collision-detection algorithms (which are not time-critical). [COHEa95] The hierarchical nature of sphere trees eliminates a great deal of redundant computation.

3.3.2 Hardware Architectures

Analog VLSI

On August 15th, 1995, the Center was granted Patent # 5,442,583 for Compensated Analog Multiplier Circuits. This type of multiplier is part of our project for performing computer graphics calculations in analog VLSI hardware. In addition, this past year there has been a breakthrough in analog VLSI techniques at the NSF ERC for Neuromorphic Engineering. A circuit and method have been developed for setting and stably storing analog values -- in other words, creating stable analog memory. This works around one of the key impediments for achieving quantitative calculations in analog VLSI, the lack of stable analog memory. We will be evaluating the breakthrough and seeing how well it fits with teleological circuit approaches. We expect that this may be a key component of analog computations for computer graphics.

3.3.3 Tracking Technology

Tracking continues to be an extremely hard problem, due to the human perceptual system's relative intolerance for lag and inaccuracy. The Center has been attempting to develop techniques to tackle lag, which has been shown to be the largest factor in tracker error. The Center is also developing more useful trackers that are lighter-weight and smaller without sacrificing performance.

Analysis of Head-Motion Prediction

The Center has analyzed the performance of two kinds of prediction methods for head-motion tracking. This information is especially useful when designing tracking hardware for immersive virtual reality. A polynomial extrapolation method with perfect data and a Kalman filter prediction method using noisy data were analyzed in the frequency domain. One result of the analysis is that error grows quadratically with both increases in the prediction interval and frequency of motion. These analysis methods will allow designers to determine the largest acceptable delay between tracker reporting and image display based on the characteristics of a user's motion in a given application.[AZUM95a] [AZUM95b]

Light-Weight Tracker:

We have made progress in both hardware and software for a new light-weight optical tracker for virtual reality systems. UNC and Utah collaborated on the design of a novel optical device, the ``hiball,'' which is designed to spot LED beacons that have been placed on the ceiling, an inside-looking-out approach that will allow tracking in very large spaces. The hiball is a metal housing shaped like a dodecahedron (a solid with twelve faces) that places lenses at six of its faces and holds six photodetectors at the opposing faces. After UNC tracker researchers consulted with Utah's experts in manufacturing, the design was improved. Subsequently, several hiballs have been machined at the University of Utah. (See Plate 3)

Ray-tracing simulation of the hiball optics show that a single LED can be simultaneously imaged on more than one of the photodetectors, since the hollow interior allows a photodetector to see more than one lens. These multiple sightings mean that the hiball can spot LEDs in a solid angle that is three times larger than originally planned, and this means that a greater range of motions can be tracked. The electronics for the system are nearly complete, and we expect this light-weight tracking system to be running in early 1996.[GOTT95]

3.3.4 Radiosity Walkthroughs of Complex Environments

The Center has investigated improving the computation and display of global illumination solutions by leveraging the Center's research in high-performance graphics hardware and in global illumination techniques. The techniques being developed have many uses, particularly in virtual reality applications that display realistic illumination at interactive rates. The team has explored using new algorithmic approaches, special purpose hardware, and parallel processing to generate and display radiosity solutions of building interiors. This research began with an evaluation of the Pixel-Planes hardware and Pixel-Flow simulators on global illumination solutions of complex environments. Weaknesses in current hardware designs were discovered and improvements for future display hardware were suggested. Quality and speed improvements were sought in the display of precomputed radiosity solutions that may influence future global illumination algorithms as well as future display hardware. Parallel global illumination algorithms were designed and implemented, and methods for both multiprocessors and networks of workstations were studied.

Interpolation for Interactive Display of Radiosity Solutions

The common method used to render radiosity solutions on graphics accelerators is linear color interpolation, chiefly because its directly supported by the hardware, thus fast. Unfortunately, this method can lead to artifacts, such as Mach banding, and requires careful meshing to give good results. We have implemented and optimized a second-order color interpolation method on Pixel-Planes 5 using that machine's quadratic interpolation hardware, and have used this method to display quadratically interpolated results from discontinuity meshing radiosity. Although second-order interpolation takes longer to compute, less densely meshed models are required for equivalent display quality, resulting in either a net gain in frame rate or better images for the same rendering time. Although PixelFlow, the next machine from UNC, does not support quadratic interpolation, the pixel processors are substantially faster and have more local memory. This extra capability leads us to believe that we can perform cubic color interpolation on PixelFlow, and we are currently investigating algorithms for doing this.

Meshing of Radiosity Solutions

The work on higher-order interpolation brought to light the fact that many of the meshes produced by radiosity solutions are less than optimal for display -- there's too much detail in some areas and not enough in others. We have been investigating methods for generating an efficient illumination mesh, and then applying them to our ``height field'' situation, where two dimensions are the parametric coordinates of a patch and the third dimension is the illumination over the patch. We have investigated a method proposed by Scarlatos for meshing of height-field data, and are now investigating extensions of Varshney's meshing algorithm [VARSH94]. We have also been developing an algorithm that takes a density estimation radiosity solution and generates an efficient mesh.[SHIR95] This algorithm does not have the benefit of precomputed discontinuities, but it is free to place sample points wherever they are needed to capture the detail of the solution efficiently. This same algorithm is being used to develop meshes with increasing levels of detail for use in time-critical computing. Our work with these mesh decimation and generation algorithms will also consider the possibility of generating meshes that use higher-order lighting interpolation. We can test combinations of algorithms and interpolation methods to maximize image quality versus display time.

Parallel Radiosity Algorithms

We have implemented a parallel-processing version of a global illumination method using density estimation [ZARE95]. The algorithm uses a network of workstations to efficiently compute a global illumination solution using particle tracing. The results of this tracing are then filtered in a parallel local pass, and a mesh is generated for the solution. We have also investigated implementing the ``Path Buffer'' algorithm [WALT95b] on Pixel-Flow. This algorithm uses Kajiya-style path tracing to calculate a view-dependent illumination solution, and has been implemented in software. We have determined that the algorithm will run on Pixel-Flow hardware, and implementation is underway using the Pixel-Flow simulator. The goal is to generate ten or more screen updates per second in a frameless-rendering environment.