Breadcrumb
- Home
- Publications
- Proceedings
- 2010 Annual Meeting
- Computing and Systems Technology Division
- Advances in Optimization
- (78c) Efficient Parallel Optimization On Emerging Computational Architectures
In this research, we have developed two approaches for parallel optimization on distributed clusters and GPUs. To effectively take advantage of the massively parallel architecture inherent in the GPU, we have developed an interior-point approach that uses an iterative preconditioned-conjugate gradient (PCG) approach on the doubly-augmented form of the linear KKT system. Because of the simplicity of the PCG approach, this technique is widely applicable on fine-grained parallel architectures like the GPU. For coarse-grained architectures like distributed clusters, it is much more important to exploit specific problem structure. In previous work, we presented a Schur-complement decomposition approach for efficient parallel solution of the augmented system required at each iteration of the nonlinear interior-point solver, IPOPT. This technique exploits the block-bordered structure found in multiscenario problems and parameter estimation problems. This technique scales extremely well to large numbers of CPUs and blocks. However, the process of forming the Schur-complement scales linearly with the number of coupling variables and, as the number of coupling variables increases, can become a prohibitive bottleneck in the approach. In this work, we present a quasi-Newton approach for updating an approximate Schur-complement, dramatically increasing the efficiency of the decomposition technique for problems with many coupling variables (hundreds). Both of these approaches are demonstrated on several applicable test problems and case-studies.