Papers and Publications for R. Clint Whaley
Journal Publications

"Scaling LAPACK Panel Operations Using Parallel Cache Assignment",
by Anthony M. Castaldo, Siju Samuel and R. Clint Whaley.
ACM Transactions on Mathematical Software (TOMS),
Volume 39, Number 4, pp 22:122:30, Article 22, July 2013.

"Reducing Floating Point Error in Dot Product using the Superblock Family
of Algorithms", by Anthony M. Castaldo, R. Clint Whaley and
Anthony T. Chronopoulos.
SIAM Journal on Scientific Computing (SISC), Volume 31, Number 2,
pp 11561174, 2008.

"Achieving accurate and contextsensitive timing for code optimization",
by R. Clint Whaley and Anthony M. Castaldo.
Software: Practice & Experience, Volume 38, Number 15, pp 16211642,
April, 2008.

"Minimizing Development and Maintenance Costs in Supporting Persistently
Optimized BLAS",
by R. Clint Whaley and Antoine Petitet.
Software: Practice & Experience, Volume 35, Number 2, pp 101121,
February, 2005.
 "SelfAdapting Linear Algebra Algorithms and Software", by
J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet,
R. Vuduc, R. C. Whaley and K. Yelick. Proceedings of the IEEE,
Volume 93, Number 2, pp 293312, February, 2005.

"An Updated Set of Basic Linear Algebra Subprograms (BLAS)",
by L. Susan Blackford, James Demmel, Jack Dongarra, Iain Duff,
Sven Hammarling, Greg Henry, Micheal Heroux, Linda Kaufman,
Andrew Lumsdain, Antoine Petitet, Roldan Pozo, Karin Remington,
and R. Clint Whaley. ACM Transactions on Mathematical Software,
28(2):135151, June 2002.

"Automated Empirical Optimization of Software and the ATLAS project"
by R. Clint Whaley, Antoine Petitet and Jack Dongarra.
Parallel Computing, 27(12):335, 2001.

"Practical Experience in the Numerical Dangers of Heterogeneous Computing",
by L. S. Blackford, A. Cleary, J. Demmel, I. Dhillon, J. Dongarra,
S. Hammarling, A. Petitet, H. Ren, K. Stanley and R. C. Whaley.
ACM Transactions on Mathematical Software Volume 23, Number 2,
pages 133147, June 1997.

"ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers
 Design Issues and Performance",
by J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet,
K. Stanley, D. Walker, and R. C. Whaley.
Computer Physics Communications Volume 97, pages 115, 1996.

"The Design and Implementation of ScaLAPACK LU, QR, and Cholesky",
by J. Choi, J. Dongarra, S. Ostrouchov, A. Petitet, D. Walker, and
R. C. Whaley. Scientific Programming Volume 5, pages 173184, 1996.
Refereed Conference Publications

"Effectively exploiting parallel scale for all problem sizes in LU
factorization" by Md Rakib Hasan and R. Clint Whaley.
In
28th International Parallel & Distributed Processing Symposium
(IPDPS2014)
Phoenix, AZ, May 1923, 2014.

"Vectorization Past Dependent Branches Through Speculation" by
Majedul Haque Sujon, R. Clint Whaley and Qing Yi.
In 22nd International Conference on Parallel Architectures and
Compilation Techniques (PACT2013), pages 353362,
Edinburgh, Scotland, September 911, 2013.

"Achieving Scalable Parallelization For The Hessenberg Factorization"" by
Anthony M. Castaldo and R. Clint Whaley.
In IEEE Cluster 2011, pages 6573, Austin, TX,
September 2630, 2011.

"Scaling LAPACK Panel Operations Using Parallel Cache Assignment" by
Anthony M. Castaldo and R. Clint Whaley.
In 15th ACM SIGPLAN Annual Symposium on Principles and Practice of
Parallel Programming, pages 223231, Bangalore, India,
January 914, 2010.

"Minimizing Startup Costs for PerformanceCritical Threading" by
Anthony M. Castaldo and R. Clint Whaley.
23rd IEEE International Parallel and
Distributed Processing Symposium, pages 18, Rome, Italy,
May 2529, 2009.

"Empirically Tuning LAPACK's Blocking Factor for Increased Performance",
by R. Clint Whaley.
International Multiconference on Computer Science and Information
Technology,
Wisla, Poland, October 2022, 2008.

"Automated Transformation for PerformanceCritical Kernels",
by Qing Yi and R. Clint Whaley. ACM SIGPLAN Symposium on LibraryCentric
Software Design, Montreal, Canada. Oct, 2007.

"Tuning High Performance Kernels through Empirical Compilation"
by R. Clint Whaley and David B. Whalley.
The 2005 International Conference on Parallel Processing (ICPP05),
June 1417, 2005.
 "Automatically Tuned Linear Algebra Software"
by R. Clint Whaley and Jack Dongarra.
Ninth SIAM Conference on Parallel Processing for Scientific Computing,
March 2224, 1999, CDROM Proceedings.

"Numerical Linear Algebra Problem Solving Environment Designer's Perspective",
Society for Industrial and Applied Mathematics, Philadelphia, PA,
1999.

"Automatically Tuned Linear Algebra Software"
by R. Clint Whaley and Jack Dongarra.
Winner, best paper in systems catagory, SuperComputing 1998:
High Performance Networking and Computing.

"ScaLAPACK: A Linear Algebra Library for Messagepassing Computers",
by L. Susan Blackford, Jaeyoung Choi, Andrew J. Cleary,
Eduardo F. D'Azevedo, James Demmel, Inderjit S. Dhillon,
Jack Dongarra, Sven Hammerling, Greg Henry, Antoine Petitet,
Ken Stanley, David Walker and R. Clint Whaley.
Proceedings of 1997 SIAM Conference on Parallel Processing
for Scientific Computing, March 1997.

"A Proposal for a Set of Parallel Basic Linear Algebra Subprograms",
by Jaeyoung Choi, J. Dongarra, S. Ostrouchov, A. Petitet, D. Walker
and R. C. Whaley. Second International Workshop, PARA'95, Lyngby, Denmark,
August 1995. Proceedings in
Lecture Notes in Computer Science, Number 1041, pages 107114,
SpringerVerlag, Berlin  Heidenberg  New York, 1996.
 "Two Dimensional Basic Linear Algebra Communications Subprograms",
by Jack Dongarra, Robert A. van de Geijn and R. Clint Whaley,
Proceedings of the sixth SIAM Conference on Parallel Processing for
Scientific Computing, SIAM Publications, pages 347352,
Norfolk, Virginia, March 2224, 1993.
Books
 Software Automatic Tuning: From Concepts to StateoftheArt Results by K. Naono, K. Teranishi, J. Cavazos, R. Suda (Eds.).
Springer New York Dordrecht Heidelberg London, 2010,
ISBN: 9781441969347.

ScaLAPACK Users' Guide
by L.S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon,
J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker,
R. C. Whaley. SIAM Publications, Philadelphia, 1997, ISBN 0898713978.
 Handbook on Parallel and Distributed Processing,
editors: J. Blazewicz, K. Ecker, B. Plateau, D. Trystram.
SpringerVerlag Berlin Headelberg, 2000, ISBN: 354066416.
Doctoral Dissertation

"Automated Empirical Optimization of High Performance Floating Point Kernels"
by R. Clint Whaley. Defended November 2, 2004.
Advisor: David Whalley
Master's Thesis

"Basic Linear Algebra Communication Subprograms: Analysis and
Implementation Across Multiple Parallel Architectures" by R. Clint Whaley.
May, 1994.
Advisor:
Jack Dongarra
Selected Workshops and Presentations

"ATLAS Version 3.8 : Overview and Status" by R. Clint Whaley.
International Workshop on Automatic Performance Tuning (iWAPT07),
Tokyo, Japan, September 2021, 2007. Invited speaker with paper and
talk. Proceedings available.

"Automatically Tuned Linear Algebra Software" by R. Clint Whaley,
Workshop on Automatic Tuning for Petascale Systems,
Snowbird, Utah, July 912 2007.

"NSF CRI CNS0551504, ATLAS Support and Development",
by R. Clint Whaley,
2007 NSF/CISE CRI PI Meeting,
Boston, MA, June 45, 2007.
UTSA Technical Reports

"ATLAS Installation Guide",
by R. Clint Whaley. Technical Report CSTR2008002,
University of Texas at San Antonio, January 2008.

"Achieving accurate and contextsensitive timing for code optimization",
by Anthony M. Castaldo and R. Clint Whaley. Technical Report CSTR2008001,
University of Texas at San Antonio, January 2008.

"Automated Transformation for PerformanceCritical Kernels",
by Qing Yi and R. Clint Whaley. Technical Report CSTR2007003,
University of Texas at San Antonio, June 2007.

"Error Analysis of Various Forms of Floating Point Dot Products",
by Anthony M. Castaldo and R. Clint Whaley. Technical Report CSTR2007002,
University of Texas at San Antonio, May 2007.
User's Guides, HOWTOs, and miscellaneous.

ATLAS Installation Guide.

"Some notes on using assembly", by R. Clint Whaley.

"A Guide to User Contribution to ATLAS", by R. Clint Whaley. Also available
online as
html.

"A Collaborative Guide to ATLAS Development",
by R. Clint Whaley and Peter Soendergaard. Also available
online as
html.

"A User's Guide to Extract",
by R. Clint Whaley. Also available
online as
html.

"Installation Guide and Design of the HPF 1.1 interface to ScaLAPACK, SLHPF"
by L. S. Blackford, J. J. Dongarra, C. A. Papadopoulos, and R. C. Whaley.
August, 1998.

"ScaLAPACK Evaluation and Performance at the DoD MSRCs"
by L. S. Blackford and R. C. Whaley. UTCS98.388, April 1998.

"Installation Guide for the BLACS and its Test Suite" by R. Clint Whaley.

"A User's Guide to the BLACS v1.1",
by J. Dongarra and R. C. Whaley". March, 1995 (last updated, May 5, 1997).

"Installation Guide for ScaLAPACK"
by J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov,
A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. March, 1995.

"Using BLACS and MPI in ScaLAPACK" by R. Clint Whaley.

"Outstanding Issues in the MPIBLACS" by R. Clint Whaley.

"Some Plebian Extensions to MPI" by R. Clint Whaley.
Back to homepage