Technology & Research
Programming Systems Publications
2002
Cierniak, M., Lewis, B., Stichnoth, J., "Open Runtime Platform: Flexibility with Performance using Interfaces", Proceedings of the ACM Java Grande Conference, Seattle, WA, November 2002.

Youfeng Wu, Ryan Rakvic, Li-Ling Chen, Chyi-Chang Miao, George Chrysos, Jesse Fang, "Compiler Managed Micro-cache Bypassing for High Performance EPIC Processors", Micro-35, November 2002.

Tutorial: "ORC 2.0 and Tuning Performance on Itanium" to be presented at Micro-35, November 2002. Organizers: R. Ju, S. Chan, C. Wu, T. Ngai.

Cifuentes, C., Lewis, B., and Ung, D., "Walkabout - A Retargetable Dynamic Binary Translation Framework", Workshop on Binary Translation 2002, Charlottesville, VA, September 2002.

Tutorial: "Open Research Compiler (ORC): Beyond Version 1.0" at PACT, September 2002. Organizers: R. Ju, S. Chan, F. Chow, X. Feng.

Youfeng Wu, "Efficient Discovery of Regular Stride Patterns in Irregular Programs and Its Use in Compiler Prefetching", PLDI 2002, June 2002.

Neal Glew and Jens Palsberg. Type-Safe Method Inlining. In The European Conference on Object-Oriented Programming 2002, Malaga, Spain, pages 525-544 of LNCS 2374, Springer, June 2002.

Birds-of-a-Feather session: "Open Research Compiler (ORC) as a Compiler Research Infrastructure" at PLDI, June 2002. Organizers: R. Ju, S. Chan, C. Wu.

Martín Abadi, Neal Glew, Bill Horne, and Benny Pinkas. Certified Email with a Light On-line Trusted Third Party: Design and Implementation. In The Eleventh World Wide Web Conference, Honolulu, HI, USA, May 2002.

M. Eng, H. Wang, P. Wang, A. Ramirez, J. Fung, and J. Shen, "Mesocode: Optimizations for Improving Fetch Bandwidth of Itanium Processors", Workshop on Complexity Effective Design, in conjunction with ISCA 2002, May 2002.

Youfeng Wu, Mauricio Serrano, Rakesh Krishnaiyer, Wei Li, Jesse Fang, "Value Profile Guided Stride Prefetching for Irregular Code", International Conference on Compiler Construction, April 2002.

Neal Glew. A Theory of Second-Order Trees. In the European Symposium on Programming 2002, Grenoble, France, pages 147-161 of LNCS 2305, Springer, April 2002.

Efficiently Adapting Java Binaries in Limited Memory Contexts Pramod G. Joisha, Samuel P. Midkiff, Mauricio J. Serrano, Manish Gupta International Journal of Parallel Programming, August 2002, Vol. 30 , Issue 4

A Study of Memory Behavior of Java Workloads Yefim Shuf, Mauricio J. Serrano, Manish Gupta and Jaswinder Pal Singh. In Java Microarchitectures (Book): Edited by Vijaykrishnan Narayanan (Penn State University) and Mario L. Wolczko (Sun Microsystem) Book Series: The Kluwer International Series in Engineering and Computer Science : Volume 679: Chapter 2 Kluwer Academic Publishers, Boston, April 2002 (ISBN 1-4020-7034-9).

Quicksilver: A Quasi-static Java Compiler for Embedded Systems Sam Midkiff, Pramod Joisha, Mauricio J. Serrano, Manish Gupta. In Java Microarchitectures (Book): Edited by Vijaykrishnan Narayanan (Penn State University) and Mario L. Wolczko (Sun Microsystem) Book Series: The Kluwer International Series in Engineering and Computer Science : Volume 679: Chapter 4 Kluwer Academic Publishers, Boston, April 2002 (ISBN 1-4020-7034-9)

Youfeng Wu, "Accuracy of Profile Maintenance in Optimizing Compilers", INTERACT-6 with HPCA, Feb 2002.

Wei-Chung Hsu, Howard Chen, Pen-Chung Yew, and Dong-Yuan Chen. On the Predicatability of Program Behavior Using Different Input Data Sets. The 6th Annual Workshop on Interaction between Compilers and Computer Architectures (INTERACT-6), Feb. 2002. (in conjunction with HPCA-8).

Cifuentes, C., Lewis, B., and Ung, D., "Walkabout-A Retargetable Dynamic Binary Translation Framework", Sun Labs Tech Report TR-2002-106, January 2002. Expanded version of the above.

Cifuentes, C., Van Emmerik, M., Lewis, B., and Ramsey, N., "Experience in the Design, Implementation and Use of a Retargetable Static Binary Translation Framework", Sun Labs Tech Report TR-2002-105, January 2002.

"Just-In-Time Java Compilation for the Itanium Processor" by Tatiana Shpeisman, Guei-Yuan Lueh, Ali-Reza Adl-Tabatabai at PACT 02. Sapphire: Copying GC Without Stopping the World Richard L. Hudson and J. Eliot B. Moss Concurrency and Computation: Practice and Experience 2002; 14:1-39.

"Post-pass binary adaptation for software-based speculative precomputation". Authors: steve liao, perry wang, hong wang, gerolf hoflehner, dan lavery, john shen, Proc. of Programming Language Design and Implementation (PLDI), 2002.

"Segmented Alignment: An Enhanced Model to Align Data Parallel Programs of HPF" by G. Hwang, C. Chen, J. Lee, R. Ju, accepted by Journal of Supercomputing.

2001
Tutorial: "Open Research Compiler (ORC) for Itanium Processor Family" at Micro-34, December 2001. Organizers: R. Ju, S. Chan, C. Wu.

Mauricio Serrano and Youfeng Wu, "Memory Performance Analysis of SPEC2000C for the Itanium™ Processor", IEEE 4th Annual Workshop on Workload Characterization, Austin TX, Micro-34, December 2001.

Kalyan Muthukumar, Dong-Yuan Chen, Youfeng Wu, Daniel M. Lavery. Software Pipelining of Loops with Early Exits for the Itanium Architecture. The 1st Workshop on EPIC Architectures and Compiler Technology (EPIC-1), December 2001. (with MICRO-34).

"The Impact of If-Conversion and Branch Prediction on Program Execution on the Intel Itanium Processor,'' Youngsoo Choi, Allan Knies, Luke Gerke and Tin-Fook Ngai, in Proceedings 34th Annual International Symposium on Microarchitecture, Austin, December 1-5, 2001, pp. 182-191.

2000
Li-Ling Chen and Youfeng Wu, "Fast Forward: Aggressive Compiler Optimization with Speculative Multi-Threaded Support", Workshop on Multithreaded Execution, Architecture and Compilation, in conjunction with Micro-33, Dec. 2000.

R. D.-C. Ju, K. Nomura, U. Mahadevan, and L.-C. Wu, "A Unified Compiler Framework for Control and Data Speculation," Proc. of 2000 Int'l Conf. on Parallel Architectures and Compilation Techniques (PACT), pp. 157 - 168, Oct. 2000.

U. Mahadevan, K. Nomura, R. D.-C. Ju, and Rick Hank"Applying Data Speculation in Modulo Scheduled Loops," Proc. of 2000 Int'l Conf. on Parallel Architectures and Compilation Techniques (PACT), pp. 169 - 176, Oct. 2000.

J. Knoop, J.F. Collard, and R. D.-C. Ju, "Partial Redundancy Elimination on Predicated Code," Proc. of 7th Static Analysis Symposium, Volume 1824 of LNCS, pp. 260-279, Springer, 2000.

Youfeng Wu and Yong-fong Lee, "Accurate Invalidation Profiling for Effective Data Speculation on EPIC Processors," ISCA 13th International Conference on Parallel and Distributed Computing Systems, Las Vegas, Nevada, August 8-10, 2000.

J. Knoops, J.F. Collard, and R. D.-C. Ju, "Partial Redundancy Elimination on Predicated Code," To appear in Proc. of 7th Static Analysis Symposium, Santa Barbara, June, 2000.

Michal Cierniak, Guei-Yuan Lueh and James Stichnoth. "Practicing JUDO: Java Under Dynamic Optimizations," In Proceedings of the SIGPLAN '00 Conference on Programming Language Design and Implementation (PLDI), Vancouver B.C., Canada, June, 2000.

Hsien-Hsin Lee, Youfeng Wu, and Gary Tyson, "Quantifying Instruction-Level Parallelism on an EPIC Architecture," International Symposium on Performance Analysis of Systems and Software, Apr. 2000.

Hsien-Hsin Lee, Youfeng Wu, and Gary Tyson, "Quantifying Instruction-Level Parallelism on an EPIC Architecture," International Symposium on Performance Analysis of Systems and Software, Apr. 2000.

Jesse Fang, Tutorial on "IA-64 Architecture and ILP Compiler Research", ISCA'00, 06/12-14/00, Vancouver, Canada, 2000.

1999
Kingsum Chow and Youfeng Wu, "Feedback-Directed Selection and Characterization of Compiler Optimizations," Workshop on Feedback Directed Optimizations with MICRO32, Israel, Nov. 1999.

Tatiana Shpeisman and Mustafa Tikir, "Generating Efficient Stack Code for Java," Technical Report CS-TR-4069, Dept. of Computer Science, University of Maryland, College Park, October 1999.

Youfeng Wu, Yong-fong Lee, and Hong Wang, "An Efficient Software-Hardware Collaborative Profiling Technique for Wide-Issue Processors," Workshop on Binary Translation with PACT99, Newport Beach, California, Oct. 1999.

V.C. Sreedhar, R. D.-C. Ju, D. Gillies, and V. Santhanam, "Translating Out of Static Single Assignment Form," Proc. of 6th Static Analysis Symposium, Venice, Italy, pp. 194-210, September, 1999.

Youfeng Wu and Yong-Fong Lee, "Comprehensive Redundant Load Elimination for the IA-64 Architecture," Proceedings of the 12th International Workshop on Languages and Compilers for Parallel Computing, San Diego, California, August 1999.

James Stichnoth, Guei-Yuan Lueh, and Michal Cierniak. "Support for Garbage Collection at Every Instruction in a Java Compiler," In Proceedings of the SIGPLAN '99 Conference on Programming Language Design and Implementation (PLDI), Atlanta, GA, May, 1999.

Tutorial on "Compiler Technology on IA-64 Architecture", PACT'99, 10/12/99, Newport Beach, CA, 1999.

Jesse Fang, Patel on "Architecture/Compiler Research in Industry and Academia", PACT'99, 10/12-16/99, Newport Beach, CA., 1999.

Jesse Fang, Tutorial on "IA-64 Architecture and IA-64 Compiler Technology", HotChips'99, 09/10/99, Palo Alto, CA., 1999.

Ching-Hong Leung, Kin-Man Cheung, Kin-Ho Yan and Tin-Fook Ngai, "Design and Implementation of a Mobile Application Support System," in Advances in Database Technologies, edited by Y. Kambayashi, D. L. Lee, E-P. Lim, M. K. Mohania and Y. Masunaga, LNCS Vol. 1552, Springer Verlag, 1999.

1998
John Irwin, Jean-Marc Loingtier, John Gilbert, Gregor Kiczales, John Lamping, Anurag Mendhekar and Tatiana Shpeisman, "Aspect-Oriented Programming of Sparse Matrix Code," International Scientific Computing in Object-Oriented Parallel Environments, Marina del Rey, California, December 1997.

Michal Cierniak and Wei Li. "Interprocedural Array Remapping," In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT '97), San Francisco, CA, November, 1997.

G.-H. Hwang, J.K. Lee, and R. D.-C. Ju, "A Function-Composition Approach to Synthesize Fortran 90 Array Operations," J. of Parallel and Distributed Computing, No. 54, pp. 1-47, Oct. 1998.

Adl-Tabatabai, M. Cierniak, G.Y. Lueh, V.M. Parikh and J.M. Stichnoth, Fast, Effective Code Generation in a Just-In-Time Java Compiler, Proc. ACM SIGPLAN Symp. on Programming Language Design and Implementation, pages 280 - 290, June, 1998.

Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Warter, Wen-mei W. Hwu: IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors. 25 Years ISCA: Retrospectives and Reprints 1998: 408-417, 1998.

R. D.-C. Ju, J.-F. Collard, and K. Oukbir, "Probabilistic Memory Disambiguation and its Application to Data Speculation," Proc. of the 3rd Workshop on Interaction between Compilers and Computer Architectures, San Jose, Oct. 1998.

William Pugh and Tatiana Shpeisman, "SIPR: A new framework for generating efficient code for sparse matrix computations," Eleventh International Workshop on Languages and Compilers for Parallel Computing, Chapel Hill, North Carolina, August 1998.

Ali-Reza Adl-Tabatabai, Michal Cierniak, Guei-Yuan Lueh, Vishesh M. Parikh and James Stichnoth. "Fast, Effective Code Generation in a Just-In-Time Java Compiler," In Proceedings of the SIGPLAN '98 Conference on Programming Language Design and Implementation (PLDI), Montreal, Canada, June, 1998.

A.V.S. Sastry and R. D.-C. Ju, "A New Algorithm for Scalar Register Promotion based on SSA Form," Proc. of the ACM SIGPLAN'98 Conference on Programming Language Design and Implementation, pp. 15-25, June 1998. Hudson, R. L., Morrison, R., Moss, J. E. B. & Munro, D. S. "Where have all the pointers gone?" In Computer Science '98, C. McDonald (ed) Springer, Proc. 21st Australasian Computer Science Conference, Perth, Australia, ISBN 981-3083-90-5 pp 107-119, 1998.

Jesse Fang, Patel on "The New Crop of Java Virtual Machine", OOPSLA'98, 10/18-22/98, Vancouver, Canada, 1998.

1997
Jesse Fang et al, "Changing Interaction of Compiler and architecture" pp 51-59, COMPUTER, Dec, 1997.

Jesse Fang et al, "Compilers for Instruction-Level Parallelism" pp 63-70, COMPUTER, Dec, 1997.

Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal, and Wen-mei W. Hwu, "Optimizing NET Compilers for Improved Java Performance", IEEE Computer, June, 1997.

G.-H. Hwang, J.K. Lee, and R. D.-C. Ju, "Integrating Automatic Data Alignment and Array Operation Synthesis to Optimize Data Parallel Programs," Proc. of the 10th Int'l Workshop on Languages and Compilers for Parallel Computing, Aug. 1997.

Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal, and Wen-mei W. Hwu, "A study of the Cache and Branch Performance Issues with Running Java on Current Hardware Platforms", Proceedings of 42nd IEEE Computer Society International Conference, pp.211-216, February 1997.

Jesse Fang, "An If-Conversion Algorithm with Predicate Microarchitecture Support" Proceedings of the International Conference on High Performance Computer Architecture, Feb, 1997.

Hudson, R L, Morrison, R, Moss, J E B, Munro, D S. "Garbage Collecting the World: One Car at a Time." In: Proc. OOPSLA 97, Atlanta, USA. 1997.

Hudson, R. L., Morrison, R., Moss, J. E. B. & Munro, D. S. "Training Distributed Garbage: The DMOS Collector," University of St Andrews Report, 1997.

G.Y. Lueh and T. Gross, Call-cost Directed Register Allocation, Proc. ACM SIGPLAN Symp. on Programming Language Design and Implementation, June 1997, pages 296 - 307, 1997.

T. Z. Yu, E. H.-M. Sha, N. Passos, and R. D.-C. Ju, "Algorithm and Hardware Support for Branch Anticipation," Proc. of the 1997 Great Lakes Symposium on VLSI, 1997.

Yong-fong Lee and Youfeng Wu, "Hot Region Loop Formation for Effective Software Pipelining and ILP Optimization," 1997 Intel Software Development Conference, October 21-23, Portland, Oregon, 1997.

1996
D. Gillies, R. D.-C. Ju, R. Johnson, and M. Schlansker, "Global Predicate Analysis and its Application to Register Allocation," Proc. of the 29th Symp. of Microarchitecture, pp. 114- 125, December 1996.

Cheng-Hsueh A. Hsieh, John C. Gyllenhaal, and Wen-mei W. Hwu, "Java Bytecode to Native Code Translation: The Caffeine Prototype and Preliminary Results," Proceedings of 29th Annual ACM/IEEE International Symposium on Microarchitecture, pp.90-97, December, 1996.

Wayne Kelly, William Pugh, Evan Rosser and Tatiana Shpeisman, "Transitive Closure of Infinite Graphs and its Applications, International Journal of Parallel Programming," v. 24, n. 6, pages 579-598, December 1996.

G.Y. Lueh, Issues in Register Allocation by Graph Coloring, Technical Report CMU-CS-96-171, School of Computer Science, Carnegie Mellon University, November, 1996.

Adl-Tabatabai, T. Gross and G.Y. Lueh, Code Reuse in an Optimizing Compiler, Proc. ACM Conference on Object-Oriented Programming Systems, Languages, and Applications (OOPSLA'96), pages 51 - 68. October, 1996.

P. Chang, D.-Y. Chen, Y.F. Lee, Youfeng Wu, and U. Banerjee, "Bidirectional Scheduling: A New Global Code Scheduling Approach," Proceedings of the 9th International Workshop on Languages and Compilers for Parallel Computing, August, 1996.

P. Chang, D.-Y. Chen, Y.F. Lee, Y. Wu, and U. Banerjee, "Bidirectional Scheduling: A New Global Code Scheduling Approach," Proceedings of the 9th International Workshop on Languages and Compilers for Parallel Computing, August, 1996.

Jesse Fang, "Compiler Algorithms on If-Conversion, Speculative Predicates Assignment and Predicated Code Optimizations" Proceedings of the 9th International Workshop on Languages and Compilers for Parallel Computing, Aug 1996.

G.Y. Lueh, T. Gross and A. Adl-Tabatabai, Global Register Allocation Based on Graph Fusion, Ninth Workshop on Languages and Compilers for Parallel Computers, pages 246 - 265, August 1996.

G.-H. Hwang, J.K. Lee, and R. D.-C. Ju, "Array Operation Synthesis to Optimize HPF Programs," Proc. of the 25th Int'l Conf. on Parallel Processing, pp. III:1-8, August, 1996.

William Pugh, Evan Rosser, and Tatiana Shpeisman, "Exploiting Monotone Convergence Functions in Parallel Programs," Ninth Annual Workshop on Languages and Compilers for Parallel Computing, San Jose, California, August 1996.

G.Y. Lueh, T. Gross and A. Adl-Tabatabai, Global Register Allocation Based on Graph Fusion, Technical Report CMU-CS-96-106, School of Computer Science, Carnegie Mellon University, March 1996.

Youfeng Wu, P. Chang, D.-Y. Chen, Y. Lee, "Speculative Load Motion for Wide Issue Speculative Processors," 1996 Intel Software Development Conference, October 21-23, 1996, Scottsdale, Arizona, 1996.

Moss, J. E. B., Munro, D. S. & Hudson, R. L. "PMOS: A Complete and Coarse-Grained Incremental Garbage Collector for Persistent Object Stores," In Proc. 7th International Workshop on Persistent Object Systems (POS7), Cape May, NJ, USA, 1996.

Tin-Fook Ngai, King-On Yeung and Shing-Chi Cheung, "An Integrated Distributed Multimedia System and Development Environment," in Proceedings 1996 Pacific Workshop on Distributed Multimedia Systems, Hong Kong, June 25-28, 1996, pp. 191-198, 1996.

Youfeng Wu, Pohua Chang, Dong-Yuan Chen, Yong-Fong Lee, "Speculative Load Motion for Wide Issue Speculative Processors," 1996 Intel Software Development Conference, October 21-23, Scottsdale, Arizona, 1996.

1995
Youfeng Wu and James Larus, "Static Prediction of Branch Probability, Branch Frequency and Function Frequency," 27th Annual International Symposium on Microarchitecture, Dec. 1995.

G.-H. Hwang, J.K. Lee, and R. D.-C. Ju, "An Approach to Array Operation Synthesis for Data Parallel Languages," Proc. of 5th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, pp. 112-122, Santa Barbara, CA, July, 1995.

Michal Cierniak and Wei Li. "Unifying Data and Control Transformations for Distributed Shared-Memory Machines," In Proceedings of the SIGPLAN '95 Conference on Programming Language Design and Implementation (PLDI), La Jolla, California, June, 1995.

Wayne Kelly, Vadim Maslov, William Pugh, Evan Rosser, Tatiana Shpeisman and David Wonnacott, "The Omega Library Interface Guide," Technical Report CS-TR-3445, Dept. of Computer Science, University of Maryland, College Park, March 1995.

Li-Ling Chen, W. Ludwell Harrison, and Kwang-keun Yi. "Efficient computation of fixpoints that arise in complex program analysis," The Journal of Programming Languages, Vol. 3, No. 1, pp.31-68, Chapman & Hall, UK, March, 1995.

Dong-Yuan Chen, "C-PAR: Extensions to C for Adaptive and Irregular Parallel Computations," Ph.D. Dissertation, Dept. of Computer Science, Yale University, 1995.

Pohua P. Chang, Daniel M. Lavery, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors. IEEE Transactions on Computers 44(3): 353-370, 1995.

Youfeng Wu, "Strength Reduction of Multiplications by Integer Constants," SIGPLAN Notices 30 (2): 42-48 , 1995.

1994
D.F. Bacon, J.-H. Chow, D.-C. Ju, K. Muthukumar, and V. Sarkar, "A Compiler Framework for Restructuring Data Declarations to Enhance Cache and TLB Effectiveness," Proc. of IBM Center for Advanced Studies Conference, pp. 270-282, (Winner of Best Paper Award), Canada, November, 1994.

D.-C. Ju, C.-L. Wu, and P. Carini, "The Classification, Fusion, and Parallelization of Array Language Primitives," IEEE Trans. on Parallel and Distributed Systems, 5(10):1113-1120, October, 1994.

Li-Ling Chen and W. Ludwell Harrison. "Performing abstract interpretation in parallel," The Seventh International Conference on Parallel and Distributed Computing Systems, 1994, pp. 1-7, October, 1994.

Li-Ling Chen and W. Ludwell Harrison. "An efficient approach to computing fixpoints for complex program analysis," The Eighth ACM International Conference on Supercomputing, Manchester, England, pp. 98-106, July 1994.

Li-Ling Chen and W. Ludwell Harrison. "An efficient approach to computing fixpoints for complex program analysis," The Eighth ACM International Conference on Supercomputing, Manchester, England, pp. 98-106, July 1994.

Li-Ling Chen. "Efficient Computation of Fixpoints that Arise in Abstract Interpretation" Ph.D. Thesis, Department of Computer Science, University of Illinois at Urbana-Champaign, Report No. UIUCDCS-R-94-1866, July, 1994.

D.-C. Ju, C.-L. Wu, and P. Carini, "Statement Merge: an Inter-statement Optimization of Array Language Programs," Proc. of the 23rd Int'l Conf. on Parallel Processing, pp. II:126-129, August, 1994.

Dong-Yuan Chen and Marina C. Chen, "CPAR-Language Extensions to C for Irregular and Adaptive Parallel Computation," Proceedings of the Scalable High-Performance Computing Conference '94, pp.501-508, May 1994.

Pohua P. Chang, Nancy J. Warter, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: Three Architectural Models for Compiler-Controlled Speculative Execution. IEEE Transactions on Computers 44(4): 481-494 (1995) 1994.

Su, Bogong, Stanley Habib, Wei Zhao, Jian Wang, Youfeng Wu, "A Study of Pointer Aliasing for Software Pipelining using Run-time Disambiguation," Micro-27, 1994, pp112-118, 1994.

1993
D.-C. Ju and C.-L. Wu, "Integrated Optimizations and Parallelizations of Array Language Programs," Proc. of Int'l Conf. on Parallel and Distributed Systems, pp. 178-185, December, 1993.

Antony L. Hosking, Richard L. Hudson, "Remembered sets can also play cards ," Position paper for OOPSLA '93 Workshop on Memory Management and Garbage Collection, Washington, DC, October, 1993.

Jesse Fang and Mi Lu, "An Iteration Partition Approach for cache or Local Memory Thrashing on Parallel Processing", pp. 529-547, IEEE Trans. on Computers, May, 1993.

Adl-Tabatabai, T. Gross, G.Y. Lueh and J. Reinders. Modeling Instruction-Level Parallelism for Software Pipelining, Proc. of the IFIP WG10.3 Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism, Orlando, FL, pages 321-330. North Holland, January, 1993.

W.-M. Ching, P. Carini, and D.-C. Ju, "A Primitive-Based Strategy for Producing Efficient Code for Very High Level Programs," Int'l J. of Computer Languages, Vol. 19, No. 1, pp. 41-50, January, 1993.

David M. Gallagher, William Y. Chen, Scott A. Mahlke, John C. Gyllenhaal, Wen-mei W. Hwu: Dynamic Memory Disambiguation Using the Memory Conflict Buffer. ASPLOS 1994: 183-193, 1993.

William Y. Chen, Pohua P. Chung, Thomas M. Conte, Wen-mei W. Hwu: The Effect of Code Expanding Optimizations on Instruction Cache Design. IEEE Transactions on Computers 42(9): 1045-1057, 1993.

Tokuzo Kiyohara, Scott A. Mahlke, William Y. Chen, Roger A. Bringmann, Richard E. Hank, Sadun Anik, Wen-mei W. Hwu: Register Connection: A New Approach to Adding Registers into Instruction Set Architectures. ISCA 1993: 247-256, 1993.

Scott A. Mahlke, William Y. Chen, Roger A. Bringmann, Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau, Michael S. Schlansker: Sentinel Scheduling for VLIW and Superscalar Processors. TOCS 11(4): 376-408, 1993.

1992
Youfeng Wu, "Ordering Functions for Improving Memory Reference Locality in Shared Memory Multiprocessor Systems," 25th Annual International Symposium on Microarchitecture, Dec. 1992.

T. Gross, S. Hinrichs, G. Lueh, D. O'Hallaron, J.Stichnoth, and J. Subholk. Compiling Task and Data Parallel Programs for iWarp, Workshop on Languages, Compilers, and Run-time Environments for Distributed Memory Machines, Sep. - Oct 1992.

Jesse Fang and Mi Lu, "A Solution of the cache Ping-Pong Problem in Multiprocessor Systems", pp. 158-172, Journal of Parallel and Distributed Computing, Oct. 1992.

D.-C. Ju, C.-L. Wu, and P. Carini, "The Synthesis of Array Functions and Its Use in Parallel Computation," Proc. of the 21st Int'l Conf. on Parallel Processing, pp. II:293-296, August, 1992.

Meng Lee, Partha Tirumalai and Tin-Fook Ngai, "Software Pipelining and Superblock Scheduling: Compilation Techniques for VLIW Machines," in Proc. 26th Hawaii International Conference on System Sciences, January 1993 (Also as Technical Report HPL-92-78, Hewlett-Packard Laboratories, June 1992.

Dong-Yuan Chen and Marina C. Chen, "Parallelizing FORTRAN Programs for Massively Parallel Machines via Crystal," Department of Computer Science, Yale University, Technical Report YALEU/DCS/TR-858, March 1992.

Tin-Fook Ngai, Runtime Resource Management in Concurrent Systems, Tech. Report No. CSL-TR-92-504, Computer Systems Laboratory, Stanford University, January, 1992.

Youfeng Wu, "The Design of a Test Suite for Parallelizing Translators," 25th International Conference on System Sciences, Hawaii, Jan. 1992.

W.-M. Ching and D.-C. Ju, "An APL-to-C Compiler for IBM RS/6000 System: Compilation, Performance and Limitations," Proc. of IBM Programming Technology Forum, June, 1992. (Also appeared in) ACM SIGAPL Quad Quote, 1992.

Tin-Fook Ngai, Stephen F. Lundstrom and Michael J. Flynn, "Automated Runtime Scheduling of Unstructured Scientific Computation on Scalable Multiprocessors," in Unstructured Scientific Computation on Scalable Multiprocessors, edited by P. Mehrotra, J. Saliz and R. Voigt, MIT Press, 1992.

Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu, B. Ramakrishna Rau, Michael S. Schlansker: Sentinel Scheduling for VLIW and Superscalar Processors. ASPLOS 1992: 238-247, 1992.

Richard Hudson and J. Eliot B. Moss. "Incremental Collection of Mature Objects ," International Workshop on Memory Management, St. Malo, France, Sept. 1992, pp. 388-403, available from Springer-Verlag, 1992.

M. Chen, D.-Y. Chen, Y. Hu, M. Jacquemin, C.-Y. Lin, and J.-J. Wu, "Parallelizing Programs for Distributed-Memory Machines using the Crystal System," in Saltz and Mehrotra eds, Languages, Compilers and Run-Time Environments for Distributed Memory Machines, 1992.

Amer Diwan, J. Eliot B. Moss, Richard Hudson. "Compiler Support for Garbage Collection in a Statically Typed Language ," SIGPLAN PLDI '92, pp. 273-282, 1992.

1991
W.-M. Ching and D.-C. Ju, "Execution of Automatically Parallelized APL Programs on RP3," IBM J. of Research and Development special issue on Parallel Processing, Vol. 35, No. 5/6, pp. 767-777, September/November, 1991.

D.-C. Ju, W.-M. Ching, and C.-L. Wu, "On Performance and Space Usage Improvement for Parallelized Compiled APL Code," Proc. of APL'91 Conf., Palo Alto, CA, August, 1991.

D.-C. Ju and W.-M. Ching, "Exploitation of APL Data Parallelism on a Shared-memory MIMD Machine," Proc. of the 3rd ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, pp. 61-72, Williamsburg, VA, April, 1991.

Pohua P. Chang, William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu: Comparing Static and Dynamic Code Scheduling for Multiple-Instruction-Issue Processors. MICRO 1991: 25-33, 1991.

William Y. Chen, Scott A. Mahlke, Pohua P. Chang, Wen-mei W. Hwu: Data Access Microarchitectures for Superscalar Processors with Compiler-Assisted Data Prefetching. MICRO 1991: 69-73,1991.

Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: Profile-guided Automatic Inline Expansion for C Programs. SP&E 22(5): 349-369 (1992) 1991.

EE Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Warter, Wen-mei W. Hwu: IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors. ISCA 1991: 266-275, 1991.

1990
Youfeng Wu and Ted Lewis, "Parallel Algorithms for Decomposable Linear Programs," Proc. 1990 International Conference on Parallel Processing, Aug. 1990.

Youfeng Wu and Ted Lewis, "Parallelizing While Loops," Proc. 1990 International Conference on Parallel Processing, Aug. 1990.

Youfeng Wu and Ted Lewis, "Parallelism Encapsulation in C++," Proc. 1990 International Conference on Parallel Processing, Aug. 1990.

Jesse Fang, P.Tang and P.Yew, "Dynamic Processor Self-Scheduling for General Parallel Nested Loops", pp. 919-930, IEEE Trans. on Computers, July 1990.

1980 – 1989
Youfeng Wu and Ted Lewis, "Parallel Processor Load Balance Through Loop Spreading," Proc. Supercomputing '89, Nov. 1989.

Jesse Fang and X. Li, "Parallel Clustering Algorithms", pp. 275-290, Journal on Parallel Computing, Nov. 1989.

Jesse Fang, X. Li, and C-Q, Zhu; "A New Parallel Sorting Approach with Sorting Memory Module" pp. 482-502, Journal of Parallel and Distributed Computing, July, 1989.

Jesse Fang, X.Li and L.Ni, "On the Communication Complexity of Generalized 2-D Convolution on Array Processors", pp180-194, IEEE Trans. On Computers, Feb, 1989.

Jesse Fang and J.S.Deogun, "Closure Properties of Certain Classes of Languages under Generalized Morphic Replication", pp. 325-331, The Computer Journal, April, 1988.

Li-Ling Chen. "An Object-Oriented Approach to Knowledge Base Management," Masters thesis, Department of Computer Science and Information Engineering, National Taiwan University, May 1987.

Tin-Fook Ngai, Jerry C. Yan, Victor W.K. Mak, Stephen F. Lundstrom and Michael J. Flynn, "Mapping between Parallel Processor Structures and Programs," in Proc. 20th Hawaii International Conference on System Sciences, Kailua-Kona, Hawaii, Jan. 6-9, 1987.

Jesse Fang, X.Li and L.Ni, "Parallel Algorithms for Image Template Metching on Hypercube SIMD Computers", pp. 835-841, IEEE Trans. on Pattern Analysis and Machine Intelligence, Nov. 1987

Tin-Fook Ngai, Dynamic Resource Allocation in a Hierarchical Multiprocessor System - A Preliminary Study, Tech. Report No. CSL-TR-86-310, Computer Systems Laboratory, Stanford University, Oct. 1986.

Tin-Fook Ngai, Mary J. Irwin and Shishpal Rawat, "Regular, Area-Time Efficient Carry-Lookahead Adders," in Journal of Parallel and Distributed Computing, vol. 3, 1986, pp. 92-105, 1986.

Tin-Fook Ngai and Mary J. Irwin, "Regular, Area-Time Efficient Carry-Lookahead Adders," in Proc. 7th Symposium on Computer Arithmetic, Urbana, Illinois, June 4-6 1985, pp 9-15, 1985.

Tin-Fook Ngai, On the Capacitive Model of Computation in VLSI - Its Limitation and Applicability, Tech. Report No. CS-84-15, Dept. of Computer Science, Pennsylvania State University, University Park, PA, September, 1984.

Richard Hudson, J. Eliot B. Moss, Amer Diwan, and Christopher F. Weight. "A Language-Independent Garbage Collector Toolkit ," University of Massachusetts at Amherst Technical Report TR 91-47.

G.-H. Hwang, J.K. Lee, and R. D.-C. Ju, "Array Operation Synthesis to Optimize HPF Programs on Distributed Memory Machines" to appear in J. of Parallel and Distributed Computing.

All information provided related to future Intel products and plans is preliminary and subject to change at any time, without notice.
Research Focus Areas
Related Links
Back to Top