|
[1] Richard A. Huff. ”Lifetime-sensitive modulo scheduling” In Proc. of the SIGPLAN ’93 Conf. on Program- ming Language Design and Imple- mentation, Albuquerque, N. Mex., Jun. 23–25, 1993. ACM SIG- PLAN. [2] B. Rau, M. Lee, P. Tirumalai, and P. Schlansker ”Register allocation for software pipelined loops” In Proc. of the ACM SIGPLAN’92 Conference on Programming Language Design and Implementation, pages 283–299, June 1992. [3] B. Rau. ”Iterative Modulo Scheduling: An Algorithm for software pipelining loops” MICRO-27, 1994, pp. 63-74 . [4] B. Rau, M. Schlansker, and P.Tirumalai ”Code Generation Schemas for Modulo Scheduled DO-Loops and WHILE-Loops” MICRO-25, Dec. 1992. [5] M.Lam. ”Software pipelining: an effective scheduling technique for VLIW machines” Proceedings of the SIGPLAN ’88 conference on Programming language design and implementation. 1988. [6] ME Wolf, MS Lam. ”A loop transformation theory and an algorithm to maximize parallelism” IEEE Transactions on Parallel and Distributed Systems, 1991. [7] Yung-Chia Lin, Yi-Ping You, and Jenq Kuen Lee. ”Register Allocation for VLIW DSP Processors with Irregular Register Files” Compiler for Parallel Computing. 2006. [8] S. Rixner, W. J. Dally, B. Khailany, P. Mattson, U. J. Kapasi, and J. D. ”Owens: Register organization for media processing” International Symposium on High Performance Computer Architecture (HPCA), pp.375-386, 2000. [9] T.-J. Lin, C.-C. Chang. C.-C. Lee, and C.-W. Jen. ”An Efficient VLIW DSP Architecture for Baseband Processing” Proceedings of the 21th International Conference on Computer Design, 2003. [10] Tay-Jyi Lin, Chie-Min Chao, Chia-Hsien Liu, Pi-Chen Hsiao, Shin-Kai Chen, Li- Chun Lin, Chih-Wei Liu, Chein-Wei Jen ”Computer architecture: A unified processor architecture for RISC & VLIW DSP”Proceedings of the 15th ACM Great Lakes symposium on VLSI, April 2005. [11] Yung-Chia Lin, Chung-Lin Tang, Chung-Ju Wu, Ming-Yu Hung, Yi-Ping You, Ya-Chiao Moo, Sheng-Yuan Chen, and Jenq Kuen Lee ”Compiler Supports and Optimizations for PAC VLIW DSP Processors”Proceedings of the 18th International Workshop on Languages and Compilers for Parallel Computing, 2005. [12] BR Rau, CD Glaeser ” Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing” ACM SIGMICRO Newsletter, 1981. [13] H Rong, Z Tang, R Govindarajan, A Douillet, GR Gao. ”Singledimension software pipelining for multi-dimensional loops” Code Generation and Optimization, 2004. CGO 2004 [14] Zhining Huang and Sharad Malik. ”Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks” DATE 2001. [15] S. Rixner, W. J. Dally, B.Khailany, P. Mattson, U. J. Kapasi, and I. D. ”Owens Register organzation for media processing” International Symposium on High Performance Computer Architecture (HPCA), pp.375-386, 2000. [16] STC ITRI,PACDSP ISM v2.0. [17] David Chang and Max Baron: Taiwan’s Roadmap to Leadership in Design. Microprocessor Report, In-Stat/MDR, Dec. 2004. http://www.mdronline.com/mpr/archive/mpr 2004.html. [18] Roy Ju, Sun Chan, and Cheng yong Wu, ”Open Research Compiler for the Itanium Family”. Tutorial at the 34th Annual Intl Symposium on Micro-architecture, Dec, 2001. [19] SGI - Developer Central Open Source - Pro64 http://oss.sgi.com/projects/Pro64/. [20] A Capitanio, N. Dutt, and A. Nicolau, ”Partitioned register files for VLIWs: A preliminary analysis of tradeoffs”, Proceedings of the 25th Annual International Symposium on Microarchitecture (MICRO25), Porland, December 1V4, 1992; pages 292V300. [21] The open research compiler official page. http://ipforc.sourceforge.net. [22] Yung-Chia Lin, Chung-Lin Tang, Chung-Ju Wu, Ming-Yu Hung, Yi-Ping You, Ya-Chiao Moo, Sheng-Yuan Chen, Jenq-Kuen Lee. ”Compiler Supports and Opimizations for PAC VLIW DSP Processors”, LCPC, 2005. [23] Cheng-Wei Chen, Yung-Chia Lin, Chung-Ling Tang, Jenq-Kuen Lee. ”ORC2DSP: Compiler Infrastructure Supports for VLIW DSP Processors”, IEEE VLSI TSA, April 27-29, 2005. [24] Tay-Jyi Lin, Chen-Chia Lee, Chih-Wei Liu, and Chein-Wei Jen ”A Novel Register Orgnization for VLIW Digital Signal Processors”, Proceedings of 2005 IEEE International Symposium on VLSI Design, Automation, and Test, 2005, pages 335 V338. [25] R.Leupers ”instruction scheduling for clustered VLIW DSPs”, Proc. Intl Conference on Parallel Architecture and Compilation Techniques, Ort. 2000, pages 291V300. [26] V.Zivojnovic, J. Martines, C. Schlager and H. Meyr ”DSPstone: A DSP-Oriented Benchmarcking Methodology”, Proc. of ICSPAT, Dallas,1994. [27] VH Allan, RB Jones, RM Lee, SJ Allan ,”Software pipelining” ACM Computing Surveys (CSUR), 1995 - portal.acm.org. [28] K Ebcio.lu ,”A compilation technique for software pipelining of loops with conditional jumps”, 1987 - ACM Press New York, NY, USA. [29] J Ruttenberg, GR Gao, A. Stoutchinin, W. Lichtenstein , ”Software pipelining showdown: optimal vs. heuristic methods in a production compiler”, Proceedings of the ACM SIGPLAN 1996 conference on Programming language design and implementation. [30] R Govindarajan, ER Altman, GR Gao, ”Minimizing register requirements under resource-constrained rate-optimal software pipelining”, Proceedings of the 27th annual international symposium on Microarchitecture. [31] J Wang, C Eisenbeis, M Jourdan, B Su, ”Decomposed software pipelining: a new perspective and a new approach”, International Journal of Parallel Programming, 1994 - portal.acm.org. [32] S Jain ,”Circular scheduling: a new technique to perform software pipelining”, ACM SIGPLAN Notices, 1991 - portal.acm.org. [33] Q Ning, GR Gao ,”A novel framework of register allocation for software pipelining”, Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages. [34] B Su, S Ding, J Xia ,”URPR-An extension of URCR for software pipelining”, ACM SIGMICRO Newsletter, 1986 - portal.acm.org. [35] A Aiken, A Nicolau, S Novack ,”Resource-constrained software pipelining”, IEEE Transactions on Parallel and Distributed Systems, 1995 - doi.ieeecs.org. [36] MS Lam, ”A systolic array optimizing compiler”, Kluwer Academic Publishers.... [37] SOOM MOON, KE GLU ,”Parallelizing Nonnumerical Code with Selective Scheduling and Software Pipelining”, ACM Transactions on Programming Languages and Systems, 1997 - portal.acm.org. [38] B Su, S Ding, J Wang, J Xia, ”GURPR-a method for global software pipelining”, Proceedings of the 20th annual workshop on Microprogramming, 1987 - portal.acm.org. [39] TJ Callahan, J Wawrzynek ,”Adapting software pipelining for reconfigurable computing”, Proceedings of the 2000 international conference on Compilers, architecture, and synthesis for embedded systems. [40] A Aiken, A Nicolau ,”A realistic resource-constrained software pipelining algorithm”, Advances in Languages and Compilers for Parallel Processing, 1991.
|