fsucas.jpg

Computer Architecture and SysTems Research Lab (CASTL)

Address: 114 Milton Carothers Hall (MCH), Tallahassee, FL 32306; Contact: Dr. Weikuan (Will) Yu (yuw@cs.fsu.edu), 850-644-5442

List of Publications

PASL Publications

Total 57 publications.

2012201120102009200820072006200520042003

2012

  • Gu, B., Carpenter, P., Yu, W. (2012). Implementation of Multicore-Aware Load Balancing on Clusters through Data Distribution in Chapel. Journal of The KIPS Transactions: Part A. 19 , 129 - 138
  • Li, D., Vetter, J.S., Yu, W. (2012). Classifying Soft Error Vulnerabilities in Extreme-Scale Scientific Applications Using a Binary Instrumentation Tool International Conference for High Performance Computing, Networking, Storage and Analysis (SC'12) Salt Lake City, UT , , -
  • Liu, Z., Wang, B., Carpenter, P., Li, D., Vetter, J.S., Yu, W. (2012). PCM-Based Durable Write Cache for Fast Disk I/O. International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS'12) Washington, DC , , -
  • Tian, Y., Klasky, S., Yu, W., Abbasi, H., Wang B., Podhorszki, N., Grout, R., Wolf, M. (2012). A System-Aware Optimized Data Organization for Efficient Scientific Analytics. HPDC 2012. Poster. Delft, The Netherlands , , -
  • Tian, Y., Klasky, S., Yu, W., Abbasi, H., Wang B., Podhorszki, N., Grout, R., Wolf, M. (2012). Smart-IO: SysteM-AwaRe Two-Level Data Organization for Efficient Scientific Analysis. International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS'12) Washington, DC , , -
  • Tian, Y., Yu, W. (2012). Smart-IO: System-Aware Two-Level Data Organization for Efficient Scientific Analytics. 2012 ACM Grand Finals Student Research Competition, San Francisco, CA, , - PDF link
  • Yu, W., Que, X., Tipparaju, V., Vetter, J.S. (2012). HiCOO: Hierarchical Cooperation for Scalable Communication in Global Address Space Programming Models on Cray XT Systems Journal of Parallel and Distributed Computing , -

2011

  • Klasky,S., Abbasi,H., Logan,J., Parashar,M., Schwan,K., Shoshani,A., Wolf,M., Ahern,S., Altintas,I., Bethel,W., Chacon,L., Chang,C.S., Chen,J., Childs,H., Cummings,J., Ethier,S., Grout,R., Lin,Z., Liu,Q., Ma,X., Moreland,K., Pascucci,V., Podhorszki,N., Samatova,N., Schroeder,W., Tchoua,R., Wu,K.J., Yu,W. (2011). In situ data processing for extreme scale computing. SciDAC Conference, Denver, CO , , -
  • Liu, Z., Zhou, J., Yu, W., Wu, F., Qin, X., Xie, C. (2011). MIND: A Black-Box Energy Consumption Model for Disk Arrays. The First International Workshop on Energy Consumption and Reliability of Storage Systems (ERSS'11). Orlando, FL , , -
  • Gu,B., Yu,W., Kwak,Y. (2011). Communication and Computation Overlap through Task Synchronization in Multi-Locale Chapel Environment. The 3rd International Workshop on IT Service and Cloud Computing (ISCC'11) Crete, Greece PDF link
  • Que,X., Yu,W., Tipparaju,V., Vetter,J.S., Wang,B. (2011). Network-Friendly One-Sided Communication Through Multinode Cooperation on Petascale Cray XT5 Systems. IEEE International Symposium on Cluster Computing and the Grid (CCGrid'11) Newport Beach, CA PDF link
  • Shoshani,A., Altintas,I., Chen,J., Chin,G., Choudhary,A., Crawl,D., Critchlow,T., Gao,K., Grimm,B., Iyer,H., Kamath,C., Khan,A., Klasky,S., Koehler,S., Lang,S., Latham,R., Li,J.W., Liao,W., Ligon,W., Liu,Q., Ludaescher,B., Mouallem,P., Nagappan,M., Podhorszki,N., Ross,R., Rotem,D., Samatova,N., Silva,C., Sim,A., Tchoua,R., Thakur,R., Vouk,M., Wu,K., Yu,W. (2011). The Scientific Data Management Center: Available Technologies and Highlights. SciDAC Conference, Denver, CO PDF link
  • Tian,Y. (2011). SRC: Enabling Petascale Data Analysis for Scientific Applications Through Data Reorganization. ACM International Conference on Supercomputing (ICS'11) Tucson, AZ PDF link
  • Tian,Y., Klasky,S., Abbasi,H., Lofstead,J., Grout,R., Podhorszki,N., Liu,Q., Wang,Y., Yu,W. (2011). EDO: Improving Read Performance for Scientific Applications Through Elastic Data Organization. IEEE International Conference on Cluster Computing (Cluster'11) Austin, TX PDF link
  • Wang,Y., Que,X., Yu,W., Goldenberg,D., Sehgal,D. (2011). Hadoop Acceleration through Network Levitated Merge. International Conference for High Performance Computing, Networking, Storage and Analysis (SC'11) Seattle, WA PDF link
  • Yu,W., Tipparaju,V., Que,X., Vetter,J.S. (2011). Virtual Topologies for Scalable Resource Management and Contention Attenuation in a Global Address Space Model on the Cray XT5. IEEE International Conference on Parallel Processing (ICPP'11) Taipei, Taiwan PDF link
  • Yu,W., Wu,K.J., Ku,W.S., Xu,C., Gao,J. (2011). BMF: Bitmapped Mass Fingerprinting for Fast Protein Identification. IEEE International Conference on Cluster Computing (Cluster'11) Austin, TX PDF link

2010

  • Tipparaju,V.,, Apra,E.,, Yu,W.,, Vetter,J. (2010). Enabling a highy-scalable global address space model for petascale computing, Proceedings of International Conference on Computing Frontiers, Bertinoro, Italy PDF link
  • Yu,W.,, Que,X.,, Tipparaju,V.,, Graham,R.,, Vetter,J. (2010). Cooperative Server Clustering for a Scalable GAS Model on Petascale Cray XT5 Systems, International Supercomputing Conference, Computer Science-Research and Development series. Springer Berlin / Heidelberg, 25, 57-64 PDF link
  • Yu,W.,, Tian,Y.,, Vetter,J. (2010). Efficient Zero-Copy Noncontiguous I/O for Globus on InfiniBand, Proceedings of the Third International Workshop on Parralel Programming Models and Systems Software for High-end Computing. Held in Conjunction with ICPP10, San Diego, CA PDF link
  • Yu,W.,, Vetter,J. (2010). Initial Characterization of Parallel NFS Implementations, Workshop on System Management Techniques, Processes, and Services (SMTPS)Held in Conjunction with IPDPS10, Atlanta, GA PDF link

2009

  • Yu,W.,, Drokin,O.,, Vetter,J.S. (2009). Design, Implementation, and Evaluation of Transparent pNFS on Lustre, 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS'09) Rome, Italy PDF link

2008

  • Alam,S.,, Barrett,B.,, Bast,M.,, Fahey,M.R.,, Kuehn,J.,, McCurdy,C.,, Rogers,J.,, Roth,P.,, Sankaran,R.,, Vetter,J.,, Worley,P.,, Yu,W. (2008). Early Evaluation of IBM BlueGene/P, SC08, PDF link
  • Rao,N.S.V.,, Yu,W.,, Poole,S.W.,, Wing,W.R.,, Vetter,J.S. (2008). Wide-Area Performance Profiling of 10GigE and Infiniband Technologies, SC08: International Conference High Performance Computing, Networking, Storage, and Analysis, PDF link
  • Yu,W.,, Oral,H.S.,, Canon,R.S.,, Vetter,J.S.,, Sankaran,R. (2008). Empirical Analysis of a Large-Scale Hierarchical Storage System, The 14th European Conference on Parallel and Distributed Computing (Euro-Par 2008) PDF link
  • Yu,W.,, Rao,N.S.V.,, Wyckoff,P.,, Vetter,J.S. (2008). Performance of RDMA-capable Storage Protocols on Wide-Area Network, Peta-Byte Storage Workshop 2008 (PDSW08) PDF link
  • Yu,W.,, Vetter,J.S. (2008). ParColl: Partitioned Collective IO on the Cray XT, The 37th International Conference on Parallel Processing. PDF link
  • Yu,W.,, Vetter,J.S., (2008). Xen-Based HPC: A Parallel IO Perspective, 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid'08). Lyon, France PDF link
  • Yu,W.,, Vetter,J.S.,, Oral,H.S. (2008). Performance Characterization and Optimization of Parallel I/O on the Cray XT, 22nd IEEE International Parallel and Distributed Processing Symposium (IPDPS'08). Miami, FL PDF link
  • Yu,W., Rao,N., Vetter,J.S. (2008). Experimental Analysis of InfiniBand Transport Services on WAN. International Conference on Network, Architecture, and Storage, Chongqing, China:IEEE PDF link

2007

  • Chen, Feng,, Jiang, Song,, Shi, Weisong,, Yu, Weikuan (2007). FlexFetch: A History-Aware Scheme for I/O Energy Saving in Mobile Computing, International Conference on Parallel Processing (ICPP-07) Xi'an, China PDF link
  • Storaasli, Olaf O.,, Yu, Weikuan,, Strenski, Dave (2007). Accelerating Scientific Applications with FPGAs, Manchester Reconfigurable Supercomputing Conference, Manchester, UK [Online]
  • Storaasli, Olaf O.,, Yu, Weikuan,, Strenski, Dave,, Maltby, Jim (2007). Performance Evaluation of Biological Applications that Use FPGAs, Cray User Group Meeting (CUG 2007) Seattle, Washington [Online]
  • Yu,W.,, Vetter,J.S.,, Canon,R.S.,, Jiang,S. (2007). Exploiting Lustre File Joining for Effective Collective IO, Int'l Conference on Clusters Computing and Grid (CCGrid '07) Rio de Janeiro, Brazil:IEEE Computer Society PDF link
  • Yu, Weikuan,, Oral, Sarp,, Vetter, Jeffrey,, Barrett, Richard (2007). Efficiency Evaluation of Cray XT Parallel IO Stack, Cray User Group Meeting (CUG 2007) Seattle, Washington PDF link
  • Yu, Weikuan,, Vetter, Jeffrey,, Canon, Shane (2007). OPAL: An Open-Source MPI-IO Library over Cray XT, International Workshop on Storage Network Architecture and Parallel I/O (SNAPI'07). Held together with IEEE Conference on Mass Storage Systems and Technologies, San Diego, CA PDF link

2006

  • Dhabaleswar K. Panda, Weikuan Yu (2006). Storage Networks, Protocols and File Systems: Latest Trends?, Hot Interconnects 14, Tutorial series. Stanford University, Palo Alto, CA
  • Qi Gao, Weikuan Yu,, Wei Huang, Dhabaleswar K. Panda (2006). Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand, International Conference on Parallel Processing (ICPP '06) Columbus, OH, USA:IEEE Computer Society, 471 - 478 PDF link
  • Shuang Liang, Weikuan Yu, D.K. Panda (2006). High Performance Block I/O for Global File System (GFS) with InfiniBand RDMA, International Conference on Parallel Processing (ICPP) Columbus, OH:IEEE Computer Society, 391-398 PDF link
  • Weikuan Yu, Qi Gao, D.K. Panda (2006). Adaptive Connection Management for Scalable MPI over InfiniBand, International Parallel and Distributed Processing Symposium (IPDPS '06) Rhodes Island, Greece PDF link
  • Weikuan Yu, Ranjit Noronha, Lei Chai, Shuang Liang, D.K. Panda (2006), Optimizing Open Solaris NFS over RDMA. PDF link
  • Weikuan Yu, Ranjit Noronha,, Shuang Liang, D. K. Panda (2006). Benefits of High Speed Interconnects to Cluster File Sys-tems: A Case Study with Lustre, Workshop on Communication Architecture for Clusters (CAC '06), in Conjunction with (IPDPS '06) Rhodes Island, Greece PDF link

2005

  • Pavan Balaji, Wu-chun Feng,, Qi Gao, Ranjit Noronha,, Weikuan Yu, Dhabaleswar K. Panda (2005). Head-to-TOE Comparison for High Performance Sockets over Protocol Offload Engines, IEEE International Conference on Cluster Computing (Cluster '05) Boston, Massachusetts:IEEE Computer Society PDF link
  • Weikuan Yu, Dhabaleswar K. Panda (2005). Benefits of Quadrics Scatter/Gather to PVFS2 Noncontiguous I/O, International Workshop on Storage Network Architecture and Parallel IO. Held in conjunction with 14th International Conference on Parallel Architecture and Compilation Techniques, St. Louis, Missouri PDF link
  • Weikuan Yu, Sayantan Sur, Dhabaleswar K. Panda, Rob T. Aulwes, Richard Graham (2005). High Performance Broadcast Support in LA-MPI over Quadrics. International Journal of High Performance Computing Applications 19(4), 453-463 [Online]
  • Weikuan Yu, Shuang Liang, Dhabaleswar K. Panda (2005). High performance support of parallel virtual file system (PVFS2) over Quadrics, 19th ACM International Conference on Supercomputing (ICS '05) Cambridge, Massachusetts:ACM Press, 323-331 PDF link
  • Weikuan Yu, Tim S. Woodall,, Rich L. Graham, Dhabaleswar K. Panda (2005). Design and Implementation of Open MPI over Quadrics/Elan4, International Conference on Parallel and Distributed Processing Symposium (IPDPS '05) Colorado, Denver PDF link
  • Yu, Weikuan,, Panda, D.K. (2005). Benefits of Quadrics Scatter/Gather to PVFS2 Noncontiguous I/O, International Workshop on Storage Network Architecture and Parallel IO (SNAPI). Held in conjunction with PACT '05, St. Louis, Missouri PDF link

2004

  • Jiuxing Liu, Bala Chandrasekaran,, Weikuan Yu, Jiesheng Wu,, Darius Buntinas, S. P. Kini,, Pete Wyckoff, D. K. Panda (2004). Micro-Benchmark Performance Comparison of High-Speed Cluster Interconnects. IEEE Micro 24(1), 42-51 PDF link
  • Weikuan Yu, D. K. Panda, (2004). Scalable, high-performance NIC-based all-to-all broadcast over Myrinet/GM, IEEE International Conference on Cluster Computing (Cluster '04) San Diego, CA:IEEE Computer Society, 125-134 PDF link
  • Weikuan Yu, Darius Buntinas,, Rich L. Graham, Dhabaleswar K. Panda (2004). Efficient and Scalable NIC-Based Barrier over Quadrics and Myrinet, Workshop on Communication Architecture for Clusters, in Conjunction with International Parallel and Distributed Processing Symposium (IPDPS '04) Santa Fe, NM PDF link
  • Weikuan Yu, Jiesheng Wu, Dhabaleswar K. Panda (2004). Fast and Scalable Startup of MPI Programs in InfiniBand Clusters, High Performance Computing - HiPC 2004, , 440-449 PDF link

2003

  • Jiuxing Liu, Bala Chandrasekaran,, Jiesheng Wu, Weihang Jiang,, S. P. Kini, Weikuan Yu,, Darius Buntinas, Pete Wyckoff, (2003). Performance Comparison of MPI implementations over Infiniband, Myrinet and Quadrics, Proceedings of Supercomputing '03 (SC '03) PDF link
  • Jiuxing Liu, Bala Chandrasekaran,, Weikuan Yu, Jiesheng Wu,, Darius Buntinas, S. P. Kini,, Pete Wyckoff, D. K. Panda (2003). Micro-Benchmark Level Performance Comparison of High-Speed Cluster Interconnects, Proceedings of Hot Interconnects 11 (HotI XI) PDF link
  • Jiuxing Liu, Jiesheng Wu,, Sushmitha P. Kini, Darius Buntinas,, Weikuan Yu, Balasubraman Chandrasekaran,, Ranjit Noronha, Peter Wyckoff, (2003), MPI over InfiniBand: Early Experiences. PDF link
  • Weikuan Yu, Darius Buntinas, Dhabaleswar K. Panda (2003). High Performance and Reliable NIC-Based Multicast over Myrinet/GM-2, Proceedings of the International Conference on Parallel Processing (ICPP'03) PDF link
  • Weikuan Yu, Sayantan Sur, Dhabaleswar K. Panda, Rob T. Aulwes, Richard Graham (2003). High Performance Broadcast Support in LA-MPI over Quadrics, Los Alamos Computer Science Institute Symposium (LACSI '03) PDF link


Personal Tools