Jiajia Li

Computer Science

NCSU

Website: www.jiajiali.org
Email: jiajia.li@ncsu.edu
Curriculum Vitae: [pdf]
Google Scholar
LinkedIn / Github

Address:
Engineering Building II, 890 Oval Drive
Raleigh, NC
US
27695

About me

Jiajia Li is an Assistant Professor in Department of Computer Science at North Carolina State University (NCSU), Raleigh, NC. Her research emphasizes on high performance computing with a focus on the interaction among applications, numerical methods, data structures, algorithms, automatic performance tuning, and computer architectures. She is eager to pursue high performance sparse (multi-)linear algebra, solvers, and tensor decompositions for large-scale data analytics and domain applications on diverse computer architectures.


Jiajia Li was an Assistant Professor in Department of Computer Science at the College of William & Mary (W&M), Williamsburg, VA and a Research Scientist at High Performance Computing group of Pacific Northwest National Laboratory (PNNL), Richland, WA from 2018-2022. She has received her Ph.D. degree (Aug. 2018) in Computational Science & Engineering at Georgia Institute of Technology, advised by Professor Richard Vuduc. She has received Rising Stars in Computational and Data Sciences, Best Student Paper Award, and IBM PhD Fellowship. Before, she was a research intern of IBM Thomas J. Watson Research Center and Intel Parallel Computing Lab in the summers of 2016 and 2015 respectively. In the past, she has received a Ph.D. degree (Jul. 2013) from Institute of Computing Technology at Chinese Academy of Sciences. She received her B.S. (Jul. 2008) in Computational Mathematics from Dalian University of Technology in the Accelerated Student Program (2/180).


Please feel free to drop me an email @ jiajia.li@ncsu.edu if you have questions about CS PhD program, research collaboration, research/career/international life suggestions, etc.


For more information, please click here for the Curriculum Vitae

.

News

  • May 13-17: My student will present a recent work at SIAM LA'24, Paris, France
  • Apr 27-May 1: Will serve as the Registration Chair and organize the XTensor workshop at ASPLOS'24, San Diego, CA
  • March 2-6: Our work was presented as a poster at PPoPP'24, Edinburgh, UK

Projects

Current Projects

Past Projects

  • SHARWK: Scalable Hypergraph Analysis Via Random Walk Kernels

    PI: Jiajia Li
    DOE EXPRESS project #656071, 11/14/2022 – 12/30/2023, Total amount: $78,382

  • HiParTI: Application-Algorithm-Architecture Co-Design for Large-Scale, Sparse Tensor/Matrix Methods

    PI: Jiajia Li; Team: Ang Li, Ajay Panyala
    DOE PNNL LDRD project

  • Parallel Tensor Infrastructure (ParTI) on multicore CPUs and GPUs

    Code released in Github: [ParTI]

  • SMAT (SpMV Auto-tuner)

CLICK for more past Projects

Awards

  • The 39th IEEE International Conference on Computer Design (ICCD’21) Best Paper Award

  • Rising Stars in Computational and Data Sciences, 2019 [Link]

  • Principles and Practice of Parallel Programming (PPoPP’19) Best Paper Award Finalist

  • ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC'18) Best Student Paper Award
    [PNNL Press] [GaTech Press]

  • SIAM ALA'18 Student Travel Grant

  • GaTech CoC Graduate Student Council Travel Grant

  • IBM PhD Fellowship for 2017-2018 [Link]

  • Travel grant from the Institute for Pure and Applied Mathematics (IPAM) for Big Data Meets Computation Workshop 2017

  • Selected students to attend IEEE-WIE Women’s Leadership Summit 2016

  • ZhuLiYueHua Award for the Excellent PhD Students of Chinese Academy of Sciences (Top 0.2%), 2013

  • Merit Student of Institute of Computing Technology, 2013

  • Xia Peisu Scholarship of Institute of Computing Technology (Top 1%), 2011

  • Outstanding Research Assistant of the Computer Architecture Laboratory at the University of Chinses Academy of Sciences, 2011

  • Outstanding Student of the Computer Architecture Laboratory at the University of Chinses Academy of Sciences, 2010

Software

  • HiParTI

    A Hierarchical Parallel Tensor Infrastructure

  • PASTA

    A Parallel Sparse Tensor Algorithm Benchmark Suite

  • ParTI

    A Parallel Tensor Infrastructure for Data Analysis

  • AdaTM

    Adaptive Tensor Memoization algorithm for CP decomposition

  • InTensLi

    Input-adaptive and in-place dense tensor-times-matrix multiply

  • SMAT

    Sparse Matrix-vector multiplication Auto-Tuner

  • HDGEMM

    A Hybrid DGEMM library on a Heterogeneous CPU-AMD GPU Architecture

Publication

  • Sparse Symmetric Format for Tucker Decomposition

    Shruti Shivakumar, Jiajia Li, Ramakrishnan Kannan, Srinivas Aluru.

    IEEE Transactions on Parallel and Distributed Systems (TPDS).

  • Performance Implication of Tensor Irregularity and Optimization for Distributed Tensor Decomposition

    Zheng Miao, Jon C. Calhoun, Rong Ge, Jiajia Li.

    ACM Transactions on Parallel Computing (TOPC).

  • Merchandiser: Data Placement on Heterogeneous Memory for Task-Parallel HPC Applications with Load-Balance Awareness

    Zhen Xie, Jie Liu, Jiajia Li, Dong Li.

    Principles and Practice of Parallel Programming (PPoPP).

  • AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices

    Zhen Du, Jiajia Li, Yinshan Wang, Xueqi Li, Guangming Tan, Ninghui Sun.

    The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC).

  • BALA-CPD: BALanced and Asynchronous Distributed Tensor Decomposition

    Zheng Miao, Jiajia Li, Jon Calhoun, Rong Ge.

    The 24th IEEE Cluster (CLUSTER).

  • DRIPS: Dynamic Rebalancing of Pipelined Streaming Applications CGRAs

    Cheng Tan, Nicolas Bohm Agostini, Tong Geng, Chenhao Xie, Jiajia Li, Ang Li, Kevin Barker, Antonino Tumeo.

    28th IEEE International Symposium on High-Performance Computer Architecture (HPCA).

  • A High Performance Sparse Tensor Algebra Compiler in MLIR

    Ruiqin Tian, Luanzheng Guo, Jiajia Li, Bin Ren, Gokcen Kestor.

    The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), the Seventh Annual Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC)

  • A Survey: Handling Irregularities in Neural Network Acceleration with FPGAs

    Tong Geng, Chunshu Wu, Cheng Tan, Chenhao Xie, Anqi Guo, Pouya Haghi, Sarah Yuan He, Jiajia Li, Martin Herbordt, Ang Li.

    IEEE High Performance Extreme Computing Conference (HPEC).

  • DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications

    Cheng Tan, Tong Geng, Chenhao Xie, Nicolas Bohm Agostini, Jiajia Li, Ang Li, Kevin Barker, Antonino Tumeo.

    The 39th IEEE International Conference on Computer Design (ICCD). Best Paper Award

  • Athena: High-Performance Sparse Tensor Contraction Sequence on Heterogeneous Memory

    Jiawen Liu, Dong Li, Roberto Gioiosa, Jiajia Li

    International Conference on Supercomputing (ICS).
    [paper] [bib]

  • Efficient Parallel Sparse Symmetric Tucker Decomposition for High-Order Tensors

    Shruti Shivakumar, Jiajia Li, Ramakrishnan Kannan, Srinivas Aluru

    SIAM Conference on Applied and Computational Discrete Algorithms (ACDA).

  • Sparta: High-Performance, Element-Wise Sparse Tensor Contraction on Heterogeneous Memory

    Jiawen Liu, Jie Ren, Roberto Gioiosa, Dong Li, Jiajia Li

    Principles and Practice of Parallel Programming (PPoPP).
    [paper] [bib] [code]

  • A Sparse Tensor Benchmark Suite for CPUs and GPUs

    Jiajia Li, Mahesh Lakshminarasimhan, Xiaolong Wu, Ang Li, Catherine Olschanowsky, Kevin Barker

    IEEE International Symposium on Workload Characterization (IISWC).
    [paper] [bib] [code-PASTA]

  • Sparsity-Aware Distributed Tensor Decomposition.

    Zheng Miao, Jon C. Calhoun, Rong Ge, Jiajia Li

    ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC).

  • High-Performance Sparse Tensor Algebra Compiler.

    Ruiqin Tian, Jiajia Li, Bin Ren, Gokcen Kestor

    Women in High Performance Computing Workshop (WHPC), in conjunction with ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC).

  • Generic, Sparse Tensor Core for Neural Networks.

    Xiaolong Wu, Yang Yi, Dave (Jing) Tian, Jiajia Li

    1st International Workshop on Machine Learning for Software Hardware Co-Design (MLSH), in conjunction with the 29th International Conference on Parallel Architectures and Compilation Techniques (PACT).

  • Programming Strategies for Irregular Algorithms on the Emu Chick.

    Eric Hein, Srinivas Eswar, Abdurrahman Yasar, Jiajia Li, Jeffrey S. Young, Tom Conte, Umit V. Catalyurek, Rich Vuduc, Jason Riedy, Bora Ucar

    Transactions on Parallel Computing.

  • On the Feasibility of Using Reduced- PrecisionTensor Core Operations for Graph Analytics

    Jesun Sahariar Firoz, Ang Li, Jiajia Li, Kevin Barker

    The IEEE High Performance Extreme Computing Conference (HPEC).

  • A Parallel Sparse Tensor Benchmark Suite on CPUs and GPUs

    Jiajia Li, Mahesh Lakshminarasimhan, Xiaolong Wu, Ang Li, Catherine Olschanowsky, Kevin Barker

    Principles and Practice of Parallel Programming (PPoPP).
    [paper] [bib] [code-PASTA]

  • An Efficient Mixed-Mode Representation of Sparse Tensors

    Israt Nisa, Jiajia Li, Aravind Sukumaran-Rajam, Prashant Rawat, Sriram Krishnamoorthy, P. (Saday) Sadayappan

    ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC).
    [paper] [bib]

  • Efficient and Effective Sparse Tensor Reordering

    Jiajia Li, Bora Ucar, Umit Catalyurek, Jimeng Sun, Kevin Barker, Richard Vuduc

    International Conference on Supercomputing (ICS).
    [paper] [bib] [code-ParTI v1.1.0]

  • A Microbenchmark Characterization of the Emu Chick

    Jeffrey S.Young, Eric Heinc, Srinivas Eswar, Patrick Lavin, Jiajia Li, Jason Riedy, Richard Vuduc, Tom Conte

    Parallel Computing (ParCo).
    [paper] [bib]

  • Load-Balanced Sparse MTTKRP on GPUs

    Israt Nisa, Jiajia Li, Aravind Sukumaran-Rajam, Richard Vuduc, P. Sadayappan

    IEEE International Parallel & Distributed Processing Symposium (IPDPS).
    [paper] [bib] [code]

  • Programming Strategies for Irregular Algorithms on the Emu Chick

    Eric Hein, Srinivas Eswar, Abdurrahman Yaşar, Jiajia Li, Jeffrey S. Young, Thomas M. Conte, Ümit V. Çatalyürek, Rich Vuduc, Jason Riedy, Bora Uçar

    ArXiv.
    [paper] [bib]

  • Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect

    Ang Li, Shuaiwen Leon Song, Jieyang Chen, Jiajia Li, Xu Liu, Nathan Tallent, and Kevin Barker

    IEEE Transactions on Parallel and Distributed Systems (TPDS).
    [paper] [bib]

  • PASTA: A Parallel Sparse Tensor Algorithm Benchmark Suite

    Jiajia Li, Yuchen Ma, Xiaolong Wu, Ang Li, Kevin Barker

    CCF Transactions on High Performance Computing.
    [paper] [bib] [code]

  • A Pattern Based Algorithmic Autotuner for Graph Processing on GPUs

    Ke Meng, Jiajia Li, Guangming Tan, Ninghui Sun

    Principles and Practice of Parallel Programming (PPoPP). Best Paper Award Finalist
    [paper] [bib] [code]

  • An Autotuning Protocol to Rapidly Build Autotuners

    Junhong Liu, Guangming Tan, Yulong Luo, Jiajia Li, Zeyao Mo, Ninghui Sun

    ACM Transactions on Parallel Computing.
    [paper] [bib]

  • Scalable Tensor Decompositions in High Performance Computing Environments

    Jiajia Li

    PhD Dissertation. Georgia Institute of Technology, Atlanta, GA, USA.
    [paper][bib]

  • HiCOO: Hierarchical Storage of Sparse Tensors

    Jiajia Li, Jimeng Sun, Richard Vuduc

    ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC). Best Student Paper Award
    [paper] [slides] [bib] [code]
    News: [PNNL Press] [GaTech Press]

  • Optimizing Sparse Tensor Times Matrix on GPUs

    Yuchen Ma, Jiajia Li, Xiaolong Wu, Chenggang Yan, Jimeng Sun, Richard Vuduc

    Journal of Parallel and Distributed Computing (Special Issue on Systems for Learning, Inferencing, and Discovering)
    [paper] [bib] [code]

  • An Initial Characterization of the Emu Chick

    Eric Hein, Tom Conte, Jeffrey Young, Srinivas Eswar, Jiajia Li, Patrick Lavin, Richard Vuduc, Jason Riedy

    2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).
    [paper][bib]

  • Bridging the Gap between Deep Learning and Sparse Matrix Format Selection

    Yue Zhao, Jiajia Li, Chunhua Liao, Xipeng Shen

    23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP).
    [paper][bib]

  • Design and Implementation of Adaptive SpMV Library for Multicore and Manycore Architecture

    Guangming Tan, Junhong Liu, Jiajia Li

    ACM Transactions on Mathematical Software.
    [paper][bib]

  • Model-Driven Sparse CP Decomposition for Higher-Order Tensors

    Jiajia Li, Jee Choi, Ioakeim Perros, Jimeng Sun, Richard Vuduc

    31st IEEE International Parallel & Distributed Processing Symposium (IPDPS).
    [paper][slides][code][bib]

  • Bridging the Gap between Deep Learning and Sparse Matrix Format Selection

    Yue Zhao, Jiajia Li, Chunhua Liao, Xipeng Shen

    The 26th International Conference on Parallel Architectures and Compilation Techniques (PACT).
    [poster][bib]

  • Non-Negative CP Decomposition on Distributed Clusters

    Jiajia Li, Jee Choi, Xing Liu, Richard Vuduc

    (To be submitted).

  • Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning

    Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li, Keren Zhou, Mingyu Chen

    22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP).
    [paper][bib]

  • ParTI!: a Parallel Tensor Infrastructure for Data Analysis

    Jiajia Li, Yuchen Ma, Chenggang Yan, Jimeng Sun, Richard Vuduc

    Tensor-Learn Workshop, co-located with NIPS’16.
    [poster][code]

  • Optimizing Sparse Tensor Times Matrix on multi-core and many-core architectures

    Jiajia Li, Yuchen Ma, Chenggang Yan, Richard Vuduc

    The sixth Workshop on Irregular Applications: Architectures and Algorithms (IA^3), co-located with SC’16.
    [paper] [code] [bib]

  • Model-driven Sparse CP Decomposition for High-Order Tensors

    Jiajia Li, Ioakeim Perros, Jimeng Sun, Richard Vuduc

    The fifth international Women in HPC workshop, co-located with SC’16.
    [poster]

  • Tensors in Data Analysis: Methods, Applications, and Software

    Casey Battaglino, Jiajia Li, Ioakeim Perros, Jimeng Sun, Richard Vuduc

    (To be submitted).

  • An Input-Adaptive and In-Place Approach to Dense Tensor-Times-Matrix Multiply

    Jiajia Li, Casey Battaglino, Ioakeim Perros, Jimeng Sun, Richard Vuduc

    The International Conference for High Performance Computing, Networking, Storage and Analysis (SC).
    [pdf] [slides] [code] [bib]

  • Introducing high performance computing concepts into engineering undergraduate curriculum: a success story

    B. Neelima, Jiajia Li

    Proceedings of the Workshop on Education for High-Performance Computing (EduHPC), co-located with the International Conference for High Performance Computing, Networking, Storage and Analysis (SC).
    [pdf][bib]

  • SMAT: A Cross-Platform Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication

    Jiajia Li, Zhonghai Zhang, Guangming Tan, David Bader

    Technical Report.

  • The Study of Choosing the Best Storage Format of Sparse Matrix Vector Multiplication

    Jiajia Li, Xiuxia Zhang, Guangming Tan, Mingyu Chen

    Journal of Computer Research and Development. Vol 51. (IN CHINESE).

  • Research on Sparse Matrix Vector Multiplication Auto-tuning Method

    Jiajia Li

    PhD Thesis, The University of Chinese Academy of Sciences, Beijing, China

  • SMAT: An Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication

    Jiajia Li, Guangming Tan, Mingyu Chen, Ninghui Sun

    Programming Language Design and Implementation (PLDI).
    [pdf] [slides] [code] [bib]

  • An Optimized Large-Scale Hybrid DGEMM Design for CPUs and ATI GPUs

    Jiajia Li, Xingjian Li, Guangming Tan, Mingyu Chen, Ninghui Sun

    International Conference on Supercomputing (ICS).
    [pdf] [slides] [web] [bib]

  • Algebraic Multi-grid Optimization Study on GPU

    Jiajia Li, Xiuxia Zhang, Guangming Tan, Mingyu Chen

    HPC China (IN CHINESE).
    [pdf]

  • Memory Accessing Analysis of Sparse Matrix Vector Multiplication and Optimization

    Xiuxia Zhang, Mingyu Chen, Jiajia Li, Guangming Tan

    HPC China (IN CHINESE).
    [pdf]

  • Automatically Tuned Dynamic Programming with an Algorithm-by-Blocks

    Jiajia Li, Guangming Tan, Mingyu Chen

    16th International Conference on Parallel and Distributed Systems (ICPADS).
    [pdf] [bib]

Activities

Organizing Activities

  • Program Co-Chair of the 26th International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS'22).

  • Finance Chair of International Conference on Supercomputing (ICS'21).

  • Artifact Evaluation Chair of the 2021 edition of the International Conference on Languages Compilers, Tools and Theory of Embedded Systems (LCTES'21).

  • Publicity Chair of International Conference on Parallel Architectures and Compilation Techniques (PACT'20).

  • Finance and Session Chair of Principles and Practice of Parallel Programming (PPoPP'20).

  • Proceeding Chair of Emerging Parallel and Distributed Runtime Systems and Middleware Workshop (IPDRM'19), Held in conjunction with IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC'19).

  • Web Chair of International Conference on Parallel Architectures and Compilation Techniques (PACT'19).

  • Co-Chair of the 25th International European Conference on Parallel and Distributed Computing (Euro-Par'19).

  • Co-Organizer of The First International Workshop on the Intersection of High Performance Computing and Machine Learning (HPCaML'19), Held in conjunction with International Symposium on Code Generation and Optimization (CGO'19).

  • Co-Organizer of SIAM Conference on Computational Science and Engineering (SIAM CSE'19) Minisymposium "High Performance Sparse Matrix, Tensor, and Graph Kernels".

Peer Review Activities

  • 2022: PC member of SC, IPDPS, ISC, CLUSTER, PPoPP, SIAM PP.

  • 2021: PC member of SC, LCTES, ICPP, CLUSTER, ICS, ICDCS, ISC.

  • PC member of ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC'20).

  • Research Poster Committee member of High Performance Computing, Networking, Storage, Data Analytics, and AI/Machine Learning (ISC'20).

  • PC member of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'20).

  • PC member of the International Conference on Parallel Processing (ICPP'20).

  • Travel Grant PC member of the ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC'19).

  • PC member of the Workshop on Tensor Methods for Emerging Data Science Challenges (TMEDSC), held in conjunction with the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'19).

  • PC member of the 25th International European Conference on Parallel and Distributed Computing (Euro-Par'19).

  • PC member of the First International Workshop on the Intersection of High Performance Computing and Machine Learning (HPCaML'19), held in conjunction with International Symposium on Code Generation and Optimization (CGO'19).

  • PC member of the International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP'17,18,19).

  • External PC member of The 32nd ACM International Conference on Supercomputing (ICS'18).

  • PC member of Experiments and Practice in Parallel and Distributed Computing Track of IEEE International Parallel & Distributed Processing Symposium (IPDPS'18).

  • PC member of Student Research Competition (SRC) of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'18).

  • PC member of Parallel Algorithm Track of National Annual Conference on High Performance Computing (HPC China)”.

  • Reviewer of The 32nd ACM International Conference on Supercomputing (ICS'18), The 47th International Conference on Parallel Processing (ICPP'18,19), the 21st IEEE International Conference on Parallel and Distributed Systems (ICPADS'15), the Transactions on Parallel and Distributed Systems(TPDS), the Frontiers of Computer Science, IEEE Transactions on Neural Networks and Learning Systems(TNNLS), Algorithmica Journal, Journal of Parallel and Distributed Computing (JPDC), Parallel Computing Journal (ParCo), Transactions on Knowledge and Data Engineering (TKDE), Journal of Low Power Electronics and Applications, Journal of Visual Communication and Image Representation, IEEE Access, International Journal of High Performance Computing Applications, CCF Transactions on High Performance Computing (THPC) .

Other Activities