About me

Jiajia Li is an Assistant Professor in Department of Computer Science at North Carolina State University (NCSU), Raleigh, NC. Her research emphasizes on high performance computing with a focus on the interaction among applications, numerical methods, data structures, algorithms, automatic performance tuning, and computer architectures. She is eager to pursue high performance sparse (multi-)linear algebra, solvers, and tensor decompositions for large-scale data analytics and domain applications on diverse computer architectures.

Jiajia Li was an Assistant Professor in Department of Computer Science at the College of William & Mary (W&M), Williamsburg, VA and a Research Scientist at High Performance Computing group of Pacific Northwest National Laboratory (PNNL), Richland, WA from 2018-2022. She has received her Ph.D. degree (Aug. 2018) in Computational Science & Engineering at Georgia Institute of Technology, advised by Professor Richard Vuduc. She has received Rising Stars in Computational and Data Sciences, Best Student Paper Award, and IBM PhD Fellowship. Before, she was a research intern of IBM Thomas J. Watson Research Center and Intel Parallel Computing Lab in the summers of 2016 and 2015 respectively. In the past, she has received a Ph.D. degree (Jul. 2013) from Institute of Computing Technology at Chinese Academy of Sciences. She received her B.S. (Jul. 2008) in Computational Mathematics from Dalian University of Technology in the Accelerated Student Program (2/180).

Please feel free to drop me an email @ jiajia.li@ncsu.edu if you have questions about CS PhD program, research collaboration, research/career/international life suggestions, etc.

For more information, please click here for the Curriculum Vitae

News

May 13-17: My student will present a recent work at SIAM LA'24, Paris, France
Apr 27-May 1: Will serve as the Registration Chair and organize the XTensor workshop at ASPLOS'24, San Diego, CA
March 2-6: Our work was presented as a poster at PPoPP'24, Edinburgh, UK

Projects

Current Projects

CROSS: Collaborative Research: PPoSS: LARGE: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)

Lead PI: Jiajia Li; PIs: Frank Mueller (NCSU), Dong Li (UC Merced), Lizhong Chen (OSU)
NSF PPoSS project, 09/07/2023 – 08/31/2028, Total amount: $5,000,000
CROSS: Collaborative Research: PPoSS: Planning: Cross-layer Coordination and Optimization for Scalable and Sparse Tenor Networks(CROSS)”

Lead PI: Jiajia Li; PIs: Frank Mueller (NCSU), Dong Li (UC Merced), Lizhong Chen (OSU)
NSF PPoSS project, 10/01/2022 – 09/30/2024, Total amount: $250,000

Past Projects

SHARWK: Scalable Hypergraph Analysis Via Random Walk Kernels

PI: Jiajia Li
DOE EXPRESS project #656071, 11/14/2022 – 12/30/2023, Total amount: $78,382
HiParTI: Application-Algorithm-Architecture Co-Design for Large-Scale, Sparse Tensor/Matrix Methods

PI: Jiajia Li; Team: Ang Li, Ajay Panyala
DOE PNNL LDRD project
Parallel Tensor Infrastructure (ParTI) on multicore CPUs and GPUs

Code released in Github: [ParTI]
SMAT (SpMV Auto-tuner)

CLICK for more past Projects

Awards

The 39th IEEE International Conference on Computer Design (ICCD’21) Best Paper Award
Rising Stars in Computational and Data Sciences, 2019 [Link]
Principles and Practice of Parallel Programming (PPoPP’19) Best Paper Award Finalist
ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC'18) Best Student Paper Award
[PNNL Press] [GaTech Press]
SIAM ALA'18 Student Travel Grant
GaTech CoC Graduate Student Council Travel Grant
IBM PhD Fellowship for 2017-2018 [Link]
Travel grant from the Institute for Pure and Applied Mathematics (IPAM) for Big Data Meets Computation Workshop 2017
Selected students to attend IEEE-WIE Women’s Leadership Summit 2016
ZhuLiYueHua Award for the Excellent PhD Students of Chinese Academy of Sciences (Top 0.2%), 2013
Merit Student of Institute of Computing Technology, 2013
Xia Peisu Scholarship of Institute of Computing Technology (Top 1%), 2011
Outstanding Research Assistant of the Computer Architecture Laboratory at the University of Chinses Academy of Sciences, 2011
Outstanding Student of the Computer Architecture Laboratory at the University of Chinses Academy of Sciences, 2010

Software

HiParTI

A Hierarchical Parallel Tensor Infrastructure
PASTA

A Parallel Sparse Tensor Algorithm Benchmark Suite
ParTI

A Parallel Tensor Infrastructure for Data Analysis
AdaTM

Adaptive Tensor Memoization algorithm for CP decomposition
InTensLi

Input-adaptive and in-place dense tensor-times-matrix multiply
SMAT

Sparse Matrix-vector multiplication Auto-Tuner
HDGEMM

A Hybrid DGEMM library on a Heterogeneous CPU-AMD GPU Architecture

Publication

Sparse Symmetric Format for Tucker Decomposition
Shruti Shivakumar, Jiajia Li, Ramakrishnan Kannan, Srinivas Aluru.

IEEE Transactions on Parallel and Distributed Systems (TPDS). 2023
Performance Implication of Tensor Irregularity and Optimization for Distributed Tensor Decomposition
Zheng Miao, Jon C. Calhoun, Rong Ge, Jiajia Li.

ACM Transactions on Parallel Computing (TOPC). 2023
Merchandiser: Data Placement on Heterogeneous Memory for Task-Parallel HPC Applications with Load-Balance Awareness
Zhen Xie, Jie Liu, Jiajia Li, Dong Li.

Principles and Practice of Parallel Programming (PPoPP). 2023
AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices
Zhen Du, Jiajia Li, Yinshan Wang, Xueqi Li, Guangming Tan, Ninghui Sun.

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC). 2022
BALA-CPD: BALanced and Asynchronous Distributed Tensor Decomposition
Zheng Miao, Jiajia Li, Jon Calhoun, Rong Ge.

The 24th IEEE Cluster (CLUSTER). 2022
DRIPS: Dynamic Rebalancing of Pipelined Streaming Applications CGRAs
Cheng Tan, Nicolas Bohm Agostini, Tong Geng, Chenhao Xie, Jiajia Li, Ang Li, Kevin Barker, Antonino Tumeo.

28th IEEE International Symposium on High-Performance Computer Architecture (HPCA). 2022
A High Performance Sparse Tensor Algebra Compiler in MLIR
Ruiqin Tian, Luanzheng Guo, Jiajia Li, Bin Ren, Gokcen Kestor.

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), the Seventh Annual Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC) 2021
A Survey: Handling Irregularities in Neural Network Acceleration with FPGAs
Tong Geng, Chunshu Wu, Cheng Tan, Chenhao Xie, Anqi Guo, Pouya Haghi, Sarah Yuan He, Jiajia Li, Martin Herbordt, Ang Li.

IEEE High Performance Extreme Computing Conference (HPEC). 2021
DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications
Cheng Tan, Tong Geng, Chenhao Xie, Nicolas Bohm Agostini, Jiajia Li, Ang Li, Kevin Barker, Antonino Tumeo.

The 39th IEEE International Conference on Computer Design (ICCD). 2021 Best Paper Award
Athena: High-Performance Sparse Tensor Contraction Sequence on Heterogeneous Memory
Jiawen Liu, Dong Li, Roberto Gioiosa, Jiajia Li

International Conference on Supercomputing (ICS). 2021
[paper] [bib]
Efficient Parallel Sparse Symmetric Tucker Decomposition for High-Order Tensors
Shruti Shivakumar, Jiajia Li, Ramakrishnan Kannan, Srinivas Aluru

SIAM Conference on Applied and Computational Discrete Algorithms (ACDA). 2021
Sparta: High-Performance, Element-Wise Sparse Tensor Contraction on Heterogeneous Memory
Jiawen Liu, Jie Ren, Roberto Gioiosa, Dong Li, Jiajia Li

Principles and Practice of Parallel Programming (PPoPP). 2021
[paper] [bib] [code]
A Sparse Tensor Benchmark Suite for CPUs and GPUs
Jiajia Li, Mahesh Lakshminarasimhan, Xiaolong Wu, Ang Li, Catherine Olschanowsky, Kevin Barker

IEEE International Symposium on Workload Characterization (IISWC). 2020
[paper] [bib] [code-PASTA]
Sparsity-Aware Distributed Tensor Decomposition.
Zheng Miao, Jon C. Calhoun, Rong Ge, Jiajia Li

ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC). 2020 (Poster, Accepted)
High-Performance Sparse Tensor Algebra Compiler.
Ruiqin Tian, Jiajia Li, Bin Ren, Gokcen Kestor

Women in High Performance Computing Workshop (WHPC), in conjunction with ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC). 2020 (Poster, Accepted)
Generic, Sparse Tensor Core for Neural Networks.
Xiaolong Wu, Yang Yi, Dave (Jing) Tian, Jiajia Li

1st International Workshop on Machine Learning for Software Hardware Co-Design (MLSH), in conjunction with the 29th International Conference on Parallel Architectures and Compilation Techniques (PACT). 2020 (Accepted)
Programming Strategies for Irregular Algorithms on the Emu Chick.
Eric Hein, Srinivas Eswar, Abdurrahman Yasar, Jiajia Li, Jeffrey S. Young, Tom Conte, Umit V. Catalyurek, Rich Vuduc, Jason Riedy, Bora Ucar

Transactions on Parallel Computing. 2020 (Accepted)
On the Feasibility of Using Reduced- PrecisionTensor Core Operations for Graph Analytics
Jesun Sahariar Firoz, Ang Li, Jiajia Li, Kevin Barker

The IEEE High Performance Extreme Computing Conference (HPEC). 2020 (Accepted)
A Parallel Sparse Tensor Benchmark Suite on CPUs and GPUs
Jiajia Li, Mahesh Lakshminarasimhan, Xiaolong Wu, Ang Li, Catherine Olschanowsky, Kevin Barker

Principles and Practice of Parallel Programming (PPoPP). 2020 (Poster, Accepted)
[paper] [bib] [code-PASTA]
An Efficient Mixed-Mode Representation of Sparse Tensors
Israt Nisa, Jiajia Li, Aravind Sukumaran-Rajam, Prashant Rawat, Sriram Krishnamoorthy, P. (Saday) Sadayappan

ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC). 2019
[paper] [bib]
Efficient and Effective Sparse Tensor Reordering
Jiajia Li, Bora Ucar, Umit Catalyurek, Jimeng Sun, Kevin Barker, Richard Vuduc

International Conference on Supercomputing (ICS). 2019
[paper] [bib] [code-ParTI v1.1.0]
A Microbenchmark Characterization of the Emu Chick
Jeffrey S.Young, Eric Heinc, Srinivas Eswar, Patrick Lavin, Jiajia Li, Jason Riedy, Richard Vuduc, Tom Conte

Parallel Computing (ParCo). 2019
[paper] [bib]
Load-Balanced Sparse MTTKRP on GPUs
Israt Nisa, Jiajia Li, Aravind Sukumaran-Rajam, Richard Vuduc, P. Sadayappan

IEEE International Parallel & Distributed Processing Symposium (IPDPS). 2019
[paper] [bib] [code]
Programming Strategies for Irregular Algorithms on the Emu Chick
Eric Hein, Srinivas Eswar, Abdurrahman Yaşar, Jiajia Li, Jeffrey S. Young, Thomas M. Conte, Ümit V. Çatalyürek, Rich Vuduc, Jason Riedy, Bora Uçar

ArXiv. 2019
[paper] [bib]
Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect
Ang Li, Shuaiwen Leon Song, Jieyang Chen, Jiajia Li, Xu Liu, Nathan Tallent, and Kevin Barker

IEEE Transactions on Parallel and Distributed Systems (TPDS). 2019
[paper] [bib]
PASTA: A Parallel Sparse Tensor Algorithm Benchmark Suite
Jiajia Li, Yuchen Ma, Xiaolong Wu, Ang Li, Kevin Barker

CCF Transactions on High Performance Computing. 2019
[paper] [bib] [code]
A Pattern Based Algorithmic Autotuner for Graph Processing on GPUs
Ke Meng, Jiajia Li, Guangming Tan, Ninghui Sun

Principles and Practice of Parallel Programming (PPoPP). 2019 Best Paper Award Finalist
[paper] [bib] [code]
An Autotuning Protocol to Rapidly Build Autotuners
Junhong Liu, Guangming Tan, Yulong Luo, Jiajia Li, Zeyao Mo, Ninghui Sun

ACM Transactions on Parallel Computing. 2019
[paper] [bib]
Scalable Tensor Decompositions in High Performance Computing Environments
Jiajia Li

PhD Dissertation. Georgia Institute of Technology, Atlanta, GA, USA. July 2018
[paper][bib]
HiCOO: Hierarchical Storage of Sparse Tensors
Jiajia Li, Jimeng Sun, Richard Vuduc

ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC). 2018 Best Student Paper Award
[paper] [slides] [bib] [code]
News: [PNNL Press] [GaTech Press]
Optimizing Sparse Tensor Times Matrix on GPUs
Yuchen Ma, Jiajia Li, Xiaolong Wu, Chenggang Yan, Jimeng Sun, Richard Vuduc

Journal of Parallel and Distributed Computing (Special Issue on Systems for Learning, Inferencing, and Discovering)
[paper] [bib] [code]
An Initial Characterization of the Emu Chick
Eric Hein, Tom Conte, Jeffrey Young, Srinivas Eswar, Jiajia Li, Patrick Lavin, Richard Vuduc, Jason Riedy

2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 2018
[paper][bib]
Bridging the Gap between Deep Learning and Sparse Matrix Format Selection
Yue Zhao, Jiajia Li, Chunhua Liao, Xipeng Shen

23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). 2018
[paper][bib]
Design and Implementation of Adaptive SpMV Library for Multicore and Manycore Architecture
Guangming Tan, Junhong Liu, Jiajia Li

ACM Transactions on Mathematical Software. 2018
[paper][bib]
Model-Driven Sparse CP Decomposition for Higher-Order Tensors
Jiajia Li, Jee Choi, Ioakeim Perros, Jimeng Sun, Richard Vuduc

31st IEEE International Parallel & Distributed Processing Symposium (IPDPS). 2017
[paper][slides][code][bib]
Bridging the Gap between Deep Learning and Sparse Matrix Format Selection
Yue Zhao, Jiajia Li, Chunhua Liao, Xipeng Shen

The 26th International Conference on Parallel Architectures and Compilation Techniques (PACT). 2017
[poster][bib]
Non-Negative CP Decomposition on Distributed Clusters
Jiajia Li, Jee Choi, Xing Liu, Richard Vuduc

(To be submitted). 2017
Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning
Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li, Keren Zhou, Mingyu Chen

22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). 2017
[paper][bib]
ParTI!: a Parallel Tensor Infrastructure for Data Analysis
Jiajia Li, Yuchen Ma, Chenggang Yan, Jimeng Sun, Richard Vuduc

Tensor-Learn Workshop, co-located with NIPS’16. 2016
[poster][code]
Optimizing Sparse Tensor Times Matrix on multi-core and many-core architectures
Jiajia Li, Yuchen Ma, Chenggang Yan, Richard Vuduc

The sixth Workshop on Irregular Applications: Architectures and Algorithms (IA^3), co-located with SC’16. 2016
[paper] [code] [bib]
Model-driven Sparse CP Decomposition for High-Order Tensors
Jiajia Li, Ioakeim Perros, Jimeng Sun, Richard Vuduc

The fifth international Women in HPC workshop, co-located with SC’16. 2016
[poster]
Tensors in Data Analysis: Methods, Applications, and Software
Casey Battaglino, Jiajia Li, Ioakeim Perros, Jimeng Sun, Richard Vuduc

(To be submitted). 2015
An Input-Adaptive and In-Place Approach to Dense Tensor-Times-Matrix Multiply
Jiajia Li, Casey Battaglino, Ioakeim Perros, Jimeng Sun, Richard Vuduc

The International Conference for High Performance Computing, Networking, Storage and Analysis (SC). 2015
[pdf] [slides] [code] [bib]
Introducing high performance computing concepts into engineering undergraduate curriculum: a success story
B. Neelima, Jiajia Li

Proceedings of the Workshop on Education for High-Performance Computing (EduHPC), co-located with the International Conference for High Performance Computing, Networking, Storage and Analysis (SC). 2015
[pdf][bib]
SMAT: A Cross-Platform Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication
Jiajia Li, Zhonghai Zhang, Guangming Tan, David Bader

Technical Report. 2014
The Study of Choosing the Best Storage Format of Sparse Matrix Vector Multiplication
Jiajia Li, Xiuxia Zhang, Guangming Tan, Mingyu Chen

Journal of Computer Research and Development. Vol 51. (IN CHINESE). 2014
Research on Sparse Matrix Vector Multiplication Auto-tuning Method
Jiajia Li

PhD Thesis, The University of Chinese Academy of Sciences, Beijing, China 2013
SMAT: An Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication
Jiajia Li, Guangming Tan, Mingyu Chen, Ninghui Sun

Programming Language Design and Implementation (PLDI). 2013
[pdf] [slides] [code] [bib]
An Optimized Large-Scale Hybrid DGEMM Design for CPUs and ATI GPUs
Jiajia Li, Xingjian Li, Guangming Tan, Mingyu Chen, Ninghui Sun

International Conference on Supercomputing (ICS). 2012
[pdf] [slides] [web] [bib]
Algebraic Multi-grid Optimization Study on GPU
Jiajia Li, Xiuxia Zhang, Guangming Tan, Mingyu Chen

HPC China (IN CHINESE). 2011
[pdf]
Memory Accessing Analysis of Sparse Matrix Vector Multiplication and Optimization
Xiuxia Zhang, Mingyu Chen, Jiajia Li, Guangming Tan

HPC China (IN CHINESE). 2011
[pdf]
Automatically Tuned Dynamic Programming with an Algorithm-by-Blocks
Jiajia Li, Guangming Tan, Mingyu Chen

16th International Conference on Parallel and Distributed Systems (ICPADS). 2010
[pdf] [bib]

Activities

Organizing Activities

Program Co-Chair of the 26th International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS'22).
Finance Chair of International Conference on Supercomputing (ICS'21).
Artifact Evaluation Chair of the 2021 edition of the International Conference on Languages Compilers, Tools and Theory of Embedded Systems (LCTES'21).
Publicity Chair of International Conference on Parallel Architectures and Compilation Techniques (PACT'20).
Finance and Session Chair of Principles and Practice of Parallel Programming (PPoPP'20).
Proceeding Chair of Emerging Parallel and Distributed Runtime Systems and Middleware Workshop (IPDRM'19), Held in conjunction with IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC'19).
Web Chair of International Conference on Parallel Architectures and Compilation Techniques (PACT'19).
Co-Chair of the 25th International European Conference on Parallel and Distributed Computing (Euro-Par'19).
Co-Organizer of The First International Workshop on the Intersection of High Performance Computing and Machine Learning (HPCaML'19), Held in conjunction with International Symposium on Code Generation and Optimization (CGO'19).
Co-Organizer of SIAM Conference on Computational Science and Engineering (SIAM CSE'19) Minisymposium "High Performance Sparse Matrix, Tensor, and Graph Kernels".

Peer Review Activities

2022: PC member of SC, IPDPS, ISC, CLUSTER, PPoPP, SIAM PP.
2021: PC member of SC, LCTES, ICPP, CLUSTER, ICS, ICDCS, ISC.
PC member of ACM/IEEE International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC'20).
Research Poster Committee member of High Performance Computing, Networking, Storage, Data Analytics, and AI/Machine Learning (ISC'20).
PC member of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'20).
PC member of the International Conference on Parallel Processing (ICPP'20).
Travel Grant PC member of the ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC'19).
PC member of the Workshop on Tensor Methods for Emerging Data Science Challenges (TMEDSC), held in conjunction with the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'19).
PC member of the 25th International European Conference on Parallel and Distributed Computing (Euro-Par'19).
PC member of the First International Workshop on the Intersection of High Performance Computing and Machine Learning (HPCaML'19), held in conjunction with International Symposium on Code Generation and Optimization (CGO'19).
PC member of the International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP'17,18,19).
External PC member of The 32nd ACM International Conference on Supercomputing (ICS'18).
PC member of Experiments and Practice in Parallel and Distributed Computing Track of IEEE International Parallel & Distributed Processing Symposium (IPDPS'18).
PC member of Student Research Competition (SRC) of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'18).
PC member of Parallel Algorithm Track of National Annual Conference on High Performance Computing (HPC China)”.

2013-2019
Reviewer of The 32nd ACM International Conference on Supercomputing (ICS'18), The 47th International Conference on Parallel Processing (ICPP'18,19), the 21st IEEE International Conference on Parallel and Distributed Systems (ICPADS'15), the Transactions on Parallel and Distributed Systems(TPDS), the Frontiers of Computer Science, IEEE Transactions on Neural Networks and Learning Systems(TNNLS), Algorithmica Journal, Journal of Parallel and Distributed Computing (JPDC), Parallel Computing Journal (ParCo), Transactions on Knowledge and Data Engineering (TKDE), Journal of Low Power Electronics and Applications, Journal of Visual Communication and Image Representation, IEEE Access, International Journal of High Performance Computing Applications, CCF Transactions on High Performance Computing (THPC) .

Other Activities

STEM Ambassador of the Office of STEM Education at PNNL.

2019
Organizer of Hot CSE seminar, a PhD academic seminar in GT CSE.

2014-2018
Volunteer Librarian of Repetitive Stress Injury (RSI) Lending Library of GT College of Computing.

2013-2018
Teach Assistant of Intro to High-Performance Computing (OMSCS) (CSE 6220).

Spring 2017
Reviewer of “President’s Undergraduate Research Awards (PUMA)”.

2013, 2014, 2016
Reviewer of National Center for Women & IT (NCWIT) Award.

2013, 2015
Teach Assistant of High-Performance Computing: Tools and Applications (CSE 6230).

Fall 2014
Instructor of “Parallel Computing on GPUs using CUDA” Training at Sun Yat-sen University

June 2012
Teach Assistant of “Parallel Computer Architecture" class of Dragonstar Project.

May 2012
Vice Minister of Academic Study of Student Union at the University of Chinese Academy of Sciences

2008-2009