Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
Understanding Gradient Compression in Distributed Training
Published:
As deep learning models continue to grow in size, distributed training has become essential for reducing training time. However, the communication overhead between nodes can become a significant bottleneck. This is where gradient compression comes into play.
portfolio
Portfolio item number 1
Short description of portfolio item number 1
Portfolio item number 2
Short description of portfolio item number 2 
project
projects
publications
CGAN-IRB: a novel data augmentation method for apple leaf diseases
Published in In the proceedings of 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC), 2021
Recommended citation: Xinbin Yuan, Cong Yu, Bin Liu, Henan Sun, Xianyu Zhu, "CGAN-IRB: a novel data augmentation method for apple leaf diseases." In the proceedings of 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC), 2021.
Apple-YOLO: A novel mobile terminal detector based on YOLOv5 for early apple leaf diseases
Published in In the proceedings of 2022 IEEE 46th Annual Computers, Software, and Applications Conference (COMPSAC), 2022
Recommended citation: Jinjiang Li, Xianyu Zhu, Runchang Jia, Bin Liu, Cong Yu, "Apple-YOLO: A novel mobile terminal detector based on YOLOv5 for early apple leaf diseases." In the proceedings of 2022 IEEE 46th Annual Computers, Software, and Applications Conference (COMPSAC), 2022.
Lightweight identification model for apple leaf diseases and pests based on mobile terminals
Published in Transactions of the Chinese Society of Agricultural Engineering (TCSAE), 2022
Recommended citation: Bin Liu, Runchang Jia, Xianyu Zhu, Cong Yu, Zhuohan Yao, Haixi Zhang, Dongjian He, "Lightweight identification model for apple leaf diseases and pests based on mobile terminals." Transactions of the Chinese Society of Agricultural Engineering, 2022.
LAD-Net: A Novel Light Weight Model for Early Apple Leaf Pests and Diseases Classification
Published in IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2023
Recommended citation: Xianyu Zhu, Jinjiang Li, Runchang Jia, Bin Liu, Zhuohan Yao, Aihong Yuan, Yingqiu Huo, Haixi Zhang, "LAD-Net: A Novel Light Weight Model for Early Apple Leaf Pests and Diseases Classification." IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2023.
SWattention: designing fast and memory-efficient attention for a new Sunway Supercomputer
Published in The Journal of Supercomputing, 2024
Recommended citation: Ruohan Wu, Xianyu Zhu, Junshi Chen, Sha Liu, Tianyu Zheng, Xin Liu, Hong An, "SWattention: designing fast and memory-efficient attention for a new Sunway Supercomputer." The Journal of Supercomputing, 2024.
SwFormer: Enabling Faster Foundation Models on new Sunway Supercomputer via Holistic Kernel Tiling and Scheduling
Published in Journal of Computer Science and Technology (JCST), 2025
Recommended citation: Ruohan Wu, Xianyu Zhu, Junshi Chen, Hong An "SwFormer: Enabling Faster Foundation Models on new Sunway Supercomputer via Holistic Kernel Tiling and Scheduling." Journal of Computer Science and Technology(JCST), 2025.
swPredictor: A Data-Driven Performance Model for Distributed Data Parallelism Training on Large-Scale HPC Clusters
Published in Performance Evaluation: An International Journal (PEVA), 2025
Recommended citation: Xianyu Zhu, Ruohan Wu, Junshi Chen, Hong An "swPredictor: A Data-Driven Performance Model for Distributed Data Parallelism Training on Large-Scale HPC Clusters." Performance Evaluation: An International Journal (PEVA), 2025.
