副研究员/高级工程师

副研究员/高级工程师

肖俊敏

肖俊敏

  • 职称: 副研究员
  • 研究方向: 

    并行算法设计、神经网络Kernel优化和分布式深度学习

  • 导师类别: 硕士生导师
  • 电子邮件: xiaojunmin@ict.ac.cn
  • 个人主页: https://people.ucas.edu.cn/~xiaojunmin

简历

肖俊敏,中国科学院计算技术研究所副研究员,硕士生导师;2012年7月于中国科学院计算数学与科学工程计算研究所获得理学博士学位;长期从事高性能并行优化研究,在SC、PPoPP、ICS、SPAA、APJ等国际会议和期刊上发表多篇论文;负责了国家自然科学基金面上项目和青年基金项目,作为骨干参加了中科院重点部署项目和多个国家重点研发计划项目;其研究方向主要包括并行算法设计、神经网络Kernel优化和分布式深度学习。

获奖及荣誉:

(1) IEEE ISPA’19 Best Paper Award, 其他, 2019
(2) “2018年度优秀研究人员”称号, , 研究所(学校), 2018
(3) “2017年度优秀研究人员”称号, , 研究所(学校), 2017

代表论著:

JOURNAL ARTICLES
(1) Zhongzhe Hu, Junmin Xiao, Ninghui Sun, and Guangming Tan, “Fast and accurate variable batch size convolution neural network training on large scale distributed systems”, Concurrency and Computation: Practice and Experience, Wiley, 2022.
(2) Junmin Xiao, and Jian Peng, “Tradeoffs between Computation, Communication, and Synchronization in Stencil-collective Alternate Update”, CCF Transactions on High Performance Computing, Springer, 2019, 1(2), pp. 144--160.
(3) Junmin Xiao, Guizhao Zhang, Yanan Gao, Xuehai Hong, and Guangming Tan, “Fast Data-obtaining Algorithm for Data Assimilation with Large Data Set”, International Journal of Parallel Programming, Springer, 2019, 47, pp. 1--21.
(4) Junmin Xiao, Jun Zhang, Ting Li, and Shuhong Yang, “Dark Ribbons Propagating and Sweeping Across Extreme Ultraviolet Structures after Filament Eruptions”, The Astrophysical Journal, 2015, 805(1), pp. 25--37.
(5) Huadong Chen, Jun Zhang, Suli Ma, Shuhong Yang, Leping Li, Xin Huang, and Junmin Xiao, “Confined Flares in Solar Active Region 12192 from 2014 October 18 to 29”, The Astrophysical Journal Letters, 2015, 808, pp. L24--L31.
(6) Junmin Xiao, and Qiya Hu, “Multilevel Correction for Collocation Solutions of Volterra Integral Equations with Proportional Delays”, Advances in Computational Mathematics, 2013, 39(3-4), pp. 611--644.
(7) Xingding Chen, Qiya Hu, and Junmin Xiao, “On the Enhanced Strain Finite Element Method for Incompressible Linear Elasticity”, Applied Numerical Mathematics, 2013, 72, pp. 131--142.
CONFERENCE PROCEEDINGS
(1) Zhongzhe Hu, Junmin Xiao, Zheye Deng, Mingyi Li, Kewei Zhang, Xiaoyang Zhang, Ke Meng, Ninghui Sun, and Guangming Tan, “MegTaiChi: Dynamic Tensor-based Memory Management Optimization for DNN Training”, 36th ACM International Conference on Supercomputing (ICS’22), Virtual Event, USA, 2022-06-28~2022-06-30.
(2) Junmin Xiao, Qing Xue, Hui Ma, Xiaoyang Zhang, and Guangming Tan, “POSTER: A W-cycle Algorithm for Efficient Batched SVD on GPUs”, 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’22), Seoul, Republic of Korea, 2022-04-02~2022-04-06.
(3) Xiaoyang Zhang, Junmin Xiao, and Guangming Tan, “I/O Lower Bounds for Auto-tuning of Convolutions in CNNs”, 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’21), Virtual Event, Republic of Korea, 2021-02-27~2021-03-03.
(4) Xiaoyang Zhang, Junmin Xiao, and Guangming Tan, “Brief Announcement: Communication Lower Bounds of Convolutions in CNNs”, 32nd ACM Symposium on Parallelism in Algorithms and Architectures (SPAA’20), Virtual Event, USA, 2020-07-15~2020-07-17.
(5) Junmin Xiao, Shijie Wang, Weiqiang Wan, Xuehai Hong, and Guangming Tan, “S-EnKF: Co-designing for Scalable Ensemble Kalman Filter”, 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’19), Washington, D.C., USA, 2019-02-16~2019-02-20.
(6) Guizhao Zhang, Junmin Xiao, Xuehai Hong, and Guangming Tan, “Fast Data-obtaining Algorithm for Data Assimilation with Large Data Set”, 16th International Conference on Network and Parallel Computing (NPC’19), Hohhot, Inner Mongolia, China, 2019-08-23~2019-08-24.
(7) Zhongzhe Hu, Junmin Xiao, Zhongbo Tian, Xiaoyang Zhang, Chengji Yao, Ninghui Sun, and Guangming Tan, “A Variable Batch Size Strategy for Large Scale Distributed DNN Training”, 17th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA’19), Fujian, Xiamen, China, 2019-12-16~2019-12-19 (BEST PAPER AWARD).
(8) Xiaoyang Zhang, Junmin Xiao, Xiaobin Zhang, Zhongzhe Hu, Hongrui Zhu, Zhongbo Tian, and Guangming Tan, “Tensor Layout Optimization of Convolution for Inference on Digital Signal Processor”, 17th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA’19), Fujian, Xiamen, China, 2019-12-16~2019-12-19.
(9) Junmin Xiao, Shigang Li, Baodong Wu, He Zhang, Kun Li, Erlin Yao, Yunquan Zhang, and Guangming Tan, “Communication-avoiding for Dynamical Core of Atmospheric General Circulation Model”, 47th ACM International Conference on Parallel Processing (ICPP’18), Eugene, Oregon, USA, 2018-08-13~2018-08-16.
(10) Baodong Wu, Shigang Li, Hang Cao, Yunquan Zhang, He Zhang, Junmin Xiao, and Minghua Zhang, “AGCM3D: A Highly Scalable Finite-Difference Dynamical Core of Atmospheric General Circulation Model based on 3D Decomposition”, 24th IEEE International Conference on Parallel and Distributed Systems (ICPADS’18), Sentosa, Singapore, 2018-12-11~2018-12-13.

承担科研项目情况:

( 1 ) 高分辨率全球大气环流模式的通信优化研究, 负责人, 国家任务, 2019-01--2021-12
( 2 ) 神经网络处理器关键标准与验证芯片, 第二子课题: 神经网络处理器算法库和工具链, 参与, 国家任务, 2019-12--2022-12
( 3 ) 全球高分辨率海洋资料同化技术研究与业务应用示范, 第六子课题: 同化程序的高性能并行优化, 参与, 国家任务, 2016-09--2020-12
( 4 ) 大规模分布式深度学习软件开发及并行算法研究, 负责人, 研究所自选, 2019-06--2020-06
( 5 ) 卷积神经网络训练与推理的并行优化研究, 负责人, 国家任务, 2022-01--2025-12
( 6 ) 伪造检测软硬加速硬件设备研发, 负责人, 研究所自选, 2021-06--2023-05