Hi, I am Zhuangbin Chen, currently an Assistant Professor at the School of Software Engineering, Sun Yat-sen University. I work on Software Engineering, Cloud Computing, and Datacenter Networking. More specifically, I am interested in improving the reliability of online services and the troubleshooting process of data center networks. Prior to SYSU, I obtained a Ph.D. at the Computer Science and Engineering Department of The Chinese University of Hong Kong. I was privileged to work with Prof. Michael R. Lyu.


Updates

07/2024    Our paper won the 🏆 Best Paper Award at CLOUD '24! Congrats to the authors!

03/2024    One paper accepted by ISSTA '24! Congrats to the authors!

01/2023    One paper accepted by FSE '24! Congrats to the authors!

12/2023    One paper accepted by ICSE '24! Congrats to the authors!

08/2023    Two papers accepted by ASE '23! Congrats to the authors!

07/2023    One paper accepted by ISSRE '23! Congrats to the authors!

01/2023    Three papers accepted by ICSE '23! Congrats to the authors!

01/2023    I will be joining the School of Software Engineering of Sun Yat-sen University.

07/2022    Our project LogPAI: An Open-Source Project for Automated Log Analysis won The First IEEE Open Software Services Award!

12/2021    One paper accepted by ICSE '22!


Research

Cloud System Reliability
  • Prism: Revealing Hidden Functional Clusters of Massive Instances in Cloud Systems [ASE '23]
  • Maat: Performance Metric Anomaly Anticipation for Cloud Services with Conditional Diffusion [ASE '23]
  • Adaptive Performance Anomaly Detection for Online Service Systems via Pattern Sketching [ICSE '22]
  • Graph-based Incident Aggregation for Large-Scale Online Service Systems [ASE '21]
  • Towards Intelligent Incident Management: Why We Need It and How We Make It [FSE '20]
Log Analysis
  • Experience Report: Deep Learning-based System Log Analysis for Anomaly Detection [arXiv preprint '21]
  • A Survey on Automated Log Analysis for Reliability Engineering [CSUR '21]
  • Characterizing the Natural Language Descriptions in Software Logging Statements [ASE '18]

More...


Selected Publications (* indicates correspondence)

2024

  • [CLOUD '24] TraceMesh: Scalable and Streaming Sampling for Distributed Traces
    Zhuangbin Chen, Zhihan Jiang, Yuxin Su, Michael R. Lyu, Zibin Zheng
    [PDF] [Code]

    🏆 Best Paper Award

  • [ISSTA '24] A Large-Scale Evaluation for Log Parsing Techniques: How Far Are We?
    Zhihan Jiang, Jinyang Liu, Junjie Huang, Yichen Li, Yintong Huo, Jiazhen Gu, Zhuangbin Chen*, et al.
    [PDF]

  • [FSE '24] LILAC: Log Parsing using LLMs with Adaptive Parsing Cache
    Zhihan Jiang, Jinyang Liu, Zhuangbin Chen, et al.
    [PDF]

  • [ICSE '24] FaultProfIT: Hierarchical Fault Profiling of Incident Tickets in Large-scale Cloud Systems
    Junjie Huang, Jinyang Liu, Zhuangbin Chen, et al.
    [PDF]

2023

  • [ASE '23] Prism: Revealing Hidden Functional Clusters of Massive Instances in Cloud Systems
    Jinyang Liu, Zhihan Jiang, Jiazhen Gu, Junjie Huang, Zhuangbin Chen*, et al.
    [PDF]

  • [ASE '23] Maat: Performance Metric Anomaly Anticipation for Cloud Services with Conditional Diffusion
    Cheryl Lee, Tianyi Yang, Zhuangbin Chen*, Yuxin Su, Michael R. Lyu
    [PDF]

  • [ISSRE '23] Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
    Jinyang Liu, Tianyi Yang, Zhuangbin Chen*, Yuxin Su, Cong Feng, Zengyin Yang, Michael R. Lyu
    [PDF]

  • [ICSE '23] Incident-Aware Duplicate Ticket Aggregation for Cloud Systems
    Jinyang Liu, Shilin He, Zhuangbin Chen, et al.
    [PDF]

  • [ICSE '23] Heterogeneous anomaly detection for software systems via semi-supervised cross-modal attention
    Cheryl Lee, Tianyi Yang, Zhuangbin Chen, Yuxin Su, Yongqiang Yang, Michael R. Lyu
    [PDF]

  • [ICSE '23] Eadro: An End-to-End Troubleshooting Framework for Microservices on Multi-Source Data
    Cheryl Lee, Tianyi Yang, Zhuangbin Chen, Yuxin Su, Michael R. Lyu
    [PDF]

2022

  • [SIGOPS OSR '22] An Intelligent Framework for Timely, Accurate, and Comprehensive Cloud Incident Detection
    Yichen Li, Xu Zhang, Shilin He, Zhuangbin Chen, Yu Kang, et al.
    [PDF]

  • [ICSE '22] Adaptive Performance Anomaly Detection for Online Service Systems via Pattern Sketching
    Zhuangbin Chen, Jinyang Liu, Yuxin Su, Hongyu Zhang, et al.
    [PDF] [Code] [Official News]

2021

  • [ASE '21] Graph-based Incident Aggregation for Large-Scale Online Service Systems
    Zhuangbin Chen, Jinyang Liu, Yuxin Su, Hongyu Zhang, et al.
    [PDF] [Official News]

  • [arXiv '21] Experience Report: Deep Learning-based System Log Analysis for Anomaly Detection
    Zhuangbin Chen, Jinyang Liu, Wenwei Gu, Yuxin Su, and Michael R. Lyu
    [PDF] [Code]

  • [TOSEM '21] Memory-Safety Challenge Considered Solved? An In-Depth Study with All Rust CVEs
    Hui Xu, Zhuangbin Chen, Mingshen Sun, Yangfan Zhou, and Michael R. Lyu
    [PDF]

  • [CSUR '21] A Survey on Automated Log Analysis for Reliability Engineering
    Shilin He, Pinjia He, Zhuangbin Chen, Tianyi Yang, Yuxin Su, and Michael R. Lyu
    [PDF]

2020

  • [FSE '20] Towards Intelligent Incident Management: Why We Need It and How We Make It
    Zhuangbin Chen, Yu Kang, Hongyu Zhang, et al.
    [PDF]

  • [AAAI-W '20] AIOps Innovations of Incident Management for Cloud Services
    Zhuangbin Chen, Yu Kang, et al.
    [PDF]

2018

  • [ASE '18] Characterizing the Natural Language Descriptions in Software Logging Statements
    Pinjia He, Zhuangbin Chen, Shilin He, and Michael R. Lyu
    [PDF] [Dataset]

  • [CIKM '18] Neural relational topic models for scientific article analysis
    Haoli Bai, Zhuangbin Chen, Michael R. Lyu, Irwin King, and Zenglin Xu
    [PDF] [Code]


Teaching

Cloud Computing Technology (SSE316云计算技术)

Undergraduates, School of Software Engineering, Sun Yat-sen University, Spring 2023


Academic Service

Program Committee

2022: ICONIP

2020: ICONIP

Reviewer/Sub-Reviewer

ICSE, DSN, NeurIPS, WWW, WSDM


Awards

2024

2022

2020

  • Stars of Tomorrow (Award of Excellent Intern), Microsoft Research Asia