Book Chapters
- [B1] Qiang Guan, Nathan DeBardeleben, Sean Blanchard, Song Fu, Claude H. Davis IV and William M. Jones
"Analyzing the Robustness of HPC Applications Using a Fine-Grained Soft Error Fault Injection Tool",
Innovative Research and Applications in Next-Generation High Performance Computing.
Peer-Reviewed Journal Publications
- [J5] Li Tan, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, and Micheal Lang
"Using Virtualization to Quantify Power Conservation via Near-Threshold Voltage Reduction for Inherently Resilient Applications",
Journal of Parallel Computing (PARCO), 2017.
- [J4] Qiang Guan, Nathan DeBardeleben, Sean Blanchard and Song Fu
Addressing Statistical Significance of Fault Injection: Empirical Studies of the Soft Error Susceptibility
International Journal of High Performance Computing and Networking, June 2016.
- [J3] Qiang Guan, Ziming Zhang and Song Fu
"A Failure Detection and Prediction Mechanism for Enhancing Dependability of Data Centers",
International Journal of Computer Theory and Engineering (IJCTE), 2012.
- [J2] Qiang Guan, Ziming Zhang and Song Fu
"Ensemble of Bayesian Predictors and Decision Trees for Proactive Failure Management in Cloud",
Journal of Communication, 2012.
- [J1] Cheng-Ri Piao, Qiang Guan and Seung-soo Han
"Robust Digital Image Watermarking Algorithm Using RBF Neural Network in DWT domain",
International Journal of Fuzzy Logic and Intelligent Systems, 2008.
Peer-Reviewed Conference Publications
2018
- [C42] Xinyu Chen, Qiang Guan, Li-Ta Lo, Simon Su, Zhengyong Ren, James Ahrens, and Trilce Estrada
"In situ TensorView: In situ Visualization of Convolutional Neural Network",
IEEE Bigdata, 2018.
- [C41] Jieyang Chen, Qiang Guan, Xin Liang, Paul Bryant, Patricia Grubel, Allen McPherson, Li-Ta Lo, Timothy Randles, Zizhong Chen, James Ahrens
"Build and Execution Environment (BEE): an Encapsulated Environment Enabling HPC Applications Running Everywhere",
IEEE Bigdata, 2018.
- [C40] Jieyang Chen, Hongbo Li, Sihuan Li, Xin Liang, Panruo Wu, Dingwen Tao, Kaiming Ouyang, Yuanlai Liu, Kai Zhao, Qiang Guan, Zizhong Chen
"Fault Tolerant One-sided Matrix Decompositions on Heterogeneous Systems with GPUs",
SC, 2018.
- [C39] Kai Wu, Wenqian Dong, Qiang Guan, Nathan DeBardeleben, Dong Li
"Modeling Application Resilience in Large-scale Parallel Execution",
ICPP, 2018.
- [C38] Jieyang Chen, Qiang Guan, Zhao Zhang, Xin Liang, Louis Vernon, Allen Mcpherson, Li-Ta Lo, Patricia Grubel, Jim Ahrens, Zizhong Chen
"BeeFlow: a Workflow Management System for In situ Processing Across HPC and Cloud Systems",
ICDCS, 2018.
- [C37] Simon Su, Vincent Perry, Qiang Guan, Andrew Durkee, Alexis R Neigel, and Sue Kase
"Sensor Data Fusion Framework to Improve Holographic Object Registration Accuracy for a Shared Augmented Reality Mission Planning Scenario",
Human-Computer Interaction International Conference (HCI), 2018.
2017
- [C36] Xinyu Chen, Qiang Guan, Xin Liang, Li-Ta Lo, Simon Su, Trice Estrada, James Ahrens
"TensorView: Visualizing Training of Convolutional Neural Network Using Paraview",
Workshop on Distributed Infrastructures for Deep Learning (DIDL) with Middleware, 2017.
- [C35] Taniya Siddiqua, Vilas Sridharan, Steven E. Raasch, Nathan DeBardeleben, Kurt B. Ferreira, Scott Levy, Elisabeth Baseman, Qiang Guan
"Lifetime Memory Reliability Data from the Field",
Best Paper Final List, IEEE DFT 2017.
- [C34] Li Tan, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, Michael Lang
"RSVP: Soft Error Resilient Power Saving at Near-Threshold Voltage Using Register Vulnerability",
DNS-Workshop, 2017.
- [C33] Ryan Slechta, Laura Monroe, Nathan DeBardeleben, Qiang Guan, Joanne Wendelberger, Sarah Michalak
"Resilience of Top K Selection Algorithms",
EDCC '17.
- [C32] Bo Fang, Qiang Guan, Nathan DeBardeleben, Karthik Pattabiraman, Matei Ripeanu
"LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures",
HPDC'17. Acceptance Rate:19%..
- [C31] Panruo Wu, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, Jieyang Chen, Dingwen Tao, Xin Liang, Ouyang Kaiming, Sihuan Li, and Zizhong Chen
"Silent Data Corruption Resilient Two-sided Matrix Factorizations",
PPoPP'17. Acceptance Rate: 21.9% (29/132).
2016
- [C30] Laura Monroe, John Daly, Nathan Debardeleben, Sarah Michalak, Qiang Guan and Kevin Rudd
"Probabilistic Computing for HPC in the Post-Moore’s Era",
Post-Moore's Era Supercomputing (PMES) Workshop in conjunction with SC'16.
- [C29] Qiang Guan, Nathan DeBardeleben, Panruo Wu, Stephan Eidenbenz, Sean Blanchard, Laura Monroe, Elisabeth Baseman, and Li Tan
"Design, Use, and Evaluation of P-FSEFI: A Parallel Soft Error Fault Injection Framework for Emulating Soft Errors in Parallel Applications",
SIMUTOOLS'16.
- [C28] Elisabeth Baseman, Nathan DeBardeleben, Kurt Ferreira, Scott Levy, Steven Raasch, Vilas Sridharan, Taniya Siddiqua and Qiang Guan
"Improving DRAM Fault Characterization Through Machine Learning",
DSN'16.
- [C27] Panruo Wu, Qiang Guan, Nathan DeBardeleben, Sean Blanchard, Dingwen Tao, Xin Liang, Jieyang Chen, and Zizhong Chen
"Towards Practical Algorithm Based Fault Tolerance in Dense Linear Algebra",
HPDC'16.
- [C26] Bo Fang, Panruo Wu, Qiang Guan, Nathan DeBardeleben, Laura Monroe, Sean Blanchard, Zhizong Chen, Karthik Pattabiraman, Matei Ripeanu
"SDC is in the Eye of the Beholder: A Survey and Preliminary Study",
3rd IEEE International Workshop on Reliability and Security Data Analysis (RSDA), 2016.
- [C25] Qiang Guan, Nathan Debardeleben, Sean Blanchard, Panruo Wu, Laura Monrow and Zizhong Chen
"P-FSEFI: A Parallel Soft Error Fault Injection Framework for Parallel Applications",
the 12th Workshop on Silicon Error in Logic-System Effect (SELSE'16), 2016.
- [C24] Laura Monroe, William Jones, Claude Davis, Scott Lavigne, Qiang Guan and Nathan Debardeleben,
"On the Inherent Resilience of Integer Operators",
the 12th Workshop on Silicon Error in Logic-System Effect (SELSE'16), 2016.
2015
- [C23] Song Huang, Song Fu, Nathan Debardeleben, Qiang Guan and Cheng-Zhong Xu
"Differentiated Failure Remediation with Action Selection for Resilient Computing",
The 21st IEEE Pacific Rim International Symposium on Dependable Computing (PRDC'15), 2015.
- [C22] Qiang Guan, Nathan, DeBardeleben, Brain Atkinson, Robert Robey, and William Jones
"Towards Building Resilience Scientific Applications: Resilience Analysis on the Impact of Soft Error and Transient Error Tolerance with CLAMR Hydrodynamics Mini-App",
IEEE Cluster'15.
- [C21] Qiang Guan, Nathan DeBardeleben, Sean Blanchard and Song Fu
"Empirical Studies of the Soft Error Susceptibility of Sorting Algorithms",
5th Fault Tolerance for HPC at eXtreme Scale (FTXS) Workshop with HPDC 2015.
- [C20] Qiang Guan, Nathan DeBardeleben, Sean Blanchard and Song Fu
"Empirical Studies of the Soft Error Susceptibility of Sorting Algorithms to Statistical Fault Injection",
11th Workshop on Silicon Error in Logic-System Effect (SELSE'15), 2015.
2014
- [C19] Brian Atkinson, Nathan DeBardeleben, Qiang Guan, Robert Robey, and William Jones
"Fault Injection Experiments with the CLAMR Hydrodynamics Mini-Ap",
25th IEEE International Symposium on Software Reliability Engineering (ISSRE'14), 2014.
- [C18] Qiang Guan, Nathan DeBardeleben, Sean Blanchard and Song Fu
"Towards Exploring the Soft Error Susceptibility of Heapsort Algorithm",
DSN'14 .
- [C17] Qiang Guan, Song Fu, Nathan Blanchard and Sean Blanchard
"F-SEFI: A Fine-Grained Soft Error Fault Injection Tool for Profiling Application Vulnerability",
IPDPS'14.
2013
- [C16] Qiang Guan and Song Fu
"Exploring Time and Frequency Domains for Accurate and Automated Anomaly Detection in Cloud Computing Systems",
PRDC'13.
- [C15] Qiang Guan and Song Fu
"Wavelet-Based Multi-scale Anomaly Identification in Cloud Computing Systems",
GlobalCom'13.
- [C14] Qiang Guan, Song Fu, Nathan Blanchard and Sean Blanchard
"Autonomic Failure Identification and Diagnosis for Building Dependable Computing Systems",
Ph.D. Showcase, IEEE/ACM Supercomputing Conference (SC), 2013..
- [C13] Qiang Guan and Song Fu
"Adaptive Anomaly Identification by Exploring Metric Subspace in Cloud Computing Infrastructures",
SRDS'13.
Before 2013
- [C12] Husanbir S Pannu, Jianguo Liu, Qiang Guan and Song Fu
"AFD: Adaptive Failure Detection System for Cloud Computing Infrastructure",
IPCCC'12 .
- [C11] Ziming Zhang, Qiang Guan and Song Fu
"An Adaptive Power Management Framework for Autonomic Resource Configuration in Cloud Computing Infrastructures",
IPCCC'12.
- [C10] Qiang Guan, Chi-Chen Chiu and Song Fu
"A Cloud Dependability Analysis Framework for Characterizing System Dependability in Cloud Computing Infrastructures",
PRDC'12 .
- [C9] Efficient and Accurate Anomaly Identification Using Reduced Metric Space in Utility Cloud
"Efficient and Accurate Anomaly Identification Using Reduced Metric Space in Utility Cloud",
NAS'12 .
- [C8] Qiang Guan, Ziming Zhang and Song Fu
"Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems",
IEEE International Conference on Availability, Reliability and Security (ARES'11), Aug 2011.
- [C7] Nathan DeBardeleben, Sean Blanchard, Qiang Guan, Ziming Zhang and Song Fu
"Experimental Framework for Injecting Logic Errors in a Virtual Machine to Profile Applications for Soft Error Resilience",
Resilience, Intl. European Conference on Parallel and Distributed Computing (Euro-Par), 2011..
- [C6] Qiang Guan, Ziming Zhang and Song Fu
"Ensemble of Bayesian Predictors for Autonomic Failure Management in Cloud Computing",
IEEE Intl. Conference on Computer Communications and Networks (ICCCN'11), 2011.
- [C5] Qiang Guan and Song Fu
"auto-AID: A Data Mining Framework for Autonomic Anomaly Identification in Networked Computer Systems",
IEEE Intl. Performance Computing and Communications Conference (IPCCC'11), 20101.
- [C4] Qiang Guan Derek Smith and Song Fu
"Anomaly Detection in Large-Scale Coalition Clusters for Dependability Assurance",
IEEE Intl. Conference on High Performance Computing (HiPC'10), 2010..
- [C3] Derek Smith Qiang Guan and Song Fu
"An Anomaly Detection Framework for Autonomic Management of Compute Cloud Systems",
34th IEEE International Conference on Computer Software and Applications (COMPSAC'10), July 2010..
- [C2] Qiang Guan and Seung-soo Han
"Reliability and Dependability Analysis for Agent-Based Reliability Enhancement Technology (ARET)System",
International Conference on Electronic Computer Technology (ICECT), 2009.
- [C1] Yang Liu, Qiang Guan, Seung-Soo Han, Myeon-Song Choi, Seung-Jae lee
"Research on Optimization of Process Bus in IEC 61850 Based Substation Communication Network",
The International Conference on Electrical Engineering (ICEE), 2009.