- Neural Acceleration of Incomplete Cholesky Preconditioners. Joshua D. Booth, Hongyang Sun, Trevor Garnett. Neural Computing and Applications, 2024 (To appear).
- *A Survey on Checkpointing Strategies: Should We Always Checkpoint à la Young/Daly? Leonardo Bautista-Gomez, Anne Benoit, Sheng Di, Thomas Hérault, Yves Robert, Hongyang Sun. Future Generation Computer Systems, 161: 315-328, 2024. [pdf]
- *Improved Online Scheduling of Moldable Task Graphs under Common Speedup Models. Lucas Perotin, Hongyang Sun. ACM Transactions on Parallel Computing, 11(1): 2:1-2:31, 2024. [pdf]
- Multi-Resource Scheduling of Moldable Workflows.
Lucas Perotin, Sandhya Kandaswamy, Hongyang Sun, Padma Raghavan.
Journal of Parallel and Distributed Computing, 184:104792, 2024. [pdf]
2023
- INDICES: Applying DDDAS Principles for Performance Interference-aware Cloud-to-Fog Application Migration. Shashank Shekhar, Ajay Dev Chhokra, Anirban Bhattacharjee, Yogesh Barve, Shweta Khare, Guillaume Pallez, Hongyang Sun, Aniruddha Gokhale, Gabor Karsai. In: Darema, F., Blasch, E.P., Ravela, S., Aved, A.J. (eds) Handbook of Dynamic Data Driven Applications Systems. Springer, Cham. 2023. [preprint]
- Toward Automated Algorithm Configuration for Distributed Hybrid Flow Shop Scheduling with Multiprocessor Tasks. Hadi Gholami, Hongyang Sun. Knowledge-Based Systems, 264: 110309, 2023. [pdf]
- Dynamic Resource Management for Cloud-native Bulk Synchronous Parallel Applications. Evan Wang, Yogesh D. Barve, Aniruddha Gokhale, Hongyang Sun. IEEE International Symposium on Real-Time Distributed Computing (ISORC), Nashville, TN, USA, 2023. [pdf]
- Dynamic Selective Protection of Sparse Iterative Solvers via
ML Prediction of Soft Error Impacts.
Zizhao Chen, Thomas Verrecchia, Hongyang Sun, Joshua D. Booth, Padma Raghavan.
Workshop on Fault Tolerance for HPC at eXtreme Scales (FTXS), Denver, CO, USA, 2023. [pdf]
2022
- *Checkpointing Workflows à la Young/Daly Is Not Good Enough. Anne Benoit, Lucas Perotin, Yves Robert, Hongyang Sun. ACM Transactions on Parallel Computing, 9(4):14, 2022. [pdf] [wsm]
- *Resilient Scheduling of Moldable Parallel Jobs to Cope with Silent Errors. Anne Benoit, Valentin Le Fevre, Lucas Perotin, Padma Raghavan, Yves Robert, Hongyang Sun. IEEE Transactions on Computers, 71(7):1696-1710, 2022. [pdf] [wsm]
- *Online Scheduling of Moldable Task Graphs under Common Speed-Up Models. Anne Benoit, Lucas Perotin, Yves Robert, Hongyang Sun. International Conference on Parallel Processing (ICPP), Bordeaux, France, 2022. [pdf] (Best Paper Award)
- *Checkpointing à la Young/Daly: An Overview.
Anne Benoit, Yishu Du, Thomas Hérault, Loris Marchal, Guillaume Pallez, Lucas Perotin, Yves Robert, Hongyang Sun, Frédéric Vivien.
International Conference on Contemporary Computing (IC3), Noida, India, 2022. [pdf]
2021
- *Resilient Scheduling Heuristics for Rigid Parallel Jobs. Anne Benoit, Valentin Le Fevre, Padma Raghavan, Yves Robert, Hongyang Sun. International Journal of Networking and Computing, 11(1):2–26, 2021. [pdf]
- EXPPO: Execution Performance Profiling and Optimization For Co-Simulation-as-a-Service Platform. Yogesh Barve, Himanshu Neema, Zhuangwei Kang, Harsh Vardhan, Hongyang Sun, Aniruddha Gokhale. Journal of Systems Architecture, 118:102189, 2021. [pdf]
- Multi-Resource Scheduling of Moldable Parallel Jobs with Precedence Constraints. Lucas Perotin, Hongyang Sun, Padma Raghavan. International Conference on Parallel Processing (ICPP), Chicago, IL, USA, 2021. [pdf]
- A Self-Adaptive Load Balancing Approach for Software-Defined Networks in IoT.
Ziran Min, Hongyang Sun, Shunxing Bao, Aniruddha S. Gokhale, Swapna S. Gokhale.
IEEE International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS), Washington DC, USA, 2021. [pdf]
2020
- Increase in Inter-Network Functional Connectivity in the Human Brain with Attention Capture. Hongyang Sun, Qiuhai Yue, Jocelyn L. Sy, Douglass Godwin, Hana P. Eaton, Padma Raghavan, Rene Marois. Journal of Neurophysiology, 124(6):1885–1899, 2020. [link]
- URMILA: Dynamically Trading-Off Fog and Edge Resources for Performance and Mobility-Aware IoT Services. Shashank Shekhar, Ajay Chhokra, Hongyang Sun, Aniruddha Gokhale, Abhishek Dubey, Xenofon Koutsoukos, Gabor Karsai. Journal of Systems Architecture, 107:101710, 2020. [pdf]
- MILP Formulations for Spatio-Temporal Thermal-Aware Scheduling in Cloud and HPC Datacenters. Jean-Marc Pierson, Patricia Stolf, Hongyang Sun, Henri Casanova. Cluster Computing, 23(2): 421–439, 2020. [pdf]
- Selective Protection for Sparse Iterative Solvers to Reduce the Resilience Overhead. Hongyang Sun, Ana Gainaru, Manu Shantharam, Padma Raghavan. IEEE International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Porto, Portugal, 2020. [pdf]
- *Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms. Anne Benoit, Valentin Le Fevre, Lucas Perotin, Padma Raghavan, Yves Robert, Hongyang Sun. IEEE Cluster Conference, Kobe, Japan, 2020. [pdf]
- EXPPO: Execution Performance Profiling and Optimization For Co-Simulation-as-a-Service Platform. Yogesh Barve, Himanshu Neema, Zhuangwei Kang, Hongyang Sun, Aniruddha Gokhale, Thomas Roth. International Symposium on Real-Time Distributed Computing (ISORC), Nashville, TN, USA, 2020. [pdf]
- Deep-Edge: An Efficient Framework for Deep Learning Model Update on Heterogeneous Edge. Anirban Bhattacharjee, Ajay Dev Chhokra, Hongyang Sun, Shashank Shekhar, Aniruddha Gokhale, Gabor Karsai, Abhishek Dubey. IEEE International Conference on Fog and Edge Computing (ICFEC), Melbourne, Australia, 2020. [pdf]
- Design and Comparison of Resilient Scheduling Heuristics for Parallel Jobs. Anne Benoit, Valentin Le Fevre, Padma Raghavan, Yves Robert, Hongyang Sun. Workshop on Advances in Parallel and Distributed Computational Models (APDCM), New Orleans, LA, USA, 2020. [pdf] (Best Paper Award)
- *Reservation and Checkpointing Strategies for Stochastic Jobs.
Ana Gainaru, Brice Goglin, Valentin Honore, Guillaume Pallez (Aupy), Padma Raghavan, Yves Robert, Hongyang Sun.
IEEE International Parallel and Distributed Processing Symposium (IPDPS), New Orleans, LA, USA, 2020. [pdf]
2019
- On-the-fly Scheduling versus Reservation-based Scheduling for Unpredictable Workflows. Ana Gainaru, Hongyang Sun, Guillaume Aupy, Yuankai Huo, Bennett A. Landman, Padma Raghavan. International Journal of High Performance Computing Applications, 33(6):1140–1158, 2019. [pdf]
- Non-clairvoyant Scheduling with Conflicts for Unit-Size Jobs. Hongyang Sun. Information Processing Letters, 144:1–8, 2019. [pdf]
- Linearize, Predict and Place: Minimizing the Makespan for Edge-based Stream Processing of Directed Acyclic Graphs. Shweta Khare, Hongyang Sun, Julien Gascon-Samson, Kaiwen Zhang, Yogesh Barve, Aniruddha Gokhale, Xenofon Koutsoukos. ACM/IEEE Symposium on Edge Computing (SEC), Washington D.C., USA, 2019. [pdf]
- Speculative Scheduling for Stochastic HPC Applications. Ana Gainaru, Guillaume Pallez (Aupy), Hongyang Sun, Padma Raghavan. International Conference on Parallel Processing (ICPP), Kyoto, Japan, 2019. [pdf]
- BARISTA: Efficient and Scalable Serverless Serving System for Deep Learning Prediction Services. Anirban Bhattacharjee, Ajay Dev Chhokra, Zhuangwei Kang, Hongyang Sun, Aniruddha Gokhale. IEEE International Conference on Cloud Engineering (IC2E), Prague, Czech Republic, 2019. [pdf]
- FECBench: A Holistic Interference-aware Approach for Application Performance Modeling. Yogesh Barve, Shashank Shekhar, Ajay Chhokra, Shweta Khare, Anirban Bhattacharjee, Zhuangwei Kang, Hongyang Sun, Aniruddha Gokhale. IEEE International Conference on Cloud Engineering (IC2E), Prague, Czech Republic, 2019. [pdf] (Best Paper Award)
- *Reservation Strategies for Stochastic Jobs. Guillaume Aupy, Ana Gainaru, Valentin Honor´ e, Padma Raghavan, Yves Robert, Hongyang Sun. IEEE International Symposium on Parallel and Distributed Processing (IPDPS), Rio de Janeiro, Brazil, 2019. [pdf]
- URMILA: A Performance and Mobility-Aware Fog and Edge Resource Mangement Framework. Shashank Shekhar, Ajay Chhokra, Hongyang Sun, Aniruddha Gokhale, Abhishek Dubey, Xenofon Koutsokos. IEEE International Symposium on Real-Time Distributed Computing (ISORC), Valencia, Spain, 2019. [pdf]
- Supporting Fog/Edge-based Cognitive Assistance IoT Services for the Visually Impaired.
Shashank Shekhar, Ajay Chhokra, Hongyang Sun, Aniruddha Gokhale, Abhishek Dubey, Xenofon Koutsokos.
ACM/IEEE International Conference on Internet of Things Design and Implementation (IoTDI), Montreal, Canada, 2019. [pdf]
2018
- *Coping with Silent and Fail-Stop Errors at Scale by Combining Replication and Checkpointing. Anne Benoit, Aurelien Cavelan, Franck Cappello, Padma Raghavan, Yves Robert, Hongyang Sun. Journal of Parallel and Distributed Computing, 122:209-225, 2018. [pdf]
- *Multi-Level Checkpointing and Silent Error Detection for Linear Workflows. Anne Benoit, Aurelien Cavelan, Yves Robert, Hongyang Sun. Journal of Computational Science, 28:398–415, 2018. [pdf]
- Technology Enablers for Big Data Multi-Stage Analysis in Medical Image Processing. Shunxing Bao, Prasanna Parvarthaneni, Yuankai Huo, Yogesh Barve, Andrew J. Plassard, Yuang Yao, Hongyang Sun, Ilwoo Lyu, David H. Zald, Bennett A. Landman, Aniruddha Gokhale. IEEE International Conference on Big Data, Seattle, WA, USA, 2018. [pdf]
- Scalable Edge Computing for Low Latency Data Dissemination in Topic-based Publish/Subscribe. Shweta Khare, Hongyang Sun, Kaiwen Zhang, Julien Gascon-Samson, Aniruddha Gokhale, Xenofon Koutsoukos, Hamzah Abdelaziz. ACM/IEEE Symposium on Edge Computing (SEC), Bellevue, WA, USA, 2018. [pdf]
- A Scalability and Sensitivity Study of Parallel Geometric Algorithms for Graph Partitioning. Shad Kirmani, Hongyang Sun, Padma Raghavan. Workshop on Applications for Multi-Core Architectures (WAMCA), Lyon, France, 2018. [pdf]
- Scheduling Parallel Tasks under Multiple Resources: List Scheduling vs. Pack Scheduling. Hongyang Sun, Redouane Elghazi, Ana Gainaru, Guillaume Aupy, Padma Raghavan. IEEE International Parallel and Distributed Processing Symposium (IPDPS), Vancouver, Canada, 2018. [pdf]
- Convergent Functional Network Connectivity Changes in Attention Capture and Awareness. Jocelyn L. Sy, Hongyang Sun, Douglass Godwin, Hana P. Eaton, Padma Raghavan, Rene Marois. Society for Neuroscience Annual Meeting (Neuroscience 2018), San Diego, CA, USA, 2018. [pdf]
- Ensuring Low-Latency and Scalable Data Dissemination for Smart-City Applications.
Shweta Khare, Hongyang Sun, Kaiwen Zhang, Julien Gascon-Samson, Aniruddha Gokhale, Xenofon Koutsoukos.
ACM/IEEE International Conference on Internet of Things Design and Implementation (IoTDI), Orlando, FL, USA, 2018. [pdf]
2017
- *Coping with Silent Errors in HPC Applications. Guillaume Aupy, Anne Benoit, Aurélien Cavelan, Massimiliano Fasi, Yves Robert, Hongyang Sun, Bora Uçar. Emergent Computation. A Festschrift for Selim G. Akl. Editors: Adamatzky, Andrew (Ed.), 2017. [pdf]
- *Towards Optimal Multi-Level Checkpointing. Anne Benoit, Aurélien Cavelan, Valentin Le Fèvre, Yves Robert, Hongyang Sun. IEEE Transactions on Computers, 66(7): 1212-1226, 2017. [pdf]
- Spatio-Temporal Thermal-Aware Scheduling for Homogeneous High-Performance Computing Datacenters. Hongyang Sun, Patricia Stolf, Jean-Marc Pierson. Future Generation Computer Systems, 71:157-170, 2017. [pdf]
- *Identifying the Right Replication Level to Detect and Correct Silent Errors at Scale. Anne Benoit, Aurlien Cavelan, Franck Cappello, Padma Raghavan, Yves Robert, Hongyang Sun. Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS), Washington D.C., USA, 2017. [pdf]
- When Good Enough Is Better: Energy-Aware Scheduling for Multicore Servers.
Xinning Hui, Zhihui Du, Jason Liu, Hongyang Sun, Yuxiong He, David A. Bader.
Workshop on High-Performance, Power-Aware Computing (HPPAC), Orlando, FL, USA, 2017. [pdf]
2016
- *Coping with Recall and Precision of Soft Error Detectors. Leonardo Bautista-Gomez, Anne Benoit, Aurélien Cavelan, Saurabh K. Raina, Yves Robert, Hongyang Sun. Journal of Parallel and Distributed Computing, 98:8-24, 2016. [pdf]
- *Assessing General-Purpose Algorithms to Cope with Fail-Stop and Silent Errors. Anne Benoit, Aurélien Cavelan, Yves Robert, Hongyang Sun. ACM Transactions on Parallel Computing, 3(2):13, 2016. [pdf]
- *When Amdahl Meets Young/Daly. Aurélien Cavelan, Jiafan Li, Yves Robert, Hongyang Sun. IEEE International Conference on Cluster Computing (CLUSTER), Taipei, Taiwan, 2016. [pdf]
- *A Different Re-Execution Speed Can Help. Anne Benoit, Aurélien Cavelan, Valentin Le Fèvre, Yves Robert, Hongyang Sun. International Workshop on Power-aware Algorithms, Systems, and Architectures (PASA), Philadelphia, USA, 2016. [pdf]
- *Optimal Resilience Patterns to Cope with Fail-Stop and Silent Errors. Anne Benoit, Aurélien Cavelan, Yves Robert, Hongyang Sun. IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, USA, 2016. [pdf]
- *Two-Level Checkpointing and Verifications for Linear Task Graphs.
Anne Benoit, Aurélien Cavelan, Yves Robert, Hongyang Sun.
IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC), Chicago, USA, 2016. [pdf]
2015
- *Energy-Efficient, Thermal-Aware Modeling and Simulation of Data Centers: The CoolEmAll Approach and Evaluation Results. Leandro Cupertino, Georges Da Costa, Ariel Oleksiak, Wojciech Piatek, Jean-Marc Pierson, Jaume Salom, Laura Siso, Patricia Stolf, Hongyang Sun, Thomas Zilio. Ad Hoc Networks, 25:535-553, 2015. [pdf]
- *Which Verification for Soft Error Detection? Leonardo Bautista-Gomez, Anne Benoit, Aurélien Cavelan, Saurabh K. Raina, Yves Robert, Hongyang Sun. IEEE International Conference on High Performance Computing (HiPC), Bangalore, India, 2015. [pdf]
- *Scheduling Independent Tasks with Voltage Overscaling. Aurélien Cavelan, Yves Robert, Hongyang Sun, Frédéric Vivien. IEEE Pacific Rim International Symposium on Dependable Computing (PRDC), Zhangjiajie, China, 2015. [pdf]
- *Assessing the Impact of Partial Verifications Against Silent Data Corruptions. Aurélien Cavelan, Saurabh K. Raina, Yves Robert, Hongyang Sun. International Conference on Parallel Processing (ICPP), Beijing, China, 2015. [pdf]
- *Voltage Overscaling Algorithms for Energy-Efficient Workflow Computations With Timing Errors.
Aurélien Cavelan, Yves Robert, Hongyang Sun, Frédéric Vivien.
Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS), Portland, USA, 2015. [pdf]
2014
- Energy-Efficient and Thermal-Aware Resource Management for Heterogeneous Datacenters. Hongyang Sun, Patricia Stolf, Jean-Marc Pierson, Georges Da Costa. Sustainable Computing: Informatics and Systems, 4(4):292-306, 2014. [pdf]
- Energy-Efficient Multiprocessor Scheduling for Flow Time and Makespan. Hongyang Sun, Yuxiong He, Wen-Jing Hsu, Rui Fan. Theoretical Computer Science, 550:1-20, 2014. [pdf]
- Competitive Online Adaptive Scheduling for Sets of Parallel Jobs with Fairness and Efficiency. Hongyang Sun, Wen-Jing Hsu, Yangjie Cao. Journal of Parallel and Distributed Computing, 74(3):2180-2192, 2014. [pdf]
- Scalable Hierarchical Scheduling for Malleable Parallel Jobs on Multiprocessor-Based Systems. Yangjie Cao, Hongyang Sun, Depei Qian, Weiguo Wu. Computer Systems: Science & Engineering, 29(2):169-181, 2014. [pdf]
- *Assessing General-Purpose Algorithms to Cope with Fail-Stop and Silent Errors. Anne Benoit, Aurélien Cavelan, Yves Robert, Hongyang Sun. International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), New Orleans, USA, 2014. [pdf]
- Multi-objective Scheduling for Heterogeneous Server Systems with Machine Placement.
Hongyang Sun, Patricia Stolf, Jean-Marc Pierson, Georges Da Costa.
IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Chicago, USA, 2014. [pdf]
2013 and Before
- Improved Semi-Online Makespan Scheduling with a Reordering Buffer. Hongyang Sun, Rui Fan. Information Processing Letters, 113(12):434-439, 2013. [pdf]
- Stable Adaptive Work-Stealing for Concurrent Many-core Runtime Systems. Yangjie Cao, Hongyang Sun, Depei Qian, Weiguo Wu. IEICE Transactions on Information and Systems, 95-D(5):1407-1416, 2012. [pdf]
- Efficient Adaptive Scheduling of Multiprocessors with Stable Parallelism Feedback. Hongyang Sun, Yangjie Cao, Wen-Jing Hsu. IEEE Transactions on Parallel and Distributed Systems, 22(4):594-607, 2011. [pdf]
- Improved Results for Scheduling Batched Parallel Jobs by Using a Generalized Analysis Framework. Yuxiong He, Hongyang Sun, Wen-Jing Hsu. Journal of Parallel and Distributed Computing, 70(2):173-182, 2010. [pdf]
- Energy-Efficient Scheduling for Best-Effort Interactive Services to Achieve High Response Quality. Zhihui Du, Hongyang Sun, Yuxiong He, Yu He, David Bader, Huazhe Zhang. IEEE International Parallel and Distributed Processing Symposium (IPDPS), Boston, USA, 2013. [pdf]
- Fair and Efficient Online Adaptive Scheduling for Multiple Sets of Parallel Applications. Hongyang Sun, Yangjie Cao, Wen-Jing Hsu. IEEE International Conference on Parallel and Distributed Systems (ICPADS), Tainan, Taiwan, 2011. [pdf]
- Tians Scheduling: Using Partial Processing in Best-Effort Applications. Yuxiong He, Sameh Elnikety, Hongyang Sun. International Conference on Distributed Computing Systems (ICDCS), Minneapolis, USA, 2011. [pdf]
- Scheduling Functional Heterogeneous Systems with Utilization Balancing. Yuxiong He, Jie Liu, Hongyang Sun. IEEE International Parallel and Distributed Processing Symposium (IPDPS), Anchorage, USA, 2011. [pdf]
- Stable Adaptive Work-Stealing for Concurrent Multi-core Runtime Systems. Yangjie Cao, Hongyang Sun, Depei Qian, Weiguo Wu. IEEE International Conference on High Performance Computing and Communications (HPCC), Banff, Canada, 2011. [pdf]
- Speed Scaling for Energy and Performance with Instantaneous Parallelism. Hongyang Sun, Yuxiong He, Wen-Jing Hsu. International ICST Conference on Theory and Practice of Algorithms in Computer Systems (TAPAS), Rome, Italy, 2011. [pdf]
- Scalable Hierarchical Scheduling for Multiprocessor Systems Using Adaptive Feedback-Driven Policies. Yangjie Cao, Hongyang Sun, Depei Qian, Weiguo Wu. IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA), Taipei, Taiwan, 2010. [pdf]
- Malleable-Lab: A Tool for Evaluating Adaptive Online Schedulers on Malleable Jobs. Yangjie Cao, Hongyang Sun, Wen-Jing Hsu, Depei Qian. Euromicro International Conference on Parallel, Distributed and Network-Based Computing (PDP), Pisa, Italy, 2010. [pdf]
- Competitive Two-Level Adaptive Scheduling using Resource Augmentation. Hongyang Sun, Yangjie Cao, Wen-Jing Hsu. Workshop on Job Scheduling Strategy for Parallel Processing (JSSPP), Rome, Italy, 2009. [pdf]
- Non-clairvoyant Speed Scaling for Batched Parallel Jobs on Multiprocessors. Hongyang Sun, Yangjie Cao, Wen-Jing Hsu. ACM International Conference on Computing Frontiers, Ischia, Italy, 2009. [pdf]
- Adaptive B-Greedy (ABG): A Simple yet Efficient Scheduling Algorithm. Hongyang Sun, Wen-Jing Hsu. IEEE International Symposium on Parallel and Distributed Processing (IPDPS), Miami, USA, 2008. [pdf]
- Adaptive Scheduling of Parallel Jobs on Functionally Heterogeneous Resources. Yuxiong He, Hongyang Sun, Wen-Jing Hsu. International Conference on Parallel Processing (ICPP), Xi'an, China, 2007. [pdf]
2024