default search action
IEEE Transactions on Parallel and Distributed Systems, Volume 35
Volume 35, Number 1, January 2024
- Qiufen Xia, Zhiwei Jiao, Zichuan Xu:
Online Learning Algorithms for Context-Aware Video Caching in D2D Edge Networks. 1-19 - Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian:
Adaptive Auto-Tuning Framework for Global Exploration of Stencil Optimization on GPUs. 20-33 - Qixiang Chen, Zhijun Chen, Kai Zhang, X. Sean Wang:
CLIC: An Extensible and Efficient Cross-Platform Data Analytics System. 34-45 - Yuan Li, Ahmed Louri, Avinash Karanth:
A High-Performance and Energy-Efficient Photonic Architecture for Multi-DNN Acceleration. 46-58 - Fangming Liu, Yipei Niu:
Demystifying the Cost of Serverless Computing: Towards a Win-Win Deal. 59-72 - Isam Mashhour Al Jawarneh, Paolo Bellavista, Antonio Corradi, Luca Foschini, Rebecca Montanari:
SpatialSSJP: QoS-Aware Adaptive Approximate Stream-Static Spatial Join Processor. 73-88 - Zhe Jiang, Kecheng Yang, Nathan Fisher, Nan Guan, Neil C. Audsley, Zheng Dong:
Hopscotch: A Hardware-Software Co-Design for Efficient Cache Resizing on Multi-Core SoCs. 89-104 - Zichuan Xu, Guangyuan Xu, Hao Wang, Weifa Liang, Qiufen Xia, Shangguang Wang:
Enabling Streaming Analytics in Satellite Edge Computing via Timely Evaluation of Big Data Queries. 105-122 - Yunqi Gao, Bing Hu, Mahdi Boloursaz Mashhadi, A-Long Jin, Pei Xiao, Chunming Wu:
US-Byte: An Efficient Communication Framework for Scheduling Unequal-Sized Tensor Blocks in Distributed Deep Learning. 123-139 - Changlong Li, Yu Liang, Liang Shi, Chao Wang, Chun Jason Xue, Xuehai Zhou:
Flexible and Efficient Memory Swapping Across Mobile Devices With LegoSwap. 140-153 - Qiliang Li, Liangliang Xu, Yongkun Li, Min Lyu, Wei Wang, Pengfei Zuo, Yinlong Xu:
Enabling Efficient Erasure Coding in Disaggregated Memory Systems. 154-168 - Tiangang Li, Shi Ying, Yishi Zhao, Jianga Shang:
Batch Jobs Load Balancing Scheduling in Cloud Computing Using Distributional Reinforcement Learning. 169-185 - Yaozheng Fang, Zhiyuan Zhou, Surong Dai, Jinni Yang, Hui Zhang, Ye Lu:
PaVM: A Parallel Virtual Machine for Smart Contract Execution and Validation. 186-202
Volume 35, Number 2, February 2024
- Ajay Singh, Trevor Alexander Brown, Ali José Mashtizadeh:
Simple, Fast and Widely Applicable Concurrent Memory Reclamation via Neutralization. 203-220 - Zhiyuan Wang, Hongli Xu, Yang Xu, Zhida Jiang, Jianchun Liu, Suo Chen:
FAST: Enhancing Federated Learning Through Adaptive Data Sampling and Local Training. 221-236 - Gang Zeng, Jianfeng Zhu, Yichi Zhang, Ganhui Chen, Zhenhai Yuan, Shaojun Wei, Leibo Liu:
A High-Performance Genomic Accelerator for Accurate Sequence-to-Graph Alignment Using Dynamic Programming Algorithm. 237-249 - Junyan Qian, Kunzhu Qiu, Hao Ding, Huimin Zhang, Zhongyi Zhai:
An Efficient Bottleneck Planes Exclusion Method for Reconfiguring 3D VLSI Arrays. 250-263 - Yong Dong, Yiqin Dai, Min Xie, Kai Lu, Ruibo Wang, Juan Chen, Mingtian Shao, Zheng Wang:
Faster and Scalable MPI Applications Launching. 264-279 - Jing Wu, Lin Wang, Qirui Jin, Fangming Liu:
Graft: Efficient Inference Serving for Hybrid Deep Learning With SLO Guarantees via DNN Re-Alignment. 280-296 - Ning Li, Jianmei Guo, Bo Huang, Yuyang Li, Yilei Zhang, Chengdong Li, Wenxin Huang:
TCSA: Efficient Localization of Busy-Wait Synchronization Bugs for Latency-Critical Applications. 297-309 - Hao-Rui Chen, Lei Yang, Xinglin Zhang, Jiaxing Shen, Jiannong Cao:
Distributed Semi-Supervised Learning With Consensus Consistency on Edge Devices. 310-323 - Zhao Liu, Xuesen Chu, Xiaojing Lv, Hongsong Meng, Hanyue Liu, Guanghui Zhu, Haohuan Fu, Guangwen Yang:
SunwayLB: Enabling Extreme-Scale Lattice Boltzmann Method Based Computing Fluid Dynamics Simulations on Advanced Heterogeneous Supercomputers. 324-337 - Jiaqi Yang, Hao Zheng, Ahmed Louri:
Versa-DNN: A Versatile Architecture Enabling High-Performance and Energy-Efficient Multi-DNN Acceleration. 349-361 - Dongyu Zheng, Lei Liu, Guoming Tang, Yi Wang, Weichao Li:
Power Demand Reshaping Using Energy Storage for Distributed Edge Clouds. 362-376
Volume 35, Number 3, March 2024
- Di Wu, Rehmat Ullah, Philip Rodgers, Peter Kilpatrick, Ivor T. A. Spence, Blesson Varghese:
EcoFed: Efficient Communication for DNN Partitioning-Based Federated Learning. 377-390 - Xing Chen, Shengxi Hu, Chujia Yu, Zheyi Chen, Geyong Min:
Real-Time Offloading for Dependent and Parallel Tasks in Cloud-Edge Environments Using Deep Reinforcement Learning. 391-404 - Linpeng Jia, Yanxiu Liu, Keyuan Wang, Yi Sun:
Estuary: A Low Cross-Shard Blockchain Sharding Protocol Based on State Splitting. 405-420 - Daoce Wang, Jesus Pulido, Pascal Grosset, Sian Jin, Jiannan Tian, Kai Zhao, James P. Ahrens, Dingwen Tao:
TAC+: Optimizing Error-Bounded Lossy Compression for 3D AMR Simulations. 421-438 - Weiling Yang, Jianbin Fang, Dezun Dong, Xing Su, Zheng Wang:
Optimizing Full-Spectrum Matrix Multiplications on ARMv8 Multi-Core CPUs. 439-454 - Guangjing Huang, Qiong Wu, Peng Sun, Qian Ma, Xu Chen:
Collaboration in Federated Learning With Differential Privacy: A Stackelberg Game Analysis. 455-469 - Fatemeh Elahi, Mahmood Fazlali, Hadi Tabatabaee Malazi, Mehdi Elahi:
Parallel Fractional Stochastic Gradient Descent With Adaptive Learning for Recommender Systems. 470-483 - Vinicius S. da Silva, Everton Camargo de Lima, Janaina Schwarzrock, Fábio D. Rossi, Marcelo Caggiani Luizelli, Antonio Carlos Schneider Beck, Arthur Francisco Lorenzon:
Synergistically Rebalancing the EDP of Container-Based Parallel Applications. 484-498 - Jialun Li, Jieqian Yao, Danyang Xiao, Diying Yang, Weigang Wu:
EvoGWP: Predicting Long-Term Changes in Cloud Workloads Using Deep Graph-Evolution Learning. 499-516
Volume 35, Number 4, April 2024
- Jian Yang, Jiantong Jiang, Zeyi Wen, Ajmal Mian:
Parallel and Distributed Bayesian Network Structure Learning. 517-530 - Jie Song, Peimeng Zhu, Yanfeng Zhang, Ge Yu:
CloudSimPer: Simulating Geo-Distributed Datacenters Powered by Renewable Energy Mix. 531-547 - Jie Xu, Yulong Ming, Zihan Wu, Cong Wang, Xiaohua Jia:
X-Shard: Optimistic Cross-Shard Transaction Processing for Sharding-Based Blockchains. 548-559 - Zhaojie Wen, Qiong Chen, Yipei Niu, Zhen Song, Quanfeng Deng, Fangming Liu:
Joint Optimization of Parallelism and Resource Configuration for Serverless Function Steps. 560-576 - Dongsheng Li, Shengwei Li, Zhiquan Lai, Yongquan Fu, Xiangyu Ye, Lei Cai, Linbo Qiao:
A Memory-Efficient Hybrid Parallel Framework for Deep Neural Network Training. 577-591 - Meilin Yang, Jian Xu, Wenbo Ding, Yang Liu:
FedHAP: Federated Hashing With Global Prototypes for Cross-Silo Retrieval. 592-603 - Amanda Jayanetti, Saman K. Halgamuge, Rajkumar Buyya:
Multi-Agent Deep Reinforcement Learning Framework for Renewable Energy-Aware Workflow Scheduling on Distributed Cloud Data Centers. 604-615 - Jianyuan Lu, Tian Pan, Shan He, Mao Miao, Guangzhe Zhou, Yining Qi, Shize Zhang, Enge Song, Xiaoqing Sun, Huaiyi Zhao, Biao Lyu, Shunmin Zhu:
CloudSentry: Two-Stage Heavy Hitter Detection for Cloud-Scale Gateway Overload Protection. 616-633 - Subhadeep Karan, Zainul Abideen Sayed, Jaroslaw Zola:
End-to-End Bayesian Networks Exact Learning in Shared Memory. 634-645 - Ke Cheng, Sheng Zhang, Meizhao Liu, Yingcheng Gu, Liu Wei, Huanyu Cheng, Kai Liu, Yu Song, Xiaohang Shi, Andong Zhu, Lei Tang:
GeoScale: Microservice Autoscaling With Cost Budget in Geo-Distributed Edge Clouds. 646-662 - Zhe Wang, Jia Hu, Geyong Min, Zhiwei Zhao, Zi Wang:
Agile Cache Replacement in Edge Computing via Offline-Online Deep Reinforcement Learning. 663-674 - Wai-Kong Lee, Raymond K. Zhao, Ron Steinfeld, Amin Sakzad, Seong Oun Hwang:
High Throughput Lattice-Based Signatures on GPUs: Comparing Falcon and Mitaka. 675-692 - Burak Aksar, Efe Sencan, Benjamin Schwaller, Omar Aaziz, Vitus J. Leung, Jim M. Brandt, Brian Kulis, Manuel Egele, Ayse K. Coskun:
Runtime Performance Anomaly Diagnosis in Production HPC Systems Using Active Learning. 693-706
Volume 35, Number 5, May 2024
- Yi-Chien Lin, Bingyi Zhang, Viktor K. Prasanna:
HitGNN: High-Throughput GNN Training Framework on CPU+Multi-FPGA Heterogeneous Platform. 707-719 - Tianyu Zeng, Xiaoxi Zhang, Jingpu Duan, Chao Yu, Chuan Wu, Xu Chen:
An Offline-Transfer-Online Framework for Cloud-Edge Collaborative Distributed Reinforcement Learning. 720-731 - Yuhang Liu, Xin Deng, Jiapeng Zhou, Mingyu Chen, Yungang Bao:
Suppressing the Interference Within a Datacenter: Theorems, Metric and Strategy. 732-750 - Enge Song, Tian Pan, Haoyu Song, Qiang Fu, Yingjiang Liu, Chenhao Jia, Chuanying Yuan, Minglan Gao, Jiao Zhang, Tao Huang, Yunjie Liu:
INT-Label: Lightweight In-Band Network-Wide Telemetry via Distributed Labeling. 751-767 - Fan Yuan, Xiaojian Yang, Shengguo Li, Dezun Dong, Chun Huang, Zheng Wang:
Optimizing Multi-Grid Preconditioned Conjugate Gradient Method on Multi-Cores. 768-779 - Yuanhong Zhang, Weizhan Zhang, Haipeng Du, Caixia Yan, Li Liu, Qinghua Zheng:
FHVAC: Feature-Level Hybrid Video Adaptive Configuration for Machine-Centric Live Streaming. 780-795 - Bowen Zhang, Shengan Zheng, Liangxu Nie, Zhenlin Qi, Hongyi Chen, Linpeng Huang, Hong Mei:
Revisiting PM-Based B$^{+}$+-Tree With Persistent CPU Cache. 796-813 - Anshuman Misra, Ajay D. Kshemkalyani:
Byzantine-Tolerant Causal Ordering for Unicasts, Multicasts, and Broadcasts. 814-828 - Dingding Li, Weijie Zhang, Mianxiong Dong, Kaoru Ota:
DMA-Assisted I/O for Persistent Memory. 829-843 - Runzhou Han, Mai Zheng, Suren Byna, Houjun Tang, Bin Dong, Dong Dai, Yong Chen, Dongkyun Kim, Joseph Hassoun, David Thorsley:
PROV-IO$^+$+: A Cross-Platform Provenance Framework for Scientific Data on HPC Systems. 844-861
Volume 35, Number 6, June 2024
- Jiamin Fan, Kui Wu, Guoming Tang, Yang Zhou, Shengqiang Huang:
Taking Advantage of the Mistakes: Rethinking Clustered Federated Learning for IoT Anomaly Detection. 707-721 - Xinyi Ji, Jiankuo Dong, Tonggui Deng, Pinchang Zhang, Jiafeng Hua, Fu Xiao:
HI-Kyber: A Novel High-Performance Implementation Scheme of Kyber Based on GPU. 722-736 - Chiranjeb Mondal, Sanjay V. Rajopadhye:
Taking RNA-RNA Interaction to Machine Peak. 737-749 - Siqi Wang, Tianyu Feng, Hailong Yang, Xin You, Bangduo Chen, Tongxuan Liu, Zhongzhi Luan, Depei Qian:
AtRec: Accelerating Recommendation Model Training on CPUs. 750-763 - Wei-Mei Chen, Hsin-Hung Tsai, Joon Fong Ling:
Parallel Computation of Dominance Scores for Multidimensional Datasets on GPUs. 764-776 - Zirui Liu, Yikai Zhao, Zhuochen Fan, Tong Yang, Xiaodong Li, Ruwen Zhang, Kaicheng Yang, Zihan Jiang, Zheng Zhong, Yi Huang, Cong Liu, Jing Hu, Gaogang Xie, Bin Cui:
BurstBalancer: Do Less, Better Balance for Large-Scale Data Center Traffic. 777-794 - Jiesong Liu, Feng Zhang, Lv Lu, Chang Qi, Xiaoguang Guo, Dong Deng, Guoliang Li, Huanchen Zhang, Jidong Zhai, Hechen Zhang, Yuxing Chen, Anqun Pan, Xiaoyong Du:
G-Learned Index: Enabling Efficient Learned Index on GPU. 795-812 - Amirhossein Taherpour, Xiaodong Wang:
HybridChain: Fast, Accurate, and Secure Transaction Processing With Distributed Learning. 813-827 - Pourya Soltani, Farid Ashtiani:
Analytical Modeling and Throughput Computation of Blockchain Sharding. 828-842 - Zheng Zhang, Yaqi Xia, Hulin Wang, Donglin Yang, Chuang Hu, Xiaobo Zhou, Dazhao Cheng:
MPMoE: Memory Efficient MoE for Pre-Trained Models With Adaptive Pipeline Parallelism. 843-856 - Cheng Wang, Kun Xie, Jiazheng Tian, Jigang Wen, Xiaocan Li, Gaogang Xie, Kenli Li:
HPETC: History Priority Enhanced Tensor Completion for Network Distance Measurement. 857-873 - Kaiyang Liu, Jingrong Wang, Zhiming Huang, Jianping Pan:
Sampling-Based Multi-Job Placement for Heterogeneous Deep Learning Clusters. 874-888 - Guoqing Xiao, Chuanghui Yin, Yuedan Chen, Mingxing Duan, Kenli Li:
Efficient Utilization of Multi-Threading Parallelism on Heterogeneous Systems for Sparse Tensor Contraction. 889-900 - Xin Du, Minglong Wang, Zhihui Lu, Qiang Duan, Yuhao Liu, Jianfeng Feng, Huarui Wang:
HRCM: A Hierarchical Regularizing Mechanism for Sparse and Imbalanced Communication in Whole Human Brain Simulations. 901-918 - Dazhao Cheng, Kai Yan, Xinquan Cai, Yili Gong, Chuang Hu:
SLO-Aware Function Placement for Serverless Workflows With Layer-Wise Memory Sharing. 919-936 - Chen Wang, Kathryn M. Mohror, Marc Snir:
Formal Definitions and Performance Comparison of Consistency Models for Parallel File Systems. 937-951 - Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Quyang Pan, Xuefeng Jiang, Bo Gao:
FedICT: Federated Multi-Task Distillation for Multi-Access Edge Computing. 952-966
Volume 35, Number 7, July 2024
- Runzhen Xue, Dengke Han, Mingyu Yan, Mo Zou, Xiaocheng Yang, Duo Wang, Wenming Li, Zhimin Tang, John Kim, Xiaochun Ye, Dongrui Fan:
HiHGNN: Accelerating HGNNs Through Parallelism and Data Reusability Exploitation. 1122-1138 - Bowen Zhang, Huaxi Gu, Grace Li Zhang, Yintang Yang, Ziteng Ma, Ulf Schlichtmann:
A 3D Hybrid Optical-Electrical NoC Using Novel Mapping Strategy Based DCNN Dataflow Acceleration. 1139-1154 - Chen Chen, Hong Xu, Wei Wang, Baochun Li, Bo Li, Li Chen, Gong Zhang:
Synchronize Only the Immature Parameters: Communication-Efficient Federated Learning By Freezing Parameters Adaptively. 1155-1173 - Xiaqing Li, Qi Guo, Guangyan Zhang, Siwei Ye, Guanhua He, Yiheng Yao, Rui Zhang, Yifan Hao, Zidong Du, Weimin Zheng:
FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning With Partitioning and Parallelism of Search Space. 1174-1188 - Linsi Lan, Junbo Wang, Zhi Li, Krishna Kant, Wanquan Liu:
FedREM: Guided Federated Learning in the Presence of Dynamic Device Unpredictability. 1189-1206 - Rahul Mishra, Hari Prabhat Gupta, Garvit Banga, Sajal K. Das:
Fed-RAC: Resource-Aware Clustering for Tackling Heterogeneity of Participants in Federated Learning. 1207-1220 - Yuyang Jin, Haojie Wang, Runxin Zhong, Chen Zhang, Xia Liao, Feng Zhang, Jidong Zhai:
Graph-Centric Performance Analysis for Large-Scale Parallel Applications. 1221-1238 - Yuzhen Zhao, Xiyu Liu:
Spiking Neural P Systems With Microglia. 1239-1250 - Liang Zhang, Wenli Zheng, Kuangyu Zheng, Hongzi Zhu, Chao Li, Minyi Guo:
Bayesian-Driven Automated Scaling in Stream Computing With Multiple QoS Targets. 1251-1267 - Lu Zhao, Fu Xiao, Bo Li, Jian Zhou, Xiaolong Xu, Yun Yang:
Availability-Aware Revenue-Effective Application Deployment in Multi-Access Edge Computing. 1268-1280 - Kai Zhang, Jiahui Hong, Zhengying He, Yinan Jing, X. Sean Wang:
AdaptChain: Adaptive Data Sharing and Synchronization for NFV Systems on Heterogeneous Architectures. 1281-1292 - Chilankamol Sunny, Satyajit Das, Kevin J. M. Martin, Philippe Coussy:
CREPE: Concurrent Reverse-Modulo-Scheduling and Placement for CGRAs. 1293-1306 - Daniela Loreti, Marcello Artioli, Anna Ciampolini:
Rollback-Free Recovery for a High Performance Dense Linear Solver With Reduced Memory Footprint. 1307-1319 - Sai Zhang, Li Tang, Yan-Jun Liu:
Adaptive Neural Control for a Network of Parabolic PDEs With Event-Triggered Mechanism. 1320-1330
Volume 35, Number 8, August 2024
- Jinfan Chen, Shigang Li, Ran Guo, Jinhui Yuan, Torsten Hoefler:
AutoDDL: Automatic Distributed Deep Learning With Near-Optimal Bandwidth Cost. 1331-1344 - Isra Mohamed Ali, Mohamed M. Abdallah:
On Off-Chaining Smart Contract Runtime Protection: A Queuing Model Approach. 1345-1359 - Yanxi Zhang, Muyu Mei, Dongqi Yan, Xu Zhang, Qinghai Yang, Mingwu Yao:
Age-of-Event Aware: Sampling Period Optimization in a Three-Stage Wireless Cyber-Physical System With Diverse Parallelisms. 1360-1372 - Yang Zhou, Fang Wang, Zhan Shi, Dan Feng:
The Static Allocation is Not a Static: Optimizing SSD Address Allocation Through Boosting Static Policy. 1373-1386 - Changmao Wu, Zhengwei Xu, Xiaoming He, Qi Lou, Yuanyuan Xia, Shuman Huang:
Proactive Caching With Distributed Deep Reinforcement Learning in 6G Cloud-Edge Collaboration Computing. 1387-1399 - Xiaofeng Hou, Xuehan Tang, Jiacheng Liu, Chao Li, Luhong Liang, Kwang-Ting Cheng:
WASP: Efficient Power Management Enabling Workload-Aware, Self-Powered AIoT Devices. 1400-1414 - Shengwei Li, Kai Lu, Zhiquan Lai, Weijie Liu, Keshi Ge, Dong Sheng Li:
A Multidimensional Communication Scheduling Method for Hybrid Parallel DNN Training. 1415-1428 - Jingwen Zhou, Feifei Chen, Guangming Cui, Yong Xiang, Qiang He:
FEUAGame: Fairness-Aware Edge User Allocation for App Vendors. 1429-1443 - Jiantong Jiang, Zeyi Wen, Atif Bin Mansoor, Ajmal Mian:
Faster-BNI: Fast Parallel Exact Inference on Bayesian Networks. 1444-1455 - Xinliang Wei, Kejiang Ye, Xinghua Shi, Cheng-Zhong Xu, Yu Wang:
Joint Participant and Learning Topology Selection for Federated Learning in Edge Clouds. 1456-1468 - Chengying Huan, Yongchao Liu, Heng Zhang, Hang Liu, Shiyang Chen, Shuaiwen Leon Song, Yanjun Wu:
TeGraph+: Scalable Temporal Graph Processing Enabling Flexible Edge Modifications. 1469-1487 - Liang Geng, Hao Wang, Jingsong Meng, Dayi Fan, Sami Ben-Romdhane, Hari Kadayam Pichumani, Vinay Phegade, Xiaodong Zhang:
RR-Compound: RDMA-Fused gRPC for Low Latency, High Throughput, and Easy Interface. 1488-1505 - Junxue Zhang, Xiaodian Cheng, Liu Yang, Jinbin Hu, Han Tian, Kai Chen:
High-Performance Hardware Acceleration Architecture for Cross-Silo Federated Learning. 1506-1523
Volume 35, Number 9, September 2024
- Yi-Wei Ci, Michael R. Lyu, Zhan Zhang, De-Cheng Zuo, Xiao-Zong Yang:
KLNK: Expanding Page Boundaries in a Distributed Shared Memory System. 1524-1535 - Sheng Qi, Chao Jin, Mosharaf Chowdhury, Zhenming Liu, Xuanzhe Liu, Xin Jin:
Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters. 1536-1550 - Ahmad Tarraf, Martin Schreiber, Alberto Cascajo, Jean-Baptiste Besnard, Marc-André Vef, Dominik Huber, Sonja Happ, André Brinkmann, David E. Singh, Hans-Christian Hoppe, Alberto Miranda, Antonio J. Peña, Rui Machado, Marta Garcia-Gasulla, Martin Schulz, Paul M. Carpenter, Simon Pickartz, Tiberiu Rotaru, Sergio Iserte, Víctor López, Jorge Ejarque, Heena Sirwani, Jesús Carretero, Felix Wolf:
Malleability in Modern HPC Systems: Current Experiences, Challenges, and Future Opportunities. 1551-1564 - Jiuchen Shi, Kaihua Fu, Jiawen Wang, Quan Chen, Deze Zeng, Minyi Guo:
Adaptive QoS-Aware Microservice Deployment With Excessive Loads via Intra- and Inter-Datacenter Scheduling. 1565-1582 - Dhruv Gajaria, Kevin Antony Gomez, Tosiron Adegbija:
STT-RAM-Based Hierarchical in-Memory Computing. 1615-1629 - Rong Cong, Zhiwei Zhao, Linyuanqi Zhang, Geyong Min:
Cost-Effective Server Deployment for Multi-Access Edge Networks: A Cooperative Scheme. 1583-1597 - Yifan Hua, Shengan Zheng, Weihan Kong, Cong Zhou, Kaixin Huang, Ruoyan Ma, Linpeng Huang:
RADAR: A Skew-Resistant and Hotness-Aware Ordered Index Design for Processing-in-Memory Systems. 1598-1614 - Ran Wang, Cheng Xu, Xiaotong Zhang:
Toward Materials Genome Big-Data: A Blockchain-Based Secure Storage and Efficient Retrieval Method. 1630-1643 - Yuchen Zhong, Guangming Sheng, Juncheng Liu, Jinhui Yuan, Chuan Wu:
Swift: Expedited Failure Recovery for Large-Scale DNN Training. 1644-1656 - Gabriele Mencagli, Patrizio Dazzi, Massimo Coppola:
Springald: GPU-Accelerated Window-Based Aggregates Over Out-of-Order Data Streams. 1657-1671 - Cunyang Wei, Haipeng Jia, Yunquan Zhang, Jianyu Yao, Chendi Li, Wenxuan Cao:
IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs. 1672-1689
Volume 35, Number 10, October 2024
- Jiaxing Qi, Wencong Xiao, Mingzhen Li, Chaojie Yang, Yong Li, Wei Lin, Hailong Yang, Zhongzhi Luan, Depei Qian:
ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG. 1708-1720 - Rong Chen, Xingda Wei, Xiating Xie, Haibo Chen:
Locality-Preserving Graph Traversal With Split Live Migration. 1810-1825 - Mi Zhang, Qihan Kang, Patrick P. C. Lee:
FlexRaft: Exploiting Flexible Erasure Coding for Minimum-Cost Consensus and Fast Recovery. 1826-1840 - Peixuan Li, Ping Xie, Qiang Cao:
SSRAID: A Stripe-Queued and Stripe-Threaded Merging I/O Strategy to Improve Write Performance of Serial Interface SSD RAID. 1841-1853 - Fatemeh Keshavarz-Kohjerdi:
Paired Many-to-Many 2-Disjoint Path Covers in Meshes. 1854-1866 - Jiangfei Duan, Xiuhong Li, Ping Xu, Xingcheng Zhang, Shengen Yan, Yun Liang, Dahua Lin:
Proteus: Simulating the Performance of Distributed DNN Training. 1867-1878
Volume 35, Number 11, November 2024
- Yichen Li, Wenchao Xu, Yining Qi, Haozhao Wang, Ruixuan Li, Song Guo:
SR-FDIL: Synergistic Replay for Federated Domain-Incremental Learning. 1879-1890 - Yuezhi Che, Dazhao Cheng, Xiao Wang, Rujia Wang:
Opca: Enabling Optimistic Concurrent Access for Multiple Users in Oblivious Data Storage. 1891-1903 - Yaqi Xia, Zheng Zhang, Donglin Yang, Chuang Hu, Xiaobo Zhou, Hongyang Chen, Qianlong Sang, Dazhao Cheng:
Redundancy-Free and Load-Balanced TGNN Training With Hierarchical Pipeline Parallelism. 1904-1919 - Haoran Zhou, Wei Rang, Hongyang Chen, Xiaobo Zhou, Dazhao Cheng:
DeepTM: Efficient Tensor Management in Heterogeneous Memory for DNN Training. 1920-1935 - Sanjay Lall, Calin Cascaval, Martin Izzard, Tammo Spalink:
Logical Synchrony and the Bittide Mechanism. 1936-1948 - Peng Wang, Hong Jiang, Yu Liu, Zhelong Zhao, Ke Zhou, Zhihai Huang:
Beyond Belady to Attain a Seemingly Unattainable Byte Miss Ratio for Content Delivery Networks. 1949-1963 - Shiyu Shen, Hao Yang, Wangchen Dai, Hong Zhang, Zhe Liu, Yunlei Zhao:
High-Throughput GPU Implementation of Dilithium Post-Quantum Digital Signature. 1964-1976 - Hua Huang, Edmond Chow:
Exploring the Design Space of Distributed Parallel Sparse Matrix-Multiple Vector Multiplication. 1977-1988 - Zhaojie Wen, Qiong Chen, Quanfeng Deng, Yipei Niu, Zhen Song, Fangming Liu:
ComboFunc: Joint Resource Combination and Container Placement for Serverless Function Scaling With Heterogeneous Container. 1989-2005 - Huali Lu, Feng Lyu, Ju Ren, Huaqing Wu, Conghao Zhou, Zhongyuan Liu, Yaoxue Zhang, Xuemin Shen:
CODE$^{+}$+: Fast and Accurate Inference for Compact Distributed IoT Data Collection. 2006-2022 - Di Mou, Bo Wang, Dajiang Liu:
SC-CGRA: An Energy-Efficient CGRA Using Stochastic Computing. 2023-2038 - Yin Xu, Mingjun Xiao, Jie Wu, He Sun:
Privacy Preserving Task Push in Spatial Crowdsourcing With Unknown Popularity. 2039-2053 - Lan Zhang, Anran Li, Hongyi Peng, Feng Han, Fan Huang, Xiang-Yang Li:
Privacy-Preserving Data Selection for Horizontal and Vertical Federated Learning. 2054-2068 - Kai Chen, Qingjun Qu, Feng Zhu, Zhengming Yi, Wenjie Tang:
CPLNS: Cooperative Parallel Large Neighborhood Search for Large-Scale Multi-Agent Path Finding. 2069-2086 - Qiqi Duan, Chang Shao, Guochen Zhou, Minghan Zhang, Qi Zhao, Yuhui Shi:
Distributed Evolution Strategies With Multi-Level Learning for Large-Scale Black-Box Optimization. 2087-2101 - Ping Luo, Jieren Cheng, Neal Xiong, Zhenhao Liu, Jie Wu:
FedVeca: Federated Vectorized Averaging on Non-IID Data With Adaptive Bi-Directional Global Objective. 2102-2113 - Hui Dou, Yilun Wang, Yiwen Zhang, Pengfei Chen, Zibin Zheng:
DeepCAT+: A Low-Cost and Transferrable Online Configuration Auto-Tuning Approach for Big Data Frameworks. 2114-2131 - Biao Hou, Song Yang, Fan Li, Liehuang Zhu, Lei Jiao, Xu Chen, Xiaoming Fu:
Gamora: Learning-Based Buffer-Aware Preloading for Adaptive Short Video Streaming. 2132-2146 - Feng Yao, Qian Tao, Shengyuan Lin, Yanfeng Zhang, Wenyuan Yu, Shufeng Gong, Qiange Wang, Ge Yu, Jingren Zhou:
Towards Efficient Graph Processing in Geo-Distributed Data Centers. 2147-2160 - Darong Huang, Luis Costero, David Atienza:
An Evaluation Framework for Dynamic Thermal Management Strategies in 3D MultiProcessor System-on-Chip Co-Design. 2161-2176 - Rui Tian, Jiazhi Jiang, Jiangsu Du, Dan Huang, Yutong Lu:
Sophisticated Orchestrating Concurrent DLRM Training on CPU/GPU Platform. 2177-2192 - Donglei Wu, Weihao Yang, Xiangyu Zou, Hao Feng, Dingwen Tao, Shiyi Li, Wen Xia, Binxing Fang:
BIRD+: Design of a Lightweight Communication Compressor for Resource-Constrained Distribution Learning Platforms. 2193-2207 - Yuyang Jin, Runxin Zhong, Saiqin Long, Jidong Zhai:
Efficient Inference for Pruned CNN Models on Mobile Devices With Holistic Sparsity Alignment. 2208-2223 - Shouxi Luo, Renyi Wang, Ke Li, Huanlai Xing:
Efficient Cross-Cloud Partial Reduce With CREW. 2224-2238 - Renyou Xie, Chaojie Li, Xiaojun Zhou, Zhaoyang Dong:
Accelerating Communication-Efficient Federated Multi-Task Learning With Personalization and Fairness. 2239-2253 - Hanfei Yu, Hao Wang, Jian Li, Xu Yuan, Seung-Jong Park:
Freyr $^+$+: Harvesting Idle Resources in Serverless Computing via Deep Reinforcement Learning. 2254-2269 - Jiandong Liu, Lan Zhang, Fengxiang He, Chi Zhang, Shanyang Jiang, Xiang-Yang Li:
Communication-Efficient Regret-Optimal Distributed Online Convex Optimization. 2270-2283 - Renwen Ma, Kai Hwang, Mo Li, Yiming Miao:
Trusted Model Aggregation With Zero-Knowledge Proofs in Federated Learning. 2284-2296
Volume 35, Number 12, December 2024
- Quan Deng, Qiang Liu, Ming Yuan, Xiaohui Duan, Lin Gan, Jinzhe Yang, Wenlai Zhao, Zhenxiang Zhang, Guiming Wu, Wayne Luk, Haohuan Fu, Guangwen Yang:
Acceleration of Multi-Body Molecular Dynamics With Customized Parallel Dataflow. 2297-2314 - Liang Wang, Jinzhe Yang, Jidong Zhai, Guangwen Yang:
Optimizing I/O Performance Through Effective vCPU Scheduling Interference Management. 2315-2330 - Shuangwu Chen, Jiangming Li, Qifeng Yuan, Huasen He, Sen Li, Jian Yang:
Two-Timescale Joint Optimization of Task Scheduling and Resource Scaling in Multi-Data Center System Based on Multi-Agent Deep Reinforcement Learning. 2331-2346 - Francesco De Pellegrini, Vaibhav Kumar Gupta, Rachid El Azouzi, Serigne Gueye, Cédric Richier, Jeremie Leguay:
Fair Coflow Scheduling via Controlled Slowdown. 2347-2360 - Devki Nandan Jha, Yinhao Li, Zhenyu Wen, Graham Morgan, Prem Prakash Jayaraman, Maciej Koutny, Omer F. Rana, Rajiv Ranjan:
GeoDeploy: Geo-Distributed Application Deployment Using Benchmarking. 2361-2374 - Zhiqi Lin, Youshan Miao, Guanbin Xu, Cheng Li, Olli Saarikivi, Saeed Maleki, Fan Yang:
Efficient Schedule Construction for Distributed Execution of Large DNN Models. 2375-2391 - Qiushi Zheng, Jiong Jin, Zhishu Shen, Libing Wu, Iftekhar Ahmad, Yong Xiang:
Distributed Task Processing Platform for Infrastructure-Less IoT Networks: A Multi-Dimensional Optimization Approach. 2392-2404 - Bingyi Zhang, Rajgopal Kannan, Carl E. Busart, Viktor K. Prasanna:
VisionAGILE: A Versatile Domain-Specific Accelerator for Computer Vision Tasks. 2405-2422 - Jinyu Hu, Huizhang Luo, Hong Jiang, Guoqing Xiao, Kenli Li:
FastLoad: Speeding Up Data Loading of Both Sparse Matrix and Vector for SpMV on GPUs. 2423-2434 - Rong Hu, Haotian Wang, Wangdong Yang, Renqiu Ouyang, Keqin Li, Kenli Li:
BCB-SpTC: An Efficient Sparse High-Dimensional Tensor Contraction Employing Tensor Core Acceleration. 2435-2448 - Binghan Wu, Wei Bao, Bing Bing Zhou:
Competitive Analysis of Online Elastic Caching of Transient Data in Multi-Tiered Content Delivery Network. 2449-2462 - Zhenhua Guo, Yinan Tang, Jidong Zhai, Tongtong Yuan, Jian Jin, Li Wang, Yaqian Zhao, Rengang Li:
A Survey on Performance Modeling and Prediction for Distributed DNN Training. 2463-2478 - Hui Sun, Deyan Kong, Song Jiang, Yinliang Yue, Xiao Qin:
TrieKV: A High-Performance Key-Value Store Design With Memory as Its First-Class Citizen. 2479-2496 - Keyuan Wang, Linpeng Jia, Zhaoxiong Song, Yi Sun:
Mitosis: A Scalable Sharding System Featuring Multiple Dynamic Relay Chains. 2497-2512 - Chunlin Tian, Li Li, Kahou Tam, Yebo Wu, Cheng-Zhong Xu:
Breaking the Memory Wall for Heterogeneous Federated Learning via Model Splitting. 2513-2526 - Ruchi Bhoot, Suved Sanjay Ghanmode, Yogesh Simmhan:
TARIS: Scalable Incremental Processing of Time-Respecting Algorithms on Streaming Graphs. 2527-2544
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.