Cloud-Edge Inference under Communication Constraints: Data Quantization and Early Exit
编号:15 访问权限:仅限参会人 更新:2022-10-11 14:39:20 浏览:83次 口头报告

报告开始:2022年10月20日 10:15(Asia/Shanghai)

报告时间:15min

所在会场:[RS] Regular Session [RS2] RS2: Resource Allocation in Wireless Networks

视频 无权播放 演示文件

提示:该报告下的文件权限为仅限参会人,您尚未登录,暂时无法查看。

摘要
The inference delay of deep neural networks (DNN) cannot always fulfill the application requirements due to the data transmission to the cloud, which can be effectively alleviated by cloud-edge collaboration via DNN partitioning. However, the communication capability between cloud and edge is usually limited. In this paper, we propose a threshold-based data quantization and exit (TDQE) framework, where the classification thresholds divide the data to different parts and determine to either quantize the data for transmitting under the communication constraints or early exit the DNN at the partition point. To solve the optimal solutions of the thresholds, we model an accuracy optimization problem under communication constraints, and solve it through the linear programming. In order to reduce the impact of quantization on accuracy due to difference parts of data, we further adjust the quantization ranges for each part of data to refine the quantization performance. Based on the optimization results, TDQE algorithm is proposed to construct the DNN partitioning with classified data processing. Finally, to evaluate the proposed method, we compare it with two traditional DNN partitioning algorithms via the simulation results. The results show that the proposed algorithm outperforms the other two baselines with respect to the accuracy and meets the real-time requirements.
关键词
Cloud, fog, and edge network architectures
报告人
Gao Yu
Zhejiang University

发表评论
验证码 看不清楚,更换一张
全部评论
重要日期
  • 会议日期

    10月19日

    2022

    10月22日

    2022

主办单位
Zhejiang University
承办单位
Zhejiang University
联系方式
移动端
在手机上打开
小程序
打开微信小程序
客服
扫码或点此咨询