科學(xué)研究

學(xué)術(shù)講座:Cost-aware Cascading Bandits

發(fā)布時(shí)間:2018-11-21

報(bào)告題目:Cost-aware Cascading Bandits

主講嘉賓:Cong Shen 教授 \r
中國(guó)科學(xué)技術(shù)大學(xué)

邀請(qǐng)人:全智教授

時(shí)間:2018年 11月 23日(周 \r
五)上午11:00

地點(diǎn):深圳大學(xué)南校區(qū)基礎(chǔ)實(shí)驗(yàn)樓北座信息工程學(xué)院N710會(huì)議室

報(bào)告摘要:

We will discuss a \r
cost-aware cascading bandits mode that is motivated by many practical \r
applications. This is a new variant of the multi-armed bandit model but \r
incorporating the random cost of pulling arms and cascading feedback. In each \r
step, the learning agent chooses an ordered list of items and examines them \r
sequentially, until certain stopping condition is satisfied. Our objective is \r
then to maximize the expected net reward in each step, i.e., the reward obtained \r
in each step minus the total cost incurred in examining the items, by deciding \r
the ordered list of items, as well as when to stop examination.

We study both the offline and online settings, depending on whether the state \r
and cost statistics of the items are known beforehand. For the offline setting, \r
we show that the Unit Cost Ranking with Threshold 1 (UCR-T1) policy is optimal. \r
For the online setting, we propose a Cost-aware Cascading Upper Confidence Bound \r
(CC-UCB) algorithm, and show that the cumulative regret scales in O(log T). We \r
also provide a lower bound for all α-consistent policies, which scales in Ω(log \r
T) and matches our upper bound. The performance of the CC-UCB algorithm is \r
evaluated with real-world datasets.Joint work with R. Zhou (University of \r
Science and Technology of China), C. Gan and J. Yang (Pennsylvania State \r
University)

嘉賓簡(jiǎn)介:

Cong Shen received his B.S. and M.S. degrees, in 2002 and 2004 \r
respectively, from the Department of Electronic Engineering, Tsinghua \r
University, China. He obtained the Ph.D. degree from the Electrical Engineering \r
Department, UCLA, in 2009. From 2009 to 2014, He worked for Qualcomm Research in \r
San Diego, CA. In 2015, he joined University of Science and Technology of China \r
(USTC) as Professor in the School of Information Science and Technology. His \r
research interests include machine learning, information theory, and wireless \r
communications. He currently serves as an editor for the IEEE Transactions on \r
Wireless Communications and an editor for the IEEE Wireless Communications

歡迎各位老師和同學(xué)參加。

最新動(dòng)態(tài)

玉树县| 花莲市| 获嘉县| 甘洛县| 腾冲县| 清远市| 清水县| 双鸭山市| 山东省| 包头市| 阿图什市| 弋阳县| 松桃| 子长县| 海丰县| 尉犁县| 分宜县| 即墨市| 秀山| 京山县| 体育| 阳西县| 合江县| 时尚| 商城县| 阿坝县| 交口县| 公安县| 紫云| 措勤县| 和平县| 永德县| 蒙城县| 星子县| 铁岭县| 定陶县| 富锦市| 宜昌市| 讷河市| 扶余县| 清水河县|