麻豆精品无码av,欧美1区2区,久久中文字幕乱码人妻,亚洲欧美另类少妇精品,在线看黄射,69pao高清,九九九久久久国产精品,子操大逼1234区,九九爱99热精品

2
點贊
0
評論
0
轉(zhuǎn)載
收藏

Our paper got accepted by NIPS'16

Our paper "Double Thompson Sampling for Dueling Bandits" got accepted by NIPS'16, one of the top conferences in machine learning. 

In this paper, we propose a Double Thompson Sampling (D-TS) algorithm for dueling bandit problems. As indicated by its name, D-TS selects both the first and the second candidates according to Thompson Sampling. Specifically, D-TS maintains a posterior distribution for the preference matrix, and chooses the pair of arms for comparison by sampling twice from the posterior distribution. This simple algorithm applies to general Copeland dueling bandits, including Condorcet dueling bandits as its special case. For general Copeland dueling bandits, we show that D-TS achieves O(K^2 log T) regret. For Condorcet dueling bandits, we further simplify the D-TS algorithm and show that the simplified D-TS algorithm achieves O(Klog T + K^2 log log T) regret. Simulation results based on both synthetic and real-world data demonstrate the efficiency of the proposed D-TS algorithm.


A preliminary version can be found at https://arxiv.org/abs/1604.07101.

聲明:本內(nèi)容系學者網(wǎng)用戶個人學術動態(tài)分享,不代表平臺立場。

加州大學戴維斯分校 計算機系
近期熱門動態(tài)
Our paper got accepted by NIPS'15
1631 2015-09-09 05:06:23
#
809 2015-08-19 06:20:04
SCHOLAT.com 學者網(wǎng)
免責聲明 | 關于我們 | 聯(lián)系我們
聯(lián)系我們:
返回頂部
平凉市| 逊克县| 枣阳市| 明溪县| 嫩江县| 镇雄县| 得荣县| 泗洪县| 德令哈市| 都匀市| 东台市| 白玉县| 虎林市| 航空| 八宿县| 东海县| 澄迈县| 阳谷县| 四子王旗| 信阳市| 太湖县| 上思县| 富阳市| 鹿邑县| 永寿县| 安新县| 浦东新区| 凤冈县| 和平区| 田东县| 思南县| 蒲城县| 镇沅| 辽源市| 金湖县| 汝南县| 绵阳市| 隆子县| 湟源县| 永靖县| 沙田区|