• 两分布间距离的度量:MMD、KL散度、Wasserstein 对比


    MMD:最大均值差异

    Wasserstein距离[1]

    实验

    数据来源

    Amazon review benchmark dataset. The Amazon review dataset is one of the most widely used benchmarks for domain adaptation and sentiment analysis. It is collected from product reviews from Amazon.com and contains four types (domains), namely books (B), DVDs (D), electronics (E) and kitchen appliances (K). For each domain, there are 2,000 labeled reviews and approximately 4,000 unlabeled reviews (varying slightly across domains) and the classes are balanced. In our experiments, for easy computation, we follow (Chen et al. 2012) to use the 5,000 most frequent terms of unigrams and bigrams as the input and totally A24 = 12 adaptation tasks are constructed.

    Office-Caltech object recognition dataset. The Office-Caltech dataset released by (Gong et al. 2012) is comprised of 10 common categories shared by the Office-31 and Caltech-256 datasets. In our experiments, we construct 12 tasks across 4 domains: Amazon (A), Webcam (W), DSLR (D) and Caltech (C), with 958, 295, 157 and 1,123 image samples respectively. In our experiments, Decaf features are used as the input. Decaf features (Donahue et al. 2014) are the 4096-dimensional FC7-layer hidden activations extracted by the deep convolutional neural network AlexNet.

    实验结果

    从上面结果可知,Wasserstein距离基本比MMD的效果好,但MMD也在个别情况下优于Wasserstein距离。

    参考文献

    [1] Shen J, Qu Y, Zhang W, et al. Wasserstein Distance Guided Representation Learning for Domain Adaptation[J]. AAAI-2017.

    转自:http://www.pianshen.com/article/5980165769/

  • 相关阅读:
    利用书签栏作插入时失败告终
    组以逗号分隔的子串及跨平update join
    ms_sql:drop and create a job
    why dicePlayer cannot player with defy mb526
    好像国庆三天是可以加班工资计了
    msssql 用numberic(38)替代int去解决int不够的问题
    C#的switch与二维数组.....
    某牛人所留的联系方式
    封装对象类
    数据库访问小列题
  • 原文地址:https://www.cnblogs.com/jiangkejie/p/10581073.html
Copyright © 2020-2023  润新知