[paper note] Human-In-The-Loop Person Re-Identification

来源:互联网 发布:马蓉出轨内裤淘宝 编辑:程序博客网 时间:2024/05/07 17:17
  • ECCV 2016
  • Paper
  • Poster
  • Author: Hanxiao Wang, Shaogang Gong, Xiatian Zhu, Tao (Tony) Xiang

Professor Shaogang Gong from Queen Mary, University of London, who works closely with Dr. Tony Xiang, is an expert in person re-id field. He wrote a book naming Person Re-Identification. Since this book has been published, supervised learning method with CNN feature extractor has gradually dominate this field (person re-id). However, prof. Gong and his group are seeking for novel ways to resolve re-id problem. They have two papers about person re-id in ECCV 2016, together with Person Re-identification by Unsupervised L1 Graph Learning, both do not follow the supervised learning scheme.

model pipe line

Highlight

  • Propose Human Verification Incremental Learning (HVIL), an online learning approach for person re-id.
  • Do not need labeled data. Human participates in the training procedure, to give a pair of probe-gallery image a feedback as true, similar(but not true), dissimilar.
  • Small training set, large test set.
  • They show some other works attempting to relax the need of labeling, with semi-supervised, unsupervised and transfer learning approches, in Related Work part.

Modeling human feedback as a loss function

  • Incrementally optimised ranking function
    err(fxp(xg),y)=y(rank(fxp(xg)))

    where fxp(xg) is the distanse of pair {xp,xg}, which is defined as negative Mahalanobis Distance. y denotes the feedback, that is, y {true-match, strong-negative, week-negative} ({m,s,w}). rank is just an int number denotes the rank of a gallery image.
  • Re-id ranking loss y is defined as
    y(k)or=i=1kαiify{m,w}=i=k+1ngαiify{s}

    with α1α20

Real-time Model Update for Instant Feedback Reward

  • Negative Mahalanobis Distance:
    fxp(xg)=[(xpxg)TM(xpxg)],MSd+

    Sd+ represents semi-definate matrix.
  • Knowledge cumulation by online learning
    Mt=argminMSd+ΔF(M,Mt1)+η(t)

    This equation with t indicate that the matrix M in M-distance is learned in stages (knowledge cumulation). (t) is the loss of human feed back in t stage. ΔF is Burg matrix divergence??
    ΔF(M,Mt1)=tr(MM1t1)logdet(MM1t1)

Metric ensemble learning

  • When no human feedback is avilable.
  • Idea: re-using pairs already verified by human
    fensij=fensxpi(xgj)=dTijWdij
  • Ideal ranking: fij=0 for ci=cj and fij=1 for cicj.

Experiment

  • Settings
    • For human feedback, 300 people/image probe; 1000 people/image gallery. Return top-50 in the rank list for feedback.
    • Max 3 rounds for each probe, result in 300-900 indicative verification.
  • Claim that suer input will be 10-fold less.
  • Better than other human-in-the-loop methods. Less feedback and search time.
  • 56.1% on CUHK-03, 78% on Market-1501.
  • Evaluate automated person re-id
    • 168 pairs on CUHK-03, 234 pairs on Market-1501; supervised model trained with 300 ground truth data for comparison.
    • Also compared with unensembled matrix after τ (Mτ) and average matrix M for all time 1τ (Mavg)
    • Ensembled performs best.
0 0
原创粉丝点击