PDF
Abstract
Traditional hand-crafted features for representing local image patches are evolving into current data-driven and learning-based image feature, but learning a robust and discriminative descriptor which is capable of controlling various patch-level computer vision tasks is still an open problem. In this work, we propose a novel deep convolutional neural network (CNN) to learn local feature descriptors. We utilize the quadruplets with positive and negative training samples, together with a constraint to restrict the intra-class variance, to learn good discriminative CNN representations. Compared with previous works, our model reduces the overlap in feature space between corresponding and non-corresponding patch pairs, and mitigates margin varying problem caused by commonly used triplet loss. We demonstrate that our method achieves better embedding result than some latest works, like PN-Net and TN-TG, on benchmark dataset.
Cite this article
Download citation ▾
Da-long Zhang, Lei Zhao, Duan-qing Xu, Dong-ming Lu.
Discriminatively learning for representing local image features with quadruplet model.
Optoelectronics Letters 462-465 DOI:10.1007/s11801-017-7198-z
| [1] |
MoltonN., DavisonA. J., ReidI.. Locally Planar Patch Features for Real-Time Structure from Motion, 2004, 1
|
| [2] |
SeitzS. M., CurlessB., DiebelJ., ScharsteinD., SzeliskiR.. A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms, 2006, 519
|
| [3] |
SzeliskiR.. Foundations and Trends in Computer Graphics and Vision, 2006, 2: 1
|
| [4] |
LoweD. G.. International Journal of Computer Vision, 2004, 60: 91
|
| [5] |
BayH., EssA., TuytelaarsT., Van GoolL.. Computer Vision and Image Understanding, 2008, 110: 346
|
| [6] |
Simo-SerraE., TrullsE., FerrazL., KokkinosI., FnaP., Moreno-NoguerF.. Discriminative Learning of Deep Convolutional Feature Point Descriptors, 2015, 118
|
| [7] |
ZagoruykoS., KomodakisN.. Learning to Compare Image Patches via Convolutional Neural Networks, 2015, 4353
|
| [8] |
BalntasV., JohnsE., TangL., MikolajczykK.. PN-Net: Conjoined Triple Deep Network for Learning Local Image Descriptors, 2016,
|
| [9] |
KumarB. G. V., CarneiroG., ReidI.. Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by minimising global loss Functions, 2016, 5385
|
| [10] |
JiaY., ShelhamerE., DonahueJ., KarayevS., LongJ., GirshickR., GuadarramaS., DarrellT.. Caffe: Convolutional Architecture for Fast Feature Embedding, 2014,
|
| [11] |
BrownM., HuaG., WinderS.. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33: 43
|
| [12] |
SimonyanK., VedaldiA., ZissermanA.. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36: 1573
|
Just Accepted
This article has successfully passed peer review and final editorial review, and will soon enter typesetting, proofreading and other publishing processes. The currently displayed version is the accepted final manuscript. The officially published version will be updated with format, DOI and citation information upon launch. We recommend that you pay attention to subsequent journal notifications and preferentially cite the officially published version. Thank you for your support and cooperation.