PTF-SimCM: A Simple Contrastive Model with Polysemous Text Fusion for Visual Similarity Metric

Complexity 2022:1-14 (2022)
  Copy   BIBTEX

Abstract

Image similarity metric, also known as metric learning in computer vision, is a significant step in various advanced image tasks. Nevertheless, existing well-performing approaches for image similarity measurement only focus on the image itself without utilizing the information of other modalities, while pictures always appear with the described text. Furthermore, those methods need human supervision, yet most images are unlabeled in the real world. Considering the above problems comprehensively, we present a novel visual similarity metric model named PTF-SimCM. It adopts a self-supervised contrastive structure like SimSiam and incorporates a multimodal fusion module to utilize textual modality correlated to the image. We apply a cross-modal model for text modality rather than a standard unimodal text encoder to improve late fusion productivity. In addition, the proposed model employs Sentence PIE-Net to solve the issue caused by polysemous sentences. For simplicity and efficiency, our model learns a specific embedding space where distances directly correspond to the similarity. Experimental results on MSCOCO, Flickr 30k, and Pascal Sentence datasets show that our model overall outperforms all the compared methods in this work, which illustrates that the model can effectively address the issues faced and enhance the performances on unsupervised visual similarity measuring relatively.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,423

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Two Conceptions of Similarity.Ben Blumson - 2018 - Philosophical Quarterly 68 (270):21-37.
Bounds on Scott ranks of some polish metric spaces.William Chan - 2020 - Journal of Mathematical Logic 21 (1):2150001.
Non-metric Propositional Similarity.A. C. Paseau - 2022 - Erkenntnis 87 (5):2307-2328.
Explaining Contrastive Facts.David-Hillel Ruben - 1987 - Analysis 47 (1):35-37.
Contrastive Explanations as Social Accounts.Kareem Khalifa - 2010 - Social Epistemology 24 (4):263-284.
A Linked Aggregate Code For Processing Faces.M. Lyons, K. Morikawa & S. Akamatsu - 2000 - Pragmatics and Cognition 8 (1):63-81.

Analytics

Added to PP
2022-09-17

Downloads
5 (#1,514,558)

6 months
3 (#1,002,413)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references