PhrasIS: Phrase Inference and Similarity benchmark

Logic Journal of the IGPL 32 (6):1088-1101 (2024)
  Copy   BIBTEX

Abstract

We present PhrasIS, a benchmark dataset composed of natural occurring Phrase pairs with Inference and Similarity annotations for the evaluation of semantic representations. The described dataset fills the gap between word and sentence-level datasets, allowing to evaluate compositional models at a finer granularity than sentences. Contrary to other datasets, the phrase pairs are extracted from naturally occurring text in image captions and news headlines. All the text fragments have been annotated by experts following a rigorous process also described in the manuscript achieving high inter annotator agreement. In this work we analyse the dataset, showing the relation between inference labels and similarity scores. With 10K phrase pairs split in development and test, the dataset is an excellent benchmark for testing meaning representation systems.

Other Versions

No versions found

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 107,589

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Benchmarking Scientific Image Forgery Detectors.João P. Cardenuto & Anderson Rocha - 2022 - Science and Engineering Ethics 28 (4):1-38.

Analytics

Added to PP
2024-04-07

Downloads
35 (#782,213)

6 months
15 (#301,529)

Historical graph of downloads
How can I increase my downloads?

Author Profiles

Isaí López
National Autonomous University of Mexico
Paula Garcia
University of Canterbury
1 more

Citations of this work

No citations found.

Add more citations

References found in this work

Add more references