Evaluates the degree to which two English sentences are semantically equivalent to each other. Similarity scores range from 0 for no overlap in meaning to 5 for equivalence of meaning. Values in between reflect interpretable levels of partial overlap in meaning.
Publication
Daniel Cer, Mona Diab, Eneko Agirre, Iñigo Lopez-Gazpio, and Lucia Specia. 2017. SemEval-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pages 1–14, Vancouver, Canada. Association for Computational Linguistics.
Language
English
NLP topic
Abstract task
Year
2017
Publication link
Ranking metric
Pearson correlation
Task results
System | Pearson correlation Sort ascending | ICM |
---|---|---|
Roberta large | 0.8656 | |
Roberta base | 0.8572 | |
Xlm roberta large | 0.8450 | |
Bert base cased | 0.8434 | |
Distilbert base uncased | 0.8360 | |
Ixa ehu ixambert base cased | 0.8170 | 0.7872 |
Bert base multilingual cased | 0.8112 | |
Xlm roberta base | 0.8097 | |
Distilbert base multilingual cased | 0.7872 | |
Llama-3.1-8B | 0.7699 |
Pagination
- Page 1
- Next page