Kno.e.sis Publications

A Semantics-Based Measure of Emoji Similarity

Sanjaya Wijeratne, Wright State University - Main CampusFollow
Lakshika Balasuriya, Wright State University - Main CampusFollow
Amit Sheth, Wright State University - Main CampusFollow
Derek Doran, Wright State University - Main CampusFollow

Document Type

Conference Proceeding

Publication Date

2017

Abstract

Emoji have grown to become one of the most important forms of communication on the web. With its widespread use, measuring the similarity of emoji has become an important problem for contemporary text processing since it lies at the heart of sentiment analysis, search, and interface design tasks. This paper presents a comprehensive analysis of the semantic similarity of emoji through embedding models that are learned over machine-readable emoji meanings in the EmojiNet knowledge base. Using emoji descriptions, emoji sense labels and emoji sense definitions, and with different training corpora obtained from Twitter and Google News, we develop and test multiple embedding models to measure emoji similarity. To evaluate our work, we create a new dataset called EmoSim508, which assigns human-annotated semantic similarity scores to a set of 508 carefully selected emoji pairs. After validation with EmoSim508, we present a real-world use-case of our emoji embedding models using a sentiment analysis task and show that our models outperform the previous best-performing emoji embedding model on this task. The EmoSim508 dataset and our emoji embedding models are publicly released with this paper and can be downloaded from http://emojinet.knoesis.org/.

Comments

Repository Citation

Sanjaya Wijeratne, Lakshika Balasuriya, Amit Sheth, and Derek Doran.2017. A Semantics-Based Measure of Emoji Similarity. In Proceedings of WI’17, Leipzig, Germany, August 23-26, 2017, 8 pages.

DOI

10.1145/3106426.3106490

Download

Included in

Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, Science and Technology Studies Commons

COinS

Kno.e.sis Publications

A Semantics-Based Measure of Emoji Similarity

Document Type

Publication Date

Abstract

Comments

Repository Citation

DOI

Included in

Search

Browse

About

SelectedWorks Sites

Kno.e.sis Publications

A Semantics-Based Measure of Emoji Similarity

Authors

Document Type

Publication Date

Abstract

Comments

Repository Citation

DOI

Included in

Share

Search

Browse

About

SelectedWorks Sites