Summary: A newly developed artificial intelligence algorithm can accurately detect sarcasm in comments written on social media platforms.
Source: University of Central Florida
Computer science researchers at the University of Central Florida have developed a sarcasm detector.
Social media has become a dominant form of communication for individuals, and for companies looking to market and sell their products and services. Properly understanding and responding to customer feedback on Twitter, Facebook and other social media platforms is critical for success, but it is incredibly labor intensive.
That’s where sentiment analysis comes in. The term refers to the automated process of identifying the emotion — either positive, negative or neutral — associated with text. While artificial intelligence refers to logical data analysis and response, sentiment analysis is akin to correctly identifying emotional communication. A UCF team developed a technique that accurately detects sarcasm in social media text.
The team’s findings were recently published in the journal Entropy.
Effectively the team taught the computer model to find patterns that often indicate sarcasm and combined that with teaching the program to correctly pick out cue words in sequences that were more likely to indicate sarcasm. They taught the model to do this by feeding it large data sets and then checked its accuracy.
“The presence of sarcasm in text is the main hindrance in the performance of sentiment analysis,” says Assistant Professor of engineering Ivan Garibay.
“Sarcasm isn’t always easy to identify in conversation, so you can imagine it’s pretty challenging for a computer program to do it and do it well. We developed an interpretable deep learning model using multi-head self-attention and gated recurrent units. The multi-head self-attention module aids in identifying crucial sarcastic cue-words from the input, and the recurrent units learn long-range dependencies between these cue-words to better classify the input text.”
The team, which includes computer science doctoral student Ramya Akula, began working on this problem under a DARPA grant that supports the organization’s Computational Simulation of Online Social Behavior program.
“Sarcasm has been a major hurdle to increasing the accuracy of sentiment analysis, especially on social media, since sarcasm relies heavily on vocal tones, facial expressions and gestures that cannot be represented in text,” says Brian Kettler, a program manager in DARPA’s Information Innovation Office (I2O).
“Recognizing sarcasm in textual online communication is no easy task as none of these cues are readily available.”
This is one of the challenges Garibay’s Complex Adaptive Systems Lab (CASL) is studying. CASL is an interdisciplinary research group dedicated to the study of complex phenomena such as the global economy, the global information environment, innovation ecosystems, sustainability, and social and cultural dynamics and evolution. CASL scientists study these problems using data science, network science, complexity science, cognitive science, machine learning, deep learning, social sciences, team cognition, among other approaches.
“In face-to-face conversation, sarcasm can be identified effortlessly using facial expressions, gestures, and tone of the speaker,” Akula says. “Detecting sarcasm in textual communication is not a trivial task as none of these cues are readily available. Especially with the explosion of internet usage, sarcasm detection in online communications from social networking platforms is much more challenging.”
Garibay is an assistant professor in Industrial Engineering and Management Systems. He has several degrees including a Ph.D. in computer science from UCF. Garibay is the director of UCF’s Artificial Intelligence and Big Data Initiative of CASL and of the master’s program in data analytics.
His research areas include complex systems, agent-based models, information and misinformation dynamics on social media, artificial intelligence and machine learning. He has more than 75 peer-reviewed papers and more than $9.5 million in funding from various national agencies.
Akula is a doctoral scholar and graduate research assistant at CASL. She has a master’s degree in computer science from Technical University of Kaiserslautern in Germany and a bachelor’s degree in computer science engineering from Jawaharlal Nehru Technological University, India.
About this natural language processing research news
Source: University of Central Florida
Contact: Zenaida Gonzalez Kotala – University of Central Florida
Image: The image is in the public domain
Original Research: Open access.
“Interpretable Multi-Head Self-Attention Architecture for Sarcasm Detection in Social Media” by Ivan Garibay & Ramya Akula. Entropy
Abstract
Interpretable Multi-Head Self-Attention Architecture for Sarcasm Detection in Social Media
With the online presence of more than half the world population, social media plays a very important role in the lives of individuals as well as businesses alike. Social media enables businesses to advertise their products, build brand value, and reach out to their customers.
To leverage these social media platforms, it is important for businesses to process customer feedback in the form of posts and tweets. Sentiment analysis is the process of identifying the emotion, either positive, negative or neutral, associated with these social media texts.
The presence of sarcasm in texts is the main hindrance in the performance of sentiment analysis.
Sarcasm is a linguistic expression often used to communicate the opposite of what is said, usually something that is very unpleasant, with an intention to insult or ridicule. Inherent ambiguity in sarcastic expressions make sarcasm detection very difficult. In this work, we focus on detecting sarcasm in textual conversations from various social networking platforms and online media.
To this end, we develop an interpretable deep learning model using multi-head self-attention and gated recurrent units.
The multi-head self-attention module aids in identifying crucial sarcastic cue-words from the input, and the recurrent units learn long-range dependencies between these cue-words to better classify the input text.
We show the effectiveness of our approach by achieving state-of-the-art results on multiple datasets from social networking platforms and online media. Models trained using our proposed approach are easily interpretable and enable identifying sarcastic cues in the input text which contribute to the final classification score.
We visualize the learned attention weights on a few sample input texts to showcase the effectiveness and interpretability of our model.