The increasing popularity of social media platforms has led to the emergence of hate speech as a major concern. Hate speech refers to abusive language that targets specific group characteristics, such as gender, religion or ethnicity, with the intention of inciting violence. With the expansion of technology, some people are deliberately using social media platforms to spread hate by posting, sharing and commenting on hateful content. This has been observed in incidents such as the Christchurch mosque shootings and hate crimes against Asians in the west where convicts were influenced by online hate text. AI systems have been developed to flag such text but one of the key challenges is reducing false positives (marking non-hate as hate) so that these systems can detect hate speech without undermining freedom of expression. In this paper, Rajput et al. use ETHOS hate speech detection dataset and analyze the performance of a hate speech detection classifier by replacing or integrating word embeddings (fastText (FT), GloVe (GV) or FT + GV) with static BERT embeddings (BE). Through extensive experimental trials, they observe that neural networks perform better with static BE compared to using FT, GV or FT + GV as word embeddings. In comparison to fine-tuned BERT, one metric that significantly improved is specificity. The paper also provides a literature review on recent developments in deep learning technology for various applications across domains such as healthcare, image processing and natural language processing. The research community has shown keen interest in developing AI-assisted applications for detecting hate speech on social media platforms. For instance, Badjatiya et al. proposed a deep learning approach validated on a dataset consisting of 16000 tweets marked either racist or sexist using convolutional neural networks and FastText. Rizos et al., on the other hand experimented with short-text data augmentation techniques in deep learning for hate speech classification using substitution-based augmentation (ThreshAug), word position augmentation (PosAug) and neural generative augmentation (GenAug). They achieved their best results by using GloVe + CNN + LSTM + BestAug where BestAug is a combination of PosAug and ThreshAug. Faris et al proposed a deep learning approach to detect hate speech in Arabic language context using CNN and LSTM with AraVec. The paper also introduces Bidirectional Encoder Representations from Transformers (BERT), a transformer-based ML technique pre-trained on unlabeled data taken from Wikipedia and BookCorpus. BERT uses two major strategies for training: masked language modeling (MLM) and next sentence prediction (NSP). MLM involves randomly masking 15% of the words in a sentence while NSP involves predicting if the second sentence in a pair is the subsequent sentence in original document .
- - Social media platforms have led to the emergence of hate speech as a major concern
- - Hate speech refers to abusive language that targets specific group characteristics with the intention of inciting violence
- - Some people are deliberately using social media platforms to spread hate by posting, sharing and commenting on hateful content
- - AI systems have been developed to flag such text but reducing false positives is a key challenge
- - Rajput et al. use ETHOS hate speech detection dataset and analyze the performance of a hate speech detection classifier by replacing or integrating word embeddings with static BERT embeddings
- - Neural networks perform better with static BERT compared to using other word embeddings
- - The paper provides a literature review on recent developments in deep learning technology for various applications across domains such as healthcare, image processing and natural language processing
- - The research community has shown keen interest in developing AI-assisted applications for detecting hate speech on social media platforms
- - Various approaches have been proposed for detecting hate speech using deep learning techniques, including convolutional neural networks, FastText, GloVe, CNN and LSTM with AraVec
- - Bidirectional Encoder Representations from Transformers (BERT) is a transformer-based ML technique pre-trained on unlabeled data taken from Wikipedia and BookCorpus
1. Social media platforms can cause people to say mean things about others.
2. When someone says mean things about a group of people with the intention of causing harm, it's called hate speech.
3. Some people use social media to spread hate by posting or commenting on mean content.
4. Computers can help find and flag hateful words, but sometimes they make mistakes.
5. Scientists are trying to use a special computer program called BERT to better detect hate speech.
Definitions- Hate speech: saying hurtful things about a group of people with the intention of causing harm
- Social media: websites or apps where people can share information and communicate with each other online
- AI systems: computers that can learn and make decisions like humans do
- False positives: when a computer program thinks something is bad (like hate speech) but it's actually okay
- Word embeddings: a way for computers to understand how different words relate to each other in language
- BERT: a special type of computer program that helps computers understand language better
Hate Speech Detection on Social Media Platforms Using AI
The increasing popularity of social media platforms has led to the emergence of hate speech as a major concern. Hate speech refers to abusive language that targets specific group characteristics, such as gender, religion or ethnicity, with the intention of inciting violence. With the expansion of technology, some people are deliberately using social media platforms to spread hate by posting, sharing and commenting on hateful content. This has been observed in incidents such as the Christchurch mosque shootings and hate crimes against Asians in the west where convicts were influenced by online hate text. As a result, there is an urgent need for automated systems that can detect this kind of malicious content before it causes further harm.
AI Systems for Detecting Hate Speech
AI systems have been developed to flag such text but one of the key challenges is reducing false positives (marking non-hate as hate) so that these systems can detect hate speech without undermining freedom of expression. In this paper, Rajput et al. use ETHOS hate speech detection dataset and analyze the performance of a hate speech detection classifier by replacing or integrating word embeddings (fastText (FT), GloVe (GV) or FT + GV) with static BERT embeddings (BE). Through extensive experimental trials, they observe that neural networks perform better with static BE compared to using FT, GV or FT + GV as word embeddings. In comparison to fine-tuned BERT, one metric that significantly improved is specificity.
Recent Developments in Deep Learning Technology
The research community has shown keen interest in developing AI-assisted applications for detecting hate speech on social media platforms. For instance, Badjatiya et al proposed a deep learning approach validated on a dataset consisting of 16000 tweets marked either racist or sexist using convolutional neural networks and FastText. Rizos et al., experimented with short-text data augmentation techniques in deep learning for hate speech classification using substitution-based augmentation (ThreshAug), word position augmentation (PosAug) and neural generative augmentation (GenAug). They achieved their best results by using GloVe + CNN + LSTM + BestAug where BestAug is a combination of PosAug and ThreshAug. Faris et al proposed a deep learning approach to detect hate speech in Arabic language context using CNN and LSTM with AraVec . The paper also introduces Bidirectional Encoder Representations from Transformers (BERT), a transformer-based ML technique pre-trained on unlabeled data taken from Wikipedia and BookCorpus which uses two major strategies for training: masked language modeling (MLM) and next sentence prediction(NSP). MLM involves randomly masking 15%of words while NSP involves predicting if second sentence is subsequent sentence in original document .
Conclusion
In conclusion , there have been various developments over recent years towards building AI models capable enough to detect hateful content on social media platforms without compromising freedom of expression . Rajput et al’s research shows promising results when static BERT embedding was used instead other word embedding methods like fastText ,GloVe etc . However more work needs be done towards improving accuracy metrics like precision , recall & F1 score while minimizing false positives rate along with exploring different approaches such as data augmentation techniques & transfer learning methods which could help improve performance even further .