COVID-19 Anti-Asian Racism Hate Speech Detection using Transfer Learning

Since the start of the COVID-19 pandemic, there has been a widespread increase in the amount of hate-speech being propagated online against the Asian community. This project builds upon and explores the work of He et al. Their COVID-HATE dataset contains 206 million tweets focused around anti-Asian hate speech. Using tweet data from before the COVID-19 pandemic, as well as the COVID-HATE dataset from He et al, we performed transfer learning. We tested several different models, including BERT, RoBERTa, LSTM, and BERT-CNN.
Some of these models hindered the performance of He et al’s model, while others improved it.

You can access the original proposal here: Proposal

You can access the full report here: Report

We also explored the geographical extent of the ani-Asain tweets from the COVID-HATE dataset.

Since a large majority of the tweets originated from U.S.A., here is a map showing the distribution within this country.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
docs		docs
images		images
notebooks		notebooks
License		License
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COVID-19 Anti-Asian Racism Hate Speech Detection using Transfer Learning

About

Releases

Packages

Languages

License

nihaldsouza/covid-hatespeech-detection

Folders and files

Latest commit

History

Repository files navigation

COVID-19 Anti-Asian Racism Hate Speech Detection using Transfer Learning

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages