-
Updated
Dec 19, 2019 - Python
corpora
Here are 154 public repositories matching this topic...
Reference bio-translations and post-edited automatic translations of systematic reviews published by the Cochrane Collaboration
-
Updated
Aug 12, 2020
Handy frequency lists from corpora and a few related utilities.
-
Updated
Apr 10, 2021 - R
Corpora of Thomas Moore texts and results of stylometric analysis
-
Updated
Feb 28, 2020
Estonian TIMEX Annotated Corpora \ Eesti keele ajaväljendimärgendustega korpused
-
Updated
May 2, 2022 - Python
A script for preprocessing a frequency list from the Norwegian Web as Corpus (NoWaC) in R
-
Updated
Feb 4, 2024 - R
Turkish English Parallel Corpus Generator
-
Updated
Nov 19, 2015 - Python
A collection of small corpuses of interesting data for the creation of bots and similar stuff.
-
Updated
Apr 4, 2021 - JavaScript
Compared writing styles of two authors with different personalities and designation using nltk
-
Updated
Dec 24, 2017 - Jupyter Notebook
St. Petersburg corpus of hagiographic texts
-
Updated
Apr 24, 2016 - Python
The Serbian Semantic Textual Similarity News Corpus
-
Updated
Feb 22, 2021
Named Entity Recognition data for Biblioteca Virtual Miguel de Cervantes
-
Updated
Jul 31, 2017
Split-corpus package that provide dividing text corpora into the meaningful parts as close to specified size as possible.
-
Updated
Feb 8, 2022 - Python
Improve this page
Add a description, image, and links to the corpora topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the corpora topic, visit your repo's landing page and select "manage topics."