An R 📦 of H. P. Lovecraft's works for textual analysis
-
Updated
Jul 16, 2024 - R
An R 📦 of H. P. Lovecraft's works for textual analysis
Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
Deduce: de-identification method for Dutch medical text
Add-on to the Orange3 data mining toolkit with text processing widgets from the project Navigating Stories
A high performance matcher designed to solve AND OR NOT logical word matching and TEXT VARIATIONS problems.
Python & command-line tool to gather text on the Web: Crawling & scraping, content extraction, metadata. TXT, Markdown, CSV & XML output.
Analyze text stored as *.txt in chosen file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words).
This repository contains code and analysis for exploring the transcripts of the various "Against The Clock" videos featured on the FACT Magazine YouTube channel. The goal is to uncover insights, patterns, and trends across the different artists and their creative process under time constraints.
This repository houses a script that can download PDFs from a specified URL, convert them to text, and perform text analysis. This analysis includes identifying the language, eliminating stopwords, and counting word and phrase frequency. It's worth noting that the script is capable of analyzing texts in multiple languages.
extract text from any document. no muss. no fuss.
Learning or Cheating? Reddit Insights on ChatGPT in Academia
Various Algorithms for Short Text Mining
Web scraping and text mining on patents
We unified some latent block models by proposing a flexible ELBM that is extended to SELBM to address the sparse problem by revealing a diagonal structure from sparse datasets. This leads to obtain more homogeneous co-clusters and therefore produce useful, ready-to-use and easy-to-interpret results.
Extract substrings matching a lexical pattern
Ein Parser für die Generation eines XML-TEI-Korpus der 20. Legislaturperiode des hessischen Landtags und die Berechnung eines Topic Models.
Compilation of texts from WoW alphas and betas. Used by https://github.com/The-Alpha-Project/Text-Crawler-Website
Language, Knowledge, Cognition
Text mining tasks to extract information using regular expression
Add a description, image, and links to the text-mining topic page so that developers can more easily learn about it.
To associate your repository with the text-mining topic, visit your repo's landing page and select "manage topics."