#

text-mining

Here are 2,190 public repositories matching this topic...

jrdnbradford / lovecraftr

An R 📦 of H. P. Lovecraft's works for textual analysis

nlp text-mining r text-analysis horror digital-humanities text-processing lovecraft horror-fiction

Updated Jul 16, 2024
R

palladian / palladian

Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.

text-mining retrieval information-extraction classification

Updated Jul 16, 2024
Java

vmenger / deduce

Deduce: de-identification method for Dutch medical text

python nlp text-mining python-library information-extraction text-processing dutch deidentification dutch-clinical-nlp

Updated Jul 16, 2024
Python

navigating-stories / orange-story-navigator

Add-on to the Orange3 data mining toolkit with text processing widgets from the project Navigating Stories

stories text-mining storytelling data-analysis orange3

Updated Jul 16, 2024
Python

Lips7 / Matcher

A high performance matcher designed to solve AND OR NOT logical word matching and TEXT VARIATIONS problems.

python java rust text-mining text-classification text pattern-matching word text-analysis matcher text-processing aho-corasick string-matching matching-engine content-moderation sensitive-word

Updated Jul 16, 2024
Rust

adbar / trafilatura

Python & command-line tool to gather text on the Web: Crawling & scraping, content extraction, metadata. TXT, Markdown, CSV & XML output.

Updated Jul 16, 2024
Python

ArdentEmpiricist / text_analysis

Analyze text stored as *.txt in chosen file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words).

rust science text-mining data-mining statistics

Updated Jul 16, 2024
Rust

JesusSalinas / master_upb

Text Analysis

text-mining sraping

Updated Jul 16, 2024
Python

otonomee / against-the-clock-transcript-analysis

This repository contains code and analysis for exploring the transcripts of the various "Against The Clock" videos featured on the FACT Magazine YouTube channel. The goal is to uncover insights, patterns, and trends across the different artists and their creative process under time constraints.

nlp machine-learning natural-language-processing text-mining data-analysis music-production audio-processing creative-ai yt-dlp ai-analysis creative-process fact-magazine against-the-clock

Updated Jul 15, 2024
Python

cortega26 / PDF-Text-Analizer

This repository houses a script that can download PDFs from a specified URL, convert them to text, and perform text analysis. This analysis includes identifying the language, eliminating stopwords, and counting word and phrase frequency. It's worth noting that the script is capable of analyzing texts in multiple languages.

nlp pdf text-mining ocr pdf-converter text-analysis text-summarization

Updated Jul 15, 2024
Python

deanmalmgren / textract

extract text from any document. no muss. no fuss.

python natural-language-processing text-mining data-mining

Updated Jul 15, 2024
HTML

strdubtseva / CSS_paper

Learning or Cheating? Reddit Insights on ChatGPT in Academia

text-mining sentiment-analysis reddit-api topic-modeling

Updated Jul 14, 2024
Python

stephenhky / PyShortTextCategorization

Various Algorithms for Short Text Mining

python package machine-learning natural-language-processing text-mining algorithm neural-network python-library topic-modeling

Updated Jul 13, 2024
Python

AidenJiang01 / WebScraping_TextMining

Web scraping and text mining on patents

text-mining webscraping text-graph patents-analysis

Updated Jul 13, 2024
Jupyter Notebook

ELBMcoclust

Saeidhoseinipour / ELBMcoclust

We unified some latent block models by proposing a flexible ELBM that is extended to SELBM to address the sparse problem by revealing a diagonal structure from sparse datasets. This leads to obtain more homogeneous co-clusters and therefore produce useful, ready-to-use and easy-to-interpret results.

text-mining word-cloud exponential text-summarization sparse-matrix co-clustering latent-block-model coclust

Updated Jul 13, 2024
Python

dishmint / LexicalCases

Extract substrings matching a lexical pattern

text-mining text pattern-matching linguistics wolfram-language text-search wolfram-mathematica text-analaysis

Updated Jul 12, 2024
Mathematica

Nolram567 / PolMinePyHesse

Ein Parser für die Generation eines XML-TEI-Korpus der 20. Legislaturperiode des hessischen Landtags und die Berechnung eines Topic Models.

text-mining xml-tei

Updated Jul 11, 2024
Python

geo-tp / Alpha-Project-Text-Archive

Compilation of texts from WoW alphas and betas. Used by https://github.com/The-Alpha-Project/Text-Crawler-Website

text-mining text-datasets

Updated Jul 11, 2024
HTML

graphbrain / graphbrain

Language, Knowledge, Cognition

python nlp natural-language-processing text-mining knowledge philosophy text-analysis artificial-intelligence knowledge-graph cognitive-science knowledge-base knowledge-representation computational-social-science natural-language-understanding hypergraphs

Updated Jul 11, 2024
Python

RutingF / nlp_textmining

Text mining tasks to extract information using regular expression

text-mining regular-expression

Updated Jul 10, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the text-mining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-mining topic, visit your repo's landing page and select "manage topics."