💫 Industrial-strength Natural Language Processing (NLP) in Python
-
Updated
Jul 12, 2024 - Python
💫 Industrial-strength Natural Language Processing (NLP) in Python
Ravencoin Core integration/staging tree
All the slides, accompanying code and exercises all stored in this repo. 🎈
This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.
LunaSec - Dependency Security Scanner that automatically notifies you about vulnerabilities like Log4Shell or node-ipc in your Pull Requests and Builds. Protect yourself in 30 seconds with the LunaTrace GitHub App: https://github.com/marketplace/lunatrace-by-lunasec/
👑 spaCy building blocks and visualizers for Streamlit apps
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Unsupervised text tokenizer focused on computational efficiency
Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Rosette API Client Library for Python
PHP Text Analysis is a library for performing Information Retrieval (IR) and Natural Language Processing (NLP) tasks using the PHP language
Secure SDK/vault for personal records/PII built to comply with GDPR
TokenScript schema, specs and paper
Fast and customizable text tokenization library with BPE and SentencePiece support
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
CodeChain's official implementation in Rust.
Use Python and NLTK to build out your own text classifiers and solve common NLP problems
ClangKit provides an Objective-C frontend to LibClang. Source tokenization, diagnostics and fix-its are actually implemented.
Add a description, image, and links to the tokenization topic page so that developers can more easily learn about it.
To associate your repository with the tokenization topic, visit your repo's landing page and select "manage topics."