-
-
Notifications
You must be signed in to change notification settings - Fork 4.3k
explosion spaCy Discussions
Sort by:
Latest activity
Label
Categories, most helpful, and community links
Categories
Community links
Discussions
-
You must be logged in to vote 🎁 GloVe Integration
third-partyThird-party packages and services feat / vectorsFeature: Word vectors and similarity -
You must be logged in to vote 💻 backend choice with joblib multiprocessing
usageGeneral spaCy usage -
🎁 Support for XML wordnet
third-partyThird-party packages and services feat / vectorsFeature: Word vectors and similarity -
🎁 Idea: Date and Time Parsing
enhancementFeature requests and improvements -
🤔 Differences between small, medium and large models
usageGeneral spaCy usage modelsIssues related to the statistical models -
🎩 Extendable language detection for spaCy on Doc and Span objects
enhancementFeature requests and improvements -
💻 Tokenizing named entities as a single token
feat / nerFeature: Named Entity Recognizer feat / tokenizerFeature: Tokenizer -
📚 Using Spacy to extract pharmaceutical active ingredients from madical notes
usageGeneral spaCy usage feat / matcherFeature: Token, phrase and dependency matcher -
🎩 I wrote code that generates numbers as Dutch text strings and vice versa
enhancementFeature requests and improvements lang / nlDutch language data and models -
You must be logged in to vote 🎁 Parallelize Pretraining
enhancementFeature requests and improvements 🌙 nightlyDiscussion and contributions related to nightly builds trainingTraining and updating models perf / speedPerformance: speed scalingScaling, serving and parallelizing spaCy -
💻 Several questions when trying to get the start/end token index of a span given the character offset of it?
feat / docFeature: Doc, Span and Token objects -
🌍 Training a Turkish model
lang / trTurkish language data and models -
🎩 CoNLL-U output
enhancementFeature requests and improvements feat / trainingFeature: Training utils, Example, Corpus and converters -
🎩 I created a raspberry pi image
enhancementFeature requests and improvements plat / aarch64aarch64 architecture support -
You must be logged in to vote 🎁 RNN Doc similarity, example available?
feat / vectorsFeature: Word vectors and similarity feat / docFeature: Doc, Span and Token objects -
You must be logged in to vote 🎁 Pipelines with duplicate models
enhancementFeature requests and improvements -
💻 How to make tokenizer handle double-word countries name ?
feat / tokenizerFeature: Tokenizer -
🤔 Unable to retrieve "ent_type" for named entity when word is lowercase
feat / nerFeature: Named Entity Recognizer -
You must be logged in to vote 🌍 tokenizer_exceptions problem with Persian
lang / faPersian language data and models feat / tokenizerFeature: Tokenizer -
You must be logged in to vote 🎁 Add lexical constraints for POS tags (like a tag dictionary)
feat / taggerFeature: Part-of-speech tagger -
💻 How to mark identifiers like productid (AS12314_11) as entity ?
feat / nerFeature: Named Entity Recognizer -
📚 Extracting Consumer information from 10-k text
feat / nerFeature: Named Entity Recognizer ✨ prodigyIssues related to using spaCy with the Prodigy annotation tool resolvedThe issue was addressed / answered -
💻 Adding already tokenized document to spaCy pipeline
usageGeneral spaCy usage feat / tokenizerFeature: Tokenizer -
🎁 Distinguish between sister languages
proposalProposal specs for new features resolvedThe issue was addressed / answered