Skip to content

bertini36/SpanishCorpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spanish Corpus

The Python dependencies are described in requirements.txt. To install I recommend install virtualenv and virtualenvwrapper (http://rukbottoland.com/blog/tutorial-de-python-virtualenvwrapper/). Then you can install the spanishCorpus requirements with the command: pip install -r requirements.txt

This little library needs some packages of NLTK (Natural Language Toolkit). To install these packages is required init a Python console in the virtualenv in which the requirements are installed.

Now enter the command: nltk.download()

In the nltk manager you have to ensure that Corpora has installed:

  • cess_esp
  • omw
  • stopwords
  • wordnet
  • words

And in Models you have to ensure that Snowball_data is installed.

Install German and French dictionaries to more accurately:

  • sudo apt-get install myspell-de-de
  • sudo apt-get install myspell-fr-fr

In main.py you have an example of the library's use.

About

Library to ease spanish text mining in Python language

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages