Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Verbnet identifier in index.xml mismatch #124

Open
alvations opened this issue Oct 19, 2018 · 1 comment
Open

Verbnet identifier in index.xml mismatch #124

alvations opened this issue Oct 19, 2018 · 1 comment

Comments

@alvations
Copy link
Contributor

When recompiling the nltk_data, it throws this error:

nltk_data$ make 
python tools/build_pkg_index.py . https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages index.xml
Traceback (most recent call last):
  File "tools/build_pkg_index.py", line 24, in <module>
    index = build_index(ROOT, BASE_URL)
  File "/Users/liling.tan/Library/Python/2.7/lib/python/site-packages/nltk/downloader.py", line 2088, in build_index
    for pkg_xml, zf, subdir in _find_packages(os.path.join(root, 'packages')):
  File "/Users/liling.tan/Library/Python/2.7/lib/python/site-packages/nltk/downloader.py", line 2216, in _find_packages
    'vs %s)' % (pkg_xml.get('id'), uid))
ValueError: package identifier mismatch (verbnet vs verbnet3)
make: *** [pkg_index] Error 1

@alvations
Copy link
Contributor Author

alvations commented Oct 19, 2018

This is because both verbnet and verbnet3 has the same id:

nltk_data/packages/corpora$ cat verbnet.xml 
<package id="verbnet"
         name="VerbNet Lexicon, Version 2.1"
         version="2.1"
         author="Karin Kipper-Schuler"
         webpage="https://verbs.colorado.edu/verbnet/"
         license="Distributed with permission of the author."
         unzip="1"
         />

nltk_data/packages/corpora$ cat verbnet3.xml 
<package id="verbnet"
         name="VerbNet Lexicon, Version 3.3"
         version="3.3"
         author="Karin Kipper-Schuler"
         webpage="https://verbs.colorado.edu/verbnet/"
         license="Distributed with permission of the author."
         unzip="1"
         />

The same identifier is causing the mismatch in the nltk code too, c.f. nltk/nltk#2015

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant