Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Link to download the training text in docs/source/quicktour.rst is broken #1526

Open
14jdelap opened this issue May 9, 2024 · 5 comments
Open

Comments

@14jdelap
Copy link

14jdelap commented May 9, 2024

In Quicktour the link to download the wikitext-103 file is broken because the response is a 403.

➜  tokenizer-training wget https://s3.amazonaws.com/research.metamind.io/wikitext/wikitext-103-raw-v1.zip
--2024-05-09 12:39:46--  https://s3.amazonaws.com/research.metamind.io/wikitext/wikitext-103-raw-v1.zip
Resolving s3.amazonaws.com (s3.amazonaws.com)... 3.5.12.14, 52.216.89.134, 52.217.197.112, ...
Connecting to s3.amazonaws.com (s3.amazonaws.com)|3.5.12.14|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
2024-05-09 12:39:46 ERROR 403: Forbidden.
Copy link

github-actions bot commented Jun 9, 2024

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

@github-actions github-actions bot added the Stale label Jun 9, 2024
@ArthurZucker
Copy link
Collaborator

Hey! WOuld you like to open a PR for a fix?

@github-actions github-actions bot removed the Stale label Jun 12, 2024
@14jdelap
Copy link
Author

Hey! I tried to find another link for the same dataset online but couldn't find one — otherwise I would've done a PR with the fix :)

@14jdelap
Copy link
Author

But if you point me to where I can find the dataset I'm happy to send the PR

@ArthurZucker
Copy link
Collaborator

Maybe using this one: https://huggingface.co/datasets/Salesforce/wikitext or one that is on the hub would be nice!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants