-
Notifications
You must be signed in to change notification settings - Fork 884
Issues: Lightning-AI/litgpt
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
LitGPT Python API seems to use more memory than Something isn't working
chat
bug
#1588
opened Jul 16, 2024 by
rasbt
Chat: doesn't work with enabled Something isn't working
compilation
bug
#1584
opened Jul 15, 2024 by
Andrei-Aksionov
ModuleNotFoundError: No module named 'extensions'
bug
Something isn't working
#1560
opened Jul 7, 2024 by
ZeguanXiao
how to change dataset path or download url when evaluating
question
Further information is requested
#1556
opened Jul 6, 2024 by
lzd-1230
Mistral v0.1 sliding window attention
enhancement
New feature or request
#1552
opened Jul 5, 2024 by
rasbt
processing the dataset.
question
Further information is requested
#1549
opened Jul 3, 2024 by
Esmail-ibraheem
Add Gemma 2 Checkpoints
checkpoints
enhancement
New feature or request
#1535
opened Jun 27, 2024 by
rasbt
LIMA multiturn dialogues not working correctly?
question
Further information is requested
#1504
opened Jun 19, 2024 by
Nanayeb34
Could we pass number of litdata workers in litgpt pretrain?
enhancement
New feature or request
#1500
opened Jun 18, 2024 by
ebektas
numpy 2.0.0 support
3rd party
enhancement
New feature or request
#1494
opened Jun 16, 2024 by
cardoprimo
serving with multi-GPU
enhancement
New feature or request
#1482
opened Jun 12, 2024 by
richardzhuang0412
Compiled inference failed: "Global state changed while dynamo tracing"
3rd party
#1479
opened Jun 12, 2024 by
antareson
Gradient Accumulation Step under Multi-node Pretaining
enhancement
New feature or request
#1474
opened Jun 10, 2024 by
SHUMKASHUN
Batched inference on a single node with multiple GPUs
enhancement
New feature or request
#1473
opened Jun 9, 2024 by
antareson
The difference between FSDPStrategy and DeepSpeedStrategy during pre-training
#1452
opened May 30, 2024 by
wen020
Training lasts just 150 seconds for TinyLlama OpenWebtext dataset
#1447
opened May 26, 2024 by
srivassid
Previous Next
ProTip!
Adding no:label will show everything without a label.