-
Notifications
You must be signed in to change notification settings - Fork 190
Issues: awslabs/data-on-eks
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Observability for rayserve-vllm pattern using Prom and Grafana dashboards.
#586
opened Jul 15, 2024 by
shivam-dubey-1
RAG pattern with LangChain or LlamaIndex
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#567
opened Jul 3, 2024 by
vara-bonthu
[jupyterhub] Jupyternoebook is not launching
question
Further information is requested
#566
opened Jul 1, 2024 by
purnasanyal
Enhance pull speed for Large ML container Images with Bottlerocket
documentation
Improvements or additions to documentation
enhancement
New feature or request
#559
opened Jun 19, 2024 by
ratnopamc
Error: failed to create containerd task: failed to create shim task: OCI runtime create failed
#557
opened Jun 13, 2024 by
pythonking6
Ray Logging and Dashboard Metrics Export to S3 with Custom Dashboard for Historical Clusters
enhancement
New feature or request
#552
opened Jun 5, 2024 by
vara-bonthu
Ray Observability with Prometheus and AMP
enhancement
New feature or request
#551
opened Jun 5, 2024 by
vara-bonthu
Llama-3 on Inferentia generate infinite and meaningless output
stale
#544
opened May 29, 2024 by
yubingjiaocn
1 task done
JARK Stack - Error while launching training step in the dogbooth Jupyter notebook
#537
opened May 20, 2024 by
rivasdam
1 task done
Incorrect command to provide Linux permission on the AWS Trainium on EKS Blueprint
bug
Something isn't working
documentation
Improvements or additions to documentation
#533
opened May 17, 2024 by
AbrahamArellano
1 task done
Re-introduce plan-examples.yml with a proper fix
bug
Something isn't working
#525
opened May 10, 2024 by
askulkarni2
Update documentation for JupyterHub on EKS solution
bug
Something isn't working
documentation
Improvements or additions to documentation
#515
opened May 2, 2024 by
petrokashlikov
1 task done
[Inference]: RayServe with NVIDIA Triton server pattern
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#509
opened Apr 25, 2024 by
vara-bonthu
[Inference]: Mistral7b on GPUs with JARK stack with Ray Serve
enhancement
New feature or request
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#497
opened Apr 8, 2024 by
vara-bonthu
deploy gradio app for llama2 on inf2/ray to k8s
enhancement
New feature or request
#495
opened Apr 8, 2024 by
harishvs
The inf2/ray gradio app does not format new lines in the output
enhancement
New feature or request
#494
opened Apr 8, 2024 by
harishvs
1 task done
Add temprature, topk, topk and other input params to UI for llama2 gradio application on inf2/ray cluster
enhancement
New feature or request
#493
opened Apr 8, 2024 by
harishvs
Move Trainium on EKS from under Blueprints to Gen AI -> Training -> BERT-Large on Trainium section
documentation
Improvements or additions to documentation
enhancement
New feature or request
#488
opened Apr 5, 2024 by
sheetaljoshi
taxi-trip-execute.sh has poor performance and is duplicated six times
enhancement
New feature or request
#486
opened Apr 4, 2024 by
raykrueger
Previous Next
ProTip!
Follow long discussions with comments:>50.