large-vision-models

Here are 3 public repositories matching this topic...

Paranioar / Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

tutorial awesome-list vision-and-language video-text-recognition cross-modal-retrieval visual-semantic-embedding image-text-matching video-text-retrieval image-text-retrieval multimodal-pretraining large-language-models large-vision-language-models memory-efficient-tuning parameter-efficient-fine-tuning large-vision-models

Updated Jul 11, 2024

PKU-Alignment / safe-sora

Star

SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (LVMs).

alignment human-preferences text-to-video-generation large-vision-models

Updated Jul 5, 2024
Python

Rnamrata / image_enhancement_for_social_robots

Star

Image enhancement using CNN and LVM

tensorflow keras jupyter-notebook python3 convolutional-neural-networks large-vision-models

Updated Jun 7, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the large-vision-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the large-vision-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly