AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification.
-
Updated
Jul 16, 2024
AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification.
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust)
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
EVE: Encoder-Free Vision-Language Models from BAAI
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
This is the official implementation (code, data) of the paper "MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?""
MOSSBench: A webpage for an oversensitivity benchmark
Add a description, image, and links to the mllm topic page so that developers can more easily learn about it.
To associate your repository with the mllm topic, visit your repo's landing page and select "manage topics."