Yi-VL-6B|34B
by Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
OSS
Yi-VL-6B|34B
Added 1 June 2026
Overview
Yi-VL-6B|34B is an open-source vision-language model from the Yi series by 01-ai. It comes in two sizes (6B and 34B parameters) and is designed for multimodal tasks such as image understanding and visual question answering. The model is available on Hugging Face under a community-driven framework.
Best for
Best for
Developers who need a free, open-source vision-language model for research or production with flexible size options.
Use cases
- Building image captioning or visual question answering applications
- Integrating vision-language capabilities into open-source projects
- Fine-tuning on custom multimodal datasets for specialized tasks
Notes
Yi-VL-6B|34B is an open-source vision-language model from the Yi series by 01-ai. It comes in two sizes (6B and 34B parameters) and is designed for multimodal tasks such as image understanding and visual question answering. The model is available on Hugging Face under a community-driven framework.
Use cases
- Building image captioning or visual question answering applications
- Integrating vision-language capabilities into open-source projects
- Fine-tuning on custom multimodal datasets for specialized tasks
Pros
- Open-source and freely available for modification and deployment
- Two model sizes allow trade-offs between performance and resource usage
- Backed by the Yi model family with strong community support
Cons
- 34B variant requires substantial GPU memory and compute
- May lag behind proprietary models in benchmark accuracy
- Documentation and examples are less extensive than commercial alternatives
Indexed from awesome-llm and enriched against its public facts.
Pros
- Open-source and freely available for modification and deployment
- Two model sizes allow trade-offs between performance and resource usage
- Backed by the Yi model family with strong community support
Cons
- 34B variant requires substantial GPU memory and compute
- May lag behind proprietary models in benchmark accuracy
- Documentation and examples are less extensive than commercial alternatives
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.