Tag: Qwen
4 articles
AI Research #130: Qwen2.5-Omni Practical Applications
Office assistant, education and training, programming and operations, search-enhanced RAG, device control/plugin agents, and companion entertainment.
AI Research #129: Qwen2.5-Omni-7B Key Specs - VRAM, Context and Deployment
Runs stably at FP16 ~14GB VRAM, with INT8/INT4 quantization (<4GB) enabling deployment on consumer GPUs or edge devices.
AI Research #128: Qwen2.5-Omni Training Pipeline - Three-stage Multi-modal Training
Complete training pipeline breakdown for Qwen2.5-Omni: Thinker based on Qwen2.5, vision initialized from Qwen2.5-VL, audio from Whisper-large-v3.
AI Research #127: Qwen2.5-Omni Deep Dive - Thinker-Talker Dual-core Architecture
Engineering breakdown of Qwen2.5-Omni (2024-2025) Thinker-Talker dual-core architecture: unified Transformer decoder for text/image/video/audio fusion, TMRoPE.