Gleam Lab · Tag Archive

Tag: Qwen

4 articles collected by topic for tutorials, cases, engineering practice, and research notes.

AI Research #130: Qwen2.5-Omni Practical Applications

Office assistant, education and training, programming and operations, search-enhanced RAG, device control/plugin agents, and companion entertainment.

11/19/2025

AI Research #129: Qwen2.5-Omni-7B Key Specs - VRAM, Context and Deployment

Runs stably at FP16 ~14GB VRAM, with INT8/INT4 quantization (<4GB) enabling deployment on consumer GPUs or edge devices.

11/18/2025

AI Research #128: Qwen2.5-Omni Training Pipeline - Three-stage Multi-modal Training

Complete training pipeline breakdown for Qwen2.5-Omni: Thinker based on Qwen2.5, vision initialized from Qwen2.5-VL, audio from Whisper-large-v3.

11/17/2025

AI Research #127: Qwen2.5-Omni Deep Dive - Thinker-Talker Dual-core Architecture

Engineering breakdown of Qwen2.5-Omni (2024-2025) Thinker-Talker dual-core architecture: unified Transformer decoder for text/image/video/audio fusion, TMRoPE.

11/16/2025