Posted 2026-06-29Updated 2026-07-01Study8 minutes read (About 1128 words)2026-06-29-QWenVLQwen-VL Paper Notes: Visual Adapter, Cross-Attention, 2D Position Encoding, and RoPERead more