Image and video analysis via z.ai GLM-4.6V vision model. Use for analyzing screenshots, extracting text/code from images (OCR), diagnosing errors from screenshots, understanding technical diagrams, reading charts/dashboards, comparing UI screenshots, and analyzing videos. Powered by MCP.