# aria-model-switcher > Switch between Ollama models at runtime - GLM for smart text, Qwen3-VL for vision. - Author: Najia-afk - Repository: Najia-afk/Aria_moltbot - Version: 20260202015217 - Stars: 0 - Forks: 0 - Last Updated: 2026-02-06 - Source: https://github.com/Najia-afk/Aria_moltbot - Web: https://mule.run/skillshub/@@Najia-afk/Aria_moltbot~aria-model-switcher:20260202015217 --- --- name: aria-model-switcher description: Switch between Ollama models at runtime - GLM for smart text, Qwen3-VL for vision. metadata: {"openclaw": {"emoji": "🔄", "requires": {"env": []}, "primaryEnv": "OLLAMA_URL"}} --- # aria-model-switcher Switch between Ollama models at runtime without restarting containers or reconfiguring LiteLLM/OpenClaw. ## Why? - **GLM-4.7-Flash-REAP** - Smarter for complex text reasoning - **Qwen3-VL** - Has vision capabilities for image tasks - Both run locally on Mac Metal GPU via Ollama ## Model Aliases | Alias | Full Model Name | Use Case | |-------|----------------|----------| | `glm` | `hf.co/unsloth/GLM-4.7-Flash-REAP-23B-A3B-GGUF:Q3_K_S` | Default. Smart text reasoning | | `qwen3-vl` | `qwen3-vl:8b` | Vision/image tasks | | `qwen2.5` | `qwen2.5:7b` | Backup text model | ## Usage ```bash # List available models exec python3 /root/.openclaw/workspace/skills/run_skill.py model_switcher list_models '{}' # Switch to GLM for text tasks (default) exec python3 /root/.openclaw/workspace/skills/run_skill.py model_switcher switch_model '{"model": "glm"}' # Switch to Qwen3-VL for vision/image analysis exec python3 /root/.openclaw/workspace/skills/run_skill.py model_switcher switch_model '{"model": "qwen3-vl"}' # Check current model exec python3 /root/.openclaw/workspace/skills/run_skill.py model_switcher get_current_model '{}' # Pull a model if not available exec python3 /root/.openclaw/workspace/skills/run_skill.py model_switcher pull_model '{"model": "qwen3-vl"}' ``` ## How It Works 1. Model preference is stored in `/root/.openclaw/workspace/memory/model_preference.json` 2. `OllamaSkill` reads this file on each request to determine which model to use 3. No container restart needed - changes take effect immediately 4. LiteLLM/OpenClaw are unaware - they just see "Ollama" as the provider ## Recommended Workflow 1. Use **GLM** as default for most tasks (smarter, better reasoning) 2. Switch to **Qwen3-VL** when you need to analyze images 3. Switch back to **GLM** after vision tasks complete ## Troubleshooting ### Model not found ```bash # Pull the model first exec python3 /root/.openclaw/workspace/skills/run_skill.py model_switcher pull_model '{"model": "glm"}' ``` ### Ollama not reachable Check that Ollama is running natively on Mac (not in Docker) for Metal GPU acceleration: ```bash OLLAMA_HOST=0.0.0.0:11434 ollama serve ```