⚡

// Skill profile

🧠 Smart Model Switcher V2 (Optimized)

Name: 🧠 Smart Model Switcher V2 (Optimized)
Author: davidme6

name: smart-model-switcher-v2

by davidme6 · published 2026-03-22

开发工具API集成加密货币

Total installs

Stars

★ 0

Last updated

2026-03

// Install command

$ claw add gh:davidme6/davidme6-smart-model-switcher-v2

View on GitHub

// Full documentation

---

name: smart-model-switcher-v2

description: Optimized Smart Model Switcher (v2) - Zero-latency, no restart required. Automatically selects and switches to the best available model for each task from your purchased plan. Runtime model selection with <100ms latency. Supports auto-detection of new models, multi-model parallel processing, and intelligent task classification. Always uses the strongest model within your plan.

---

# 🧠 Smart Model Switcher V2 (Optimized)

**Zero-Latency • No Restart • Runtime Switching**

🎯 What's New in V2

| Feature | V1 | V2 |

|---------|----|----|

| **Restart Required** | ❌ Yes | ✅ No |

| **Switch Latency** | 5-10s | <100ms |

| **Model Preloading** | ❌ No | ✅ Yes |

| **Parallel Processing** | ❌ No | ✅ Yes |

| **Auto Model Discovery** | ❌ No | ✅ Yes |

| **Fallback Logic** | Basic | Advanced |

| **Performance** | Low | High |

🚀 New Features

1. Zero-Latency Switching

No gateway restart needed

Runtime model selection

<100ms switching latency

User-transparent operation

2. Model Preloading

All plan models preloaded at startup

Instant switching between models

No API connection delays

Connection pooling

3. Intelligent Task Classification

Multi-keyword detection

Context-aware analysis

Confidence scoring

Fallback to default if uncertain

4. Parallel Model Processing

Multiple models ready simultaneously

Fast failover if model unavailable

Load balancing across models

Automatic retry logic

5. Auto Model Discovery

Detects new models in your plan

Auto-updates model registry

No manual configuration needed

Real-time plan synchronization

6. Advanced Fallback

Multi-tier fallback chain

Graceful degradation

User notification on fallback

Logs all fallback events

📊 Model Selection Matrix (Optimized)

|-----------|--------------|------------|------------|---------|

🔧 Architecture

┌─────────────────────────────────────────────────────────┐
│                   User Request                          │
└─────────────────────────────────────────────────────────┘
                          ↓
┌─────────────────────────────────────────────────────────┐
│              Task Analyzer (30ms)                       │
│  • Keyword matching                                     │
│  • Context analysis                                     │
│  • Confidence scoring                                   │
└─────────────────────────────────────────────────────────┘
                          ↓
┌─────────────────────────────────────────────────────────┐
│           Model Registry (Preloaded)                    │
│  • qwen3.5-plus (Ready)                                 │
│  • qwen3-coder-plus (Ready)                             │
│  • qwen3-max (Ready)                                    │
│  • ... (All models preloaded)                           │
└─────────────────────────────────────────────────────────┘
                          ↓
┌─────────────────────────────────────────────────────────┐
│          Model Selector (20ms)                          │
│  • Select best model for task                           │
│  • Check availability                                   │
│  • Apply fallback if needed                             │
└─────────────────────────────────────────────────────────┘
                          ↓
┌─────────────────────────────────────────────────────────┐
│            Model API Call                               │
│  • Direct API call (no config change)                   │
│  • Connection pooling                                   │
│  • Auto retry                                           │
└─────────────────────────────────────────────────────────┘
                          ↓
┌─────────────────────────────────────────────────────────┐
│              Response                                   │
│  • Return result                                        │
│  • Log performance                                      │
│  • Update statistics                                    │
└─────────────────────────────────────────────────────────┘

⚡ Performance Metrics

| Metric | V1 | V2 | Improvement |

|--------|----|----|-------------|

| **Switch Time** | 5-10s | <100ms | 50-100x faster |

| **CPU Usage** | Low | Low | Same |

| **API Calls** | 1 | 1-2 | Same |

🎯 Usage Examples

**Example 1: Writing Task**

User: "帮我写一本科幻小说"
Agent: "🧠 Switched to qwen3.5-plus (best for novel writing, 1M context)"
[Completes task]

**Example 2: Coding Task**

User: "帮我写个 Python 爬虫"
Agent: "🧠 Switched to qwen3-coder-plus (best for coding)"
[Completes task]

**Example 3: Reasoning Task**

User: "这道数学题怎么做？"
Agent: "🧠 Switched to qwen3-max (best for reasoning)"
[Completes task]

**Example 4: Multi-Step Task**

User: "帮我写个贪吃蛇游戏，然后写个游戏说明"
Agent: "🧠 Switched to qwen3-coder-plus (best for coding)"
[Writes code]
Agent: "🧠 Switched to qwen3.5-plus (best for writing)"
[Writes documentation]

⚠️ Limitations

| Limitation | Description |

|------------|-------------|

| **Plan-Bound** | Only uses models from your purchased plan |

| **No External** | Won't call models outside your plan |

| **Requires Config** | Needs correct openclaw.json setup |

| **Memory** | Uses 20% more memory for preloading |

🔍 Technical Details

Task Classification Algorithm

1. Extract keywords from user request
2. Match against task type keywords
3. Calculate confidence score for each type
4. Select type with highest confidence
5. If confidence < threshold, use default
6. Map type to best model
7. Check model availability
8. Apply fallback if needed

Model Registry

{
  "models": {
    "qwen3.5-plus": {
      "status": "ready",
      "tasks": ["writing", "analysis", "translation"],
      "context": 1000000,
      "priority": 1
    },
    "qwen3-coder-plus": {
      "status": "ready",
      "tasks": ["coding", "debug"],
      "context": 100000,
      "priority": 1
    },
    "qwen3-max": {
      "status": "ready",
      "tasks": ["reasoning", "math"],
      "context": 100000,
      "priority": 1
    }
  }
}

Fallback Chain

Primary Model (Unavailable?)
    ↓
Fallback 1 (Unavailable?)
    ↓
Fallback 2 (Unavailable?)
    ↓
Default Model (Always available)

📈 Benefits

| Benefit | Impact |

|---------|--------|

| **No Restart** | Save 5-10s per switch |

| **Zero Latency** | Instant model switching |

| **Better UX** | Users don't notice switching |

| **Auto-Update** | New models auto-detected |

| **Reliable** | Advanced fallback logic |

| **Efficient** | Connection pooling |

🆚 Comparison

V1 vs V2

| Feature | V1 | V2 |

|---------|----|----|

| Restart Required | Yes | No |

| Switch Latency | 5-10s | <100ms |

| Model Preloading | No | Yes |

| Auto Discovery | No | Yes |

| Fallback | Basic | Advanced |

| Performance | Low | High |

| Memory | Low | Medium (+20%) |

| User Experience | Poor | Excellent |

📝 Installation

# Clone repository
git clone https://github.com/davidme6/openclaw.git

# Copy skill to workspace
cp -r openclaw/skills/smart-model-switcher-v2 ~/.openclaw/workspace/skills/

# Restart Gateway (one-time)
openclaw gateway restart

🔧 Configuration

No configuration needed! The skill auto-detects your plan and available models.

🆘 Troubleshooting

**Q: Why didn't it switch models?**

A: Check logs for fallback events. Primary model may be unavailable.

**Q: Can I override the selection?**

A: Yes, manually specify a model and it will use that.

**Q: How do I know which model is being used?**

A: It always tells you at the start of each task.

**Q: Memory usage increased?**

A: Normal. Model preloading uses 20% more memory for instant switching.

📞 Support

**GitHub:** https://github.com/davidme6/openclaw/tree/main/skills/smart-model-switcher-v2

**Issues:** Report bugs or suggest improvements

**License:** MIT

---

**Version:** 2.0.0 (Optimized)

**Author:** Created for Coding Plan users

**License:** MIT

**Release Date:** 2026-03-10

🌟 What Makes V2 Special

1. **Zero-Latency** - No restart, instant switching

2. **Smart Preloading** - All models ready at startup

3. **Auto-Discovery** - New models detected automatically

4. **Advanced Fallback** - Multi-tier fallback chain

5. **Performance** - 50-100x faster than V1

6. **User-First** - Transparent, no interruption

---

**Upgrade from V1 today and experience zero-latency model switching!** 🚀

// Comments

// Related skills

More tools from the same signal band

Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).

Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.

The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...

日历管理数据处理

1 installs★ 0