· April 28, 2026

OpenClaw Self-Upgrade Roadmap

From manual YouTuber to AI-orchestrated expert. What capabilities exist now, what a private server unlocks, and the exact sequence to get there.

Deep Research Private Server Security Alert
Contents

Executive Summary

You have 4 OpenClaw instances (Kimi cloud, HER home, HIM school, Mini school Mini PC) but you're using roughly 30% of what OpenClaw can do. The biggest gaps:

A private server (VPS or your home machine running 24/7) unlocks: 24/7 operation, persistent sub-agents, ComfyUI workflows, lower API costs via local LLMs, and full control over model routing.

Part 1: What You Have Now But Aren't Using

1. Sub-Agent Orchestration

What it is: You can spawn isolated child agents to do parallel work — research, coding, writing — then collect results.

Why you need it: Research agent digs into EdTech trends while main agent drafts script. Coding agent builds a website feature while writing agent creates social copy.

Pattern you should use:

Orchestrator (you → Kimi)
├── Research Agent → Gathers EdTech news, papers, tools
├── Script Writer → Drafts YouTube script from research
├── Code Agent (ACP) → Builds demo tools/widgets for classroom
├── Fact-Checker → Verifies claims, checks sources
└── Social Agent → Writes thumbnails, descriptions, tags

Cost optimization: Set agents.defaults.subagents.model to a cheaper model (e.g., kimi/k2 instead of kimi/k2p5) for parallel tasks. Five parallel cheap runs ≈ one expensive sequential run.

2. Cron / Scheduled Tasks

What it is: Exact-time scheduled jobs that run in isolated sessions.

Workflows you should automate:

JobSchedulePurpose
Morning EdTech DigestDaily 07:00Scrape 5 EdTech sources, summarize, post to WeCom
Weekly Content AuditSundays 18:00Review unpublished scripts, prioritize backlog
Social Post Scheduler3x/week 12:00Auto-write and queue LinkedIn/Twitter posts
Git Sync MonitorEvery 2 hoursCheck if all 4 machines have synced; alert if not

3. Task Flow (Durable Multi-Step Orchestration)

What it is: Long-running workflows with state persistence, revision tracking, and recovery. Not just "run this" — "run this 5-step pipeline and survive restarts."

Use case: "Create AI Classroom Guide Episode" — Step 1: Research → Step 2: Draft script → Step 3: Generate demo code → Step 4: Fact-check → Step 5: Produce thumbnail → Step 6: Queue for review. If the gateway restarts, Task Flow resumes from the last completed step.

4. Canvas / Presentations

What it is: Visual canvas you can present, navigate, snapshot. Not just text — actual visual slides.

How you can use it: Generate teaching slides for "AI in the Classroom" presentations. Create visual YouTube thumbnails. Build interactive demos of AI tools for students.

5. Video & Music Generation

What it is: Built-in video_generate and music_generate tools. 15 video providers, multiple music providers.

UseProviders
Music (intro/outro)Google Lyria, MiniMax
Video B-rollGoogle Veo, Runway, Qwen Wan
Animated explanationsComfyUI workflows

6. Browser Automation

What it is: Full browser control — open tabs, click, type, scroll, screenshot, evaluate JS.

How you can use it: Auto-research EdTech tools: visit website, screenshot features, extract pricing. Monitor competitors' channels. Test your own website across devices.

7. ACP Coding Harnesses

What it is: Agent Control Protocol — spawn external coding agents (Claude Code, Codex, Gemini CLI) as sub-agents.

Why this is huge for you: You don't code. But you need tools. ACP agents code FOR you while you sleep. They can build classroom widgets, website features, data visualizations, video processing pipelines.

Pattern: You say "Build me a webpage that demos 5 AI classroom tools with interactive cards" → Kimi spawns ACP → ACP writes HTML/CSS/JS → Kimi reviews, tests, pushes to portfolio repo.

8. TTS (Text-to-Speech)

Generate voiceover drafts for YouTube scripts. Test script flow by listening. Create audio versions of blog posts.

9. Dreaming / Memory Optimization

OpenClaw 2026.4.5+ has experimental memory dreaming — background process that reviews daily notes, promotes important fragments to long-term memory, weights recent vs. old memories. You should enable this so I don't forget important decisions between sessions.

{
  "agents": {
    "defaults": {
      "dreaming": {
        "enabled": true,
        "mode": "core",
        "recallAging": true
      }
    }
  }
}

Part 2: Skills to Install from ClawHub

High-Value Skills for Your Vision

SkillWhat It DoesWhy You Need It
browser-automationAdvanced browser controlDeep research, competitor monitoring
adobe-automatorAutomate Adobe apps via ExtendScriptAutomate Premiere Pro
gan-style-harnessGenerator-Evaluator coding patternAutonomous app building
acp-routerRoute "build me X" to right agentAuto-picks Codex/Claude/etc.
councilMulti-voice decision makingGo/no-go on content decisions
capability-evolverSelf-evolution engineAgent improves itself
seo-auditSEO analysisOptimize 0604.ai
proseMulti-agent prose workflowsLong-form writing
ad-creativeGenerate creative assetsThumbnails, banners

Install command: openclaw skills install <name> or browse clawhub.ai

Part 3: Memory & Dreaming Upgrades

Already covered in Section 1.9 above. Enable dreaming in config so agents remember between sessions. Use Memory Bridge to push your vault into ACP harnesses before they start.

Part 4: Private Server — What Changes

Current Limitations (Cloud/Kimi)

Private Server Options

OptionProsConsCost
VPS (Singapore/Tokyo)24/7, global access, easy scalingMonthly cost, GPU expensive$40-120/mo
HER/Home (7950X3D, 96GB)Already powerful, no monthly costMust leave on 24/7Electricity only
Thailand PCYou own itOld platform, limited GPU~$500-800 upgrade

What Private Server Unlocks

FeatureCloudPrivate
24/7 operation Session resets Persistent
Local LLMs (Llama, Mistral) No GPU Free inference
ComfyUI workflows Not practical Local video/image gen
Persistent subagents Die on reset Background tasks survive
Database storage File-only Postgres, vector DB
Full web access Some blocked No restrictions

Architecture Vision

┌─────────────────────────────────────────────────┐
│              YOUR PRIVATE SERVER                │
│  (HER at home OR VPS Singapore/Tokyo)          │
├─────────────────────────────────────────────────┤
│  OpenClaw Gateway (24/7)                       │
│  ├── Local LLM (Llama 3.3 70B)               │
│  ├── ComfyUI (video/image generation)          │
│  ├── Postgres (structured data)                │
│  ├── Vector DB (semantic search)             │
│  └── FFmpeg (automated video processing)       │
├─────────────────────────────────────────────────┤
│  Agent Teams:                                   │
│  ├── Research Agent (24/7 EdTech monitor)     │
│  ├── Content Agent (scripts, scheduling)       │
│  ├── Code Agent (ACP - builds tools/demos)   │
│  ├── Social Agent (posts, metrics)           │
│  └── Video Agent (thumbnails, B-roll, music)   │
└─────────────────────────────────────────────────┘
         │
         ├── Kimi (cloud) → Orchestrator, strategy
         ├── HER (home) → Heavy compute, video
         ├── HIM (school) → Classroom demos
         └── Mini (school) → Quick queries

Part 5: Security Hardening (CRITICAL)

⚠️ Security Alert — April 2026

Recent audit found 512 vulnerabilities in OpenClaw ecosystem:

What you must do:

  1. Never install random skills — only verified authors
  2. Keep OpenClaw updated: openclaw update
  3. Enable exec approvals for destructive commands
  4. Empty URL allowlists = deny-all
  5. Verify IDENTITY.md is in .gitignore on all machines
  6. Audit installed skills: openclaw skills list

Part 6: Recommended Upgrade Sequence

Phase 1 This Week
  1. Enable heartbeat checks in HEARTBEAT.md
  2. Install browser-automation skill
  3. Install adobe-automator skill
  4. Configure video/music generation providers
  5. Set up first cron job (EdTech morning digest)
  6. Test subagent spawn for parallel research
  7. Enable dreaming in config
Phase 2 Next 2 Weeks
  1. Install gan-style-harness or acp-router
  2. Build first Task Flow (content pipeline)
  3. Test ACP harness (Claude Code or Codex)
  4. Set up canvas presentations for teaching
  5. Create social agent for automated posting
  6. Install seo-audit skill
Phase 3 Next Month
  1. Deploy private server (VPS or HER 24/7)
  2. Install ComfyUI for video/image workflows
  3. Build "AI Classroom Tools" widget library
  4. Create automated video pipeline
  5. Multi-agent team for 24/7 EdTech monitoring

Part 7: What This Means for Your Vision

Before: You manually research, write scripts, edit videos, post social.

After: A team of agents handles execution while you focus on strategy and teaching.

TaskNowAfter Upgrade
Research EdTech trends2 hours/day browsingAgent monitors 24/7, delivers digest
Write scripts4 hours per videoAgent drafts, you refine
Build classroom demosYou don't codeACP agent builds interactive widgets
Generate thumbnailsManual designAI generates 5 options, you pick
Music for videosCopyright riskAI generates royalty-free tracks
Social postingSporadic, manualScheduled, consistent, automated
Video editingLearning PremiereAdobe automator + AI assistance
Memory/contextFragile, session-boundPersistent, searchable, promoted

Your role shifts from "doer" to "director."

The "AI in Classroom" Content Pipeline

  1. Research Agent monitors EdTech → detects new AI tool
  2. Research Agent tests tool → screenshots, features, pricing
  3. Script Writer drafts "How to use X in your classroom"
  4. Code Agent builds interactive demo of the tool
  5. Fact-Checker verifies claims, sources
  6. Video Agent generates thumbnail + B-roll
  7. Music Agent generates intro/outro
  8. Social Agent writes LinkedIn/Twitter thread
  9. YOU review, record voiceover, approve
  10. Agent schedules posts, updates website, logs to memory