OpenClaw Self-Upgrade Research

Contents

Executive Summary Capabilities You Have Now Skills to Install Memory & Dreaming Private Server Security Hardening Upgrade Sequence What This Means for Your Vision

Executive Summary

You have 4 OpenClaw instances (Kimi cloud, HER home, HIM school, Mini school Mini PC) but you're using roughly 30% of what OpenClaw can do. The biggest gaps:

No sub-agent orchestration — everything happens in single sessions
No automated content workflows — no cron jobs for research, scripting, or social posting
No video/music generation — you have the tools but haven't configured providers
No canvas/presentations — you could generate teaching slides and thumbnails
No ACP coding harnesses — Claude Code/Codex could build tools while you sleep
No dreaming/memory optimization — context gets wiped instead of promoted

A private server (VPS or your home machine running 24/7) unlocks: 24/7 operation, persistent sub-agents, ComfyUI workflows, lower API costs via local LLMs, and full control over model routing.

Part 1: What You Have Now But Aren't Using

1. Sub-Agent Orchestration

What it is: You can spawn isolated child agents to do parallel work — research, coding, writing — then collect results.

Why you need it: Research agent digs into EdTech trends while main agent drafts script. Coding agent builds a website feature while writing agent creates social copy.

Pattern you should use:

Orchestrator (you → Kimi)
├── Research Agent → Gathers EdTech news, papers, tools
├── Script Writer → Drafts YouTube script from research
├── Code Agent (ACP) → Builds demo tools/widgets for classroom
├── Fact-Checker → Verifies claims, checks sources
└── Social Agent → Writes thumbnails, descriptions, tags

Cost optimization: Set agents.defaults.subagents.model to a cheaper model (e.g., kimi/k2 instead of kimi/k2p5) for parallel tasks. Five parallel cheap runs ≈ one expensive sequential run.

2. Cron / Scheduled Tasks

What it is: Exact-time scheduled jobs that run in isolated sessions.

Workflows you should automate:

Job	Schedule	Purpose
Morning EdTech Digest	Daily 07:00	Scrape 5 EdTech sources, summarize, post to WeCom
Weekly Content Audit	Sundays 18:00	Review unpublished scripts, prioritize backlog
Social Post Scheduler	3x/week 12:00	Auto-write and queue LinkedIn/Twitter posts
Git Sync Monitor	Every 2 hours	Check if all 4 machines have synced; alert if not

3. Task Flow (Durable Multi-Step Orchestration)

What it is: Long-running workflows with state persistence, revision tracking, and recovery. Not just "run this" — "run this 5-step pipeline and survive restarts."

Use case: "Create AI Classroom Guide Episode" — Step 1: Research → Step 2: Draft script → Step 3: Generate demo code → Step 4: Fact-check → Step 5: Produce thumbnail → Step 6: Queue for review. If the gateway restarts, Task Flow resumes from the last completed step.

4. Canvas / Presentations

What it is: Visual canvas you can present, navigate, snapshot. Not just text — actual visual slides.

How you can use it: Generate teaching slides for "AI in the Classroom" presentations. Create visual YouTube thumbnails. Build interactive demos of AI tools for students.

5. Video & Music Generation

What it is: Built-in video_generate and music_generate tools. 15 video providers, multiple music providers.

Use	Providers
Music (intro/outro)	Google Lyria, MiniMax
Video B-roll	Google Veo, Runway, Qwen Wan
Animated explanations	ComfyUI workflows

6. Browser Automation

What it is: Full browser control — open tabs, click, type, scroll, screenshot, evaluate JS.

How you can use it: Auto-research EdTech tools: visit website, screenshot features, extract pricing. Monitor competitors' channels. Test your own website across devices.

7. ACP Coding Harnesses

What it is: Agent Control Protocol — spawn external coding agents (Claude Code, Codex, Gemini CLI) as sub-agents.

Why this is huge for you: You don't code. But you need tools. ACP agents code FOR you while you sleep. They can build classroom widgets, website features, data visualizations, video processing pipelines.

Pattern: You say "Build me a webpage that demos 5 AI classroom tools with interactive cards" → Kimi spawns ACP → ACP writes HTML/CSS/JS → Kimi reviews, tests, pushes to portfolio repo.

8. TTS (Text-to-Speech)

Generate voiceover drafts for YouTube scripts. Test script flow by listening. Create audio versions of blog posts.

9. Dreaming / Memory Optimization

OpenClaw 2026.4.5+ has experimental memory dreaming — background process that reviews daily notes, promotes important fragments to long-term memory, weights recent vs. old memories. You should enable this so I don't forget important decisions between sessions.

{
  "agents": {
    "defaults": {
      "dreaming": {
        "enabled": true,
        "mode": "core",
        "recallAging": true
      }
    }
  }
}

Part 2: Skills to Install from ClawHub

High-Value Skills for Your Vision

Skill	What It Does	Why You Need It
`browser-automation`	Advanced browser control	Deep research, competitor monitoring
`adobe-automator`	Automate Adobe apps via ExtendScript	Automate Premiere Pro
`gan-style-harness`	Generator-Evaluator coding pattern	Autonomous app building
`acp-router`	Route "build me X" to right agent	Auto-picks Codex/Claude/etc.
`council`	Multi-voice decision making	Go/no-go on content decisions
`capability-evolver`	Self-evolution engine	Agent improves itself
`seo-audit`	SEO analysis	Optimize 0604.ai
`prose`	Multi-agent prose workflows	Long-form writing
`ad-creative`	Generate creative assets	Thumbnails, banners

Install command: openclaw skills install <name> or browse clawhub.ai

Part 3: Memory & Dreaming Upgrades

Already covered in Section 1.9 above. Enable dreaming in config so agents remember between sessions. Use Memory Bridge to push your vault into ACP harnesses before they start.

Part 4: Private Server — What Changes

Current Limitations (Cloud/Kimi)

Session resets kill context
No persistent background agents
Pay per token, no batch discount
No GPU access for local models
Gateway restarts wipe subagent state

Private Server Options

Option	Pros	Cons	Cost
VPS (Singapore/Tokyo)	24/7, global access, easy scaling	Monthly cost, GPU expensive	$40-120/mo
HER/Home (7950X3D, 96GB)	Already powerful, no monthly cost	Must leave on 24/7	Electricity only
Thailand PC	You own it	Old platform, limited GPU	~$500-800 upgrade

What Private Server Unlocks

Feature	Cloud	Private
24/7 operation	✗ Session resets	✓ Persistent
Local LLMs (Llama, Mistral)	✗ No GPU	✓ Free inference
ComfyUI workflows	✗ Not practical	✓ Local video/image gen
Persistent subagents	✗ Die on reset	✓ Background tasks survive
Database storage	✗ File-only	✓ Postgres, vector DB
Full web access	✗ Some blocked	✓ No restrictions

Architecture Vision

┌─────────────────────────────────────────────────┐
│              YOUR PRIVATE SERVER                │
│  (HER at home OR VPS Singapore/Tokyo)          │
├─────────────────────────────────────────────────┤
│  OpenClaw Gateway (24/7)                       │
│  ├── Local LLM (Llama 3.3 70B)               │
│  ├── ComfyUI (video/image generation)          │
│  ├── Postgres (structured data)                │
│  ├── Vector DB (semantic search)             │
│  └── FFmpeg (automated video processing)       │
├─────────────────────────────────────────────────┤
│  Agent Teams:                                   │
│  ├── Research Agent (24/7 EdTech monitor)     │
│  ├── Content Agent (scripts, scheduling)       │
│  ├── Code Agent (ACP - builds tools/demos)   │
│  ├── Social Agent (posts, metrics)           │
│  └── Video Agent (thumbnails, B-roll, music)   │
└─────────────────────────────────────────────────┘
         │
         ├── Kimi (cloud) → Orchestrator, strategy
         ├── HER (home) → Heavy compute, video
         ├── HIM (school) → Classroom demos
         └── Mini (school) → Quick queries

Part 5: Security Hardening (CRITICAL)

⚠️ Security Alert — April 2026

Recent audit found 512 vulnerabilities in OpenClaw ecosystem:

CVE-2026-25253: 1-click RCE via malicious skills
WebSocket shared-auth escalation: CVSS 9.9
39 skills distributing macOS infostealer

What you must do:

Never install random skills — only verified authors
Keep OpenClaw updated: openclaw update
Enable exec approvals for destructive commands
Empty URL allowlists = deny-all
Verify IDENTITY.md is in .gitignore on all machines
Audit installed skills: openclaw skills list

Part 6: Recommended Upgrade Sequence

Phase 1 This Week

Enable heartbeat checks in HEARTBEAT.md
Install browser-automation skill
Install adobe-automator skill
Configure video/music generation providers
Set up first cron job (EdTech morning digest)
Test subagent spawn for parallel research
Enable dreaming in config

Phase 2 Next 2 Weeks

Install gan-style-harness or acp-router
Build first Task Flow (content pipeline)
Test ACP harness (Claude Code or Codex)
Set up canvas presentations for teaching
Create social agent for automated posting
Install seo-audit skill

Phase 3 Next Month

Deploy private server (VPS or HER 24/7)
Install ComfyUI for video/image workflows
Build "AI Classroom Tools" widget library
Create automated video pipeline
Multi-agent team for 24/7 EdTech monitoring

Part 7: What This Means for Your Vision

Before: You manually research, write scripts, edit videos, post social.

After: A team of agents handles execution while you focus on strategy and teaching.

Task	Now	After Upgrade
Research EdTech trends	2 hours/day browsing	Agent monitors 24/7, delivers digest
Write scripts	4 hours per video	Agent drafts, you refine
Build classroom demos	You don't code	ACP agent builds interactive widgets
Generate thumbnails	Manual design	AI generates 5 options, you pick
Music for videos	Copyright risk	AI generates royalty-free tracks
Social posting	Sporadic, manual	Scheduled, consistent, automated
Video editing	Learning Premiere	Adobe automator + AI assistance
Memory/context	Fragile, session-bound	Persistent, searchable, promoted

Your role shifts from "doer" to "director."

The "AI in Classroom" Content Pipeline

Research Agent monitors EdTech → detects new AI tool
Research Agent tests tool → screenshots, features, pricing
Script Writer drafts "How to use X in your classroom"
Code Agent builds interactive demo of the tool
Fact-Checker verifies claims, sources
Video Agent generates thumbnail + B-roll
Music Agent generates intro/outro
Social Agent writes LinkedIn/Twitter thread
YOU review, record voiceover, approve
Agent schedules posts, updates website, logs to memory

OpenClaw Self-Upgrade Roadmap