AI Companion Build Log

🖥️

System Status

GPU

AMD RX 7900 XTX (gfx1100)

ROCm 6.2.4 Ready

Host

HER — 7950X3D, 96GB RAM

Online

Disk

140GB / 295GB used

50% — Healthy

Status

Core stack installed

Awaiting Character

✅

What Worked

🔧

ROCm 6.2.4

Already installed. Verified gfx1100 target. No action needed.

rocm-smi ✓ | rocminfo ✓

🦙

Ollama v0.24.0

Installed via GitHub release asset. Runs natively on GPU.

ollama-linux-amd64-rocm.tar.zst | ~/ai-stack/start-ollama.sh

🧠

Model Stack

Three models pulled and tested. 70B runs with partial CPU offload (expected on 24GB VRAM).

dolphin-llama3:8b (100% GPU) | qwen2.5:14b | dolphin-llama3:70b (46% GPU / 54% CPU)

🎭

SillyTavern

Frontend for character-based AI interaction. npm dependencies resolved (829 packages).

~/ai-stack/SillyTavern | npm install ✓

🎨

ComfyUI

Image generation pipeline. PyTorch ROCm 6.2 confirmed working.

--index-url https://download.pytorch.org/whl/rocm6.2

🎙️

Voice Pipeline

Speech-to-text and text-to-speech ready.

openai-whisper | piper-tts

💾

Memory Layer

Vector database for long-term memory and context retention.

chromadb

🚨 Critical Pattern Discovered

Everything GPU-related must run with the render group.

Without this, ROCm sees the GPU but cannot open /dev/kfd read-write. This applies to Ollama, PyTorch, and ComfyUI.

# Start Ollama with render group
sg render -c "ollama serve"

# Run PyTorch scripts
sg render -c "python script.py"

# Start ComfyUI
sg render -c "HSA_OVERRIDE_GFX_VERSION=11.0.0 python main.py"

The start-ollama.sh script handles this automatically.

🔧

What Failed & How It Was Fixed

Problem	Cause	Fix
Ollama binary 404 Blocking	Direct download URL changed	Fetched GitHub releases API, found correct asset name `ollama-linux-amd64-rocm.tar.zst`
GPU not detected by Ollama Blocking	Ollama daemon didn't inherit `render` group permissions	Used `sg render -c "ollama serve"` to start with proper group
`nous-hermes3:8b` not found Workaround	Model doesn't exist in Ollama registry	Skipped — `dolphin-llama3:8b` covers the fast/roleplay use case
PyTorch installed CUDA instead of ROCm Blocking	pip defaulted to CUDA wheel	Uninstalled, reinstalled with `--index-url https://download.pytorch.org/whl/rocm6.2`
`python3-venv` unavailable Workaround	`ensurepip` module missing, needs sudo	Used `--break-system-packages` flag for user-local installs
No passwordless sudo Workaround	Not configured	Worked around with user-local installs (`~/.local/bin`, `~/ai-stack/`)
Long downloads "failed" Visual Bug	Exec session wrapper sends SIGKILL after timeout	Downloads survived via `nohup` — just couldn't show clean progress

🎯

Current Status & Next Steps

Ready to Use Now

~/ai-stack/start-ollama.sh — Start the model server
cd ~/ai-stack/SillyTavern && ./start.sh — Launch the frontend
Connect to http://localhost:11434, select dolphin-llama3:70b

Next (When You're Ready)

Character cards (you said you'd provide)
ComfyUI checkpoint downloads (RealVisXL, JuggernautXL, Flux.1)
Mem0/Chroma configuration with SillyTavern Summary extension