๐Ÿ–ฅ๏ธ

System Status

GPU

AMD RX 7900 XTX (gfx1100)

ROCm 6.2.4 Ready

Host

HER โ€” 7950X3D, 96GB RAM

Online

Disk

140GB / 295GB used

50% โ€” Healthy

Status

Core stack installed

Awaiting Character
โœ…

What Worked

๐Ÿ”ง

ROCm 6.2.4

Already installed. Verified gfx1100 target. No action needed.

rocm-smi โœ“ | rocminfo โœ“
๐Ÿฆ™

Ollama v0.24.0

Installed via GitHub release asset. Runs natively on GPU.

ollama-linux-amd64-rocm.tar.zst | ~/ai-stack/start-ollama.sh
๐Ÿง 

Model Stack

Three models pulled and tested. 70B runs with partial CPU offload (expected on 24GB VRAM).

dolphin-llama3:8b (100% GPU) | qwen2.5:14b | dolphin-llama3:70b (46% GPU / 54% CPU)
๐ŸŽญ

SillyTavern

Frontend for character-based AI interaction. npm dependencies resolved (829 packages).

~/ai-stack/SillyTavern | npm install โœ“
๐ŸŽจ

ComfyUI

Image generation pipeline. PyTorch ROCm 6.2 confirmed working.

--index-url https://download.pytorch.org/whl/rocm6.2
๐ŸŽ™๏ธ

Voice Pipeline

Speech-to-text and text-to-speech ready.

openai-whisper | piper-tts
๐Ÿ’พ

Memory Layer

Vector database for long-term memory and context retention.

chromadb

๐Ÿšจ Critical Pattern Discovered

Everything GPU-related must run with the render group.

Without this, ROCm sees the GPU but cannot open /dev/kfd read-write. This applies to Ollama, PyTorch, and ComfyUI.

# Start Ollama with render group
sg render -c "ollama serve"

# Run PyTorch scripts
sg render -c "python script.py"

# Start ComfyUI
sg render -c "HSA_OVERRIDE_GFX_VERSION=11.0.0 python main.py"

The start-ollama.sh script handles this automatically.

๐Ÿ”ง

What Failed & How It Was Fixed

Problem Cause Fix
Ollama binary 404 Blocking Direct download URL changed Fetched GitHub releases API, found correct asset name ollama-linux-amd64-rocm.tar.zst
GPU not detected by Ollama Blocking Ollama daemon didn't inherit render group permissions Used sg render -c "ollama serve" to start with proper group
nous-hermes3:8b not found Workaround Model doesn't exist in Ollama registry Skipped โ€” dolphin-llama3:8b covers the fast/roleplay use case
PyTorch installed CUDA instead of ROCm Blocking pip defaulted to CUDA wheel Uninstalled, reinstalled with --index-url https://download.pytorch.org/whl/rocm6.2
python3-venv unavailable Workaround ensurepip module missing, needs sudo Used --break-system-packages flag for user-local installs
No passwordless sudo Workaround Not configured Worked around with user-local installs (~/.local/bin, ~/ai-stack/)
Long downloads "failed" Visual Bug Exec session wrapper sends SIGKILL after timeout Downloads survived via nohup โ€” just couldn't show clean progress
๐ŸŽฏ

Current Status & Next Steps

Ready to Use Now

  • ~/ai-stack/start-ollama.sh โ€” Start the model server
  • cd ~/ai-stack/SillyTavern && ./start.sh โ€” Launch the frontend
  • Connect to http://localhost:11434, select dolphin-llama3:70b

Next (When You're Ready)

  • Character cards (you said you'd provide)
  • ComfyUI checkpoint downloads (RealVisXL, JuggernautXL, Flux.1)
  • Mem0/Chroma configuration with SillyTavern Summary extension