videochat-withme: Real-Time AI Video Chat Through Your OpenClaw Agent

videochat-withme adds live voice and camera to your OpenClaw agent — Groq Whisper STT, edge-tts output, and full conversational context in one skill.

TL;DR

Real-time AI video chat that routes through your OpenClaw agent. Voice in, voice out, full agent context — like a video call with a knowledgeable assistant.

10-Second Pitch

What it does: Adds live camera + voice to your OpenClaw agent using Groq Whisper (STT) and edge-tts (TTS)
Key win: Your OpenClaw agent sees your camera and hears your voice in real time
Best for: Hands-free AI assistance, accessibility use cases, ambient computing

How It Works

1. STT — Your voice goes to Groq Whisper for fast, accurate speech-to-text 2. Agent — The text is sent to your OpenClaw agent via chatCompletions API 3. TTS — The agent's response is synthesized with edge-tts (Microsoft's cloud TTS) 4. Loop — You hear the response and continue the conversation naturally

Setup

bash

# Install dependencies
pip install groq-whisper edge-tts openai
# Configure OpenClaw agent endpoint
export OPENCLAW_ENDPOINT="http://localhost:18789"
export GROQ_API_KEY="your-groq-key"
# Launch video chat
videochat-withme start --agent your-agent-name

What You Need

OpenClaw agent running and accessible
Groq API key (free tier available at console.groq.com)
Microphone and camera

Pros / Cons

Pros	Cons
Fully hands-free interaction	Requires stable internet for real-time TTS
Agent has full conversation history	Groq Whisper needs explicit audio permission
Supports multiple voice options	Latency depends on Groq response time

Verdict

videochat-withme is what ambient AI computing looks like. Your OpenClaw agent is already a powerful reasoning engine — now you can talk to it like a colleague. Worth installing if you want your agent to be genuinely accessible.