Skip to content

在 Hermes 中使用语音模式

本文档的完整中文翻译正在进行中。

概述

使用语音模式。

快速开始

bash
hermes help
hermes config
hermes skills

相关链接

获取帮助

如需帮助,请运行 hermes doctor 或访问 GitHub Issues


原文档内容:

Use Voice Mode with Hermes

This guide is the practical companion to the Voice Mode feature reference.

If the feature page explains what voice mode can do, this guide shows how to actually use it well.

What voice mode is good for

Voice mode is especially useful when:

  • you want a hands-free CLI workflow
  • you want spoken responses in Telegram or Discord
  • you want Hermes sitting in a Discord voice channel for live conversation
  • you want quick idea capture, debugging, or back-and-forth while walking around instead of typing

Choose your voice mode setup

There are really three different voice experiences in Hermes.

ModeBest forPlatform
Interactive microphone loopPersonal hands-free use while coding or researchingCLI
Voice replies in chatSpoken responses alongside normal messagingTelegram, Discord
Live voice channel botGroup or personal live conversation in a VCDiscord voice channels

A good path is:

  1. get text working first
  2. enable voice replies second
  3. move to Discord voice channels last if you want the full experience

Step 1: make sure normal Hermes works first

Before touching voice mode, verify that:

  • Hermes starts
  • your provider is configured
  • the agent can answer text prompts normally
bash
hermes

Ask something simple:

text
What tools do you have available?

If that is not solid yet, fix text mode first.

Step 2: install the right extras

CLI microphone + playback

bash
pip install "hermes-agent[voice]"

Messaging platforms

bash
pip install "hermes-agent[messaging]"

Premium ElevenLabs TTS

bash
pip install "hermes-agent[tts-premium]"

Local NeuTTS (optional)

bash
python -m pip install -U neutts[all]

Everything

bash
pip install "hermes-agent[all]"

Step 3: install system dependencies

macOS

bash
brew install portaudio ffmpeg opus
brew install espeak-ng
``...

*[完整翻译即将推出]*