Asosiy tarkibga o'tish

5. Talk Rejimi

  1. Talk rejimi — uzluksiz ovozli suhbat aylanasidir:
    1. Nutqni tinglash
    1. Transkriptni modelga yuborish (asosiy sessiya, chat.send)
    1. Javobni kutish
    1. Uni ElevenLabs orqali aytish (oqimli ijro)

11) Xatti-harakat (macOS)

    1. Talk rejimi yoqilgan paytda har doim ko‘rinadigan overlay.
    1. Tinglash → O‘ylash → Gapirish bosqichlari o‘tishlari.
    1. Qisqa pauza (sukut oynasi) bo‘lganda, joriy transkript yuboriladi.
    1. Javoblar WebChat’ga yoziladi (matn terish bilan bir xil).
    1. Nutqda to‘xtatish (standart yoqilgan): foydalanuvchi assistent gapirayotganda gapira boshlasa, ijro to‘xtatiladi va keyingi prompt uchun to‘xtatish vaqti qayd etiladi.

17. Javoblardagi ovoz ko‘rsatmalari

  1. Assistent javobini ovozni boshqarish uchun bitta JSON qatori bilan boshlashi mumkin:
19. { "voice": "<voice-id>", "once": true }
  1. Qoidalar:
  • Qo‘llab-quvvatlanadigan kalitlar:
    1. Noma’lum kalitlar e’tiborsiz qoldiriladi.
    1. once: true faqat joriy javobga qo‘llanadi.
    1. once bo‘lmasa, ovoz Talk rejimi uchun yangi standartga aylanadi.
    1. JSON qatori TTS ijrosidan oldin olib tashlanadi.
  1. Qo‘llab-quvvatlanadigan kalitlar:
  • voice / voice_id / voiceId
  • model / model_id / modelId
  • speed, rate (WPM), stability, similarity, style, speakerBoost
  • seed, normalize, lang, output_format, latency_tier
  • once

32. Sozlamalar (~/.openclaw/openclaw.json)

33. {
  talk: {
    voiceId: "elevenlabs_voice_id",
    modelId: "eleven_v3",
    outputFormat: "mp3_44100_128",
    apiKey: "elevenlabs_api_key",
    interruptOnSpeech: true,
  },
}
  1. Standartlar:
  • interruptOnSpeech: true
    1. voiceId: ELEVENLABS_VOICE_ID / SAG_VOICE_ID ga qaytadi (yoki API kaliti mavjud bo‘lsa, birinchi ElevenLabs ovozi)
    1. modelId: o‘rnatilmagan bo‘lsa, eleven_v3 ga standartlanadi
    1. apiKey: ELEVENLABS_API_KEY ga qaytadi (yoki mavjud bo‘lsa, gateway shell profili)
    1. outputFormat: macOS/iOS’da pcm_44100, Android’da pcm_24000 (MP3 oqimini majburlash uchun mp3_* ni o‘rnating)

40. macOS UI

    1. Menyu paneli tugmasi: Talk
    1. Sozlamalar yorlig‘i: Talk Mode guruhi (voice id + to‘xtatish tumchog‘i)
  • Overlay:
    • 44. Tinglash: mikrofon darajasi bilan bulut pulsatsiyalari
      1. O‘ylash: cho‘kib boruvchi animatsiya
      1. Gapirish: tarqaluvchi halqalar
      1. Bulutni bosish: gapirishni to‘xtatish
      1. X ni bosish: Talk rejimidan chiqish

49. Eslatmalar

    1. Nutq + Mikrofon ruxsatlari talab etiladi.
  • Uses chat.send against session key main.
  • TTS uses ElevenLabs streaming API with ELEVENLABS_API_KEY and incremental playback on macOS/iOS/Android for lower latency.
  • stability for eleven_v3 is validated to 0.0, 0.5, or 1.0; other models accept 0..1.
  • latency_tier is validated to 0..4 when set.
  • Android supports pcm_16000, pcm_22050, pcm_24000, and pcm_44100 output formats for low-latency AudioTrack streaming.