Merge pull request 'TTS: Re-enable voice accessbility if /.tangara-tts/ contains audio samples.' (#247) from tursiae/tangara-fw:tts into main

Reviewed-on: https://codeberg.org/cool-tech-zone/tangara-fw/pulls/247 Reviewed-by: cooljqln <cooljqln@noreply.codeberg.org>
7 months ago · f0116a3c85
parent 3bad60d3a3 ffc62ee5e0
commit f0116a3c85
2 changed files with 54 additions and 1 deletions
--- a/src/tangara/tts/README.md
+++ b/src/tangara/tts/README.md
@ -0,0 +1,49 @@
 # Text-to-speech on Tangara
 The `tangara/tts/` module implements an audio accessibility layer for the
 UI, providing the ability to play back text-to-speech recordings for each
 UI element focused when using Tangara.
 The code is structured in three pieces:
 - `events.hpp`, providing the on-selection-changed and on-TTS-enabled events
  for the UI bindings.
 - `player.cpp`, which supports TTS playback via low-memory audio decoders
  (currently, only WAV files), and
 - `provider.cpp`, which is responsible for finding the TTS sample on the SD
  card for the focused UI element.
 ## End-user Configuration
 Text-to-speech will automatically be enabled if you have loaded TTS phrases
 onto the SD card, under `/.tangara-tts/`. These samples must be formatted
 and named as per the instructions below.
 To disable TTS, rename or delete the `/.tangara-tts/` directory on your SD
 card.
 ## Supported Codecs
 Currently, the TTS library only supports a WAV decoder. Natively, the player
 expects 48 kHz audio, mono or stereo, and will (if required) resample the
 audio to 48kHz for playback.
 ## Creating and enabling TTS Samples
 TTS samples should be stored on your SD card, under `/.tangara-tts/`. The
 `provider` expects that the TTS samples are stored in this directory as WAV
 files, with a `.wav` extension, named as the hexadecimal version of the
 [KOMIHASH](https://github.com/avaneev/komihash)ed TTS string.
 For example, `Settings` hashes to `1e3e816187453bf8`. If you recorded a
 short sample as a 48kHz (mono or stereo) WAV file, and stored it on the SD
 card as `/.tangara-tts/1e3e816187453bf8.wav`, it would be played back when
 the settings icon is highlighted.
 ## Finding the KOMIHASH of UI strings
 If you connect to your Tangara via the serial console, the TTS provider
 logs a `WARN`ing each time it cannot find a TTS sample. You can enable
 these log messages on the console by using the command `loglevel warn`,
 and then manipulating the click wheel to move through the UI to discover
 other missing TTS samples.
--- a/src/tangara/tts/provider.cpp
+++ b/src/tangara/tts/provider.cpp
@ -28,7 +28,11 @@ static const char* kTtsPath = "/.tangara-tts/";
 static auto textToFile(const std::string& text) -> std::optional<std::string> {
  uint64_t hash = komihash(text.data(), text.size(), 0);
  std::stringstream stream;
-  stream << kTtsPath << std::hex << hash;
+  // Assume the TTS sample is a .wav file; since we only support one low-RAM
  // overhead codec, we can presume the suffix. The suffix is needed, else we
  // fail to open the stream when it fails to autodetect the format when looking
  // up tags.
  stream << kTtsPath << std::hex << hash << ".wav";
  return stream.str();
 }