Please add an option to skip unnecessary punctuation marks such as ~~~ and *** during text-to-speech playback, as they can be distracting and irritating while reading novels.
Best posts made by wkhn66
-
Add skip or change wordposted in Feedback
-
RE: Add skip or change wordposted in Feedback
@river I am currently reading this novel on the following website:
https://novelfull.com/martial-god-asura/chapter-4159-enemies-on-a-narrow-road.htmlThe only feature missing is an auto-scroll option. Other than that and the two features I mentioned earlier, I am completely satisfied with the TTS reader.
-
RE: Add skip or change wordposted in Feedback
@river Hello,
I would like to suggest a feature that could improve the realism and expressiveness of TTS-based narration in this application.
Currently, most text-to-speech systems read dialogue exactly as written, relying mainly on punctuation for tone. This often results in speech that sounds flat, especially when reading novels or dialogue-heavy content where the writer intends specific emotions such as anger, sadness, affection, or sarcasm.
To improve this, I suggest introducing a simple annotation-based dialogue instruction system that allows writers or users to specify the intended emotion or delivery style directly in the text. The TTS engine could parse these annotations and adjust voice parameters such as pitch, speed, pauses, and emphasis accordingly.
Proposed Method
The idea is to use lightweight tags placed before dialogue lines to indicate emotional intent.Basic Format:
[emotion:intensity|style] Dialogue textWhere:
- emotion = emotional tone of the line
- intensity = level from 1–5
- style (optional) = delivery modifier such as slow, whisper, pause, etc.
Examples:
- Angry dialogue
[anger:3] Get out of my sight!
Meaning: moderate anger with stronger emphasis.
- Soft or calm speech
[soft:1] It's alright. Everything will be fine.
Meaning: gentle tone with relaxed delivery.
- Romantic or affectionate dialogue
[love:2] I missed you.
Meaning: warm and soft voice tone.
- Fearful dialogue with trembling
[fear:4|tremble] Did you hear that?
Meaning: high fear intensity with a shaking voice effect.
- Sad dialogue with slower pacing
[sad:3|slow] I thought you would stay.
Meaning: medium sadness with slower speech.
Optional Pause Markers
Additional inline markers could control pacing inside a sentence.{p} = short pause
{pp} = medium pause
{ppp} = long pauseExample:
[shock:3] What… {p} what happened here?How It Could Work Internally
-
The TTS reader scans text for annotation tags.
-
Tags are parsed and removed from the spoken text.
-
Each tag is mapped to voice parameters such as:
- pitch adjustment
- speaking rate
- pause duration
- emphasis
-
The processed dialogue is then passed to the TTS engine with these parameters applied.
Benefits
- More natural audiobook narration
- Better dialogue acting in novels
- Greater emotional realism
- Minimal complexity for users
This method would allow AI voices to deliver dialogue more like a human narrator or actor rather than simply reading text line by line.
Thank you for considering this suggestion.
Latest posts made by wkhn66
-
RE: Add skip or change wordposted in Feedback
@river Hello,
I would like to suggest a feature that could improve the realism and expressiveness of TTS-based narration in this application.
Currently, most text-to-speech systems read dialogue exactly as written, relying mainly on punctuation for tone. This often results in speech that sounds flat, especially when reading novels or dialogue-heavy content where the writer intends specific emotions such as anger, sadness, affection, or sarcasm.
To improve this, I suggest introducing a simple annotation-based dialogue instruction system that allows writers or users to specify the intended emotion or delivery style directly in the text. The TTS engine could parse these annotations and adjust voice parameters such as pitch, speed, pauses, and emphasis accordingly.
Proposed Method
The idea is to use lightweight tags placed before dialogue lines to indicate emotional intent.Basic Format:
[emotion:intensity|style] Dialogue textWhere:
- emotion = emotional tone of the line
- intensity = level from 1–5
- style (optional) = delivery modifier such as slow, whisper, pause, etc.
Examples:
- Angry dialogue
[anger:3] Get out of my sight!
Meaning: moderate anger with stronger emphasis.
- Soft or calm speech
[soft:1] It's alright. Everything will be fine.
Meaning: gentle tone with relaxed delivery.
- Romantic or affectionate dialogue
[love:2] I missed you.
Meaning: warm and soft voice tone.
- Fearful dialogue with trembling
[fear:4|tremble] Did you hear that?
Meaning: high fear intensity with a shaking voice effect.
- Sad dialogue with slower pacing
[sad:3|slow] I thought you would stay.
Meaning: medium sadness with slower speech.
Optional Pause Markers
Additional inline markers could control pacing inside a sentence.{p} = short pause
{pp} = medium pause
{ppp} = long pauseExample:
[shock:3] What… {p} what happened here?How It Could Work Internally
-
The TTS reader scans text for annotation tags.
-
Tags are parsed and removed from the spoken text.
-
Each tag is mapped to voice parameters such as:
- pitch adjustment
- speaking rate
- pause duration
- emphasis
-
The processed dialogue is then passed to the TTS engine with these parameters applied.
Benefits
- More natural audiobook narration
- Better dialogue acting in novels
- Greater emotional realism
- Minimal complexity for users
This method would allow AI voices to deliver dialogue more like a human narrator or actor rather than simply reading text line by line.
Thank you for considering this suggestion.
-
RE: Add skip or change wordposted in Feedback
@river I am currently reading this novel on the following website:
https://novelfull.com/martial-god-asura/chapter-4159-enemies-on-a-narrow-road.htmlThe only feature missing is an auto-scroll option. Other than that and the two features I mentioned earlier, I am completely satisfied with the TTS reader.
-
Add skip or change wordposted in Feedback
Please add an option to skip unnecessary punctuation marks such as ~~~ and *** during text-to-speech playback, as they can be distracting and irritating while reading novels.