Edge TTS Web

A web-based Text-to-Speech converter using Microsoft Edge's TTS service. Converts text files (TXT, FB2, EPUB, ZIP) to audio files.

Example

I like to use 1.0 speed generation then 1.20..1.35 speed in audioplayer

How

The app assigns speakers per sentence, not paragraph. Here's the breakdown:

Text Processing Hierarchy

TextBlockSplitter (C:\projects\EdgeTTS\src\services\TextBlockSplitter.ts):

Speaker Assignment Step

SpeakerAssignmentStep (C:\projects\EdgeTTS\src\services\pipeline\steps\SpeakerAssignmentStep.ts):

Creates assign blocks (8K token limit)
Calls LLM with 0-based numbered sentences: [0] First sentence\n[1] Second sentence\n...
LLM returns speaker code for each sentence
Output: SpeakerAssignment[] with: { sentenceIndex: number; // Global sentence number text: string; // Sentence text speaker: string; // Character code (A-Z, 0-9, a-z) voiceId: string; // Assigned voice }

Speech Detection

Non-dialogue sentences (no quotes/apostrophes) → Narrator voice automatically

Voice Optimization

VoiceRemappingStep then optimizes voice assignments by frequency:

So: Sentence-level assignment, paragraph+block-level grouping for LLM efficiency.

Name		Name	Last commit message	Last commit date
Latest commit History 395 Commits
.github/workflows		.github/workflows
docs		docs
public		public
scripts		scripts
src		src
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
test.config.local.ts.example		test.config.local.ts.example
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts
vitest.real.config.ts		vitest.real.config.ts
webpack.config.js		webpack.config.js