Mintplex-Labs/piper-tts-web

Fork 0

mirror of https://github.com/Mintplex-Labs/piper-tts-web.git synced 2026-07-01 20:04:04 -04:00

T

konstantin-paulus 5b558c114b v1.0.0 release

2024-07-06 15:54:25 +02:00

src

v1.0.0 release

2024-07-06 15:54:25 +02:00

.gitignore

v1.0.0 release

2024-07-06 15:54:25 +02:00

index.html

initial commit

2024-07-05 17:54:08 +02:00

package-lock.json

v1.0.0 release

2024-07-06 15:54:25 +02:00

package.json

v1.0.0 release

2024-07-06 15:54:25 +02:00

playwright.config.ts

v1.0.0 release

2024-07-06 15:54:25 +02:00

README.md

v1.0.0 release

2024-07-06 15:54:25 +02:00

tsconfig.json

initial commit

2024-07-05 17:54:08 +02:00

vite.config.ts

v1.0.0 release

2024-07-06 15:54:25 +02:00

README.md

Use VITS models in the browser powered by the ONNX Runtime

A big shout-out goes to Rhasspy Piper, who open-sourced all the currently available models (MIT License) and to @jozefchutka who came up with the wasm build steps.

Usage

First of all, you need to install the library:

npm i --save @diffusionstudio/vits-web

Then you're able to import the library like this (ES only)

import * as tts from '@diffusionstudio/vits-web';

// Hint: onnxruntime-web is a peer dependency

Now you can start synthesizing speech!

const wav = await tts.predict({
  text: "Text to speech in the browser is amazing!",
  voiceId: 'en_US-hfc_female-medium',
});

// available in Web Worker

const audio = new Audio();
audio.src = URL.createObjectURL(wav);
audio.play();

With the initial run of the predict function you will download the model which will then be stored in your Origin private file system. You can also do this manually in advance (recommended), as follows:

await tts.download('en_US-hfc_female-medium', (progress) => {
  console.log(`Downloading ${progress.url} - ${Math.round(progress.loaded * 100 / progress.total)}%`);
});

The predict function also accepts a download progress callback as the second argument (tts.predict(..., console.log)).

If you want to know which models have already been stored, do the following

console.log(await tts.stored());

// will log ['en_US-hfc_female-medium']

You can remove models from opfs by calling

await tts.remove('en_US-hfc_female-medium');

// alternatively delete all

await tts.flush();

And last but not least use this snippet if you would like to retrieve all available voices:

console.log(await tts.voices());

// Hint: the key can be used as voiceId

README.md

Use VITS models in the browser powered by the ONNX Runtime

Usage

That's it! Happy coding :)