react-native-tts vs @react-native-voice/voice vs react-native-sound
Audio Input, Playback, and Speech Synthesis in React Native
react-native-tts@react-native-voice/voicereact-native-soundSimilar Packages:

Audio Input, Playback, and Speech Synthesis in React Native

@react-native-voice/voice, react-native-sound, and react-native-tts are three distinct libraries that handle different aspects of audio in React Native applications. @react-native-voice/voice focuses on speech-to-text capabilities, allowing apps to capture and transcribe user voice input. react-native-sound is a dedicated audio player for playing sound effects, music, and recorded audio files with control over volume, looping, and mixing. react-native-tts (Text-to-Speech) enables apps to convert text strings into spoken audio output using the device's native speech synthesis engines. Together, they cover the full spectrum of audio interaction: listening, playing, and speaking.

Npm Package Weekly Downloads Trend

3 Years

Github Stars Ranking

Stat Detail

Package
Downloads
Stars
Size
Issues
Publish
License
react-native-tts33,209691162 kB1342 years agoMIT
@react-native-voice/voice02,154608 kB0-MIT
react-native-sound02,90994.4 kB2485 months agoMIT

Audio Input, Playback, and Speech Synthesis in React Native: A Technical Comparison

Building audio-enabled mobile apps requires choosing the right tool for the specific type of audio interaction. @react-native-voice/voice, react-native-sound, and react-native-tts solve three different problems: listening to the user, playing audio files, and speaking text to the user. Let's break down how they work and when to use each.

๐ŸŽค Speech Recognition: Capturing User Voice

@react-native-voice/voice bridges the native speech recognition APIs on iOS and Android.

  • It listens to microphone input and returns transcribed text.
  • You must handle permissions and lifecycle events manually.
// @react-native-voice/voice: Start listening
import Voice from '@react-native-voice/voice';

Voice.start('en-US');

Voice.onSpeechResults = (e) => {
  console.log(e.value); // Transcribed text array
};

react-native-sound does not support speech recognition.

  • It is strictly for playback.
  • Attempting to use it for input will not work.
// react-native-sound: No speech recognition API
// This library cannot capture or transcribe voice input.

react-native-tts does not support speech recognition.

  • It is strictly for output (speaking).
  • It cannot listen to the user.
// react-native-tts: No speech recognition API
// This library cannot capture or transcribe voice input.

๐Ÿ”Š Audio Playback: Playing Files and Effects

@react-native-voice/voice does not support audio playback.

  • It is input-only.
  • You cannot use it to play sounds or music.
// @react-native-voice/voice: No playback API
// This library cannot play audio files.

react-native-sound is built for high-fidelity playback.

  • Supports local assets and remote URLs.
  • Offers controls for volume, rate, looping, and mixing.
// react-native-sound: Play a sound effect
import Sound from 'react-native-sound';

const whoosh = new Sound('whoosh.mp3', Sound.MAIN_BUNDLE, (error) => {
  if (!error) {
    whoosh.play();
  }
});

react-native-tts plays audio, but only generated speech.

  • You cannot play MP3 or WAV files.
  • It converts text strings into audio on the fly.
// react-native-tts: Speak text
import Tts from 'react-native-tts';

Tts.speak('Hello, this is a test.');

๐Ÿ—ฃ๏ธ Text-to-Speech: Reading Content Aloud

@react-native-voice/voice does not support text-to-speech.

  • It is strictly for speech-to-text.
  • It cannot generate voice output.
// @react-native-voice/voice: No TTS API
// This library cannot synthesize speech from text.

react-native-sound does not support text-to-speech.

  • It plays pre-recorded files only.
  • It cannot generate speech from dynamic text strings.
// react-native-sound: No TTS API
// This library cannot synthesize speech from text.

react-native-tts is dedicated to speech synthesis.

  • Allows setting pitch, rate, and language.
  • Uses the device's native engine (Siri/Google).
// react-native-tts: Configure and speak
import Tts from 'react-native-tts';

Tts.setDefaultRate(0.5);
Tts.setDefaultPitch(1.0);
Tts.speak('Welcome to the app.');

โš™๏ธ Configuration and Control

Each library exposes different levels of control based on its purpose.

  • @react-native-voice/voice focuses on recognition events.
    You subscribe to onSpeechStart, onSpeechEnd, and onSpeechResults. It handles the heavy lifting of native audio processing but requires careful permission management in Info.plist and AndroidManifest.xml.
// @react-native-voice/voice: Event subscription
Voice.onSpeechStart = () => console.log('Started');
Voice.onSpeechEnd = () => console.log('Ended');
  • react-native-sound focuses on player state.
    You control play, pause, stop, and release. It manages audio sessions, allowing background playback if configured correctly in native settings.
// react-native-sound: Player control
whoosh.pause();
whoosh.stop();
whoosh.release();
  • react-native-tts focuses on voice settings.
    You configure setDefaultLanguage, setDefaultRate, and setDefaultPitch. It queues speech requests, so rapid calls may result in queued output.
// react-native-tts: Voice settings
Tts.setDefaultLanguage('en-US');
Tts.getAvailableEngines().then(console.log);

๐Ÿ“Š Summary Table

Feature@react-native-voice/voicereact-native-soundreact-native-tts
Primary UseSpeech-to-Text (Input)Audio Playback (Files)Text-to-Speech (Output)
Play MP3/WAVโŒ Noโœ… YesโŒ No
Capture Micโœ… YesโŒ NoโŒ No
Speak TextโŒ NoโŒ Noโœ… Yes
Volume ControlโŒ N/Aโœ… Yesโš ๏ธ System Dependent
LoopingโŒ Noโœ… YesโŒ No

๐Ÿ’ก The Big Picture

These libraries are not competitors โ€” they are companions that handle different parts of the audio stack.

@react-native-voice/voice is your microphone handler. Use it when the user needs to talk to the app. It is essential for voice search, dictation, or accessibility commands. Keep in mind that accuracy depends on the device's OS and internet connection.

react-native-sound is your media player. Use it for sound effects, background music, or playing recorded voice notes. It gives you the most control over audio timing and mixing, making it the go-to for interactive audio experiences.

react-native-tts is your narrator. Use it when the app needs to talk to the user dynamically. It is perfect for reading notifications, teaching pronunciation, or aiding visually impaired users.

Final Thought: In a complex app, you might use all three. A language learning app, for example, could use @react-native-voice/voice to grade pronunciation, react-native-sound to play example dialogues, and react-native-tts to read vocabulary words. Choose based on the direction of the audio flow: Input, Playback, or Synthesis.

How to Choose: react-native-tts vs @react-native-voice/voice vs react-native-sound

  • react-native-tts:

    Choose react-native-tts when your application needs to read text aloud to the user, such as for accessibility features, language learning tools, or notification announcements. It provides a simple interface to the device's built-in speech synthesis engine. If your UX requires auditory feedback from text content without playing pre-recorded files, this package is the direct solution.

  • @react-native-voice/voice:

    Choose @react-native-voice/voice when your app needs to capture user speech and convert it into text, such as for voice commands, dictation features, or accessibility inputs. It is the standard choice for speech recognition on both iOS and Android, but be aware that it relies heavily on native permissions and OS-level speech services. If your core feature involves hands-free text entry or voice control, this is the necessary dependency.

  • react-native-sound:

    Choose react-native-sound if you need to play local or remote audio files with precise control over playback, looping, and volume mixing. It is ideal for games, media players, or apps requiring sound effects that trigger on specific events. Unlike web-based audio tags, this library bridges directly to native audio players, ensuring better performance and background playback support on mobile devices.

README for react-native-tts

React Native TTS

React Native TTS is a text-to-speech library for React Native on iOS, Android and Windows.

Documentation

Install

npm install --save react-native-tts
react-native link react-native-tts

Usage

Imports

import Tts from 'react-native-tts';

Windows

  1. In windows/myapp.sln add the RNTTS project to your solution:

    • Open the solution in Visual Studio 2019
    • Right-click Solution icon in Solution Explorer > Add > Existing Project
    • Select node_modules\react-native-tts\windows\RNTTS\RNTTS.vcxproj
  2. In windows/myapp/myapp.vcxproj add a reference to RNTTS to your main application project. From Visual Studio 2019:

    • Right-click main application project > Add > Reference...
    • Check RNTTS from Solution Projects.
  3. In pch.h add #include "winrt/RNTTS.h".

  4. In app.cpp add PackageProviders().Append(winrt::RNTTS::ReactPackageProvider()); before InitializeComponent();.

Speaking

Add utterance to TTS queue and start speaking. Returns promise with utteranceId.

Tts.speak('Hello, world!');

Additionally, speak() allows to pass platform-specific options.

// IOS
Tts.speak('Hello, world!', {
  iosVoiceId: 'com.apple.ttsbundle.Moira-compact',
  rate: 0.5,
});
// Android
Tts.speak('Hello, world!', {
  androidParams: {
    KEY_PARAM_PAN: -1,
    KEY_PARAM_VOLUME: 0.5,
    KEY_PARAM_STREAM: 'STREAM_MUSIC',
  },
});

For more detail on androidParams properties, please take a look at official android documentation. Please note that there are still unsupported key with this wrapper library such as KEY_PARAM_SESSION_ID. The following are brief summarization of currently implemented keys:

  • KEY_PARAM_PAN ranges from -1 to +1.

  • KEY_PARAM_VOLUME ranges from 0 to 1, where 0 means silence. Note that 1 is a default value for Android.

  • For KEY_PARAM_STREAM property, you can currently use one of STREAM_ALARM, STREAM_DTMF, STREAM_MUSIC, STREAM_NOTIFICATION, STREAM_RING, STREAM_SYSTEM, STREAM_VOICE_CALL,

The supported options for IOS are:

  • iosVoiceId which voice to use, check voices() for available values
  • rate which speech rate this line should be spoken with. Will override default rate if set for this utterance.

Stop speaking and flush the TTS queue.

Tts.stop();

Waiting for initialization

On some platforms it could take some time to initialize TTS engine, and Tts.speak() will fail to speak until the engine is ready.

To wait for successfull initialization you could use getInitStatus() call.

Tts.getInitStatus().then(() => {
  Tts.speak('Hello, world!');
});

Ducking

Enable lowering other applications output level while speaking (also referred to as "ducking").

(not supported on Windows)

Tts.setDucking(true);

List Voices

Returns list of available voices

(not supported on Android API Level < 21, returns empty list)

Tts.voices().then(voices => console.log(voices));

// Prints:
//
// [ { id: 'com.apple.ttsbundle.Moira-compact', name: 'Moira', language: 'en-IE', quality: 300 },
// ...
// { id: 'com.apple.ttsbundle.Samantha-compact', name: 'Samantha', language: 'en-US' } ]
Voice fieldDescription
idUnique voice identifier (e.g. com.apple.ttsbundle.Moira-compact)
nameName of the voice (iOS only)
languageBCP-47 language code (e.g. 'en-US')
qualityVoice quality (300 = normal, 500 = enhanced/very high)
latencyExpected synthesizer latency (100 = very low, 500 = very high) (Android only)
networkConnectionRequiredTrue when the voice requires an active network connection (Android only)
notInstalledTrue when the voice may need to download additional data to be fully functional (Android only)

Set default Language

Tts.setDefaultLanguage('en-IE');

Set default Voice

Sets default voice, pass one of the voiceId as reported by a call to Tts.voices()

(not available on Android API Level < 21)

Tts.setDefaultVoice('com.apple.ttsbundle.Moira-compact');

Set default Speech Rate

Sets default speech rate. The rate parameter is a float where where 0.01 is a slowest rate and 0.99 is the fastest rate.

Tts.setDefaultRate(0.6);

There is a significant difference to how the rate value is interpreted by iOS, Android and Windows native TTS APIs. To provide unified cross-platform behaviour, translation is applied to the rate value. However, if you want to turn off the translation, you can provide optional skipTransform parameter to Tts.setDefaultRate() to pass rate value unmodified.

Do not translate rate parameter:

Tts.setDefaultRate(0.6, true);

Set default Pitch

Sets default pitch. The pitch parameter is a float where where 1.0 is a normal pitch. On iOS min pitch is 0.5 and max pitch is 2.0. On Windows, min pitch is 0.0 and max pitch is 2.0.

Tts.setDefaultPitch(1.5);

Controls the iOS silent switch behavior

Platforms: iOS

  • "inherit" (default) - Use the default behavior
  • "ignore" - Play audio even if the silent switch is set
  • "obey" - Don't play audio if the silent switch is set
Tts.setIgnoreSilentSwitch("ignore");

Events

Subscribe to TTS events

Tts.addEventListener('tts-start', (event) => console.log("start", event));
Tts.addEventListener('tts-progress', (event) => console.log("progress", event));
Tts.addEventListener('tts-finish', (event) => console.log("finish", event));
Tts.addEventListener('tts-cancel', (event) => console.log("cancel", event));

Support for multiple TTS engines

Platforms: Android

Functions to list available TTS engines and set an engine to use.

Tts.engines().then(engines => console.log(engines));
Tts.setDefaultEngine('engineName');

Install (additional) language data

Shows the Android Activity to install additional language/voice data.

Tts.requestInstallData();

Troubleshooting

No text to speech engine installed on Android

On Android, it may happen that the Text-to-Speech engine is not (yet) installed on the phone. When this is the case, Tts.getInitStatus() returns an error with code no_engine. You can use the following code to request the installation of the default Google Text to Speech App. The app will need to be restarted afterwards before the changes take affect.

Tts.getInitStatus().then(() => {
  // ...
}, (err) => {
  if (err.code === 'no_engine') {
    Tts.requestInstallEngine();
  }
});

Example

There is an example project which shows use of react-native-tts on Android/iOS/Windows: https://github.com/themostaza/react-native-tts-example

License

The MIT License (MIT)

Copyright ยฉ 2016 Anton Krasovsky

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the โ€œSoftwareโ€), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED โ€œAS ISโ€, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.