mrtmrtPLUS
Verified Purchaser badge

Verified purchaser

Member since: Nov 2022Deals bought: 228
4 stars
4 stars
Posted: Sep 12, 2025

Good - but I'm still hoping for many improvements.

I've already used a few TTS solutions (and yes, I miss SpeechKI). Overall, VoiSpark makes a solid impression, and the operation is simple and intuitive so far.

Voice cloning: Currently, this is certainly one of the most important features in TTS solutions. MiniMax's quality is good. I generally don't have a problem with the fact that the initial setup for a new cloned voice costs 100,000 credits. Cartesia and FishAudio didn't really convince me in my first test; those voices still sounded very unnatural.

Con:
What's a real challenge for me (and probably every other user) is that there's no preview option. There are many setting options, and you really have to test them all to achieve the best possible result. From the respective provider's model selection, to emotions, language boost, speed, volume, pitch, and even the "Text Normalization" function, you use up a lot of credits each time to eventually find the best setting.
Then comes the next challenge: Even though the AI ​​models (e.g., MiniMax) are now really good, the pronunciation of certain words can sometimes be off (I use it for German Voice overs). Currently, you can't preview or correct them. Here, too, you have to create the final voiceover each time (again, it costs a lot of credits) only to discover that it can't be used. This can quickly lead to frustration, especially with the lower tiers.

Overall, VoiSpark offers a solid foundation with good models. I sincerely hope that it will soon become significantly more user-friendly (preview and edit functions are a must).

Therefore, my rating fluctuates between 3 and 4 Tacos, but – with hope for future development – ​​I give it 4. However, using the voice cloning function can quickly become frustrating with Tiers 1 and 2.

Founder Team
Bingchen_VoiSpark

Bingchen_VoiSpark

Sep 12, 2025

Thanks so much for taking the time to share this detailed review. We really appreciate your feedback!

Regarding the preview feature, one thing worth clarifying is that AI voice generation works a bit like gacha - even with the exact same settings and identical text, the output can vary each time. This isn’t unique to VoiSpark — it’s the same with other AI models too. For example, if you ask ChatGPT the same question three times, you’ll likely get slightly different answers each time.

That’s why a “preview” isn’t truly possible right now — even if you heard a preview, the final generated result could still be different once you hit “Generate.” We’ve checked across all the providers we integrate (MiniMax, Cartesia, FishAudio, etc.), and currently none of them offer a true preview option.

However, if you know of a TTS solution that does have this capability, we’d love to look into it. Please feel free to reach out at contact@voispark.com — we’re always open to learning and improving.

Thanks again for your thoughtful review and for giving us 4 tacos with hope for future improvements. Your input really helps us prioritize what to build next.

Helpful?
Share
Ratings