More Customization, Accuracy Gains & Insights: Explore VoiceAI’s New Features

Ayushman Dash

We've released a new version of VoiceAI and it’s packed with fresh features!

With generative AI, enhanced multichannel diarization, custom vocabulary, subtitle guidelines, and improved language detection, it's more than an upgrade – it's a whole new experience.

Try VoiceAI for free at – your solution for accurate transcription, in-depth analysis, and interactive conversations.

Ask me Anything: Turn Audio into Insights

What happens when generative AI meets VoiceAI? Introducing Ask me Anything. 

Extracting key insights from transcriptions has always been a challenge. "Ask me Anything" changes that. Simply type a question and instantly receive answers. 

No more sifting through data — just direct, instant access to the information you need. Rest assured that when you use Ask me Anything, you can only search and retrieve information from your own audio transcripts. 

Here’s some prompt inspiration to help you get started:

  • Fraud detection: “Did this caller fail multiple attempts to access their account?”
  • Sentiment by topic: "Summarize how this caller felt about their purchase and why."
  • Interview: "What are the candidates' strengths and weaknesses?”
  • Content creation: "Create a description of this podcast episode in less than 500 words."

Custom Vocabulary: Tailor Your Model

Unique language and terminology can often go unrecognized or wrongly transcribed by standard Speech-to-Text (STT) systems. Bad transcriptions can disrupt your workflow, leading to distorted speech analytics and faltering CX systems.

With VoiceAI's Custom Vocabulary feature, you can add any word – from product names to specialist industry lingo – and see instant improvements in transcription accuracy. Achieve precise results without the need for complex model training. 

Multichannel Diarization: Clarity in Audio Segmentation

Distinguishing between different speakers in a single audio stream can be challenging, especially if the speakers' voices are similar or if there's background noise. Multichannel diarization solves this by processing each speaker's audio from separate channels. This clear separation allows the system to more accurately identify who said what and when. 

VoiceAI now offers two diarization modes:

Speaker Mode: Identify different voices within one audio channel. 

Channel Mode: Use when each speaker's audio is separated into different channels.

Advanced Configuration for Diarization

Enhance your speaker identification accuracy with advanced diarization settings. Choose from the pre-set configuration or adjust the sensitivity slider for perfect speaker separation.

High sensitivity detects more speakers; low sensitivity, fewer. Know your speaker count? Input it directly to ensure the most accurate transcription results.

In this release, we also introduce subtitle guidelines - a feature that allows you to tailor text length, line count, and duration. Easily apply your preferences and download an SRT file that reflects your personalized settings—perfect for content creators. We’ve also upgraded our language detection model, enhancing the accuracy of your transcriptions.

Subtitle Guidelines

Sign up to the VoieAI platform to try these features for free. 

Contact our sales team with any questions about our enterprise pricing and bespoke solutions. We’re here to help.

What’s a Rich Text element?

The rich text element allows you to create and format headings, paragraphs, blockquotes, images, and video all in one place instead of having to add and format them individually. Just double-click and easily create content.

Static and dynamic content editing

A rich text element can be used with static or dynamic content. For static content, just drop it into any page and begin editing. For dynamic content, add a rich text field to any collection and then connect a rich text element to that field in the settings panel. Voila!

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of" nested selector system.

  • JKDV
  • EVEV
  • EV
  • dfdb
  • dfb

Subscribe to our newsletter


Why Going Global Without AI Localization is Like Driving Without a GPS
Audiences are eager to explore global content - but can media companies keep pace without adopting AI?
May 30, 2024
Enhancing Call Center Efficiency with Advanced Speech Analytics
Customer finds solution in NeuralSpace's VoiceAI analytics API, to significantly transform their speech analytical capabilities.
May 24, 2024
Leading the way in Tagalog Speech Recognition
Our model outperforms Google, Azure, and OpenAI, with an 81.55% higher accuracy than Google.
May 20, 2024
Maximizing Localization Efficiency with LocAI Analytics
Delve into how LocAI addresses challenges of team management, time zones, and freelancing to empower teams in the dynamic subtitling landscape
May 3, 2024