Verbit has brought to market a speaker ID feature for its Captivate ASR captioning product.
It’s an update designed to improve the quality and clarity of automated captions during live broadcasts by identifying not just speaker changes but the speakers themselves.
“For the first time in automated captioning, viewers will see captions clearly identifying the speaker versus only generic chevrons (>>) traditionally used to indicate speaker transitions,” Verbit says.
The new feature is launching with Verbit’s media customers across news, weather and live sports.
“Live ASR caption viewers deserve the same clarity and context that human captioning has long provided,” said Verbit General Manager Doug Karlovits. “Our new speaker identification solution leverages the most advanced and innovative speaker models — far surpassing traditional ASR outputs — to achieve the highest accuracy for speaker IDs.”
Verbit’s professional Global Prep Team captures voice profiles, or “voice signatures,” from designated speakers, such as anchors, reporters or sportscasters, before a program goes to air. These signatures are labeled, added to Verbit’s trained acoustic and language models and activated during live broadcasts to accurately and clearly tag who’s speaking in real time.
“We work with customers to determine which speakers they want to identify,” said Karlovits. “And as with all our services, we offer a range of customization options for speaker IDs and can tailor formatting and styles to specific customer requests and preferences.”
The new feature also enhances customers’ analytics capabilities, enabling broadcasters to track and analyze who said what — a true advantage for compliance, editorial decisions and future AI-powered workflows.