Whisper Large-v3 Release

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

The large-v3 model shows improved performance over a wide variety of languages, and the plot below includes all languages where Whisper large-v3 performs lower than 60% error rate on Common Voice 15 and Fleurs, showing 10% to 20% reduction of errors compared to large-v2:

Image

Image alternative text

Federation

Status:

On | Off

Instances:

/m/[email protected]

Threads (124)

Microblog (58)

People

Magazines

Thread

Even_Adder

@[email protected]

Added: 7 months ago
Views: 6
Online: -
Ratio: 0

Magazine

This magazine is dedicated to discussions on open source software, hardware, and technology. Whether you are a developer, a tech enthusiast, or simply interested in the philosophy of open source, this is the place for you. Here you can share your knowledge, ask questions, and engage in discussions on topics such as open source programming languages, operating systems, hardware, and more. From the benefits and challenges of open source to the latest developments and trends, this category covers a wide range of topics related to open source.

Created: 1 year ago
Owner: barista
Subscribers: 12
Online: -

Moderators

barista

Add comment