Announcing Voice Annotation Tool, a utility to annotate voice samples

Feliks · May 30, 2022, 6:53am

Dear members of the CommonVoice community,

I am happy to announce a tool to annotate a collection of short audio recordings and save them in the format of the CommonVoice project. Small existing datasets can also be imported and modified. The tool is written in Python, uses the QT framework and is available on Linux and Windows.

Installation:

pip install voice-annotation-tool
voice-annotation-tool

To get started, select a folder containing samples and optionally a TSV file with existing annotations. You can then begin labeling your audio files.

Features:

Import and export CommonVoice TSV files
Play back the audio of the samples
View and edit metadata, including age, gender, accent and user-id
Support for multiple audio formats (based on QTMultimedia)
Configurable shortcuts
Export and import to and from TSV, JSON or CSV files
Multiselect: Modify multiple samples at once
Import profile exported from the CommonVoice website
Modified samples are highlighted

Please note that opening a complete CommonVoice dataset is possible but may be very slow.

For more information, see the documentation, which is hosted on ReadTheDocs.

The source code is available on Github.

We are happy about any feedback. If you spot any issues, please open a ticket.

Screenshot:

Topic		Replies	Views
Common Voice Toolbox: Updated with CV v22.0 data Common Voice feedback , tooling	16	3081	June 26, 2025
I've created a fully annotated version of Common Voice 7.0 Common Voice	3	388	March 29, 2022
Common Voice Sentence Collection Tool launch Common Voice sentence-collection , announcements	15	4230	April 2, 2019
Common Voice mid-year release - more data, more languages! Common Voice announcements , dataset	20	2490	August 12, 2019
Dataset 17 Release Common Voice	8	2029	March 22, 2024

Announcing Voice Annotation Tool, a utility to annotate voice samples

Related topics