Table of Contents

Shortcuts

COMMONVOICE¶

class torchaudio.datasets.COMMONVOICE(root: Union[str, Path], tsv: str = 'train.tsv')[source]¶

CommonVoice [Ardila et al., 2020] dataset.

Parameters

root (str or Path) – Path to the directory where the dataset is located. (Where the tsv file is present.)
tsv (str, optional) – The name of the tsv file used to construct the metadata, such as "train.tsv", "test.tsv", "dev.tsv", "invalidated.tsv", "validated.tsv" and "other.tsv". (default: "train.tsv")

getitem¶

COMMONVOICE.__getitem__(n: int) → Tuple[Tensor, int, Dict[str, str]][source]¶

Load the n-th sample from the dataset.

Parameters

n (int) – The index of the sample to be loaded

Returns

Tuple of the following items;

Tensor:

Waveform

int:

Sample rate

Dict[str, str]:

Dictionary containing the following items from the corresponding TSV file;

"client_id"
"path"
"sentence"
"up_votes"
"down_votes"
"age"
"gender"
"accent"

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources