TEDLIUM¶

class torchaudio.datasets.TEDLIUM(root: Union[str, Path], release: str = 'release1', subset: str = 'train', download: bool = False, audio_ext: str = '.sph')[source]¶

Tedlium [Rousseau et al., 2012] dataset (releases 1,2 and 3).

Parameters:

root (str or Path) – Path to the directory where the dataset is found or downloaded.
release (str, optional) – Release version. Allowed values are "release1", "release2" or "release3". (default: "release1").
subset (str, optional) – The subset of dataset to use. Valid options are "train", "dev", and "test". Defaults to "train".
download (bool, optional) – Whether to download the dataset if it is not found at root path. (default: False).
audio_ext (str, optional) – extension for audio file (default: ".sph")

Properties¶

phoneme_dict¶

property TEDLIUM.phoneme_dict¶

Phonemes. Mapping from word to tuple of phonemes. Note that some words have empty phonemes.

Type:: dict[str, tuple[str]]

Methods¶

getitem¶

TEDLIUM.__getitem__(n: int) → Tuple[Tensor, int, str, int, int, int][source]¶

Load the n-th sample from the dataset.

Parameters:

n (int) – The index of the sample to be loaded

Returns:

Tuple of the following items;

Tensor:: Waveform
int:: Sample rate
str:: Transcript
int:: Talk ID
int:: Speaker ID
int:: Identifier

TEDLIUM¶

Properties¶

phoneme_dict¶

Methods¶

getitem¶

Docs

Tutorials

Resources

TEDLIUM¶

Properties¶

phoneme_dict¶

Methods¶

__getitem__¶

Docs

Tutorials

Resources

getitem¶