TEDLIUM¶
- class torchaudio.datasets.TEDLIUM(root: Union[str, Path], release: str = 'release1', subset: str = 'train', download: bool = False, audio_ext: str = '.sph')[source]¶
Tedlium [Rousseau et al., 2012] dataset (releases 1,2 and 3).
- Parameters:
root (str or Path) – Path to the directory where the dataset is found or downloaded.
release (str, optional) – Release version. Allowed values are
"release1","release2"or"release3". (default:"release1").subset (str, optional) – The subset of dataset to use. Valid options are
"train","dev", and"test". Defaults to"train".download (bool, optional) – Whether to download the dataset if it is not found at root path. (default:
False).audio_ext (str, optional) – extension for audio file (default:
".sph")
Properties¶
phoneme_dict¶
Methods¶
__getitem__¶
- TEDLIUM.__getitem__(n: int) Tuple[Tensor, int, str, int, int, int][source]¶
Load the n-th sample from the dataset.
- Parameters:
n (int) – The index of the sample to be loaded
- Returns:
Tuple of the following items;
- Tensor:
Waveform
- int:
Sample rate
- str:
Transcript
- int:
Talk ID
- int:
Speaker ID
- int:
Identifier