Reader for the TIMIT corpus (or any other corpus with the same file
layout and use of file formats). The corpus root directory should
contain the following files:
In addition, the root directory should contain one subdirectory for
each speaker, containing three files for each utterance:
|
|
__init__(self,
root,
encoding=None)
Construct a new TIMIT corpus reader in the given directory. |
source code
|
|
|
|
files(self,
filetype=None)
Return a list of file identifiers for the files that make up this
corpus. |
source code
|
|
|
|
utterances(self,
dialect=None,
sex=None,
spkrid=None,
sent_type=None,
sentid=None)
Returns:
A list of the utterance identifiers for all utterances in this
corpus, or for the given speaker, dialect region, gender, sentence
type, or sentence number, if specified. |
source code
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
phone_times(self,
utterances=None)
offset is represented as a number of 16kHz samples! |
source code
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| audiodata(self,
utterance,
start=0,
end=None) |
source code
|
|
|
|
| _utterance_files(self,
utterances,
extension) |
source code
|
|
|
|
play(self,
utterance,
start=0,
end=None)
Play the given audio sample. |
source code
|
|
|
Inherited from api.CorpusReader:
__repr__,
abspath,
abspaths,
encoding,
open
Inherited from object:
__delattr__,
__getattribute__,
__hash__,
__new__,
__reduce__,
__reduce_ex__,
__setattr__,
__str__
|
|
|
|
|
Inherited from api.CorpusReader:
filenames
|
|
|
|
|
|
|