Reader for the TIMIT corpus (or any other corpus with the same file
layout and use of file formats). The corpus root directory should
contain the following files:
In addition, the root directory should contain one subdirectory for
each speaker, containing three files for each utterance:
|
__init__(self,
root,
encoding=None)
Construct a new TIMIT corpus reader in the given directory. |
source code
|
|
|
files(self,
filetype=None)
Return a list of file identifiers for the files that make up this
corpus. |
source code
|
|
|
utterances(self,
dialect=None,
sex=None,
spkrid=None,
sent_type=None,
sentid=None)
Returns:
A list of the utterance identifiers for all utterances in this
corpus, or for the given speaker, dialect region, gender, sentence
type, or sentence number, if specified. |
source code
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
phone_times(self,
utterances=None)
offset is represented as a number of 16kHz samples! |
source code
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
audiodata(self,
utterance,
start=0,
end=None) |
source code
|
|
|
_utterance_files(self,
utterances,
extension) |
source code
|
|
|
play(self,
utterance,
start=0,
end=None)
Play the given audio sample. |
source code
|
|
Inherited from api.CorpusReader :
__repr__ ,
abspath ,
abspaths ,
encoding ,
open
Inherited from object :
__delattr__ ,
__getattribute__ ,
__hash__ ,
__new__ ,
__reduce__ ,
__reduce_ex__ ,
__setattr__ ,
__str__
|
|
|
Inherited from api.CorpusReader :
filenames
|
|
|
|
|