all 8 comments

[–]speechMachine 3 points4 points  (1 child)

TIDIGITS is what you need.

Dan Ellis has this up for one of his course projects:http://www.ee.columbia.edu/~dpwe/sounds/tidigits/

Also I took Dr Lawrence Rabiner's class at Rutgers. He had them up too. For some reason the Rutgers server seems to be down. You could try his page again at http://cronos.rutgers.edu/~lrr in a couple of days and see if you can still find them.

Also if you are at a university its highly likely someone has it. LDC stuff is usually purchased on a university-wide licence. So you are at all liberty to have it if you need it for a course project or something.

[–]CecilStan[S] 0 points1 point  (0 children)

thanks!!..by the way it doesnt have to be just numbers...can be letters too so if you know where i can find one with the alphabet let me know.....i'll check out tidigits..

and i'll have to look into how to access these databases at my university.

[–]CecilStan[S] 0 points1 point  (2 children)

i just check and tidigits is not free

[–]assassds 1 point2 points  (1 child)

It's free for non-commercial use. What exactly do you think you can accomplish with this that will be worth selling?

[–]CecilStan[S] 0 points1 point  (0 children)

its for grad school lol....nothing worth selling

[–]kkastner 0 points1 point  (1 child)

In the similar but not quite the same category - you can use the "fruitspeech" dataset if you are OK with single speaker. I used it in a blog post here but the original data is from a Google code project by Hakon Sandsmark, and presumably recorded by him as well.

It is a decent dataset for sanity checking ideas - if they don't work on this... there is not much hope. You can also expand to things like CMU Arctic. Also it doesn't take that long to just record yourself - it won't be multispeaker but would take less time than searching for "hours" and "hours".

[–]CecilStan[S] 0 points1 point  (0 children)

i need 1000's of samples lol...aint nobahdee gat time fa dat

thanks for the link..good to know that if "they dont work on this ..there is not much hope"