Skip to content

lingz/cmudict-ipa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CMUDict encoded in IPA

File is tab separated and found in cmudict.ipa

Notes:

  • Parenthesis deleted for words with multiple pronounciations
  • Emphasis deleted
  • Split into 10% dev, 10% test, 80% test data set in datasets/

Can modify mappings found in arpa-ipa.map. Mappings taken from wikipedia: https://en.wikipedia.org/wiki/Arpabet

About

CMUDict encoded as IPA

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages