These are Korean speech and associated data available from LDC. There are duplications. I included the finite state morphology and morphologically annotated text.
- LDC2006S42 Korean Broadcast News Speech
/projects/ldc/ldc-standard-license/2006/LDC2006S42 - LDC2006T14 Korean Broadcast News Transcripts
/projects/ldc/ldc-standard-license/2006/LDC2006T14 - LDC2006S36 West Point Korean Speech
/projects/ldc/ldc-standard-license/2006/LDC2006S36 - LDC2004L01 Klex: Finite-State Lexical Transducer for Korean
/projects/ldc/ldc-standard-license/2004/LDC2004L01 - LDC2004T03 Morphologically Annotated Korean Text
/projects/ldc/ldc-standard-license/2004/LDC2004T03 - LDC2003S07 Korean Telephone Conversations Complete Set
/projects/ldc/ldc-standard-license/2003/LDC2003S07 - LDC2003L02 Korean Telephone Conversations Lexicon
/projects/ldc/ldc-standard-license/2003/LDC2003L02 - LDC2003S03 Korean Telephone Conversations Speech
/projects/ldc/ldc-standard-license/2003/LDC2003S03 - LDC2003T08 Korean Telephone Conversations Transcripts
/projects/ldc/ldc-standard-license/2003/LDC2003T08 - LDC96S54 CALLFRIEND Korean
are these datasets already downloaded somewhere in our server?
We probably need to order them, because they are not from subscription years. I’m sending Bruce an email about it now.