Data Repositories
Corpora on CS Network
- DEFT related LDC corpora 2002-2013
/scratch/cluster/yinon/DEFT/LDC
- Other DEFT related LDC corpora 2010-2012 and American National Corpus, 2nd Release
/scratch/cluster/yinon/DEFT/LDC-DVDs
- U. Wisconsin extractions from KBP
/scratch/cluster/yinon/DEFT/UWISC_KBP
- RTE 4-7
/scratch/cluster/yinon/DEFT/RTE
- Distributional semantic models by Marco Baroni
/scratch/cluster/yinon/DEFT/dist-sem-models
- Dependency Parses for the following text corpora:
- BNC (British National Corpus)
/scratch/cluster/girish/dep_parsed_stuff/BNC/parsed_BNC/
- Ukwac
/scratch/cluster/girish/dep_parsed_stuff/ukwac/parsed_ukwac/
- Wackypedia
/scratch/cluster/girish/dep_parsed_stuff/wackypedia/parsed_wackypedia/
- Gigaword (Collected by Joe Reisinger)
/scratch/cluster/girish/dep_parsed_stuff/gigaword_parsed/parsed/
- Minipar parses of BNC, UkWaC, and Wackypedia
/scratch/cluster/gboleda/corpora/
- Paraphrasing Corpus by Chris Callison-Burch (Pre-Release Version)
/scratch/cluster/beltagy/deft/paraphrases-v0.2-xl
- DEFT corpora (Available on DVD from Katrin Erk)
- Deep NLU exploration - DEFT pilot source audio
- Deep NLU exploration - DEFT pilot source text and annotations
- DEFT phase 1 sample narrative text creation