Index of /joint/data/corpora/
../
2015_news_parsed_105M/ 04-Nov-2017 11:29 -
c4corpus_en/ 04-Nov-2017 00:11 -
c4corpus_ru/ 04-Nov-2017 11:27 -
common-crawl-2016/ 05-Nov-2017 23:14 -
culwg/ 04-Nov-2017 04:10 -
en59g/ 05-Nov-2017 08:38 -
news-2015-105m-sentences-parsed/ 04-Nov-2017 10:56 -
2015_news_105M_text_and_pos.csv.gz 04-Nov-2017 06:14 11399510540
cc16-conll-copp-222.csv.gz 04-Nov-2017 00:18 58345837
cc16-conll-copp-sample-newlines-no-enhanced.csv.gz 04-Nov-2017 00:48 93380
cc16-conll-copp-sample-newlines.csv.gz 05-Nov-2017 09:55 18613142
cc16-conll-copp-sample.csv.gz 04-Nov-2017 00:17 18622940
corpus-ijcai.zip 04-Nov-2017 00:18 59915159
mwe-conll-sample.csv 04-Nov-2017 12:59 1506
usages-wiki-ddt-mwe-313k.csv.gz 05-Nov-2017 09:55 88695709
wiki-2011-35m-sentences-parsed.csv.gz 04-Nov-2017 12:59 14151116131
wiki-sentences.txt.gz 04-Nov-2017 00:48 5013261722
wikipedia-titles.csv 05-Nov-2017 09:57 225188706
wikipedia-titles.csv-out.csv 05-Nov-2017 09:58 143458620
wikipedia-titles.csv-out.csv-mwe.csv 04-Nov-2017 00:17 128464232
wikipedia_complete__parsed.gz 06-Nov-2017 10:58 14151116131