|
|
Data Set of English-Spanish Term Vectors from Wikipedia |
This data set consists of the term vectors extracted from 60,730 Wikipedia English articles and their comparable Spanish articles, sampled in 2009. Last published: August 8, 2011.
This data set consists of the term vectors extracted from 60,730 Wikipedia English articles and their comparable Spanish articles, sampled in 2009. We used this data set to test various models for creating translingual document representations, work published in [Platt et al. EMNLP-2010] and [Yih et al. CoNLL-2011]. More detail of this data set can be found in the ReadMe file.
Files
|
|
Status: LiveThis download is still available on microsoft.com. The downloads below will come directly from the Microsoft Download Center. |
| Files |
|---|
|
|
System Requirements
Operating Systems: Windows 10, Windows 7, Windows 8
- Windows 7, Windows 8, or Windows 10
Installation Instructions
- Click Download and follow the instructions.