Eigenword Resource Page

An Eigenword is an real-valued vector "embedding" associated with a word that captures its meaning in the sense that distributionally similar words have similar eigenwords. This page contains links to several sets of eigenwords They are computed as the singular vectors of the matrix of co-occurrence of words and their contexts, and used in a variety of spectral NLP methods and applications.

Eigenword Collections

Note that the "words" include punctuation, and so will confuse some software. Also note that we use the special symbol "" for out of vocabulary.

Coming soon: eigencontexts

Software for plotting eigenwords

Software for computing eigenwords


home: ungar@cis.upenn.edu