看啥推荐读物

专栏名称: 贺小三

纨绔子弟、贺百万、贺英俊、贺七七、贺叔叔、贺百科、贺会玩、贺好帅、贺天才、贺虚伪、贺薄情、乔帅格的远房表哥、妮妮的欧洲养老院同伴、资深王教人、小尼克。创业失败，娶妻生子。学术无果，回家种地。

今天看啥

微信公众号rss订阅, 微信rss, 稳定的RSS源

微信公众号RSS订阅方法

B站投稿RSS订阅方法

知乎回答RSS订阅方法

知乎专栏 RSS订阅方法

雪球动态RSS订阅方法

微博RSS订阅方法

微博搜索关键词订阅方法

豆瓣日记 RSS订阅方法

[Notes] From Frequency to Meaning: Vector Space Models of Semantics

贺小三 · 简书 · · 2018-02-02 00:38

The distributional hypothesis in linguistics is that words that occur in similar contexts tend to have similar meanings (Harris, 1954). This hypothesis is the justification for ap- plying the VSM to measuring word similarity. A word may be represented by a vector in which the elements are derived from the occurrences of the word in various contexts, such as windows of words (Lund & Burgess, 1996), grammatical dependencies (Lin, 1998; Pad ́o & Lapata, 2007), and richer contexts consisting of dependency links and selectional preferences on the argument positions (Erk & Pad ́o, 2008); see Sahlgren’s (2006) thesis for a comprehensive study of various contexts. Similar row vectors in the word–context matrix indicate similar word meanings.

The idea that word usage can reveal semantics was implicit in some of the things that Wittgenstein (1953) said about language-games and family resemblance. Wittgenstein was primarily interested in the physical activities that form the context of word usage (e.g., the word brick, spoken in the context of the physical activity of building a house), but the main context for a word is often other words.

http://www.jair.org/media/2934/live-2934-4846-jair.pdf

原文地址：访问原文地址
快照地址：访问文章快照

分享到微博