'classifier' 태그의 글 목록

classifier

welcometosorapark 2024. 5. 27. 18:47

2024. 5. 27. 18:47

Words with similar contexts have similar meanings

Zellig Harris (1954): 'If A and B have almost identical environments, we say that they are synonyms' (e.g. doctor|surgeon (patient, hospital, treatment, etc)
→ This notion is referred to as the distributional hypothesis

Distributional hypothesis

Distributional models are based on a co-occurrence matrix

Overall matrix is |V| by |D|

Overall matrix is |V| by |V|

Problems with term-term matrices:

Term-term matrices are sparse
- Term vectors are long |V|
- Most entries are zero

Doesn't reflect underlying linguistic structure: 'food is bad' and 'meal was awful'

Word embeddings

Let's represent words using low-dimensional vectors

Benefits

Word2Vec software package: Static embeddings (unlike BERT or ELMo)

Key idea
- Predict rather than count
- Binary prediction task: 'Is word x likely to co-occur with word y?'
- Keep classifier weights
- Running text is the training data
Basic algorithm (skip-gram with negative sampling)
- Treat neighbouring context words as positive samples
- Treat other random words in V as negative samples
- Train a logistic regression classifier to distinguish these classes
- Use learned weights as embeddings

The benefits of using word embeddings compared to traditional vector representations:

(w07) Lexical semantics (0)	2024.05.22
(w06) N-gram Language Models (0)	2024.05.14
(w04) Regular expression (0)	2024.04.30
(w03) Text processing fundamentals (0)	2024.04.24
(w02) NLP evaluation -basic (0)	2024.04.17