9. Margin Maximization
w
w · x+ − b = 1
x+
1
|x+ − x∗ | = w · (x+ − x∗ ) x∗
|w|
1 w·x=b
=
|w|
|w|2
9
10. SVM Hard Margin SVM
y (w · x
(i) (i)
− b) ≥ 1.
1
min. |w|2
2
s.t. y (w · x − b) ≥ 1 ; ∀i.
(i) (i)
10
11. SVM Hard Margin SVM
1
min. |w|2
2
s.t. y (w · x − b) ≥ 1 ; ∀i.
(i) (i)
11
12. Dual Problem
αi (≥ 0)
1
L(w, b, α) = |w|2 − αi y (i) (w · x(i) − b) − 1 .
2 i
∇w L = w − (i)
αi y x (i)
= 0. ∴ w = ∗ (i)
αi y x . (i)
i i
∂L
= αi y (i) = 0.
∂b i
12
13. Dual Problem
2 2
1 ∗
1
(i) (i) (i) (i)
L(w , b, α) = w −
∗
αi y x − αi y x
2 i
2 i
+b αi y (i) + αi
i i
2
1
(i) (i)
=− αi y x + αi
2 i
i
1
=− αi αj y y x · x +
(i) (j) (i) (j)
αi .
2 i,j i 13
14. Dual Problem
1
max. − αi αj y y x · x +
(i) (j) (i) (j)
αi
2 i,j i
s.t. αi y = 0,
(i)
i
αi ≥ 0 ; ∀i.
14
18. SVM Soft Margin SVM
ξi (≥ 0)
y (w · x
(i) (i)
− b) ≥ 1 − ξi .
1
min. |w| + C
2
ξi
2 i
s.t. y (w · x
(i) (i)
− b) ≥ 1 − ξi ; ∀i,
ξi ≥ 0 ; ∀i.
C 18
19. SVM Soft Margin SVM
αi βi
1
L(w, b, ξ, α, β) = |w|2 + C ξi
2 i
− αi y (w · x − b) − 1 + ξi −
(i) (i)
βi ξi .
i i
w b ξi
∂L
= C − αi − βi
∂ξi
C = αi + βi .
19
20. SVM Soft Margin SVM
L
1
L=− (i) (j)
αi αj y y x (i)
·x (j)
2 i,j
+ αi (1 − ξi ) + C ξi − βi ξi
i i i
1
=− αi αj y (i) y (j) x(i) · x(j) + αi .
2 i,j i
βi ξi βi = C − αi ≥ 0
20
21. SVM Soft Margin SVM
1
max. − αi αj y y x · x +
(i) (j) (i) (j)
αi
2 i,j i
s.t. αi y (i) = 0,
i
0 ≤ αi ≤ C ; ∀i.
21
42. Pointwise Mutual Information
w c
P (Xw = 1, C = c)
PMI(w, c) = log
P (Xw = 1)P (C = c)
Iaverage (w) = P (c)PMI(w, c),
c
Imax (w) = max P (c)PMI(w, c).
c
42
43. Pointwise Mutual Information
W
P (W = w, C = c)
PMI(w, c) = log
P (W = w)P (C = c)
c c
Xc = 1 Xc = 0 Xc
P (Xw = 1, Xc = 1)
PMI(w, c) = log
P (Xw = 1)P (Xc = 1)
43
44. Pointwise Mutual Information
w
c
PMI(w, c) = log P (C = c|Xw = 1) − log P (C = c)
= log 1 − log P (C = c).
c w
PMI(w , c) = log P (C = c|Xw = 1) − log P (C = c)
= log 0.99 − log P (C = c).
44