Kernelization: Why?

Quadratic kernel

$\phi(\mathbf{x}) = (x_1^2, x_2^2, \sqrt{2}x_1 x_2)^T$

$\phi(\mathbf{x_1}) = \phi((5.9, 3)^T) = (5.9^2, 3^2, \sqrt{2}\cdot 5.9\cdot 3)^T = (34.81, 9, 25.03)^T$

$\phi(\mathbf{x_2}) = \phi((6.9, 3.1)^T) = \dots = (47.61, 9.61, 30.25)^T$

$\phi(\mathbf{x_3}) = \phi((6.6, 2.9)^T) = \dots = (43.56, 8.41, 27.06)^T$

distances get amplified: on third dim. now $\phi(\mathbf{x_3})$ is closer to $\phi(\mathbf{x_1})$ than to $\phi(\mathbf{x_2})$

eigenpairs $(\lambda_i, \mathbf{u}_i)$ are (desc. order):

$(223.95, (-.442, -.505, -.482, -.369, -.425)^T)$

$(1.29 , (.163, -.134, -.181, .813, -.512)^T)$

$(0, \mathbf{u}_3)$ discarded

$(0, \mathbf{u}_4)$ discarded

$(0, \mathbf{u}_5)$ discarded

The $K = U\Lambda U^T$ decomposition is

(\begin{matrix} u_{1} & u_{2} \end{matrix}) (\begin{matrix} 223.95 & 0 \\ 0 & 1.29 \end{matrix}) (\begin{matrix} {u_{1}}^{T} \\ {u_{2}}^{T} \end{matrix})

$\begin{pmatrix} \mathbf{u_1} & \mathbf{u_2} \end{pmatrix} \begin{pmatrix} 223.95 & 0 \\ 0 & 1.29 \end{pmatrix} \begin{pmatrix} \mathbf{u_1}^T\\ \mathbf{u_2}^T \end{pmatrix}$

Kernelization

Data Science: Techniques and Applications (DSTA)

Kernelization: Why?

So far…

Intricated bivariate data.

Basics

Operational idea

Operational idea, II

The simplest kernels

Identity mapping: from 2-D to 2-D

Quadratic kernel

Example Mercer Kernelization

The mapping

Example