\chapter{Lecture 1 - 09-03-2020}
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\section{Introduction}
\chapter{Lecture 1 - 09-03-2020}
\section{Lecture 1 - 09-03-2020}
This is time for all good men to come to the aid of their party!
\section{Lecture 10 - 07-04-2020}
\subsection{TO BE DEFINE}
\chapter{Lecture 10 - 07-04-2020}
\section{TO BE DEFINE}
$|E[z] = |E[|E[z|x]]$
\chapter{Lecture 2 - 07-04-2020}
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\section{Argomento}
\section{Loss}
\subsection{Absolute Loss}
\subsection{Square Loss}
\subsection{Example of information of square loss}
\subsection{labels and losses}
\subsection{Example TF(idf) documents encoding}
\section{Lecture 2 - 07-04-2020}
\chapter{Lecture 2 - 07-04-2020}
Classification tasks\\
Semantic label space Y\\
Categorization Y finite and\\ small
@ -27,8 +27,8 @@ $
Losses for regression?\\
$y$, and $\hat{y} \in \barra{R}$, \\so they are numbers!\\
One example of loss is the absolute loss: absolute difference between numbers\\
\subsection{Absolute Loss}
\subsection{Absolute Loss}
$$\ell(y,\hat{y} = | y - \hat{y} | \Rightarrow absolute \quad loss\\ $$
--- DISEGNO ---\\\\
Some inconvenient properties:
@ -38,7 +38,7 @@ Some inconvenient properties:
\item Derivative only two values (not much informations)
\subsection{Square Loss}
\subsection{Square Loss}
$$ \ell(y,\hat{y} = ( y - \hat{y} )^2 \Rightarrow \textit{square loss}\\$$
-- DISEGNO ---\\
Derivative :
@ -50,7 +50,7 @@ Real numbers as label $\rightarrow$ regression.\\
Whenever taking difference between two prediction make sense (value are numbers) then we are talking about regression problem.\\
Classification as categorization when we have small finite set.\\\\
\subsection{Example of information of square loss}
\subsection{Example of information of square loss}
$\ell(y,\hat{y}) = ( y - \hat{y} )^2 = F(y)
@ -106,7 +106,7 @@ $
The algorithm will be punish high more the prediction is not real. Algorithm will not get 0 and 1 because for example is impossible to get a perfect prediction.\\
This loss is useful to give this information to the algorithm.\\\\
Now we talk about labels and losses\\
\subsection{labels and losses}
\subsection{labels and losses}
Data points: they have some semantic labels that denote some true about this data points and we want to predict this labels.\\
We need to define what data points are: number? Strings? File? Typically they are stored in database records \\
They can have very precise structure or more homogeneously structured \\
@ -154,7 +154,7 @@ There is a way to encode a set of documents in point in a fixed dimensional
space in such way it make sense this coordinate are comparable.\\
I can represent fields with [0,1] for Neural network for example. But they have no geometrical meaning\\
\subsection{Example TF(idf) documents encoding}
\subsection{Example TF(idf) documents encoding}
TF encoding of docs.
\item Extract where all the words from docs
\section{Lecture 3 - 07-04-2020}
\chapter{Lecture 3 - 07-04-2020}
Data point x represented as sequences of measurement and we called this
measurements features or attributes.\\
$$ x = (x_1,..., x_d) \qquad x_1 \quad \textit{feature value}
@ -74,7 +74,7 @@ which we can avoid this to happen by design:
We want when we run ERM choosing a good predictor with ...... PD\\\\
We called this as overfitting: specific situation in which ‘A’ (where A is the
learning algorithm) overfits if f output by A tends to have a training error much
smaller than the test error.\\
@ -84,7 +84,7 @@ Minimising training error doesn’t mean minimising test error. Overfitting is b
Why this happens?\\
This happen because we have \textbf{noise in the data}\\
\subsection{Noise in the data}
\subsection{Noise in the data}
Noise in the data: $y_t$ is not deterministically associated with $x_i$.\\\\
Could be that datapoint appears more times in the same test set.
@ -160,7 +160,7 @@ define which predictor is good.\\
‘A’ underfits when f output by A has training error close to test error but they
are both large.\\
Close error test and training error is good but the are both large.
@ -183,7 +183,7 @@ $$
m >> ln |F| \qquad when \quad |F| < \infty \textit{\\ where m is the size of traning set}
\subsection{Nearest neighbour}
\section{Nearest neighbour}
This is completely different from ERM and is one of the first learning
algorithm. This exploit the geometry of the data.
Assume that our data space X is:
\section{Lecture 4 - 07-04-2020}
\chapter{Lecture 4 - 07-04-2020}
We spoke about Knn classifier with voronoi diagram
@ -11,7 +11,7 @@ $$
$\hnn$ predictor needs to store entire dataset.
\subsection{Computing $\hnn$}
\section{Computing $\hnn$}
Computing $\hnn(x)$ requires computing distances between x and points in the traning set.
@ -69,7 +69,7 @@ There are some heuristic to run NN algorithm without value of $k$.
\subsection{Tree Predictor}
\section{Tree Predictor}
If a give you data not welled defined in a Euclidean space.
$X = X_1 \cdot x \cdot ... \cdot X_d \cdot x$ \qquad Medical Record
\chapter{Lecture 5 - 07-04-2020}
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\section{Tree Classifier}
\section{Jensen's inequality}
\section{Tree Predictor}
\section{Statistical model for Machine Learning}
\section{Tree Classifier}
\section{Jensen's inequality}
\section{Tree Predictor}
\section{Statistical model for Machine Learning}
\BOOKMARK [1][-]{section.1}{\376\377\000T\000r\000e\000e\000\040\000C\000l\000a\000s\000s\000i\000f\000i\000e\000r}{}% 1
\BOOKMARK [1][-]{section.2}{\376\377\000J\000e\000n\000s\000e\000n\040\031\000s\000\040\000i\000n\000e\000q\000u\000a\000l\000i\000t\000y}{}% 2
\BOOKMARK [1][-]{section.3}{\376\377\000T\000r\000e\000e\000\040\000P\000r\000e\000d\000i\000c\000t\000o\000r}{}% 3
\BOOKMARK [1][-]{section.4}{\376\377\000S\000t\000a\000t\000i\000s\000t\000i\000c\000a\000l\000\040\000m\000o\000d\000e\000l\000\040\000f\000o\000r\000\040\000M\000a\000c\000h\000i\000n\000e\000\040\000L\000e\000a\000r\000n\000i\000n\000g}{}% 4
\chapter{6 - 07-04-2020}
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\section{Lecture 6 - 07-04-2020}
\chapter{Lecture 6 - 07-04-2020}
\section{Lecture 7 - 07-04-2020}
\chapter{Lecture 7 - 07-04-2020}
\section{Lecture 8 - 07-04-2020}
\chapter{Lecture 8 - 07-04-2020}
\section{Lecture 9 - 07-04-2020}
\chapter{Lecture 9 - 07-04-2020}
\catcode `"\active
\@writefile{toc}{\contentsline {section}{\numberline {1}Lecture 1 - 09-03-2020}{4}{section.1}\protected@file@percent }
\@writefile{toc}{\contentsline {subsection}{\numberline {1.1}Introduction}{4}{subsection.1.1}\protected@file@percent }
\@writefile{toc}{\contentsline {chapter}{\numberline {1}Lecture 1 - 09-03-2020}{2}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {section}{\numberline {1.1}Introduction}{2}\protected@file@percent }
\@writefile{toc}{\contentsline {chapter}{\numberline {2}Lecture 2 - 07-04-2020}{5}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {section}{\numberline {2.1}Argomento}{5}\protected@file@percent }
\@writefile{toc}{\contentsline {section}{\numberline {2.2}Loss}{5}\protected@file@percent }
\@writefile{toc}{\contentsline {subsection}{\numberline {2.2.1}Absolute Loss}{5}\protected@file@percent }
\@writefile{toc}{\contentsline {subsection}{\numberline {2.2.2}Square Loss}{6}\protected@file@percent }
\@writefile{toc}{\contentsline {subsection}{\numberline {2.2.3}Example of information of square loss}{6}\protected@file@percent }
\@writefile{toc}{\contentsline {subsection}{\numberline {2.2.4}labels and losses}{7}\protected@file@percent }
\@writefile{toc}{\contentsline {subsection}{\numberline {2.2.5}Example TF(idf) documents encoding}{9}\protected@file@percent }
\@writefile{toc}{\contentsline {chapter}{\numberline {3}Lecture 3 - 07-04-2020}{11}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {section}{\numberline {3.1}Overfitting}{13}\protected@file@percent }
\@writefile{toc}{\contentsline {subsection}{\numberline {3.1.1}Noise in the data}{13}\protected@file@percent }
\@writefile{toc}{\contentsline {section}{\numberline {3.2}Underfitting}{14}\protected@file@percent }
\@writefile{toc}{\contentsline {section}{\numberline {3.3}Nearest neighbour}{15}\protected@file@percent }
\@writefile{toc}{\contentsline {chapter}{\numberline {4}Lecture 4 - 07-04-2020}{17}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {section}{\numberline {4.1}Computing $h_{NN}$}{17}\protected@file@percent }
\@writefile{toc}{\contentsline {section}{\numberline {4.2}Tree Predictor}{18}\protected@file@percent }
\@writefile{toc}{\contentsline {chapter}{\numberline {5}Lecture 5 - 07-04-2020}{21}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {section}{\numberline {5.1}Tree Classifier}{21}\protected@file@percent }
\@writefile{toc}{\contentsline {section}{\numberline {5.2}Jensen’s inequality}{22}\protected@file@percent }
\@writefile{toc}{\contentsline {section}{\numberline {5.3}Tree Predictor}{24}\protected@file@percent }
\@writefile{toc}{\contentsline {section}{\numberline {5.4}Statistical model for Machine Learning}{25}\protected@file@percent }
\@writefile{toc}{\contentsline {chapter}{\numberline {6}Lecture 6 - 07-04-2020}{27}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {chapter}{\numberline {7}Lecture 7 - 07-04-2020}{28}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {chapter}{\numberline {8}Lecture 8 - 07-04-2020}{29}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {chapter}{\numberline {9}Lecture 9 - 07-04-2020}{30}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {chapter}{\numberline {10}Lecture 10 - 07-04-2020}{31}\protected@file@percent }
\@writefile{lof}{\addvspace {10\p@ }}
\@writefile{lot}{\addvspace {10\p@ }}
\@writefile{toc}{\contentsline {section}{\numberline {10.1}TO BE DEFINE}{31}\protected@file@percent }
\BOOKMARK [1][-]{section.1}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0001\000\040\000-\000\040\0000\0009\000-\0000\0003\000-\0002\0000\0002\0000}{}% 1
\BOOKMARK [2][-]{subsection.1.1}{\376\377\000I\000n\000t\000r\000o\000d\000u\000c\000t\000i\000o\000n}{section.1}% 2
\BOOKMARK [1][-]{section.2}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0002\000\040\000-\000\040\0000\0007\000-\0000\0004\000-\0002\0000\0002\0000}{}% 3
\BOOKMARK [2][-]{subsection.2.1}{\376\377\000A\000r\000g\000o\000m\000e\000n\000t\000o}{section.2}% 4
\BOOKMARK [2][-]{subsection.2.2}{\376\377\000L\000o\000s\000s}{section.2}% 5
\BOOKMARK [3][-]{subsubsection.2.2.1}{\376\377\000A\000b\000s\000o\000l\000u\000t\000e\000\040\000L\000o\000s\000s}{subsection.2.2}% 6
\BOOKMARK [3][-]{subsubsection.2.2.2}{\376\377\000S\000q\000u\000a\000r\000e\000\040\000L\000o\000s\000s}{subsection.2.2}% 7
\BOOKMARK [3][-]{subsubsection.2.2.3}{\376\377\000E\000x\000a\000m\000p\000l\000e\000\040\000o\000f\000\040\000i\000n\000f\000o\000r\000m\000a\000t\000i\000o\000n\000\040\000o\000f\000\040\000s\000q\000u\000a\000r\000e\000\040\000l\000o\000s\000s}{subsection.2.2}% 8
\BOOKMARK [3][-]{subsubsection.2.2.4}{\376\377\000l\000a\000b\000e\000l\000s\000\040\000a\000n\000d\000\040\000l\000o\000s\000s\000e\000s}{subsection.2.2}% 9
\BOOKMARK [3][-]{subsubsection.2.2.5}{\376\377\000E\000x\000a\000m\000p\000l\000e\000\040\000T\000F\000\050\000i\000d\000f\000\051\000\040\000d\000o\000c\000u\000m\000e\000n\000t\000s\000\040\000e\000n\000c\000o\000d\000i\000n\000g}{subsection.2.2}% 10
\BOOKMARK [1][-]{section.3}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0003\000\040\000-\000\040\0000\0007\000-\0000\0004\000-\0002\0000\0002\0000}{}% 11
\BOOKMARK [2][-]{subsection.3.1}{\376\377\000O\000v\000e\000r\000f\000i\000t\000t\000i\000n\000g}{section.3}% 12
\BOOKMARK [3][-]{subsubsection.3.1.1}{\376\377\000N\000o\000i\000s\000e\000\040\000i\000n\000\040\000t\000h\000e\000\040\000d\000a\000t\000a}{subsection.3.1}% 13
\BOOKMARK [2][-]{subsection.3.2}{\376\377\000U\000n\000d\000e\000r\000f\000i\000t\000t\000i\000n\000g}{section.3}% 14
\BOOKMARK [2][-]{subsection.3.3}{\376\377\000N\000e\000a\000r\000e\000s\000t\000\040\000n\000e\000i\000g\000h\000b\000o\000u\000r}{section.3}% 15
\BOOKMARK [1][-]{section.4}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0004\000\040\000-\000\040\0000\0007\000-\0000\0004\000-\0002\0000\0002\0000}{}% 16
\BOOKMARK [2][-]{subsection.4.1}{\376\377\000C\000o\000m\000p\000u\000t\000i\000n\000g\000\040\000h\000N\000N}{section.4}% 17
\BOOKMARK [2][-]{subsection.4.2}{\376\377\000T\000r\000e\000e\000\040\000P\000r\000e\000d\000i\000c\000t\000o\000r}{section.4}% 18
\BOOKMARK [1][-]{section.5}{\376\377\000T\000r\000e\000e\000\040\000C\000l\000a\000s\000s\000i\000f\000i\000e\000r}{}% 19
\BOOKMARK [1][-]{section.6}{\376\377\000J\000e\000n\000s\000e\000n\040\031\000s\000\040\000i\000n\000e\000q\000u\000a\000l\000i\000t\000y}{}% 20
\BOOKMARK [1][-]{section.7}{\376\377\000T\000r\000e\000e\000\040\000P\000r\000e\000d\000i\000c\000t\000o\000r}{}% 21
\BOOKMARK [1][-]{section.8}{\376\377\000S\000t\000a\000t\000i\000s\000t\000i\000c\000a\000l\000\040\000m\000o\000d\000e\000l\000\040\000f\000o\000r\000\040\000M\000a\000c\000h\000i\000n\000e\000\040\000L\000e\000a\000r\000n\000i\000n\000g}{}% 22
\BOOKMARK [1][-]{section.9}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0006\000\040\000-\000\040\0000\0007\000-\0000\0004\000-\0002\0000\0002\0000}{}% 23
\BOOKMARK [1][-]{section.10}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0007\000\040\000-\000\040\0000\0007\000-\0000\0004\000-\0002\0000\0002\0000}{}% 24
\BOOKMARK [1][-]{section.11}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0008\000\040\000-\000\040\0000\0007\000-\0000\0004\000-\0002\0000\0002\0000}{}% 25
\BOOKMARK [1][-]{section.12}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0009\000\040\000-\000\040\0000\0007\000-\0000\0004\000-\0002\0000\0002\0000}{}% 26
\BOOKMARK [1][-]{section.13}{\376\377\000L\000e\000c\000t\000u\000r\000e\000\040\0001\0000\000\040\000-\000\040\0000\0007\000-\0000\0004\000-\0002\0000\0002\0000}{}% 27
\BOOKMARK [2][-]{subsection.13.1}{\376\377\000T\000O\000\040\000B\000E\000\040\000D\000E\000F\000I\000N\000E}{section.13}% 28
@ -11,6 +11,8 @@
%Options: Sonny, Lenny, Glenn, Conny, Rejne, Bjarne, Bjornstrup
\graphicspath{ {./img/} }
\definecolor{mypink}{cmyk}{0, 0.7808, 0.4429, 0.1412}
@ -28,14 +30,17 @@
% {\normalfont\bfseries}{}{0pt}{\Large}
% {\normalfont\normalsize\bfseries}{\thesection.}{1em}{}
\babel@toc {english}{}
\contentsline {chapter}{\numberline {1}Lecture 1 - 09-03-2020}{2}%
\contentsline {section}{\numberline {1.1}Introduction}{2}%
\contentsline {chapter}{\numberline {2}Lecture 2 - 07-04-2020}{5}%
\contentsline {section}{\numberline {2.1}Argomento}{5}%
\contentsline {section}{\numberline {2.2}Loss}{5}%
\contentsline {subsection}{\numberline {2.2.1}Absolute Loss}{5}%
\contentsline {subsection}{\numberline {2.2.2}Square Loss}{6}%
\contentsline {subsection}{\numberline {2.2.3}Example of information of square loss}{6}%
\contentsline {subsection}{\numberline {2.2.4}labels and losses}{7}%
\contentsline {subsection}{\numberline {2.2.5}Example TF(idf) documents encoding}{9}%
\contentsline {chapter}{\numberline {3}Lecture 3 - 07-04-2020}{11}%
\contentsline {section}{\numberline {3.1}Overfitting}{13}%
\contentsline {subsection}{\numberline {3.1.1}Noise in the data}{13}%
\contentsline {section}{\numberline {3.2}Underfitting}{14}%
\contentsline {section}{\numberline {3.3}Nearest neighbour}{15}%
\contentsline {chapter}{\numberline {4}Lecture 4 - 07-04-2020}{17}%
\contentsline {section}{\numberline {4.1}Computing $h_{NN}$}{17}%
\contentsline {section}{\numberline {4.2}Tree Predictor}{18}%
\contentsline {chapter}{\numberline {5}Lecture 5 - 07-04-2020}{21}%
\contentsline {section}{\numberline {5.1}Tree Classifier}{21}%
\contentsline {section}{\numberline {5.2}Jensen’s inequality}{22}%
\contentsline {section}{\numberline {5.3}Tree Predictor}{24}%
\contentsline {section}{\numberline {5.4}Statistical model for Machine Learning}{25}%
\contentsline {chapter}{\numberline {6}Lecture 6 - 07-04-2020}{27}%
\contentsline {chapter}{\numberline {7}Lecture 7 - 07-04-2020}{28}%
\contentsline {chapter}{\numberline {8}Lecture 8 - 07-04-2020}{29}%
\contentsline {chapter}{\numberline {9}Lecture 9 - 07-04-2020}{30}%
\contentsline {chapter}{\numberline {10}Lecture 10 - 07-04-2020}{31}%
\contentsline {section}{\numberline {10.1}TO BE DEFINE}{31}%
