Machine Learning and Association rules
117 pages
English

Machine Learning and Association rules

-

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres
117 pages
English
Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

Description

Machine Learning and
Association rules
Petr Berka, Jan Rauch
University of Economics, Prague
{berka|rauch}@vse.cz Tutorial Outline
 Statistics, machine learning and data
mining – basic concepts, similarities and
differences (P. Berka)
 Machine Learning Methods and
Algorithms – general overview and selected
methods (P. Berka)
 Break
 GUHA Method and LISp-Miner System
(J.Rauch)
Tutorial @ COMPSTAT 2010 2 Part 1
Statistics, machine learning and
data mining Statistics
 A formal science that deals with collection,
analysis, interpretation, explanation and
presentation of (usually numerical) data.
 The science of making effective use of
numerical data relating to groups of
individuals or experiments
(wikipedia)
Tutorial @ COMPSTAT 2010 4 Machine Learning
 „The field of machine learning is concerned
with the question of how to construct computer
programs that automatically improve with
experience.―
(Mitchell, 1997)
 „Things learn when they change their behavior
in a way that makes them perform better in a
future.―
(Witten, Frank, 1999)
Tutorial @ COMPSTAT 2010 5 Knowledge Discovery in Databases
 „Non-trivial process of identifying valid, novel,
potentially useful and ultimately understandable
patterns from data.―
(Fayyad et al., 1996)
 „Analysis of observational data sets to find
unsuspected relationships and summarize data in
novel ways that are both understandable ...

Sujets

Informations

Publié par
Nombre de lectures 97
Langue English
Poids de l'ouvrage 2 Mo

Exrait

Machine Learning and Association rules Petr Berka, Jan Rauch University of Economics, Prague {berka|rauch}@vse.cz Tutorial Outline  Statistics, machine learning and data mining – basic concepts, similarities and differences (P. Berka)  Machine Learning Methods and Algorithms – general overview and selected methods (P. Berka)  Break  GUHA Method and LISp-Miner System (J.Rauch) Tutorial @ COMPSTAT 2010 2 Part 1 Statistics, machine learning and data mining Statistics  A formal science that deals with collection, analysis, interpretation, explanation and presentation of (usually numerical) data.  The science of making effective use of numerical data relating to groups of individuals or experiments (wikipedia) Tutorial @ COMPSTAT 2010 4 Machine Learning  „The field of machine learning is concerned with the question of how to construct computer programs that automatically improve with experience.― (Mitchell, 1997)  „Things learn when they change their behavior in a way that makes them perform better in a future.― (Witten, Frank, 1999) Tutorial @ COMPSTAT 2010 5 Knowledge Discovery in Databases  „Non-trivial process of identifying valid, novel, potentially useful and ultimately understandable patterns from data.― (Fayyad et al., 1996)  „Analysis of observational data sets to find unsuspected relationships and summarize data in novel ways that are both understandable and useful to the data owner.‖ (Hand, Manilla, Smyth, 2001) Tutorial @ COMPSTAT 2010 6 The CRISP-DM Methodology Data Mining Tutorial @ COMPSTAT 2010 7 Data Machine StatisticsLearning Mining skill confirmatory acquisition data analysis empirical exploratory concept data learning analysis analytical descriptiveconcept statisticslearning Tutorial @ COMPSTAT 2010 8 Statistics vs. Machine Learing  Hypothesis driven  Data driven  Model oriented  Algorithm oriented  formulate hypothesis  formulate a task  collect data (in a  preprocess available controlled way) data  analyze data  apply (different) algorithms  interpret results  interpret results Tutorial @ COMPSTAT 2010 9 Terminological differences Machine Learning Statistics attribute variable target attribute, class dependent variable, response input attribute independent variable, predictor learning fitting, parameter estimation weights (in neural nets) parameters (in regression) error residuum Tutorial @ COMPSTAT 2010 10
  • Accueil Accueil
  • Univers Univers
  • Ebooks Ebooks
  • Livres audio Livres audio
  • Presse Presse
  • BD BD
  • Documents Documents