Efficient Sequential Learning in Structured and Constrained Environments ; Apprentissage séquentiel efficace dans des environnements structurés avec contraintes

Item request has been placed!

Item request cannot be made.

Processing Request

Read More Add to Saved list

Author(s): Calandriello, Daniele
Source:
https://theses.hal.science/tel-01816904 ; Machine Learning [cs.LG]. Inria Lille Nord Europe - Laboratoire CRIStAL - Université de Lille, 2017. English. ⟨NNT : ⟩.
Subject Terms:
Principal component analysis method; Online learning; Dictionary learning; Distributed learning; Low-rank Matrix Approximation; Gaussian process; Kernel learning; Semi-supervised learning; Graph Laplacian spectrum; Newton and quasi-Newton methods; Stochastic gradient method; Sequential learning; Nystrom-type algorithm; Nystrom approximation; Apprentissage; ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.6: Learning; ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.11: Distributed Artificial Intelligence; ACM: G.: Mathematics of Computing/G.1: NUMERICAL ANALYSIS/G.1.3: Numerical Linear Algebra; ACM: G.: Mathematics of Computing/G.1: NUMERICAL ANALYSIS/G.1.6: Optimization/G.1.6.3: Gradient methods; ACM: G.: Mathematics of Computing/G.1: NUMERICAL ANALYSIS/G.1.6: Optimization/G.1.6.1: Convex programming; ACM: G.: Mathematics of Computing/G.2: DISCRETE MATHEMATICS/G.2.2: Graph Theory/G.2.2.1: Graph labeling; ACM: G.: Mathematics of Computing/G.2: DISCRETE MATHEMATICS/G.2.2: Graph Theory; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]; [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA]
Document Type:
doctoral or postdoctoral thesis
Language:
English

Additional Information
- Contributors:
  Sequential Learning (SEQUEL); Centre Inria de l'Université de Lille; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL); Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS)-Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS); Inria Lille Nord Europe - Laboratoire CRIStAL - Université de Lille; Michal Valko; Alessandro Lazaric
- Publication Information:
  CCSD
- Publication Date:
  2017
- Collection:
  LillOA (HAL Lille Open Archive, Université de Lille)
- Abstract:
  The main advantage of non-parametric models is that the accuracy of the model (degreesof freedom) adapts to the number of samples. The main drawback is the so-called "curseof kernelization": to learn the model we must first compute a similarity matrix among allsamples, which requires quadratic space and time and is unfeasible for large datasets.Nonetheless the underlying effective dimension (effective d.o.f.) of the dataset is often muchsmaller than its size, and we can replace the dataset with a subset (dictionary) of highlyinformative samples. Unfortunately, fast data-oblivious selection methods (e.g., uniformsampling) almost always discard useful information, while data-adaptive methods thatprovably construct an accurate dictionary, such as ridge leverage score (RLS) sampling,have a quadratic time/space cost.In this thesis we introduce a new single-pass streaming RLS sampling approach thatsequentially construct the dictionary, where each step compares a new sample only withthe current intermediate dictionary and not all past samples. We prove that the size ofall intermediate dictionaries scales only with the effective dimension of the dataset, andtherefore guarantee a per-step time and space complexity independent from the number ofsamples. This reduces the overall time required to construct provably accurate dictionariesfrom quadratic to near-linear, or even logarithmic when parallelized.Finally, for many non-parametric learning problems (e.g., K-PCA, graph SSL, online kernellearning) we we show that we can can use the generated dictionaries to compute approximatesolutions in near-linear that are both provably accurate and empirically competitive. ; L’avantage principal des méthodes d’apprentissage non-paramétriques réside dans le faitque la nombre de degrés de libertés du modèle appris s’adapte automatiquement au nombred’échantillons. Ces méthodes sont cependant limitées par le "fléau de la kernelisation":apprendre le modèle requière dans un premier temps de construire une matrice de similitudeentre tous ...
- Online Access:
  https://theses.hal.science/tel-01816904
  https://theses.hal.science/tel-01816904v1/document
  https://theses.hal.science/tel-01816904v1/file/main.pdf
- Rights:
  info:eu-repo/semantics/OpenAccess
- Accession Number:
  edsbas.4D69099D

Comments

No Comments.

Efficient Sequential Learning in Structured and Constrained Environments ; Apprentissage séquentiel efficace dans des environnements structurés avec contraintes

Contact

Follow us