• Media type: Text; Master Thesis; Electronic Thesis; E-Book
  • Title: Diversity Driven Parallel Data Mining
  • Contributor: Sampson, Oliver R. [Author]
  • Published: KOPS - The Institutional Repository of the University of Konstanz, 2013
  • Language: English
  • Keywords: Krimp ; Itemset Mining ; Data Mining
  • Origination:
  • Footnote: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
  • Description: With increasing availability and power of parallel computational resources, attention is drawn to the question of how best to apply those resources. Instead of simply finding the same answers more quickly, this thesis describes how parallel computational resources are used to explore disparate regions of a solution space by using diversity to steer the solution paths away from each other, thereby discouraging strictly greedy behavior. The formulation of models in a concept/solution space and its relationship to a search space are described as well as common search algorithms with heuristics for time or space computationally prohibitive searches. Measures of diversity are introduced, and the application of a beam search to the solution space for the Krimp algorithm for frequent itemset mining is described. Experimental results show that it is indeed possible to get better results on real-world datasets with these methods. ; published
  • Access State: Open Access
  • Rights information: In Copyright