• Media type: E-Article
  • Title: Data ultrametricity and clusterability
  • Contributor: Simovici, Dan; Hua, Kaixun
  • Published: IOP Publishing, 2019
  • Published in: Journal of Physics: Conference Series, 1334 (2019) 1, Seite 012002
  • Language: Not determined
  • DOI: 10.1088/1742-6596/1334/1/012002
  • ISSN: 1742-6588; 1742-6596
  • Keywords: General Physics and Astronomy
  • Origination:
  • Footnote:
  • Description: Abstract The increasing needs of clustering massive datasets and the high cost of running clustering algorithms poses difficult problems for users. In this context it is important to determine if a data set is clusterable, that is, it may be partitioned efficiently into well-differentiated groups containing similar objects. We approach data clusterability from an ultrametric-based perspective. A novel approach to determine the ultrametricity of a dataset is proposed via a special type of matrix product, which allows us to evaluate the clusterability of the dataset. Furthermore, we show that by applying our technique to a dissimilarity space will generate the sub-dominant ultrametric of the dissimilarity.
  • Access State: Open Access