Show simple item record

dc.contributor.authorCerioli, Andrea
dc.contributor.authorGarcía Escudero, Luis Ángel 
dc.contributor.authorMayo Iscar, Agustín 
dc.contributor.authorRiani, Marco
dc.date.accessioned2018-10-05T21:57:07Z
dc.date.available2018-10-05T21:57:07Z
dc.date.issued2018
dc.identifier.citationJournal of Computational and Graphical Statistics, 2016, vol. 27, p. 404-416es
dc.identifier.issn1061-8600
dc.identifier.urihttp://uvadoc.uva.es/handle/10324/32023
dc.descriptionProducción Científica
dc.description.abstractDeciding the number of clusters k is one of the most difficult problems in clus- ter analysis. For this purpose, complexity-penalized likelihood approaches have been introduced in model-based clustering, such as the well known BIC and ICL crite- ria. However, the classi cation/mixture likelihoods considered in these approaches are unbounded without any constraint on the cluster scatter matrices. Constraints also prevent traditional EM and CEM algorithms from being trapped in (spurious) local maxima. Controlling the maximal ratio between the eigenvalues of the scatter matrices to be smaller than a xed constant c 1 is a sensible idea for setting such constraints. A new penalized likelihood criterion which takes into account the higher model complexity that a higher value of c entails, is proposed. Based on this criterion, a novel and fully automated procedure, leading to a small ranked list of optimal (k; c) couples is provided. A new plot called \car-bike" which provides a concise summary of the solutions is introduced. The performance of the procedure is assessed both in empirical examples and through a simulation study as a function of cluster overlap. Supplemental materials for the article are available online.es
dc.format.mimetypeapplication/pdfes
dc.language.isoenges
dc.publisherTaylor & Francis
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.classificationBIC
dc.subject.classificationCEM algorithm
dc.subject.classificationClustering
dc.subject.classificationEM algorithm
dc.subject.classificationICL
dc.subject.classificationMixtures
dc.titleFinding the number of normal groups in model-based clustering via constrained likelihoodses
dc.typeinfo:eu-repo/semantics/articlees
dc.rights.holder© 2018 American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America
dc.identifier.doi10.1080/10618600.2017.1390469
dc.relation.publisherversionhttps://www.tandfonline.com/doi/full/10.1080/10618600.2017.1390469
dc.peerreviewedSIes
dc.description.projectSpanish Ministerio de Economía y Competitividad, grant MTM2017-86061-C2-1-P, and by Consejería de Educación de la Junta de Castilla y León and FEDER, grant VA005P17 and VA002G18.es
dc.identifier.essn1537-2715
dc.rightsAtribución-NoComercial-SinDerivados 4.0 Internacional
dc.type.hasVersioninfo:eu-repo/semantics/acceptedVersion


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record