Dados Bibliográficos

AUTOR(ES) E.D. Widmer , J.-A. Gauthier , Philipp Bucher , Cédric Notredame
AFILIAÇÃO(ÕES) University of Geneva, Switzerland, University of Lausanne, Switzerland,, Swiss Institute of Bioinformatics and Swiss Institute for Experimental Cancer Research, Lausanne Switzerland, Centre National de la Recherche Scientifique, Marseille, France, and Centre for Genomic Regulation, Barcelona, Spain
ANO 2009
TIPO Artigo
PERIÓDICO Sociological Methods and Research
ISSN 0049-1241
E-ISSN 1552-8294
EDITORA Annual Reviews (United States)
DOI 10.1177/0049124109342065
CITAÇÕES 10
ADICIONADO EM 2025-08-18
MD5 2084a1046904a1db91ef8eae06283fef

Resumo

One major methodological problem in analysis of sequence data is the determination of costs from which distances between sequences are derived. Although this problem is currently not optimally dealt with in the social sciences, it has some similarity with problems that have been solved in bioinformatics for three decades. In this article, the authors propose an optimization of substitution and deletion/insertion costs based on computational methods. The authors provide an empirical way of determining costs for cases, frequent in the social sciences, in which theory does not clearly promote one cost scheme over another. Using three distinct data sets, the authors tested the distances and cluster solutions produced by the new cost scheme in comparison with solutions based on cost schemes associated with other research strategies. The proposed method performs well compared with other cost-setting strategies, while it alleviates the justification problem of cost schemes.

Ferramentas