TY - GEN
T1 - Exact and approximation algorithms for the inversion distance between two chromosomes
AU - Kececioglu, John
AU - Sankoff, David
N1 - Funding Information: Research of JK was supported by a postdoctoral fellowship from the Program in Mathematics and Molecular Biology of the University of California at Berkeley under NSF Grant DMS-8720208, and by a fellowship from the Centre de recherches mathdmatiques of the Universitd de MontrSal. Research of DS was supported by grants from the Natural Sciences and Engineering Research Council of Canada, and the Fonds pour la formation de chercheurs et l'aide h la recherche (Qudbec). DS is a fellow of the Canadian Institute for Advanced Research. Electronic mail addresses: kece@cs, ucdavis, edu and saxtkoff@ere, umontreal, ca. A full version of this paper will be appearing in Algorilhmica \[10\]. Publisher Copyright: © Springer-Verlag Berlin Heidelberg 1993.
PY - 1993
Y1 - 1993
N2 - Motivated by the problem in computational biology of reconstructing the series of chromosome inversions by which one organism evolved from another, we consider the proble m of computing the shortest series of reversals that transform one permutation to another, The permutations describe the order of genes on corresponding chromosomes, and a reversal takes an arbitrary substring of elements and reverses their order. For this problem we develop two algorithms: a greedy approximation algorithm that finds a solution provably close to optimal in O(n 2) time and O(n) space for an n element permutation, and a branch and bound exact algorithm that finds an optimal solution in O(mL(n, n)) time and O(n 2) space, where m is the size of the branch and bound search tree and L(n, n) is the time to solve a linear program of n variables and n constraints. The greedy algorithm is the first to come within a constant factor of the optimum, and guarantees a solution that uses no more than twice the minimum number of reversals. The lower and upper bounds of the branch and bound algorithm are a novel application of maximum weight matehings, shortest paths, and linear programming. In a series of experiments we study the performance of an implementation. For random permutations we find that the average difference between the upper and lower bounds is less than 3 reversals for n ≤ 50. Due to the tightness of these bounds we can solve to optimality random permutations on 30 elements in a few minutes of computer time.
AB - Motivated by the problem in computational biology of reconstructing the series of chromosome inversions by which one organism evolved from another, we consider the proble m of computing the shortest series of reversals that transform one permutation to another, The permutations describe the order of genes on corresponding chromosomes, and a reversal takes an arbitrary substring of elements and reverses their order. For this problem we develop two algorithms: a greedy approximation algorithm that finds a solution provably close to optimal in O(n 2) time and O(n) space for an n element permutation, and a branch and bound exact algorithm that finds an optimal solution in O(mL(n, n)) time and O(n 2) space, where m is the size of the branch and bound search tree and L(n, n) is the time to solve a linear program of n variables and n constraints. The greedy algorithm is the first to come within a constant factor of the optimum, and guarantees a solution that uses no more than twice the minimum number of reversals. The lower and upper bounds of the branch and bound algorithm are a novel application of maximum weight matehings, shortest paths, and linear programming. In a series of experiments we study the performance of an implementation. For random permutations we find that the average difference between the upper and lower bounds is less than 3 reversals for n ≤ 50. Due to the tightness of these bounds we can solve to optimality random permutations on 30 elements in a few minutes of computer time.
UR - http://www.scopus.com/inward/record.url?scp=84976632165&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84976632165&partnerID=8YFLogxK
U2 - 10.1007/bfb0029799
DO - 10.1007/bfb0029799
M3 - Conference contribution
SN - 9783540567646
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 87
EP - 105
BT - Combinatorial Pattern Matching - 4th Annual Symposium, CPM 1993, Proceedings
A2 - Apostolico, Alberto
A2 - Crochemore, Maxime
A2 - Galil, Zvi
A2 - Manber, Udi
PB - Springer-Verlag
T2 - Conference of the European Society for Fuzzy Logic and Technology, EUSFLAT 2017 and 16th International Workshop on Intuitionistic Fuzzy Sets and Generalized Nets, IWIFSGN 2017
Y2 - 11 September 2017 through 15 September 2017
ER -