47 results on '"Lyndon factorization"'
Search Results
2. Practical Evaluation of Lyndon Factors via Alphabet Reordering.
- Author
-
Albertini, Marcelo K. and Louza, Felipe A.
- Subjects
- *
FACTORIZATION , *PERMUTATIONS , *ALGORITHMS , *HEURISTIC - Abstract
We evaluate the influence of different alphabet orderings on the Lyndon factorization of a string. Experiments with Pizza&Chili datasets show that for most alphabet reorderings, the number of Lyndon factors is usually small, and the length of the longest Lyndon factor can be as large as the input string, which is unfavorable for algorithms and indexes that depend on the number of Lyndon factors. We present results with randomized alphabet permutations that can be used as a baseline to assess the effectiveness of heuristics and methods designed to modify the Lyndon factorization of a string via alphabet reordering. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
3. Lyndon Words Formalized in Isabelle/HOL
- Author
-
Holub, Štěpán, Starosta, Štěpán, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Moreira, Nelma, editor, and Reis, Rogério, editor
- Published
- 2021
- Full Text
- View/download PDF
4. Can We Replace Reads by Numeric Signatures? Lyndon Fingerprints as Representations of Sequencing Reads for Machine Learning
- Author
-
Bonizzoni, Paola, De Felice, Clelia, Petescia, Alessia, Pirola, Yuri, Rizzi, Raffaella, Stoye, Jens, Zaccagnino, Rocco, Zizza, Rosalba, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Martín-Vide, Carlos, editor, Vega-Rodríguez, Miguel A., editor, and Wheeler, Travis, editor
- Published
- 2021
- Full Text
- View/download PDF
5. Lyndon Words versus Inverse Lyndon Words: Queries on Suffixes and Bordered Words
- Author
-
Bonizzoni, Paola, De Felice, Clelia, Zaccagnino, Rocco, Zizza, Rosalba, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leporati, Alberto, editor, Martín-Vide, Carlos, editor, Shapira, Dana, editor, and Zandron, Claudio, editor
- Published
- 2020
- Full Text
- View/download PDF
6. Unveiling the Connection Between the Lyndon Factorization and the Canonical Inverse Lyndon Factorization via a Border Property
- Author
-
Kralovic, R, Kucera, A, Bonizzoni, P, De Felice, C, Riccardi, B, Zaccagnino, R, Zizza, R, Bonizzoni P., De Felice C., Riccardi B., Zaccagnino R., Zizza R., Kralovic, R, Kucera, A, Bonizzoni, P, De Felice, C, Riccardi, B, Zaccagnino, R, Zizza, R, Bonizzoni P., De Felice C., Riccardi B., Zaccagnino R., and Zizza R.
- Abstract
The notion of Lyndon word and Lyndon factorization has shown to have unexpected applications in theory as well as in developing novel algorithms on words. A counterpart to these notions are those of inverse Lyndon word and inverse Lyndon factorization. Differently from the Lyndon words, the inverse Lyndon words may be bordered. The relationship between the two factorizations is related to the inverse lexicographic ordering, and has only been recently explored. More precisely, a main open question is how to get an inverse Lyndon factorization from a classical Lyndon factorization under the inverse lexicographic ordering, named CFLin. In this paper we reveal a strong connection between these two factorizations where the border plays a relevant role. More precisely, we show two main results. We say that a factorization has the border property if a nonempty border of a factor cannot be a prefix of the next factor. First we show that there exists a unique inverse Lyndon factorization having the border property. Then we show that this unique factorization with the border property is the so-called canonical inverse Lyndon factorization, named ICFL. By showing that ICFL is obtained by compacting factors of the Lyndon factorization over the inverse lexicographic ordering, we provide a linear time algorithm for computing ICFL from CFLin.
- Published
- 2024
7. Numeric Lyndon-based feature embedding of sequencing reads for machine learning approaches.
- Author
-
Bonizzoni, P., Costantini, M., De Felice, C., Petescia, A., Pirola, Y., Previtali, M., Rizzi, R., Stoye, J., Zaccagnino, R., and Zizza, R.
- Subjects
- *
MACHINE learning , *PROTEIN structure prediction , *NUCLEOTIDE sequencing , *SEQUENCE alignment , *STRIPES - Abstract
Feature embedding methods have been proposed in the literature to represent sequences as numeric vectors to be used in some bioinformatics investigations, such as family classification and protein structure prediction. Recent theoretical results showed that the well-known Lyndon factorization preserves common factors in overlapping strings [1]. Surprisingly, the fingerprint of a sequencing read, which is the sequence of lengths of consecutive factors in variants of the Lyndon factorization of the read, is effective in capturing sequence similarities, suggesting it as basis for the definition of novel representations of sequencing reads. We propose a novel feature embedding method for Next-Generation Sequencing (NGS) data using the notion of fingerprint. We provide a theoretical and experimental framework to estimate the behaviour of fingerprints and of the k -mers extracted from it, called k-fingers , as possible feature embeddings for sequencing reads. As a case study to assess the effectiveness of such embeddings, we use fingerprints to represent RNA-Seq reads in order to assign them to the most likely gene from which they originated as fragments of transcripts of the gene. We provide an implementation of the proposed method in the tool lyn2vec , which produces Lyndon-based feature embeddings of sequencing reads. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
8. On the longest common prefix of suffixes in an inverse Lyndon factorization and other properties.
- Author
-
Bonizzoni, Paola, De Felice, Clelia, Zaccagnino, Rocco, and Zizza, Rosalba
- Subjects
- *
SUFFIXES & prefixes (Grammar) - Abstract
The Lyndon factorization of a word has been largely studied and recently variants of it have been introduced and investigated with different motivations. In particular, the canonical inverse Lyndon factorization ICFL (w) of a word w , introduced in [1] , maintains the main properties of the Lyndon factorization since it can be computed in linear time and it is uniquely determined. In this paper we investigate new properties of this factorization with the aim of exploring their use in some classical queries on w. The main property we prove is related to a classical query on words. We prove that there are relations between the length of the longest common prefix (or longest common extension) lcp (x , y) of two different suffixes x , y of a word w and the maximum length M of two consecutive factors of ICFL (w). More precisely, M is an upper bound on the length of lcp (x , y). A main tool used in the proof of the above result is a property that we state for factors m i with nonempty borders in ICFL (w) : a nonempty border of m i cannot be a prefix of the next factor m i + 1. Another interesting result relates sorting of global suffixes, i.e., suffixes of a word w , and sorting of local suffixes, i.e., suffixes of products of factors in ICFL (w). This is the counterpart for ICFL (w) of the compatibility property, proved in [2,3] for the Lyndon factorization. Roughly, the compatibility property allows us to extend the mutual order between suffixes of products of the (inverse) Lyndon factors to the suffixes of the whole word. The last property we prove focuses on the Lyndon factorizations of a word and its factors. It suggests that the Lyndon factorizations of two words sharing a common overlap could be used to capture the common overlap of these two words. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
9. Reprint of: Generalized Lyndon factorizations of infinite words.
- Author
-
Burcroff, Amanda and Winsor, Eric
- Subjects
- *
LINEAR orderings , *VOCABULARY - Abstract
A generalized lexicographic order on words is a lexicographic order where the total order of the (possibly infinite) alphabet depends on the position of the comparison. A generalized Lyndon word is a finite word which is strictly smallest among its class of rotations with respect to a generalized lexicographic order. This notion can be extended to infinite words: an infinite generalized Lyndon word is an infinite word which is strictly smallest among its class of suffixes. We positively answer a question of Dolce, Restivo, and Reutenauer by proving that every infinite word has a unique nonincreasing factorization into finite and infinite generalized Lyndon words. When this factorization has finitely many terms, we characterize the last term of the factorization. Our methods also show that the infinite generalized Lyndon words are precisely the words with infinitely many generalized Lyndon prefixes. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
10. Constructing and indexing the bijective and extended Burrows–Wheeler transform.
- Author
-
Bannai, Hideo, Kärkkäinen, Juha, Köppl, Dominik, and Pia̧tkowski, Marcin
- Subjects
- *
DATA compression , *INDEXING - Abstract
The Burrows–Wheeler transform (BWT) is a permutation whose applications are prevalent in data compression and text indexing. The bijective BWT is a bijective variant of it that has not yet been studied for text indexing applications. We fill this gap by proposing a self-index built on the bijective BWT. The self-index applies the backward search technique of the FM-index to find a pattern P with O (| P | lg | P |) backward search steps. Additionally, we propose the first linear-time construction algorithm that is based on SAIS, improving the best known result of O (n lg n / lg lg n) time to linear. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Generalized Lyndon factorizations of infinite words.
- Author
-
Burcroff, Amanda and Winsor, Eric
- Subjects
- *
LINEAR orderings , *VOCABULARY - Abstract
A generalized lexicographic order on words is a lexicographic order where the total order of the (possibly infinite) alphabet depends on the position of the comparison. A generalized Lyndon word is a finite word which is strictly smallest among its class of rotations with respect to a generalized lexicographic order. This notion can be extended to infinite words: an infinite generalized Lyndon word is an infinite word which is strictly smallest among its class of suffixes. We positively answer a question of Dolce, Restivo, and Reutenauer by proving that every infinite word has a unique nonincreasing factorization into finite and infinite generalized Lyndon words. When this factorization has finitely many terms, we characterize the last term of the factorization. Our methods also show that the infinite generalized Lyndon words are precisely the words with infinitely many generalized Lyndon prefixes. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
12. Suffixes, Conjugates and Lyndon Words
- Author
-
Bonomo, Silvia, Mantaci, Sabrina, Restivo, Antonio, Rosone, Giovanna, Sciortino, Marinella, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Béal, Marie-Pierre, editor, and Carton, Olivier, editor
- Published
- 2013
- Full Text
- View/download PDF
13. Primitive Words and Lyndon Words in Automatic and Linearly Recurrent Sequences
- Author
-
Goč, Daniel, Saari, Kalle, Shallit, Jeffrey, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Dediu, Adrian-Horia, editor, Martín-Vide, Carlos, editor, and Truthe, Bianca, editor
- Published
- 2013
- Full Text
- View/download PDF
14. On the number of equal-letter runs of the Bijective Burrows-Wheeler Transform
- Author
-
Elena, Biagi, Davide, Cenzato, Liptak, Zsuzsanna, and Giuseppe, Romana
- Subjects
Lyndon factorization ,Bijective Burrows-Wheeler Transform · BWT · eBWT ,combinatorics on words ,compression ,Bijective Burrows-Wheeler Transform · BWT · eBWT, combinatorics on words, Lyndon factorization, compression - Published
- 2023
15. Lyndon Factorization Algorithms for Small Alphabets and Run-Length Encoded Strings
- Author
-
Sukhpal Singh Ghuman, Emanuele Giaquinta, and Jorma Tarhio
- Subjects
Lyndon factorization ,string algorithms ,run-length encoding ,Industrial engineering. Management engineering ,T55.4-60.8 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
We present two modifications of Duval’s algorithm for computing the Lyndon factorization of a string. One of the algorithms has been designed for strings containing runs of the smallest character. It works best for small alphabets and it is able to skip a significant number of characters of the string. Moreover, it can be engineered to have linear time complexity in the worst case. When there is a run-length encoded string R of length ρ , the other algorithm computes the Lyndon factorization of R in O ( ρ ) time and in constant space. It is shown by experimental results that the new variations are faster than Duval’s original algorithm in many scenarios.
- Published
- 2019
- Full Text
- View/download PDF
16. Numeric Lyndon-based feature embedding of sequencing reads for machine learning approaches
- Author
-
Bonizzoni, P, Costantini, M, De Felice, C, Petescia, A, Pirola, Y, Previtali, M, Rizzi, R, Stoye, J, Zaccagnino, R, Zizza, R, Bonizzoni, P., Costantini, M., De Felice, C., Petescia, A., Pirola, Y., Previtali, M., Rizzi, R., Stoye, J., Zaccagnino, R., Zizza, R., Bonizzoni, P, Costantini, M, De Felice, C, Petescia, A, Pirola, Y, Previtali, M, Rizzi, R, Stoye, J, Zaccagnino, R, Zizza, R, Bonizzoni, P., Costantini, M., De Felice, C., Petescia, A., Pirola, Y., Previtali, M., Rizzi, R., Stoye, J., Zaccagnino, R., and Zizza, R.
- Abstract
Feature embedding methods have been proposed in the literature to represent sequences as numeric vectors to be used in some bioinformatics investigations, such as family classification and protein structure prediction. Recent theoretical results showed that the well-known Lyndon factorization preserves common factors in overlapping strings [1]. Surprisingly, the fingerprint of a sequencing read, which is the sequence of lengths of consecutive factors in variants of the Lyndon factorization of the read, is effective in capturing sequence similarities, suggesting it as basis for the definition of novel representations of sequencing reads. We propose a novel feature embedding method for Next-Generation Sequencing (NGS) data using the notion of fingerprint. We provide a theoretical and experimental framework to estimate the behaviour of fingerprints and of the k-mers extracted from it, called k-fingers, as possible feature embeddings for sequencing reads. As a case study to assess the effectiveness of such embeddings, we use fingerprints to represent RNA-Seq reads in order to assign them to the most likely gene from which they originated as fragments of transcripts of the gene. We provide an implementation of the proposed method in the tool lyn2vec, which produces Lyndon-based feature embeddings of sequencing reads.
- Published
- 2022
17. Inferring strings from Lyndon factorization.
- Author
-
Nakashima, Yuto, Okabe, Takashi, I, Tomohiro, Inenaga, Shunsuke, Bannai, Hideo, and Takeda, Masayuki
- Subjects
- *
FACTORIZATION , *STRING theory , *MONOTONIC functions , *ALGORITHMS , *MATHEMATICAL analysis - Abstract
The Lyndon factorization of a string w is a unique factorization ℓ 1 p 1 , … , ℓ m p m of w such that ℓ 1 , … , ℓ m is a sequence of Lyndon words that is monotonically decreasing in lexicographic order. In this paper, we consider the reverse-engineering problem on Lyndon factorization : Given a sequence S = ( ( s 1 , p 1 ) , … , ( s m , p m ) ) of ordered pairs of positive integers, find a string w whose Lyndon factorization corresponds to the input sequence S , i.e., the Lyndon factorization of w is in a form of ℓ 1 p 1 , … , ℓ m p m with | ℓ i | = s i for all 1 ≤ i ≤ m . Firstly, we show that there exists a simple O ( n ) -time algorithm if the size of the alphabet is unbounded, where n is the length of the output string. Secondly, we present an O ( n ) -time algorithm to compute a string over an alphabet of the smallest size. Thirdly, we show how to compute only the size of the smallest alphabet in O ( m ) time. Fourthly, we give an O ( m ) -time algorithm to compute an O ( m ) -size representation of a string over an alphabet of the smallest size. Finally, we propose an efficient algorithm to enumerate all strings whose Lyndon factorizations correspond to S . [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
18. Faster Lyndon factorization algorithms for SLP and LZ78 compressed text.
- Author
-
I, Tomohiro, Nakashima, Yuto, Inenaga, Shunsuke, Bannai, Hideo, and Takeda, Masayuki
- Subjects
- *
GEOMETRY , *MATRICES (Mathematics) , *ALGEBRA , *ALGORITHMS , *FACTORIZATION - Abstract
We present two efficient algorithms which, given a compressed representation of a string w of length N , compute the Lyndon factorization of w . Given a straight line program (SLP) S of size n that describes w , the first algorithm runs in O ( n 2 + P ( n , N ) + Q ( n , N ) n log n ) time and O ( n 2 + S ( n , N ) ) space, where P ( n , N ) , S ( n , N ) , Q ( n , N ) are respectively the pre-processing time, space, and query time of a data structure for longest common extensions (LCE) on SLPs. Given the Lempel–Ziv 78 encoding of size s for w , the second algorithm runs in O ( s log s ) time and space. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
19. Can We Replace Reads by Numeric Signatures? Lyndon Fingerprints as Representations of Sequencing Reads for Machine Learning
- Author
-
Martín-Vide, C, Vega-Rodríguez, MA, Wheeler, T, Bonizzoni, P, De Felice, C, Petescia, A, Pirola, Y, Rizzi, R, Stoye, J, Zaccagnino, R, Zizza, R, Bonizzoni, Paola, De Felice, Clelia, Petescia, Alessia, Pirola, Yuri, Rizzi, Raffaella, Stoye, Jens, Zaccagnino, Rocco, Zizza, Rosalba, Martín-Vide, C, Vega-Rodríguez, MA, Wheeler, T, Bonizzoni, P, De Felice, C, Petescia, A, Pirola, Y, Rizzi, R, Stoye, J, Zaccagnino, R, Zizza, R, Bonizzoni, Paola, De Felice, Clelia, Petescia, Alessia, Pirola, Yuri, Rizzi, Raffaella, Stoye, Jens, Zaccagnino, Rocco, and Zizza, Rosalba
- Abstract
Representations of biological sequences facilitating sequence comparison are crucial in several bioinformatics tasks. Recently, the Lyndon factorization has been proved to preserve common factors in overlapping reads, thus leading to the idea of using factorizations of sequences to define measures of similarity between reads. In this paper we propose as a signature of sequencing reads the notion of fingerprint, i.e., the sequence of lengths of consecutive factors in Lyndon-based factorizations of the reads. Surprisingly, fingerprints of reads are effective in preserving sequence similarities while providing a compact representation of the read, and so, k-mers extracted from a fingerprint, called k-fingers, can be used to capture sequence similarity between reads. We first provide a probabilistic framework to estimate the behaviour of fingerprints. Then we experimentally evaluate the effectiveness of this representation for machine learning algorithms for classifying biological sequences. In particular, we considered the problem of assigning RNA-Seq reads to the most likely gene from which they were generated. Our results show that fingerprints can provide an effective machine learning interpretable representation, successfully preserving sequence similarity.
- Published
- 2021
20. On the longest common prefix of suffixes in an inverse Lyndon factorization and other properties
- Author
-
Bonizzoni, P, De Felice, C, Zaccagnino, R, Zizza, R, Bonizzoni P., De Felice C., Zaccagnino R., Zizza R., Bonizzoni, P, De Felice, C, Zaccagnino, R, Zizza, R, Bonizzoni P., De Felice C., Zaccagnino R., and Zizza R.
- Abstract
The Lyndon factorization of a word has been largely studied and recently variants of it have been introduced and investigated with different motivations. In particular, the canonical inverse Lyndon factorization ICFL(w) of a word w, introduced in [1], maintains the main properties of the Lyndon factorization since it can be computed in linear time and it is uniquely determined. In this paper we investigate new properties of this factorization with the aim of exploring their use in some classical queries on w. The main property we prove is related to a classical query on words. We prove that there are relations between the length of the longest common prefix (or longest common extension) lcp(x,y) of two different suffixes x,y of a word w and the maximum length M of two consecutive factors of ICFL(w). More precisely, M is an upper bound on the length of lcp(x,y). A main tool used in the proof of the above result is a property that we state for factors mi with nonempty borders in ICFL(w): a nonempty border of mi cannot be a prefix of the next factor mi+1. Another interesting result relates sorting of global suffixes, i.e., suffixes of a word w, and sorting of local suffixes, i.e., suffixes of products of factors in ICFL(w). This is the counterpart for ICFL(w) of the compatibility property, proved in [2,3] for the Lyndon factorization. Roughly, the compatibility property allows us to extend the mutual order between suffixes of products of the (inverse) Lyndon factors to the suffixes of the whole word. The last property we prove focuses on the Lyndon factorizations of a word and its factors. It suggests that the Lyndon factorizations of two words sharing a common overlap could be used to capture the common overlap of these two words.
- Published
- 2021
21. On the longest common prefix of suffixes in an inverse Lyndon factorization and other properties
- Author
-
Rocco Zaccagnino, Rosalba Zizza, Paola Bonizzoni, Clelia De Felice, Bonizzoni, P, De Felice, C, Zaccagnino, R, and Zizza, R
- Subjects
General Computer Science ,LCP array ,Inverse ,Lyndon words ,Upper and lower bounds ,Lyndon factorization ,combinatorial algorithms on words ,Lyndon word ,Theoretical Computer Science ,Combinatorial algorithms on word ,Prefix ,Combinatorics ,Factorization ,Time complexity ,Mathematics - Abstract
The Lyndon factorization of a word has been largely studied and recently variants of it have been introduced and investigated with different motivations. In particular, the canonical inverse Lyndon factorization ICFL ( w ) of a word w, introduced in [1] , maintains the main properties of the Lyndon factorization since it can be computed in linear time and it is uniquely determined. In this paper we investigate new properties of this factorization with the aim of exploring their use in some classical queries on w. The main property we prove is related to a classical query on words. We prove that there are relations between the length of the longest common prefix (or longest common extension) lcp ( x , y ) of two different suffixes x , y of a word w and the maximum length M of two consecutive factors of ICFL ( w ) . More precisely, M is an upper bound on the length of lcp ( x , y ) . A main tool used in the proof of the above result is a property that we state for factors m i with nonempty borders in ICFL ( w ) : a nonempty border of m i cannot be a prefix of the next factor m i + 1 . Another interesting result relates sorting of global suffixes, i.e., suffixes of a word w, and sorting of local suffixes, i.e., suffixes of products of factors in ICFL ( w ) . This is the counterpart for ICFL ( w ) of the compatibility property, proved in [2] , [3] for the Lyndon factorization. Roughly, the compatibility property allows us to extend the mutual order between suffixes of products of the (inverse) Lyndon factors to the suffixes of the whole word. The last property we prove focuses on the Lyndon factorizations of a word and its factors. It suggests that the Lyndon factorizations of two words sharing a common overlap could be used to capture the common overlap of these two words.
- Published
- 2021
22. Finding an Optimal Alphabet Ordering for Lyndon Factorization Is Hard
- Author
-
Gibney, Daniel and Thankachan, Sharma V.
- Subjects
Lyndon Factorization ,String Algorithms ,Theory of computation → Problems, reductions and completeness ,Burrows-Wheeler Transform - Abstract
This work establishes several strong hardness results on the problem of finding an ordering on a string’s alphabet that either minimizes or maximizes the number of factors in that string’s Lyndon factorization. In doing so, we demonstrate that these ordering problems are sufficiently complex to model a wide variety of ordering constraint satisfaction problems (OCSPs). Based on this, we prove that (i) the decision versions of both the minimization and maximization problems are NP-complete, (ii) for both the minimization and maximization problems there does not exist a constant approximation algorithm running in polynomial time under the Unique Game Conjecture and (iii) there does not exist an algorithm to solve the minimization problem in time poly(|T|) ⋅ 2^o(σlog σ) for a string T over an alphabet of size σ under the Exponential Time Hypothesis (essentially the brute force approach of trying every alphabet order is hard to improve significantly)., LIPIcs, Vol. 187, 38th International Symposium on Theoretical Aspects of Computer Science (STACS 2021), pages 35:1-35:15
- Published
- 2021
- Full Text
- View/download PDF
23. Suffix array and Lyndon factorization of a text.
- Author
-
Mantaci, Sabrina, Restivo, Antonio, Rosone, Giovanna, and Sciortino, Marinella
- Abstract
The main goal of this paper is to highlight the relationship between the suffix array of a text and its Lyndon factorization. It is proved in [15] that one can obtain the Lyndon factorization of a text from its suffix array. Conversely, here we show a new method for constructing the suffix array of a text that takes advantage of its Lyndon factorization. The surprising consequence of our results is that, in order to construct the suffix array, the local suffixes inside each Lyndon factor can be separately processed, allowing different implementative scenarios, such as online, external and internal memory, or parallel implementations. Based on our results, the algorithm that we propose sorts the suffixes by starting from the leftmost Lyndon factors, even if the whole text or the complete Lyndon factorization are not yet available. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
24. A characterization of infinite smooth Lyndon words
- Author
-
Geneviève Paquin
- Subjects
lyndon words ,lyndon factorization ,smooth words ,extremal smooth words ,[info.info-dm] computer science [cs]/discrete mathematics [cs.dm] ,Mathematics ,QA1-939 - Abstract
Combinatorics
- Published
- 2010
- Full Text
- View/download PDF
25. Lyndon words versus inverse lyndon words: Queries on suffixes and bordered words
- Author
-
Bonizzoni, P, De Felice, C, Zaccagnino, R, Zizza, R, Bonizzoni P., De Felice C., Zaccagnino R., Zizza R., Bonizzoni, P, De Felice, C, Zaccagnino, R, Zizza, R, Bonizzoni P., De Felice C., Zaccagnino R., and Zizza R.
- Abstract
The Lyndon factorization of a word has been extensively studied in different contexts and several variants of it have been proposed. In particular, the canonical inverse Lyndon factorization ICFL, introduced in [5], maintains the main properties of the Lyndon factorization since it can be computed in linear time and it is uniquely determined. In this paper we investigate new properties of this factorization with the purpose of exploring its use in string queries. As a main result, we prove an upper bound on the length of the longest common extension (or longest common prefix) for two factors of a word w. This bound is at most the maximum length of two consecutive factors of ICFL(w). A tool used in the proof is a property that we state for factors with nonempty borders in ICFL(w): a nonempty border of a factor mi cannot be a prefix of the next factor mi+1. Another interesting result relates sorting of global suffixes, i.e., suffixes of a word w, and sorting of local suffixes, i.e., suffixes of the factors in ICFL(w). Finally, given a word w and a factor x of w, we prove that their Lyndon factorizations share factors, except for the first and last term of the Lyndon factorization of x. This property suggests that, given two words sharing a common overlap, their Lyndon factorizations could be used to capture the common overlap of these two words.
- Published
- 2020
26. The decrease value theorem with an application to permutation statistics
- Author
-
Foata, Dominique and Han, Guo-Niu
- Subjects
- *
BOUNDARY value problems , *PERMUTATIONS , *WREATH products (Group theory) , *STATISTICS , *GENERATING functions , *MATHEMATICAL symmetry , *CALCULUS - Abstract
Abstract: The decrease value theorem is restated and given a specialization more adapted to permutation statistic calculus. As an application, the computation of a factorial multivariable generating function for the wreath product of the cyclic group of finite order by the symmetric group is given in full detail. [Copyright &y& Elsevier]
- Published
- 2011
- Full Text
- View/download PDF
27. A characterization of infinite smooth Lyndon words.
- Author
-
Paquin, Geneviève
- Subjects
- *
FACTORIZATION , *MATHEMATICAL functions , *MATHEMATICS dictionaries , *LEXICOGRAPHY , *FINITENESS (Linguistics) - Abstract
In a recent paper, Brlek, Jamet and Paquin showed that some extremal infinite smooth words are also infinite Lyndon words. This result raises a natural question: are they the only ones? If no, what do the infinite smooth words that are also Lyndon words look like? In this paper, we give the answer, proving that the only infinite smooth Lyndon words are m{1< b}, with a,b even, m{1< b} and |Δ-11 with b odd, where mA is the minimal infinite smooth word with respect to the lexicographic order over a numerical alphabet A and Δ is the run-length encoding function. [ABSTRACT FROM AUTHOR]
- Published
- 2010
28. LOOK AND SAY FIBONACCI.
- Author
-
Séébold, Patrice
- Subjects
FIBONACCI sequence ,BINARY number system ,MATHEMATIC morphism ,FACTORIZATION of operators ,UNARY algebras ,COMBINATORICS - Abstract
The LS (Look and Say) derivative of a word is obtained by writing the number of consecutive equal letters when the word is spelled from left to right. For example, LS(1 1 2 3 3) = 2 1 1 2 2 3 (two 1, one 2, two 3). We start the study of the behaviour of binary words generated by morphisms under the LS operator, focusing in particular on the Fibonacci word. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
29. Smooth words on 2-letter alphabets having same parity
- Author
-
Brlek, S., Jamet, D., and Paquin, G.
- Subjects
- *
ALPHABETS , *ALGEBRAIC number theory , *FACTORS (Algebra) , *MATHEMATICS dictionaries , *ALGORITHMS - Abstract
In this paper, we consider smooth words over 2-letter alphabets , where are integers having same parity, with . We show that all are recurrent and that the closure of the set of factors under reversal holds for odd alphabets only. We provide a linear time algorithm computing the extremal words, w.r.t. lexicographic order. The minimal word is an infinite Lyndon word if and only if either and are odd, or are even. A connection is established between generalized Kolakoski words and maximal infinite smooth words over even 2-letter alphabets revealing new properties for some of the generalized Kolakoski words. Finally, the frequency of letters in extremal words is 1/2 for even alphabets, and for with odd, the frequency of ’s is . [Copyright &y& Elsevier]
- Published
- 2008
- Full Text
- View/download PDF
30. Properties of the Extremal Infinite Smooth Words.
- Author
-
Brlek, S., Melançcon, G., and Paquin, G.
- Subjects
- *
NUMBER theory , *ALGORITHMS , *FACTORIZATION , *FIBONACCI sequence , *MATHEMATICS - Abstract
Smooth words are connected to the Kolakoski sequence. We construct the maximal and the minimal infinite smooth words, with respect to the lexicographical order. The naive algorithm generating them is improved by using a reduction of the De Bruijn graph of their factors. We also study their Lyndon factorizations. Finally, we show that the minimal smooth word over the alphabet {1; 3} belongs to the orbit of the Fibonacci word. [ABSTRACT FROM AUTHOR]
- Published
- 2007
31. THE SEMIGROUP OF CONJUGATES OF A WORD.
- Author
-
HIGGINS, PETER M.
- Subjects
- *
SEMIGROUP algebras , *MONOIDS , *FACTORIZATION , *MATHEMATICAL mappings , *WORD problems (Mathematics) , *COMBINATORIAL group theory - Abstract
We introduce a finite semigroup associated with a conjugacy class of a word in the free monoid over a finite alphabet. Using properties of this semigroup we derive results on combinatorics on words. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
32. Lyndon factorization of the Prouhet words
- Author
-
Séébold, Patrice
- Subjects
- *
MATHEMATIC morphism , *FACTORIZATION , *MATHEMATICS - Abstract
Prouhet words are a natural generalization, over alphabets with more than two letters, of the well known binary Thue–Morse word.We give a unique factorization of these words in a sequence of decreasing Lyndon words, then generalizing such a decomposition given by Ido and Melanc¸on for the Thue–Morse word. [Copyright &y& Elsevier]
- Published
- 2003
- Full Text
- View/download PDF
33. Lyndon factorization of generalized words of Thue
- Author
-
Anton Černý
- Subjects
lyndon word ,lyndon factorization ,automatic sequence ,[info.info-dm] computer science [cs]/discrete mathematics [cs.dm] ,Mathematics ,QA1-939 - Abstract
The i-th symbol of the well-known infinite word of Thue on the alphabet \ 0,1\ can be characterized as the parity of the number of occurrences of the digit 1 in the binary notation of i. Generalized words of Thue are based on counting the parity of occurrences of an arbitrary word w∈\ 0,1\^+-0^* in the binary notation of i. We provide here the standard Lyndon factorization of some subclasses of this class of infinite words.
- Published
- 2002
- Full Text
- View/download PDF
34. Lyndon factorization of the Thue-Morse word and its relatives
- Author
-
Augustin Ido and Guy Melançon
- Subjects
morphisms ,lyndon factorization ,thue-morse word ,[info.info-dm] computer science [cs]/discrete mathematics [cs.dm] ,Mathematics ,QA1-939 - Abstract
We compute the Lyndon factorization of the Thue-Morse word. We also compute the Lyndon factorization of two related sequences involving morphisms that give rise to new presentations of these sequences.
- Published
- 1997
- Full Text
- View/download PDF
35. On the Size of Overlapping Lempel-Ziv and Lyndon Factorizations
- Author
-
Yuki Urabe and Yuto Nakashima and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda, Urabe, Yuki, Nakashima, Yuto, Inenaga, Shunsuke, Bannai, Hideo, Takeda, Masayuki, Yuki Urabe and Yuto Nakashima and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda, Urabe, Yuki, Nakashima, Yuto, Inenaga, Shunsuke, Bannai, Hideo, and Takeda, Masayuki
- Abstract
Lempel-Ziv (LZ) factorization and Lyndon factorization are well-known factorizations of strings. Recently, Kärkkäinen et al. studied the relation between the sizes of the two factorizations, and showed that the size of the Lyndon factorization is always smaller than twice the size of the non-overlapping LZ factorization [STACS 2017]. In this paper, we consider a similar problem for the overlapping version of the LZ factorization. Since the size of the overlapping LZ factorization is always smaller than the size of the non-overlapping LZ factorization and, in fact, can even be an O(log n) factor smaller, it is not immediately clear whether a similar bound as in previous work would hold. Nevertheless, in this paper, we prove that the size of the Lyndon factorization is always smaller than four times the size of the overlapping LZ factorization.
- Published
- 2019
- Full Text
- View/download PDF
36. Lyndon words versus inverse Lyndon words: queries on suffixes and bordered words
- Author
-
Rosalba Zizza, Clelia De Felice, Paola Bonizzoni, Rocco Zaccagnino, Bonizzoni, P, De Felice, C, Zaccagnino, R, and Zizza, R
- Subjects
FOS: Computer and information sciences ,050101 languages & linguistics ,Formal Languages and Automata Theory (cs.FL) ,05 social sciences ,String (computer science) ,Combinatorial algorithms on words ,Inverse ,Computer Science - Formal Languages and Automata Theory ,02 engineering and technology ,Lyndon words ,Article ,Combinatorics ,Factorization ,Lyndon factorization ,0202 electrical engineering, electronic engineering, information engineering ,FOS: Mathematics ,Mathematics - Combinatorics ,020201 artificial intelligence & image processing ,0501 psychology and cognitive sciences ,Combinatorics (math.CO) ,Time complexity ,Word (group theory) ,Mathematics - Abstract
Lyndon words have been largely investigated and showned to be a useful tool to prove interesting combinatorial properties of words. In this paper we state new properties of both Lyndon and inverse Lyndon factorizations of a word $w$, with the aim of exploring their use in some classical queries on $w$. The main property we prove is related to a classical query on words. We prove that there are relations between the length of the longest common extension (or longest common prefix) $lcp(x,y)$ of two different suffixes $x,y$ of a word $w$ and the maximum length $\mathcal{M}$ of two consecutive factors of the inverse Lyndon factorization of $w$. More precisely, $\mathcal{M}$ is an upper bound on the length of $lcp(x,y)$. This result is in some sense stronger than the compatibility property, proved by Mantaci, Restivo, Rosone and Sciortino for the Lyndon factorization and here for the inverse Lyndon factorization. Roughly, the compatibility property allows us to extend the mutual order between local suffixes of (inverse) Lyndon factors to the suffixes of the whole word. A main tool used in the proof of the above results is a property that we state for factors $m_i$ with nonempty borders in an inverse Lyndon factorization: a nonempty border of $m_i$ cannot be a prefix of the next factor $m_{i+1}$. The last property we prove shows that if two words share a common overlap, then their Lyndon factorizations can be used to capture the common overlap of the two words. The above results open to the study of new applications of Lyndon words and inverse Lyndon words in the field of string comparison., Comment: arXiv admin note: text overlap with arXiv:1705.10277
- Published
- 2019
- Full Text
- View/download PDF
37. Longest Lyndon Substring After Edit
- Author
-
Yuki Urabe and Yuto Nakashima and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda, Urabe, Yuki, Nakashima, Yuto, Inenaga, Shunsuke, Bannai, Hideo, Takeda, Masayuki, Yuki Urabe and Yuto Nakashima and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda, Urabe, Yuki, Nakashima, Yuto, Inenaga, Shunsuke, Bannai, Hideo, and Takeda, Masayuki
- Abstract
The longest Lyndon substring of a string T is the longest substring of T which is a Lyndon word. LLS(T) denotes the length of the longest Lyndon substring of a string T. In this paper, we consider computing LLS(T') where T' is an edited string formed from T. After O(n) time and space preprocessing, our algorithm returns LLS(T') in O(log n) time for any single character edit. We also consider a version of the problem with block edits, i.e., a substring of T is replaced by a given string of length l. After O(n) time and space preprocessing, our algorithm returns LLS(T') in O(l log sigma + log n) time for any block edit where sigma is the number of distinct characters in T. We can modify our algorithm so as to output all the longest Lyndon substrings of T' for both problems.
- Published
- 2018
- Full Text
- View/download PDF
38. Lyndon Factorization of Grammar Compressed Texts Revisited
- Author
-
Isamu Furuya and Yuto Nakashima and Tomohiro I and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda, Furuya, Isamu, Nakashima, Yuto, I, Tomohiro, Inenaga, Shunsuke, Bannai, Hideo, Takeda, Masayuki, Isamu Furuya and Yuto Nakashima and Tomohiro I and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda, Furuya, Isamu, Nakashima, Yuto, I, Tomohiro, Inenaga, Shunsuke, Bannai, Hideo, and Takeda, Masayuki
- Abstract
We revisit the problem of computing the Lyndon factorization of a string w of length N which is given as a straight line program (SLP) of size n. For this problem, we show a new algorithm which runs in O(P(n, N) + Q(n, N)n log log N) time and O(n log N + S(n, N)) space where P(n, N), S(n,N), Q(n,N) are respectively the pre-processing time, space, and query time of a data structure for longest common extensions (LCE) on SLPs. Our algorithm improves the algorithm proposed by I et al. (TCS '17), and can be more efficient than the O(N)-time solution by Duval (J. Algorithms '83) when w is highly compressible.
- Published
- 2018
- Full Text
- View/download PDF
39. Inverse Lyndon words and inverse Lyndon factorizations of words
- Author
-
Bonizzoni, P, De Felice, C, Zaccagnino, R, Zizza, R, Bonizzoni, P, De Felice, C, Zaccagnino, R, and Zizza, R
- Abstract
Motivated by applications to string processing, we introduce variants of the Lyndon factorization called inverse Lyndon factorizations. Their factors, named inverse Lyndon words, are in a class that strictly contains anti-Lyndon words, that is Lyndon words with respect to the inverse lexicographic order. The Lyndon factorization of a nonempty word w is unique but w may have several inverse Lyndon factorizations. We prove that any nonempty word w admits a canonical inverse Lyndon factorization, named ICFL(w), that maintains the main properties of the Lyndon factorization of w: it can be computed in linear time, it is uniquely determined, and it preserves a compatibility property for sorting suffixes. In particular, the compatibility property of ICFL(w) is a consequence of another result: any factor in ICFL(w) is a concatenation of consecutive factors of the Lyndon factorization of w with respect to the inverse lexicographic order.
- Published
- 2018
40. Suffix array and Lyndon factorization of a text
- Author
-
Antonio Restivo, Marinella Sciortino, Sabrina Mantaci, Giovanna Rosone, Mantaci, S, Restivo, A, Rosone, G, and Sciortino, M
- Subjects
Sorting suffixes ,BWT ,Suffix array ,Lyndon word ,Lyndon factorization ,Compressed suffix array ,Settore INF/01 - Informatica ,Generalized suffix tree ,Order (ring theory) ,Construct (python library) ,Sorting suffixe ,Theoretical Computer Science ,law.invention ,Computational Theory and Mathematics ,Factorization ,law ,Factor (programming language) ,Internal memory ,Discrete Mathematics and Combinatorics ,Arithmetic ,computer ,Mathematics ,computer.programming_language - Abstract
The main goal of this paper is to highlight the relationship between the suffix array of a text and its Lyndon factorization. It is proved in [15] that one can obtain the Lyndon factorization of a text from its suffix array. Conversely, here we show a new method for constructing the suffix array of a text that takes advantage of its Lyndon factorization. The surprising consequence of our results is that, in order to construct the suffix array, the local suffixes inside each Lyndon factor can be separately processed, allowing different implementative scenarios, such as online, external and internal memory, or parallel implementations. Based on our results, the algorithm that we propose sorts the suffixes by starting from the leftmost Lyndon factors, even if the whole text or the complete Lyndon factorization are not yet available.
- Published
- 2014
- Full Text
- View/download PDF
41. On the Size of Lempel-Ziv and Lyndon Factorizations
- Author
-
Juha Kärkkäinen and Dominik Kempa and Yuto Nakashima and Simon J. Puglisi and Arseny M. Shur, Kärkkäinen, Juha, Kempa, Dominik, Nakashima, Yuto, Puglisi, Simon J., Shur, Arseny M., Juha Kärkkäinen and Dominik Kempa and Yuto Nakashima and Simon J. Puglisi and Arseny M. Shur, Kärkkäinen, Juha, Kempa, Dominik, Nakashima, Yuto, Puglisi, Simon J., and Shur, Arseny M.
- Abstract
Lyndon factorization and Lempel-Ziv (LZ) factorization are both important tools for analysing the structure and complexity of strings, but their combinatorial structure is very different. In this paper, we establish the first direct connection between the two by showing that while the Lyndon factorization can be bigger than the non-overlapping LZ factorization (which we demonstrate by describing a new, non-trivial family of strings) it is always less than twice the size.
- Published
- 2017
- Full Text
- View/download PDF
42. Lyndon Factorization Algorithms for Small Alphabets and Run-Length Encoded Strings †.
- Author
-
Ghuman, Sukhpal Singh, Giaquinta, Emanuele, and Tarhio, Jorma
- Subjects
RUN-length encoding ,ALGORITHMS ,ALPHABET ,SPACETIME - Abstract
We present two modifications of Duval's algorithm for computing the Lyndon factorization of a string. One of the algorithms has been designed for strings containing runs of the smallest character. It works best for small alphabets and it is able to skip a significant number of characters of the string. Moreover, it can be engineered to have linear time complexity in the worst case. When there is a run-length encoded string R of length ρ , the other algorithm computes the Lyndon factorization of R in O (ρ) time and in constant space. It is shown by experimental results that the new variations are faster than Duval's original algorithm in many scenarios. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
43. Minimal Suffix and Rotation of a Substring in Optimal Time
- Author
-
Tomasz Kociumaka, Kociumaka, Tomasz, Tomasz Kociumaka, and Kociumaka, Tomasz
- Abstract
For a text of length $n$ given in advance, the substring minimal suffix queries ask to determine the lexicographically minimal non-empty suffix of a substring specified by the location of its occurrence in the text. We develop a data structure answering such queries optimally: in constant time after linear-time preprocessing. This improves upon the results of Babenko et al. (CPM 2014), whose trade-off solution is characterized by Theta(n log n) product of these time complexities. Next, we extend our queries to support concatenations of O(1) substrings, for which the construction and query time is preserved. We apply these generalized queries to compute lexicographically minimal and maximal rotations of a given substring in constant time after linear-time preprocessing. Our data structures mainly rely on properties of Lyndon words and Lyndon factorizations. We combine them with further algorithmic and combinatorial tools, such as fusion trees and the notion of order isomorphism of strings.
- Published
- 2016
- Full Text
- View/download PDF
44. Signed words and permutations, V; a sextuple distribution
- Author
-
Foata, Dominique and Han, Guo-Niu
- Published
- 2009
- Full Text
- View/download PDF
45. Lyndon factorization of the Prouhet words
- Author
-
Patrice Séébold
- Subjects
Sequence ,General Computer Science ,Thue–Morse morphism and word ,Generalization ,Unique factorization domain ,Binary number ,Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing) ,Lyndon words ,Theoretical Computer Science ,Algebra ,Combinatorics ,Lyndon factorization ,Factorization ,Computer Science::Discrete Mathematics ,Prouhet words ,Computer Science::Formal Languages and Automata Theory ,Word (group theory) ,Computer Science(all) ,Mathematics - Abstract
Prouhet words are a natural generalization, over alphabets with more than two letters, of the well known binary Thue–Morse word.We give a unique factorization of these words in a sequence of decreasing Lyndon words, then generalizing such a decomposition given by Ido and Melançon for the Thue–Morse word.
- Published
- 2003
- Full Text
- View/download PDF
46. Sorting suffixes of a text via its Lyndon Factorization
- Author
-
Mantaci, S., Restivo, A., Giovanna Rosone, Sciortino, M., Mantaci, S, Restivo, A, Rosone, G, and Sciortino, M
- Subjects
FOS: Computer and information sciences ,BWT ,Lyndon Factorization ,Settore INF/01 - Informatica ,Sorting Suffixes ,Lyndon Words ,Suffix array ,Computer Science - Data Structures and Algorithms ,Data_FILES ,Data Structures and Algorithms (cs.DS) ,Lyndon word ,Sorting suffixe - Abstract
The process of sorting the suffixes of a text plays a fundamental role in Text Algorithms. They are used for instance in the constructions of the Burrows-Wheeler transform and the suffix array, widely used in several fields of Computer Science. For this reason, several recent researches have been devoted to finding new strategies to obtain effective methods for such a sorting. In this paper we introduce a new methodology in which an important role is played by the Lyndon factorization, so that the local suffixes inside factors detected by this factorization keep their mutual order when extended to the suffixes of the whole word. This property suggests a versatile technique that easily can be adapted to different implementative scenarios., Submitted to the Prague Stringology Conference 2013 (PSC 2013)
- Published
- 2013
47. Look and Say Fibonacci
- Author
-
Patrice Séébold, Séébold, Patrice, Arithmétique informatique (ARITH), Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM), Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM), and Université Paul-Valéry - Montpellier 3 (UPVM)
- Subjects
Fibonacci number ,binary words ,General Mathematics ,Binary number ,0102 computer and information sciences ,02 engineering and technology ,[INFO.INFO-DM]Computer Science [cs]/Discrete Mathematics [cs.DM] ,Fibonacci word ,01 natural sciences ,Combinatorics ,Operator (computer programming) ,Morphism ,Factorization ,Look-and-say sequence ,0202 electrical engineering, electronic engineering, information engineering ,Mathematics ,Look and Say sequence ,morphisms ,Conway ,Computer Science Applications ,[INFO.INFO-DM] Computer Science [cs]/Discrete Mathematics [cs.DM] ,Lyndon factorization ,010201 computation theory & mathematics ,020201 artificial intelligence & image processing ,Software ,Word (group theory) - Abstract
The LS (Look and Say) derivative of a word is obtained by writing the number of consecutive equal letters when the word is spelled from left to right. For example, LS(11233) = 211223 (two 1, one 2, two 3). We start the study of the behaviour of binary words generated by morphisms under the LS operator, focusing in particular on the Fibonacci word., La dérivée LS d'un mot est obtenue en décrivant les blocs de lettres qui apparaissent quand on épelle le mot. Par exemple, LS(11233) = 211223 (deux 1, un 2, deux 3). Nous commençons l'étude de la transformation, par l'opération LS, des mots binaires engendrés par morphismes. Notre attention se porte ici en particulier sur le mot de Fibonacci.
- Published
- 2008
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.