On the possibility of a periodic word reconstruction from the subwords of fixed length
https://doi.org/10.22405/2226-8383-2021-22-1-57-66
Abstract
The problem being considered is the reconstruction of periodic words from a finite alphabet using multiset of fixed length subwords.
This is a special case of a more general problem of reconstruction with incomplete information and under restrictions on the words in question.
For some constraints on the multiset of subwords, conditions for possibility of reconstruction are obtained.
It is shown that a periodic word with period $$p$$ is uniquely determined by the multiset of its subwords of length $$k \geq \left\lfloor\frac{16}{7} \sqrt{p}\right\rfloor + 5$$.
For a word consisting of a non-periodic prefix of length $$q$$ and a periodic suffix with period $$p$$, repeated $$l$$ times, a similar estimate is obtained: $$k \geq \left\lfloor\frac{16}{7} \sqrt{P}\right\rfloor + 5$$, provided $$l \geq q^{\left\lfloor\tfrac{16}{7} \sqrt{P}\right\rfloor + 5}$$ where $$P = \max(p, q)$$.
About the Authors
Vasily Antonovich AlekseevRussian Federation
assistant at the department of informatics and computational
mathematics, assistant at the department of higher mathematics
Yuri Gennadievich Smetanin
Russian Federation
doctor of physical and mathematical sciences, chief researcher
References
1. Leontiev, V. K., & Smetanin, Y. G. 1988, “On the reconstruction of vectors from a set of their fragments”, Doklady Academii nauk, vol. 302, no. 6, pp. 1319-1322.
2. Manvel, B., Meyerowitz, A., Schwenk, A., Smith, K., & Stockmeyer, P. 1991, “Reconstruction of sequences”, Discrete Mathematics, 94(3), 209-219.
3. Skiena, S. S., & Sundaram, G. 1995, “Reconstructing strings from substrings”, Journal of Computational Biology, 2(2), 333-353.
4. Leontiev, V. K. 1995, “Words reconstruction by fragments problems and their applications”, Diskretniy analiz i issledovanie operaciy, 2(2), 26-48.
5. Gusfield, D. 1997, “Algorithms on stings, trees, and sequences: Computer science and computational biology”, Acm Sigact News, 28(4), 41-60.
6. Krasikov, I., & Roditty, Y. 1997, “On a reconstruction problem for sequences”, Journal of Combinatorial Theory, Series A, 77(2), 344-348.
7. Scott, A. D. 1997, “Reconstructing sequences”, Discrete Mathematics, 175(1-3), 231-238.
8. Borwein, P., Erd´elyi, T., & K´os, G. 1999, “Littlewood-type problems on [0, 1]”, Proceedings of the London Mathematical Society, 79(1), 22-46.
9. Levenshtein, V. I. 2001, “Efficient reconstruction of sequences from their subsequences or supersequences”, Journal of Combinatorial Theory, Series A, 93(2), 310-332.
10. Dudık, M., & Schulman, L. J. 2003, “Reconstruction from subsequences”, Journal of Combinatorial Theory, Series A, 103(2), 337-348.
11. Batu, T., Kannan, S., Khanna, S., & McGregor, A. 2004, “Reconstructing strings from random traces”, Departmental Papers (CIS), 173.
12. Kannan, S., & McGregor, A. 2005, “More on reconstructing strings from random traces: insertions and deletions”, Proceedings. International Symposium on Information Theory, 2005 (ISIT 2005), pp. 297-301, IEEE.
13. Lin, J., Keogh, E., Wei, L., & Lonardi, S. 2007, “Experiencing SAX: a novel symbolic representation of time series”, Data Mining and knowledge discovery, 15(2), 107-144.
14. Smetanin, Y. G., & Ul’yanov, M. V. 2013, “Determining the characteristics of kolmogorov complexity of time series: an approach based on symbolic descriptions”.
Review
For citations:
Alekseev V.A., Smetanin Yu.G. On the possibility of a periodic word reconstruction from the subwords of fixed length. Chebyshevskii Sbornik. 2021;22(1):57-66. (In Russ.) https://doi.org/10.22405/2226-8383-2021-22-1-57-66