Preview

Chebyshevskii Sbornik

Advanced search

On the possibility of a periodic word reconstruction from the subwords of fixed length

https://doi.org/10.22405/2226-8383-2021-22-1-57-66

Abstract

The problem being considered is the reconstruction of periodic words from a finite alphabet using multiset of fixed length subwords.
This is a special case of a more general problem of reconstruction with incomplete information and under restrictions on the words in question.
For some constraints on the multiset of subwords, conditions for possibility of reconstruction are obtained.
It is shown that a periodic word with period $$p$$ is uniquely determined by the multiset of its subwords of length $$k \geq \left\lfloor\frac{16}{7} \sqrt{p}\right\rfloor + 5$$.
For a word consisting of a non-periodic prefix of length $$q$$ and a periodic suffix with period $$p$$, repeated $$l$$ times, a similar estimate is obtained: $$k \geq \left\lfloor\frac{16}{7} \sqrt{P}\right\rfloor + 5$$, provided $$l \geq q^{\left\lfloor\tfrac{16}{7} \sqrt{P}\right\rfloor + 5}$$ where $$P = \max(p, q)$$.

About the Authors

Vasily Antonovich Alekseev
Moscow Institute of Physics and Technology (MIPT)
Russian Federation

assistant at the department of informatics and computational
mathematics, assistant at the department of higher mathematics



Yuri Gennadievich Smetanin
Federal Research Center «Computer Science and Control», Russian Academy of Sciences
Russian Federation

doctor of physical and mathematical sciences, chief researcher



References

1. Leontiev, V. K., & Smetanin, Y. G. 1988, “On the reconstruction of vectors from a set of their fragments”, Doklady Academii nauk, vol. 302, no. 6, pp. 1319-1322.

2. Manvel, B., Meyerowitz, A., Schwenk, A., Smith, K., & Stockmeyer, P. 1991, “Reconstruction of sequences”, Discrete Mathematics, 94(3), 209-219.

3. Skiena, S. S., & Sundaram, G. 1995, “Reconstructing strings from substrings”, Journal of Computational Biology, 2(2), 333-353.

4. Leontiev, V. K. 1995, “Words reconstruction by fragments problems and their applications”, Diskretniy analiz i issledovanie operaciy, 2(2), 26-48.

5. Gusfield, D. 1997, “Algorithms on stings, trees, and sequences: Computer science and computational biology”, Acm Sigact News, 28(4), 41-60.

6. Krasikov, I., & Roditty, Y. 1997, “On a reconstruction problem for sequences”, Journal of Combinatorial Theory, Series A, 77(2), 344-348.

7. Scott, A. D. 1997, “Reconstructing sequences”, Discrete Mathematics, 175(1-3), 231-238.

8. Borwein, P., Erd´elyi, T., & K´os, G. 1999, “Littlewood-type problems on [0, 1]”, Proceedings of the London Mathematical Society, 79(1), 22-46.

9. Levenshtein, V. I. 2001, “Efficient reconstruction of sequences from their subsequences or supersequences”, Journal of Combinatorial Theory, Series A, 93(2), 310-332.

10. Dudık, M., & Schulman, L. J. 2003, “Reconstruction from subsequences”, Journal of Combinatorial Theory, Series A, 103(2), 337-348.

11. Batu, T., Kannan, S., Khanna, S., & McGregor, A. 2004, “Reconstructing strings from random traces”, Departmental Papers (CIS), 173.

12. Kannan, S., & McGregor, A. 2005, “More on reconstructing strings from random traces: insertions and deletions”, Proceedings. International Symposium on Information Theory, 2005 (ISIT 2005), pp. 297-301, IEEE.

13. Lin, J., Keogh, E., Wei, L., & Lonardi, S. 2007, “Experiencing SAX: a novel symbolic representation of time series”, Data Mining and knowledge discovery, 15(2), 107-144.

14. Smetanin, Y. G., & Ul’yanov, M. V. 2013, “Determining the characteristics of kolmogorov complexity of time series: an approach based on symbolic descriptions”.


Review

For citations:


Alekseev V.A., Smetanin Yu.G. On the possibility of a periodic word reconstruction from the subwords of fixed length. Chebyshevskii Sbornik. 2021;22(1):57-66. (In Russ.) https://doi.org/10.22405/2226-8383-2021-22-1-57-66

Views: 360


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2226-8383 (Print)