Research group

Machine Learning and Information Management Lab

There are many different aspects to BIG DATA whether challenging Veracity or Variety they all require sophisticated statistical analysis, provided by machine learning. While Volume and Velocity are impossible without efficient technical solutions. Our research interests are spread among these directions. Most of our experience comes from the field of Information Retrieval and Databases.

Main research projects

Theoretical Machine Learning (ML)

Tree models, sequence analysis, ensembles, and GPU enabled ML

Search Engines (SE) and Information Retrieval (IR)

ML for Ranking, SE User Behaviour Analysis, SE performance, SE evaluation and Storage and processing of scientific and graph data

Information Management

Stream processing, declarative computation in BD environment. Efficient storage and index structures, e.g column-oriented DB. Optimization and execution of declarative queries and workflows. The holistic application, optimization, and tuning. Data quality. Consistency and high reliability.

Beside the research projects, we deliver special courses:

Students interested in research problems in the areas of our interest are welcome to join our lab. The best way to learn more about our research is to take our courses or attend our open seminars. New projects are launched regularly, sometimes it is also possible to join an ongoing project or extend its scope. Please contact the project leader for information on a specific project.

All students willing to join our projects must be skilled in either statistics or programming, preferably in both. The successful candidates will be invited to join one of the projects as a regular team member.

Publications

  • Igor Kuralenok, Natalia Starikova, Aleksandr Khvorov, and Julian Serdyuk

    The 27th ACM International Conference on Information and Knowledge Management (CIKM ’18), October 22–26, 2018, Torino, Italy. ACM, New York, NY, USA, 10 pages

  • Artem Trofimov
    In: Benczúr A. et al. (eds) New Trends in Databases and Information Systems. ADBIS 2018. Communications in Computer and Information Science, vol 909. Springer, Cham,
  • Igor Kuralenok, Artem Trofimov, Nikita Marshalkin, Boris Novikov
    In: Benczúr A., Thalheim B., Horváth T. (eds) Advances in Databases and Information Systems. ADBIS 2018. Lecture Notes in Computer Science, vol 11019. Springer, Cham,
  • Igor Kuralenok, Artem Trofimov, Nikita Marshalkin, Boris Novikov
    BeyondMR'18 Proceedings of the 5th ACM SIGMOD Workshop on Algorithms and Systems for MapReduce and Beyond,
  • Igor Kuralenok, Nikita Marshalkin, Artem Trofimov, Boris Novikov
    Proceedings of the Third Conference on Software Engineering and Information Management. Saint Petersburg, Russia. CEUR Workshop Proceedings, 2135,
  • Anastasia Tuchina, Valentin Grigorev, George Chernishev
    In Proceedings of the Third Conference on Software Engineering and Information Management. Saint Petersburg, Russia. CEUR Workshop Proceedings, 2135,
  • George Chernishev, Viacheslav Galaktionov, Valentin Grigorev, Evgeniy Klyuchikov, Kirill Smirnov
    Programming and Computer Software,
  • George Chernishev, Viacheslav Galaktionov, Valentin Grigorev, Evgeniy Klyuchikov, Kirill Smirnov
    In Proceedings of A.P. Ershov Informatics Conference (the PSI Conference Series, 11th edition), Moscow, Russia,
  • George Chernishev, Vyacheslav Galaktionov, Valentin Grigorev, Evgeniy Klyuchikov, Kirill Smirnov
    In Proceedings of the Second Conference on Software Engineering and Information Management. Saint Petersburg, Russia. CEUR Workshop Proceedings, 1864,
  • Nikita Bobrov, Anastasia Birillo, George Chernishev.
    Proceedings of the Second Conference on Software Engineering and Information Management. Saint Petersburg, Russia,
  • G. Chernishev, M. Akhin, B. Novikov, and V. Itsykson
    Message from the editors
    CEUR Workshop Proceedings, 1864,
  • Nikita Bobrov, George Chernishev, and Boris Novikov
    In Marite Kirikova, Kjetil Nørvåg, George A. Papadopoulos, Johann Gamper, Robert Wrembel, Jérôme Darmont, and Stefano Rizzi, editors, New Trends in Databases and Information Systems - ADBIS 2017 Short Papers and Workshops, AMSD, BigNovelTI, DAS, SW4CH, DC, Nicosia, Cyprus, September 24-27, 2017, Proceedings, volume 767 of Communications in Computer and Information Science, pages 275–284. Springer,
  • Nikita Bobrov, George Chernishev, Dmitry Grigoriev, and Boris Novikov
    In Yassine Ouhammou, Mirjana Ivanovic, Alberto Abelló, and Ladjel Bellatreche, editors, Model and Data Engineering - 7th International Conference, MEDI 2017, Barcelona, Spain, October 4-6, 2017, Proceedings, volume 10563 of Lecture Notes in Computer Science, pages 208–222. Springer,
  • G. Shalygina and B. Novikov
    CEUR Workshop Proceedings, 1864,
  • George Chernishev
    Journal of Big Data, 4:5,,
  • Vyacheslav Galaktionov, George Chernishev, Boris Novikov, and Dmitry Grigoriev
    Selected Papers of the XVIII International Conference on Data Analytics and Management in Data Intensive Domains (DAMDID/RCDL 2016), Ershovo, Moscow Region, Russia,
  • Valentin Grigorev and George Chernishev
    In Proceedings of the 2016 International Conference on Management of Data (SIGMOD '16). ACM, New York, NY, USA, 2251-2252.,
  • George Chernishev, Viacheslav Galaktionov, Valentin Grigorev, Evgeniy Klyuchikov, Kirill Smirnov, Andrey Terekhov
    In Proceedings of DAMDID / RCDL'2016 (local), pages 132–137, Ershovo,
  • Ilya Shkuratov and George Chernishev
    A parallel R-tree bulk-loading for shared-memory architecture
    In proceedings of CIMSP'15,
  • Chernishev George
    In: Morzy T., Valduriez P., Bellatreche L. (eds) New Trends in Databases and Information Systems. ADBIS 2015. Communications in Computer and Information Science, vol 539. Springer, Cham.,
  • Chernishev, G., Sevostyanov, V., Smirnov, K., and Shkuratov, I.
    In Selected Papers of XVI All-Russian Scientific Conference "Digital libraries: Advanced Methods and Technologies, Digital Collections Dubna, Russia,
  • Smirnov Kirill, Chernishev George, Fedotovsky Pavel, Erokhin George, Cherednik Kirill
    The Study of Multidimensional R-Tree-Based Index Scalability in Multicore Environment
    In: Voronkov A., Virbitskaite I. (eds) Perspectives of System Informatics. PSI 2014. Lecture Notes in Computer Science, vol 8974. Springer, Berlin, Heidelberg,
  • Kirill Smirnov, George Chernishev, Pavel Fedotovsky, George Erokhin and Kirill Cherednik
    R-tree re-evaluation effort: a report
    Technical report,
  • Fedotovsky P.V., Cherednik K.E., Chernishev G.A.
    To sort or not to sort: the evaluation of R-Tree and B+-Tree in transactional environment with ordered result requirement
    Труды Института системного программирования РАН. – Т. 26. – №. 4.,
  • Chernishev G.
    To Sort or not to Sort: The Evaluation of R-Tree and B+-Tree in Transactional Environment with Ordered Result Set Requirement
    SYRCoDIS. – Т. 1031. – С. 27-34.,
  • Федотовский П. В., Чернышев Г. А., Смирнов К. К.
    Реализация уровня изоляции Read Committed для древовидных структур данных
    Материалы третьей межвузовской научной конференции по проблемам информатики СПИСОК-2012,
  • Kirill K. Smirnov, Georgiy A. Chernishev
    ACM SIGMOD Programming Contest: an opportunity to study distinguished aspects of database systems and software engineering (in Russian)
    Компьютерные инструменты в образовании, 6,
  • Smirnov K., Chernishev G.
    Benchmarking Inter and Intra Operator Parallelism on Contemporary Desktop Hardware
    SYRCoDIS. – С. 62-67.,
  • Kirill Smirnov, George Chernishev
    On two methods of star query execution (in Russian)
    In proceedings of SPISOK conference p. 253-257.,
  • Smirnov K. K., Chernishev G. A.
    Networking and multithreading architectural aspects of distributed DBMS (in Russian)
    Программные продукты и системы. – №. 1.,
  • Smirnov K., Chernishev G.
    Empirical study of parallel SQL query execution
    Труды Института системного программирования РАН. – Т. 21.,
  • Kirill Smirnov, George Chernishev
    Distributed Database Query Engine”
    Contest Poster, ACM SIGMOD/PODS 2010, Indianapolis.,
  • George Chernishev, Kirill Smirnov
    ScienceDirect goes social: a social network for scientists integrated with online digital library
    Contest Poster, ACM SIGIR 2010, Geneve.,