Research group

Machine Learning and Information Management Lab

There are two different aspects of BIG DATA among the challenging Vs. Veracity and Variety require sophisticated statistical analysis, including machine learning. While Volume and Velocity are impossible without efficient technical solutions. Our research interests are spread among these directions. Most of our experience comes from the field of Information Retrieval and Databases. Currently we are focused on the following topics:

Theoretical Machine Learning (ML):

  • Tree models, sequence analysis, ensembles
  • GPU enabled ML

Search Engines (SE) and Information Retrieval (IR):

  • ML for Ranking, SE User Behaviour Analysis, SE performance, SE evaluation
  • Storage and processing of scientific, graph, etc. data

Information Management:

  • Stream processing, declarative computation in BD environment
  • Efficient storage and index structures, e.g column-oriented DB
  • Optimization and execution of declarative queries and workflows
  • Holistic application, optimization and tuning
  • Data quality
  • Consistency and high reliability

Besides research projects, we deliver the special courses:

Students interested in research problems in the areas of our interest are welcome to join our lab. The best way to learn more about our research is to take our courses or attend open seminars (to be done). New projects are launched regularly, sometimes it is also possible to join an ongoing project or extend its scope. Please contact the project leader for information on a specific project.

All students willing to join our projects must be skilled in either statistics, or programming, preferably in both. The best successful candidates will be invited to join one of projects as regular team members.

Publications

  • George Chernishev, Vyacheslav Galaktionov, Valentin Grigorev, Evgeniy Klyuchikov, Kirill Smirnov
    In Proceedings of the Second Conference on Software Engineering and Information Management. Saint Petersburg, Russia. CEUR Workshop Proceedings, 1864,
  • Nikita Bobrov, Anastasia Birillo, George Chernishev.
    Proceedings of the Second Conference on Software Engineering and Information Management. Saint Petersburg, Russia,
  • G. Chernishev, M. Akhin, B. Novikov, and V. Itsykson
    Message from the editors
    CEUR Workshop Proceedings, 1864,
  • Nikita Bobrov, George Chernishev, and Boris Novikov
    In Marite Kirikova, Kjetil Nørvåg, George A. Papadopoulos, Johann Gamper, Robert Wrembel, Jérôme Darmont, and Stefano Rizzi, editors, New Trends in Databases and Information Systems - ADBIS 2017 Short Papers and Workshops, AMSD, BigNovelTI, DAS, SW4CH, DC, Nicosia, Cyprus, September 24-27, 2017, Proceedings, volume 767 of Communications in Computer and Information Science, pages 275–284. Springer,
  • Nikita Bobrov, George Chernishev, Dmitry Grigoriev, and Boris Novikov
    In Yassine Ouhammou, Mirjana Ivanovic, Alberto Abelló, and Ladjel Bellatreche, editors, Model and Data Engineering - 7th International Conference, MEDI 2017, Barcelona, Spain, October 4-6, 2017, Proceedings, volume 10563 of Lecture Notes in Computer Science, pages 208–222. Springer,
  • George Chernishev, Viacheslav Galaktionov, Valentin Grigorev, Evgeniy Klyuchikov, Kirill Smirnov
    PosDB: a Distributed Column-Store Engine
    In Proceedings of A.P. Ershov Informatics Conference (the PSI Conference Series, 11th edition), Moscow, Russia,
  • G. Shalygina and B. Novikov
    CEUR Workshop Proceedings, 1864,
  • George Chernishev, Viacheslav Galaktionov, Valentin Grigorev, Evgeniy Klyuchikov, Kirill Smirnov
    PosDB: A Survey of Architecture
    Programming and Computer Software,
  • George Chernishev
    The design of an adaptive column-store system
    Journal of Big Data, 4:5,,
  • Vyacheslav Galaktionov, George Chernishev, Boris Novikov, and Dmitry Grigoriev
    Selected Papers of the XVIII International Conference on Data Analytics and Management in Data Intensive Domains (DAMDID/RCDL 2016), Ershovo, Moscow Region, Russia,
  • Valentin Grigorev and George Chernishev
    In Proceedings of the 2016 International Conference on Management of Data (SIGMOD '16). ACM, New York, NY, USA, 2251-2252.,
  • George Chernishev, Viacheslav Galaktionov, Valentin Grigorev, Evgeniy Klyuchikov, Kirill Smirnov, Andrey Terekhov
    In Proceedings of DAMDID / RCDL'2016 (local), pages 132–137, Ershovo,
  • Ilya Shkuratov and George Chernishev
    A parallel R-tree bulk-loading for shared-memory architecture
    In proceedings of CIMSP'15,
  • Chernishev George
    Towards Self-management in a Distributed Column-Store System
    In: Morzy T., Valduriez P., Bellatreche L. (eds) New Trends in Databases and Information Systems. ADBIS 2015. Communications in Computer and Information Science, vol 539. Springer, Cham.,
  • Chernishev, G., Sevostyanov, V., Smirnov, K., and Shkuratov, I.
    In Selected Papers of XVI All-Russian Scientific Conference "Digital libraries: Advanced Methods and Technologies, Digital Collections Dubna, Russia,
  • Smirnov Kirill, Chernishev George, Fedotovsky Pavel, Erokhin George, Cherednik Kirill
    The Study of Multidimensional R-Tree-Based Index Scalability in Multicore Environment
    In: Voronkov A., Virbitskaite I. (eds) Perspectives of System Informatics. PSI 2014. Lecture Notes in Computer Science, vol 8974. Springer, Berlin, Heidelberg,
  • Kirill Smirnov, George Chernishev, Pavel Fedotovsky, George Erokhin and Kirill Cherednik
    R-tree re-evaluation effort: a report
    Technical report,
  • Fedotovsky P.V., Cherednik K.E., Chernishev G.A.
    To sort or not to sort: the evaluation of R-Tree and B+-Tree in transactional environment with ordered result requirement
    Труды Института системного программирования РАН. – Т. 26. – №. 4.,
  • Chernishev G.
    To Sort or not to Sort: The Evaluation of R-Tree and B+-Tree in Transactional Environment with Ordered Result Set Requirement
    SYRCoDIS. – Т. 1031. – С. 27-34.,
  • Федотовский П. В., Чернышев Г. А., Смирнов К. К.
    Реализация уровня изоляции Read Committed для древовидных структур данных
    Материалы третьей межвузовской научной конференции по проблемам информатики СПИСОК-2012,
  • Kirill K. Smirnov, Georgiy A. Chernishev
    ACM SIGMOD Programming Contest: an opportunity to study distinguished aspects of database systems and software engineering (in Russian)
    Компьютерные инструменты в образовании, 6,
  • Smirnov K., Chernishev G.
    Benchmarking Inter and Intra Operator Parallelism on Contemporary Desktop Hardware
    SYRCoDIS. – С. 62-67.,
  • Kirill Smirnov, George Chernishev
    On two methods of star query execution (in Russian)
    In proceedings of SPISOK conference p. 253-257.,
  • Smirnov K. K., Chernishev G. A.
    Networking and multithreading architectural aspects of distributed DBMS (in Russian)
    Программные продукты и системы. – №. 1.,
  • Smirnov K., Chernishev G.
    Empirical study of parallel SQL query execution
    Труды Института системного программирования РАН. – Т. 21.,
  • Kirill Smirnov, George Chernishev
    Distributed Database Query Engine”
    Contest Poster, ACM SIGMOD/PODS 2010, Indianapolis.,
  • George Chernishev, Kirill Smirnov
    ScienceDirect goes social: a social network for scientists integrated with online digital library
    Contest Poster, ACM SIGIR 2010, Geneve.,