 Automatic Documentation and Mathematical LinguisticsNumber of Followers: 5      Hybrid journal (It can contain Open Access articles) ISSN (Print) 1934-8371 - ISSN (Online) 0005-1055 Published by Springer-Verlag  [2350 journals]
• On Object Representation by Bit Strings for the VKF-Method
Pages: 113 - 116
Abstract: Abstract An algorithm for the bit-string encoding of objects described by discrete attributes with a binary similarity operation on each attribute value is proposed and analyzed. In this encoding, the bitwise multiplication operation must correspond to the componentwise similarity operation.
PubDate: 2018-05-01
DOI: 10.3103/s0005105518030044
Issue No: Vol. 52, No. 3 (2018)

• Ontology Libraries on the Web: Status and Prospects
• Authors: E. M. Beniaminov
Pages: 117 - 120
Abstract: Abstract This paper reviews the use of computer ontologies for Internet data linking and knowledge representation. We discuss a trend in organizing ontology libraries and servers for the joint development of ontologies and their application. Ontology libraries are regarded as public Web resources. We pay special attention to the design of ontology libraries, their debugging, and directions for their development.
PubDate: 2018-05-01
DOI: 10.3103/s0005105518030020
Issue No: Vol. 52, No. 3 (2018)

• Scientometric Analysis of the Current State and Prospects of the
Development of Quantum Technologies
• Authors: V. O. Tolcheev
Pages: 121 - 133
Abstract: Abstract Scientometric analysis of a rapidly developing interdisciplinary area that covers problems of the creation and development of quantum technologies is carried out. General characteristics of the subject are given and key areas of research are identified. The publication activities of different countries are evaluated for the 2000–2016 period based on the analysis of papers indexed in the Web of Science database. The leading countries (United States and China) are determined. The most actively progressing research areas are considered. A short-term prediction on the expected number of publications for the United States and China is made. Particular attention is paid to the publication activities of Russian scientists on subject categories of the Web of Science index. The intensity of international cooperation in the field of quantum technologies and the degree of internationalization of research are studied.
PubDate: 2018-05-01
DOI: 10.3103/s000510551803007x
Issue No: Vol. 52, No. 3 (2018)

• Semantic Annotating of Text Documents: Basic Concepts and Taxonomic
Approach
• Authors: M. P. Kogalovskii
Pages: 134 - 141
Abstract: Abstract One of the tools for the semantic enrichment of the content of information resources is semantic annotating, which makes it possible to comment on and evaluate annotated resources and their fragments and to carry out a semantic search on their basis. The use of the taxonomic approach simultaneously allows classifying the subjects of annotating and generating new scientometric indicators. This paper considers the essence of semantic annotating, defines the basic concepts, discusses the general model of semantic annotation and taxonomic approach to presenting the semantics of annotations, and gives the examples of taxonomies based on various properties of annotations. The implementation of semantic annotating in the Socionet scientific information system is considered as an example.
PubDate: 2018-05-01
DOI: 10.3103/s0005105518030032
Issue No: Vol. 52, No. 3 (2018)

• The Tools of a Machine Grammar of the Russian Language (based on G.G.
Belonogov)
• Authors: I. V. Ablov; V. N. Kozichev; A. V. Shirmanov; Al-dr A. Khoroshilov; Al-ey A. Khoroshilov
Pages: 142 - 156
Abstract: Abstract The principles and methods of creating program and declarative tools of a machine grammar of the Russian language are considered. These tools were based on original algorithms developed by the scientific team of the staff of the VINITI (All-Russian Institute for Scientific and Technical Information), the 27th Central Scientific Research Institute, Ministry of Defense (Russia) and the Informatics and Management Federal Research Center of the Russian Academy of Sciences (FIC IU RAS). Declarative tools, which are a complex of dictionaries and grammatical tables in machine form, were created on the basis of large-scale studies of large volumes of polythematic textual information (measured in tens of millions of words) using linguistic-statistical methods. The complex of declarative tools consists of grammatical tables and machine dictionaries that include the main types of inflectional and derivational transformations, as well as representative dictionaries of word stems. Unique algorithms of machine grammar of the Russian language were developed through the use of these declarative tools. The described tools are now widely used in a number of industrial information systems for solving complex problems of automatic processing and semantic analysis of textual information.
PubDate: 2018-05-01
DOI: 10.3103/s0005105518030068
Issue No: Vol. 52, No. 3 (2018)

• The Methodological Basis of Ontological Documentary Information Modeling
• Authors: N. V. Maksimov
Pages: 57 - 72
Abstract: Abstract This article discusses some ontological properties that determine the possibilities of structural representation and processing of the meaning of documentary information. The concept frameworks for constructing a typified taxonomy of objects and top-level ontology relationships are proposed on the basis of analyzing object lifecycle processes and using the functional cybernetic model of an elementary process.
PubDate: 2018-03-01
DOI: 10.3103/s0005105518020036
Issue No: Vol. 52, No. 2 (2018)

• The Linear Estimation Problem and Information in Big-Data Systems
• Authors: P. V. Golubtsov
Pages: 73 - 79
Abstract: Abstract This paper addresses the problem of transforming the optimal linear estimation procedure in such a way that separate fragments of initial data are processed individually and concurrently. A representation of intermediate information is proposed that allows an algorithm to concurrently extract this information from each initial data set, combine it, and use it for estimation. It is shown that, on an information space constructed, an ordering is induced that reflects the concept of information quality.
PubDate: 2018-03-01
DOI: 10.3103/s0005105518020024
Issue No: Vol. 52, No. 2 (2018)

• Evaluating the Relationship between Bibliometrics and Altmetrics: A Case
Study of Russian Publications SCI-E (2015)
• Authors: A. N. Libkind; V. A. Markusova; V. G. Bogorov
Pages: 80 - 90
Abstract: Abstract A data array of Russian publications for 2015 registered in the SCI-E DB was evaluated for the correlation between traditional bibliometrics and altmetrics. A special methodology based on the Kendall rank correlation coefficient was developed for this purpose. In the case of publications, the correlation between the citedness of a publication (the value of the TC field in the SCI-E DB) and its usage (fields U1 and U2) was evaluated. In the case of source journals for the indicated publications, the correlation was determined between the Cited_Half-Life value and the values of journal-usage metrics, which were determined as the mean U1 (U2) value taken for all publications of the journal. It was established that the nature of journal distribution according to the Cited_Half-Life value of the source journals for Russian publications practically coincides with the nature of the distribution of source journals for publications of the entire global flow: the Russian distribution shifted somewhat toward larger Cited_Half-Life values (the median of the global distribution was 6.1, while that of the Russian distribution was 6.2). This shift increased sharply upon considering the number of Russian and global journal publications, respectively (the median of the global distribution was 5.8, while that of the Russian distribution was 6.8).
PubDate: 2018-03-01
DOI: 10.3103/s0005105518020061
Issue No: Vol. 52, No. 2 (2018)

• Library Sites as Seen through the Lens of Web Analytics
• Authors: N. S. Redkina
Pages: 91 - 96
Abstract: Abstract This paper considers the potential of web-analytic tools for a detailed study of the main fields of work of libraries in the web environment, study of user behavior, and analysis of the relevance of library information products and services offered on the sites. It presents the parameters for assessing the audience of library web resources (attendance, traffic sources, geography of users, etc.) using plug-ins, log-analyzers, counters, ratings, and services (Google Analytics, Yandex.Metrics, and Hotlog). The sites of several world libraries are analyzed by marketing tools (Hubspot Website Grader, Nibbler, CheckTrust, Google page-speed insights, and Hemingway).
PubDate: 2018-03-01
DOI: 10.3103/s0005105518020073
Issue No: Vol. 52, No. 2 (2018)

• On the Influence of the Conceptual Apparatus on Setting the Tasks of
Information Support of Innovation Activities
• Authors: L. K. Bobrov; I. P. Medyankina
Pages: 97 - 107
Abstract: Abstract Various interpretations of the innovative economy, innovation activity, and innovation concepts are given. A consequence of the undefined terminology is the diversity of the semantic load of the term innovation infrastructure. The resulting variety of approaches to solving the problems of information support of innovations is illustrated based on the example of analysis of regulatory and program documents of eight regions of the Siberian Federal District. As an alternative to creating a difficult-to-implement single federal point of reference in the information space, it is proposed to build a network of regional systems that provide navigation and access to appropriate resources for the information provision of innovation activities. In connection with external information resources, a model of the life cycle of innovations is presented, covering all stages from the generation of ideas to the withdrawal of an innovative product from the market.
PubDate: 2018-03-01
DOI: 10.3103/s000510551802005x
Issue No: Vol. 52, No. 2 (2018)

• Development of Means for the Formation of a Corporate Distributed Register
(Blockchain)
• Authors: A. Yu. Shcherbakov
Pages: 108 - 112
Abstract: Abstract The problem of developing methods for forming the atoms of a corporate distributed register (blockchain), which provides secure distributed storage of data in an associated chain, is discussed; an example of the application of the proposed methods for the implementation of a conditional tax payment through a smart contract is considered.
PubDate: 2018-03-01
DOI: 10.3103/s0005105518020085
Issue No: Vol. 52, No. 2 (2018)

• The Metascientific and Philosophical Bases of Information Ethics
• Authors: V. A. Kanke
Pages: 1 - 6
Abstract: Abstract The development of axiological theories posed an acute problem of their enrichment with an ethical component. A method for transferring ethics to the scientific field is shown, which consists of including the concept of maximizing the well-being of all people affected by any situation in all axiological theories. As a formal science, informatics does not contain the ethical concept of maximizing well-being itself; however, due to its engagement in cross-disciplinary relationships with all axiological sciences, it possesses an exclusively relevant ethical relativity. Determining the metascientific bases of the ethical relativity of informatics provides opportunities for its multifaceted study from the viewpoint of science instead of poorly understood metaphysics.
PubDate: 2018-01-01
DOI: 10.3103/s0005105518010041
Issue No: Vol. 52, No. 1 (2018)

• Studying the Functionality and Webometric Indicators of Specialized
Science-Related Websites
• Authors: Yu. M. Brumshteyn; E. Yu. Vas’kovskii
Pages: 7 - 23
Abstract: A review is provided of the purposes, functionality, and webometric indicators (WMIs) for the websites of the leading international organizations that aggregate specialized scientific information, including citation and scientometric systems; websites designed to support national publications; etc. The WMIs of foreign websites are compared with those of the functionally similar Russian online resources. An overview is given of the structure of foreign and Russian websites that publish information related to dissertations, their extended abstracts, and their full-text versions. The terms of user access to these materials are described. The WMIs of these websites are compared. An analysis is conducted of the functionality and WMIs of the following online resources: the leading foreign and Russian websites that provide access to autonomous information–analytical and information systems for scientific journals; websites of individual journals related to scientometric studies and information management (services) in science.
PubDate: 2018-01-01
DOI: 10.3103/s0005105518010028
Issue No: Vol. 52, No. 1 (2018)

• TextJSM: Text Sentiment Analysis Method
• Authors: E. V. Kotelnikov
Pages: 24 - 34
Abstract: Abstract The TextJSM method of text sentiment analysis is proposed, based on JSM method of automated hypothesis generation. Two versions of the TextJSM method are presented, that is, for solving predictive and descriptive problems. Parallel implementation of the main stages of both versions is considered. Experimental studies based on the ROMIP 2011–2012 seminar text corpora show the superiority of the developed method over other data mining methods.
PubDate: 2018-01-01
DOI: 10.3103/s0005105518010089
Issue No: Vol. 52, No. 1 (2018)

• The Rate of Convergence to the Limit of the Probability of Encountering an
Accidental Similarity in the Presence of Counter Examples
Pages: 35 - 37
Abstract: Abstract This paper refines the main result of [1], where the limit $$- {e^{ - a}} - a{e^{ - a}}\left[ {1 - {e^{ - c\sqrt a }}} \right]$$ was proved for the probability of encountering an accidental similarity between two parent examples without $$m = c\sqrt n$$ counter examples if each parent example and counter example is described by a series of $$\sqrt n$$ independent Bernoulli trials with success probability $$p = \sqrt {a/n}$$ . In this paper, we prove that the rate of convergence to the limit is proportional to $${n^{\frac{1}{2}}}$$ .
PubDate: 2018-01-01
DOI: 10.3103/s0005105518010090
Issue No: Vol. 52, No. 1 (2018)

• The Concept of Information in Big Data Processing
• Authors: P. V. Golubtsov
Pages: 38 - 43
Abstract: Abstract The need to transform existing algorithms in Big Data Systems is considered. The transformation must allow independent and parallel processing of separate fragments of data. The characteristic aspects of a well-organized intermediate compact form of information and its natural algebraic properties are studied and an illustrative example is provided.
PubDate: 2018-01-01
DOI: 10.3103/s000510551801003x
Issue No: Vol. 52, No. 1 (2018)

• Analysis of Spaces of Similarity Generated by a Fact Base in JSM Problems
• Authors: S. M. Gusakova
Pages: 44 - 50
Abstract: Abstract In this paper, we investigate spaces of similarity generated by fact bases of intelligent JSM systems and present a classification for the set of potential hypotheses. Conditions on similarity spaces are imposed to reduce the number of classes in this classification. The results obtained in this work make it possible to estimate the set of hypotheses (including minimal ones) found by inductive reasoning and can be used to solve the problems of opinion analysis and formation of a social structure.
PubDate: 2018-01-01
DOI: 10.3103/s0005105518010065
Issue No: Vol. 52, No. 1 (2018)

• The Phonetic Alphabet of the Chechen Language as a Basis of a
Speech-Synthesis System
• Authors: E. S. Izrailova
Pages: 51 - 55
Abstract: Abstract This paper describes the principles for developing the AZBAT phonetic alphabet, which was created by analogy with the DARPAbet phonetic alphabet of the English language and is oriented to creating the speech corpus and system for synthesis of Chechen speech. The experience of developers of other phonetic alphabets and databases was used; account was also taken of the features of pronunciation and graphics, rules of compatibility, and variability of phonemes, which had been described in the works of well-known Chechen philologists. The classification of vowels and consonant phonemes is given, according to which each phoneme has the attributes that are necessary to implement the program code. The designed system for synthesis of Chechen speech is assigned a basic set of acoustic phonetic elements that consists of diphones and allophones. This set will be complied with to build an acoustic phonetic database that is the basis of a system for automatic synthesis of Chechen speech.
PubDate: 2018-01-01
DOI: 10.3103/s0005105518010077
Issue No: Vol. 52, No. 1 (2018)

• Blockchain Technology: Universal Structure and Requirements
• Authors: M. R. Biktimirov; A. V. Domashev; P. A. Cherkashin; A. Yu. Shcherbakov
Pages: 235 - 238
Abstract: Abstract The project elaboration of the configuration and mathematical model of distributed blockchain data storage, blockchain applications for implementing various information technologies, and blockchain requirements that stem from its analytical and structural features are considered.
PubDate: 2017-11-01
DOI: 10.3103/s0005105517060036
Issue No: Vol. 51, No. 6 (2017)

• Constructing an Attribute-Based Search Subsystem for Identity Management
Systems when Implementing Integration Solutions
• Authors: A. V. Belov; M. I. Nezhurina; A. D. Shestova
Pages: 239 - 243
Abstract: Abstract This paper addresses the problem of designing an attribute-based search subsystem when integrating an identity management (IDM) system with modern complex systems (e.g., enterprise resource planning (ERP) systems) that have granular access control. When implementing integration solutions with a large number of roles and users, the context search preset in IDM systems proves to be inadequate. We propose and implement a solution to this problem that consists in changing the approach to the organization of a role search and description and present an approach to determining the optimal number of attributes required for an efficient search.
PubDate: 2017-11-01
DOI: 10.3103/s0005105517060024
Issue No: Vol. 51, No. 6 (2017)

