Subjects -> LIBRARY AND INFORMATION SCIENCES (Total: 392 journals)
    - DIGITAL CURATION AND PRESERVATION (13 journals)
    - LIBRARY ADMINISTRATION (1 journals)
    - LIBRARY AND INFORMATION SCIENCES (378 journals)

LIBRARY AND INFORMATION SCIENCES (378 journals)                  1 2 | Last

Showing 1 - 200 of 379 Journals sorted by number of followers
Library & Information Science Research     Hybrid Journal   (Followers: 1821)
Journal of Librarianship and Information Science     Hybrid Journal   (Followers: 1337)
Library Hi Tech     Hybrid Journal   (Followers: 1140)
Journal of Information Science     Hybrid Journal   (Followers: 1112)
Journal of Academic Librarianship     Hybrid Journal   (Followers: 1100)
Library Management     Hybrid Journal   (Followers: 977)
The Electronic Library     Hybrid Journal   (Followers: 976)
Library Quarterly     Full-text available via subscription   (Followers: 941)
Global Knowledge, Memory and Communication     Hybrid Journal   (Followers: 882)
Journal of Information Literacy     Open Access   (Followers: 858)
Library Hi Tech News     Hybrid Journal   (Followers: 788)
Information Technology and Libraries     Open Access   (Followers: 736)
New Library World     Hybrid Journal   (Followers: 684)
Journal of Library & Information Services in Distance Learning     Hybrid Journal   (Followers: 635)
Information Retrieval     Hybrid Journal   (Followers: 616)
Information Sciences     Hybrid Journal   (Followers: 602)
International Journal on Digital Libraries     Hybrid Journal   (Followers: 580)
Information Processing & Management     Hybrid Journal   (Followers: 567)
Information Systems Research     Full-text available via subscription   (Followers: 557)
College & Research Libraries     Open Access   (Followers: 528)
Evidence Based Library and Information Practice     Open Access   (Followers: 461)
Journal of Library and Information Science     Open Access   (Followers: 444)
International Information & Library Review     Hybrid Journal   (Followers: 437)
The Information Society: An International Journal     Hybrid Journal   (Followers: 406)
Library Trends     Full-text available via subscription   (Followers: 390)
Library and Information Research     Open Access   (Followers: 363)
Forensic Science International: Digital Investigation     Full-text available via subscription   (Followers: 344)
Annals of Library and Information Studies (ALIS)     Open Access   (Followers: 337)
International Journal of Library Science     Open Access   (Followers: 303)
Canadian Journal of Information and Library Science     Full-text available via subscription   (Followers: 289)
College & Research Libraries News     Partially Free   (Followers: 286)
Bioinformatics     Hybrid Journal   (Followers: 283)
The Reference Librarian     Hybrid Journal   (Followers: 267)
College & Undergraduate Libraries     Hybrid Journal   (Followers: 261)
IFLA Journal     Hybrid Journal   (Followers: 261)
Library Leadership & Management     Open Access   (Followers: 261)
Journal of Electronic Resources Librarianship     Hybrid Journal   (Followers: 259)
Journal of Library Administration     Hybrid Journal   (Followers: 254)
Library Collections, Acquisitions, and Technical Services     Hybrid Journal   (Followers: 253)
Communications in Information Literacy     Open Access   (Followers: 244)
Data Technologies and Applications     Hybrid Journal   (Followers: 236)
American Libraries     Partially Free   (Followers: 223)
Journal of the Medical Library Association     Open Access   (Followers: 222)
Code4Lib Journal     Open Access   (Followers: 218)
Journal of Information & Knowledge Management     Hybrid Journal   (Followers: 214)
International Journal of Information Management     Hybrid Journal   (Followers: 212)
Cataloging & Classification Quarterly     Hybrid Journal   (Followers: 207)
Journal of Library Metadata     Hybrid Journal   (Followers: 206)
Australian Library Journal     Full-text available via subscription   (Followers: 198)
Journal of Documentation     Hybrid Journal   (Followers: 195)
portal: Libraries and the Academy     Full-text available via subscription   (Followers: 189)
Ariadne Magazine     Open Access   (Followers: 185)
Journal of Hospital Librarianship     Hybrid Journal   (Followers: 184)
Behavioral & Social Sciences Librarian     Hybrid Journal   (Followers: 179)
Aslib Proceedings     Hybrid Journal   (Followers: 172)
Library & Information History     Hybrid Journal   (Followers: 165)
American Archivist     Hybrid Journal   (Followers: 161)
EDUCAUSE Review     Full-text available via subscription   (Followers: 161)
Research Library Issues     Free   (Followers: 159)
The Serials Librarian     Hybrid Journal   (Followers: 156)
The Library : The Transactions of the Bibliographical Society     Hybrid Journal   (Followers: 154)
New Review of Academic Librarianship     Hybrid Journal   (Followers: 151)
Book History     Full-text available via subscription   (Followers: 149)
Against the Grain     Partially Free   (Followers: 143)
Library Technology Reports     Full-text available via subscription   (Followers: 141)
Journal of eScience Librarianship     Open Access   (Followers: 134)
DESIDOC Journal of Library & Information Technology     Open Access   (Followers: 105)
Archives and Museum Informatics     Hybrid Journal   (Followers: 99)
Australian Academic & Research Libraries     Full-text available via subscription   (Followers: 99)
European Journal of Information Systems     Hybrid Journal   (Followers: 95)
Online Information Review     Hybrid Journal   (Followers: 91)
Journal of Librarianship and Scholarly Communication     Open Access   (Followers: 88)
International Journal of Digital Curation     Open Access   (Followers: 85)
Information Technologies & International Development     Open Access   (Followers: 84)
Journal of Electronic Publishing     Open Access   (Followers: 77)
Serials Review     Hybrid Journal   (Followers: 75)
Journal of Education in Library and Information Science - JELIS     Full-text available via subscription   (Followers: 74)
International Journal of Digital Library Systems     Full-text available via subscription   (Followers: 74)
Journal of Interlibrary Loan Document Delivery & Electronic Reserve     Hybrid Journal   (Followers: 69)
LIBER Quarterly : The Journal of the Association of European Research Libraries     Open Access   (Followers: 68)
Archival Science     Hybrid Journal   (Followers: 66)
Ethics and Information Technology     Hybrid Journal   (Followers: 66)
Journal of the Canadian Health Libraries Association / Journal de l'Association des bibliothèques de la santé du Canada     Open Access   (Followers: 66)
Library Philosophy and Practice     Open Access   (Followers: 66)
Insights : the UKSG journal     Open Access   (Followers: 65)
Practical Academic Librarianship : The International Journal of the SLA Academic Division     Open Access   (Followers: 65)
MIS Quarterly : Management Information Systems Quarterly     Hybrid Journal   (Followers: 63)
Journal of Management Information Systems     Full-text available via subscription   (Followers: 60)
Science & Technology Libraries     Hybrid Journal   (Followers: 59)
Journal of Information Technology     Hybrid Journal   (Followers: 56)
The Bottom Line: Managing Library Finances     Hybrid Journal   (Followers: 56)
Alexandria : The Journal of National and International Library and Information Issues     Full-text available via subscription   (Followers: 56)
Journal of Health & Medical Informatics     Open Access   (Followers: 54)
Partnership : the Canadian Journal of Library and Information Practice and Research     Open Access   (Followers: 54)
Archives and Manuscripts     Hybrid Journal   (Followers: 52)
International Journal of Legal Information     Full-text available via subscription   (Followers: 51)
Library & Archival Security     Hybrid Journal   (Followers: 49)
Bangladesh Journal of Library and Information Science     Open Access   (Followers: 47)
OCLC Systems & Services     Hybrid Journal   (Followers: 46)
Community & Junior College Libraries     Hybrid Journal   (Followers: 45)
Information Discovery and Delivery     Hybrid Journal   (Followers: 44)
Journal of Access Services     Hybrid Journal   (Followers: 40)
Medical Reference Services Quarterly     Hybrid Journal   (Followers: 40)
VINE Journal of Information and Knowledge Management Systems     Hybrid Journal   (Followers: 40)
Journal of the Society of Archivists     Hybrid Journal   (Followers: 36)
Scholarly and Research Communication     Open Access   (Followers: 36)
Public Library Quarterly     Hybrid Journal   (Followers: 32)
Journal of Archival Organization     Hybrid Journal   (Followers: 31)
Information & Culture : A Journal of History     Full-text available via subscription   (Followers: 31)
Australasian Public Libraries and Information Services     Full-text available via subscription   (Followers: 31)
Journal of the Association for Information Systems     Open Access   (Followers: 31)
Research Evaluation     Hybrid Journal   (Followers: 30)
Foundations and Trends® in Information Retrieval     Full-text available via subscription   (Followers: 30)
Information     Open Access   (Followers: 29)
International Journal of Information Retrieval Research     Full-text available via subscription   (Followers: 29)
Information Systems Frontiers     Hybrid Journal   (Followers: 27)
International Journal of Intellectual Property Management     Hybrid Journal   (Followers: 26)
International Journal of Information Privacy, Security and Integrity     Hybrid Journal   (Followers: 26)
Proceedings of the American Society for Information Science and Technology     Hybrid Journal   (Followers: 26)
Health Information Management Journal     Hybrid Journal   (Followers: 26)
Journal of the Institute of Conservation     Hybrid Journal   (Followers: 25)
Access     Full-text available via subscription   (Followers: 24)
Nordic Journal of Information Literacy in Higher Education     Open Access   (Followers: 24)
South African Journal of Libraries and Information Science     Open Access   (Followers: 23)
Sci-Tech News     Open Access   (Followers: 23)
LASIE : Library Automated Systems Information Exchange     Free   (Followers: 22)
Journal of Information, Communication and Ethics in Society     Hybrid Journal   (Followers: 22)
NASIG Newsletter     Open Access   (Followers: 21)
InCite     Full-text available via subscription   (Followers: 20)
Georgia Library Quarterly     Open Access   (Followers: 20)
LOEX Quarterly     Full-text available via subscription   (Followers: 20)
RBM : A Journal of Rare Books, Manuscripts, and Cultural Heritage     Open Access   (Followers: 20)
Urban Library Journal     Open Access   (Followers: 19)
El Profesional de la Informacion     Full-text available via subscription   (Followers: 18)
Journal of Research on Libraries and Young Adults     Open Access   (Followers: 18)
International Journal of Web Portals     Full-text available via subscription   (Followers: 17)
Communication Booknotes Quarterly     Hybrid Journal   (Followers: 16)
Theological Librarianship : An Online Journal of the American Theological Library Association     Open Access   (Followers: 16)
Perspectives in International Librarianship     Open Access   (Followers: 16)
Biblioteca Universitaria     Open Access   (Followers: 16)
Collection and Curation     Hybrid Journal   (Followers: 15)
Manuscripta     Full-text available via subscription   (Followers: 15)
Bibliotheca Orientalis     Full-text available via subscription   (Followers: 14)
International Journal of Business Information Systems     Hybrid Journal   (Followers: 14)
International Journal of Information Technology, Communications and Convergence     Hybrid Journal   (Followers: 14)
Notes     Full-text available via subscription   (Followers: 14)
Online Journal of Public Health Informatics     Open Access   (Followers: 14)
Alexandría : Revista de Ciencias de la Información     Open Access   (Followers: 14)
Anales de Documentacion     Open Access   (Followers: 14)
Journal of Educational Media, Memory, and Society     Full-text available via subscription   (Followers: 13)
Biblios     Open Access   (Followers: 13)
International Journal of Intercultural Information Management     Hybrid Journal   (Followers: 12)
Alsic : Apprentissage des Langues et Systèmes d'Information et de Communication     Open Access   (Followers: 12)
Journal of Information Technology Teaching Cases     Hybrid Journal   (Followers: 12)
Journal of Religious & Theological Information     Hybrid Journal   (Followers: 11)
Universal Access in the Information Society     Hybrid Journal   (Followers: 11)
InterActions: UCLA Journal of Education and Information     Open Access   (Followers: 11)
International Journal of Information and Decision Sciences     Hybrid Journal   (Followers: 11)
Journal of Information Systems     Full-text available via subscription   (Followers: 11)
Kansas Library Association College & University Libraries Section Proceedings     Open Access   (Followers: 11)
Journal of Information Engineering and Applications     Open Access   (Followers: 10)
Journal of Global Information Management     Full-text available via subscription   (Followers: 9)
Southeastern Librarian     Open Access   (Followers: 9)
e & i Elektrotechnik und Informationstechnik     Hybrid Journal   (Followers: 8)
JLIS.it     Open Access   (Followers: 8)
International Journal of Multicriteria Decision Making     Hybrid Journal   (Followers: 8)
JISTEM : Journal of Information Systems and Technology Management     Open Access   (Followers: 8)
International Journal of Multimedia Information Retrieval     Partially Free   (Followers: 8)
BIBLOS - Revista do Departamento de Biblioteconomia e História     Open Access   (Followers: 7)
New Review of Information Networking     Hybrid Journal   (Followers: 7)
Idaho Librarian     Free   (Followers: 7)
Slavic & East European Information Resources     Hybrid Journal   (Followers: 6)
Egyptian Informatics Journal     Open Access   (Followers: 6)
Informaatiotutkimus     Open Access   (Followers: 5)
Revista Interamericana de Bibliotecología     Open Access   (Followers: 5)
CIC. Cuadernos de Informacion y Comunicacion     Open Access   (Followers: 5)
Bridgewater Review     Open Access   (Followers: 5)
Bilgi Dünyası     Open Access   (Followers: 5)
Open Systems & Information Dynamics     Hybrid Journal   (Followers: 4)
ProInflow : Journal for Information Sciences     Open Access   (Followers: 4)
Nordic Journal of Library and Information Studies     Open Access   (Followers: 4)
International Journal of Cooperative Information Systems     Hybrid Journal   (Followers: 4)
OJS på dansk     Open Access   (Followers: 4)
Investigación Bibliotecológica     Open Access   (Followers: 4)
Revista Española de Documentación Científica     Open Access   (Followers: 4)
International Journal of Organisational Design and Engineering     Hybrid Journal   (Followers: 3)
Journal of Information Systems Teaching Notes     Hybrid Journal   (Followers: 3)
HLA News     Full-text available via subscription   (Followers: 3)
Encontros Bibli : revista eletrônica de biblioteconomia e ciência da informação     Open Access   (Followers: 3)
SLIS Student Research Journal     Open Access   (Followers: 3)
VRA Bulletin     Open Access   (Followers: 3)
Türk Kütüphaneciliği : Turkish Librarianship     Open Access   (Followers: 2)
Información, Cultura y Sociedad     Open Access   (Followers: 2)
Revista General de Información y Documentación     Open Access   (Followers: 2)
Informação & Informação     Open Access   (Followers: 2)
In Monte Artium     Full-text available via subscription   (Followers: 1)
Knjižnica : Revija za Področje Bibliotekarstva in Informacijske Znanosti     Open Access   (Followers: 1)
Documentación de las Ciencias de la Información     Open Access   (Followers: 1)
Palabra Clave (La Plata)     Open Access  
Liinc em Revista     Open Access  

        1 2 | Last

Similar Journals
Journal Cover
International Journal on Digital Libraries
Journal Prestige (SJR): 0.441
Citation Impact (citeScore): 2
Number of Followers: 580  
 
  Hybrid Journal Hybrid journal (It can contain Open Access articles)
ISSN (Print) 1432-1300 - ISSN (Online) 1432-5012
Published by Springer-Verlag Homepage  [2468 journals]
  • Coverage and similarity of bibliographic databases to find most relevant
           literature for systematic reviews in education

    • Free pre-print version: Loading...

      Abstract: Systematic literature reviews in educational research have become a popular research method. A key point hereby is the choice of bibliographic databases to reach a maximum probability of finding all potentially relevant literature that deals with the research question analyzed in a systematic literature review. Guidelines and handbooks on review recommend proper databases and information sources for education, along with specific search strategies. However, in many disciplines, among them educational research, there is a lack of evidence on the relevance of databases that need to be considered to find relevant literature and lessen the risk of missing relevant publications. Educational research is an interdisciplinary field and has no core database. Instead, the field is covered by multiple disciplinary and multidisciplinary information sources that have either a national or international focus. In this article, we discuss the relevance of seven databases in systematic literature reviews in education, based on results of an empirical data analysis of three recently published reviews. To evaluate the relevance of a database, the relevant literature of those reviews served as the gold standard. Results indicate that discipline-specific databases outperform international multidisciplinary sources, and a combination of discipline-specific international and national sources is most efficient in finding a high proportion of relevant literature. The article discusses the relevance of the databases in relation to their coverage of relevant literature, while considering practical implications for researchers performing a systematic literature search. We, thus, present evidence for proper database choices for educational and discipline-related systematic literature reviews.
      PubDate: 2023-05-24
       
  • Correction: Beyond translation: engaging with foreign languages in a
           digital library

    • Free pre-print version: Loading...

      PubDate: 2023-05-19
       
  • Self-training involving semantic-space finetuning for semi-supervised
           multi-label document classification

    • Free pre-print version: Loading...

      Abstract: Self-training is an effective solution for semi-supervised learning, in which both labeled and unlabeled data are leveraged for training. However, the application scenarios of existing self-training frameworks are mostly confined to single-label classification. There exist difficulties in applying self-training under multi-label scenario, since unlike single-label classification, there is no constraint of mutual exclusion over categories, and the vast number of possible label vectors makes discovery of credible predictions harder. For realizing effective self-training under multi-label scenario, we propose ML-DST and ML-DST+ that utilize contextualized document representations of pretrained language models. A BERT-based multi-label classifier and newly designed weighted loss functions for finetuning are proposed. Two label propagation-based algorithms SemLPA and SemLPA+ are also proposed to enhance multi-label prediction, whose similarity measure is iteratively improved through semantic-space finetuning, by which semantic space consisting of document representations is finetuned to better reflect learnt label correlations. High-confidence label predictions are recognized through examining the prediction score on each category separately, which are in turn used for both classifier finetuning and semantic-space finetuning. According to our experiment results, the performance of our approach steadily exceeds the representative baselines under different label rates, proving the superiority of our proposed approach.
      PubDate: 2023-05-11
       
  • DETEXA: declarative extensible text exploration and analysis through SQL

    • Free pre-print version: Loading...

      Abstract: Metadata enrichment through text mining techniques is becoming one of the most significant tasks in digital libraries. Due to the exponential increase of open access publications, several new challenges have emerged. Raw data are usually big, unstructured, and come from heterogeneous data sources. In this paper, we introduce a text analysis framework implemented in extended SQL that exploits the scalability characteristics of modern database management systems. The purpose of this framework is to provide the opportunity to build performant end-to-end text mining pipelines which include data harvesting, cleaning, processing, and text analysis at once. SQL is selected due to its declarative nature which offers fast experimentation and the ability to build APIs so that domain experts can edit text mining workflows via easy-to-use graphical interfaces. Our experimental analysis demonstrates that the proposed framework is very effective and achieves significant speedup, up to three times faster, in common use cases compared to other popular approaches.
      PubDate: 2023-05-10
       
  • Predicting answer acceptability for question-answering system

    • Free pre-print version: Loading...

      Abstract: Question-answering (QA) platforms such as Stack Overflow, Quora, and Stack Exchange have become favourite places to exchange knowledge with community users. Finding answers to simple or complex questions is easier on QA platforms nowadays. Due to a large number of responses from users all around the world, these CQA systems are currently facing massive problems. Stack Overflow allows users to ask questions and give answers or comments on others’ posts. Consequently, Stack Overflow also rewards those users whose posts are appreciated by the community in the form of reputation points. The accepted answer provides maximum reputation points to the answerer. More reputation points allow getting more website privileges. Hence, each answerer needs to get their answer accepted. Very little research has been done to check whether the user’s answers will be accepted or not. This paper proposes a model that predicts answer acceptability and its reason. The model’s findings help the answerer know about the answer acceptance; if the model predicted the probability of acceptance is less, the answerer might revise their answer immediately. The comparison with the state-of-the-art literature confirmed that the proposed model achieves better performance.
      PubDate: 2023-05-05
       
  • Deep author name disambiguation using DBLP data

    • Free pre-print version: Loading...

      Abstract: In the academic world, the number of scientists grows every year and so does the number of authors sharing the same names. Consequently, it is challenging to assign newly published papers to their respective authors. Therefore, author name ambiguity is considered a critical open problem in digital libraries. This paper proposes an author name disambiguation approach that links author names to their real-world entities by leveraging their co-authors and domain of research. To this end, we use data collected from the DBLP repository that contains more than 5 million bibliographic records authored by around 2.6 million co-authors. Our approach first groups authors who share the same last names and same first name initials. The author within each group is identified by capturing the relation with his/her co-authors and area of research, represented by the titles of the validated publications of the corresponding author. To this end, we train a neural network model that learns from the representations of the co-authors and titles. We validated the effectiveness of our approach by conducting extensive experiments on a large dataset.
      PubDate: 2023-05-04
       
  • Retrievability in an integrated retrieval system: an extended study

    • Free pre-print version: Loading...

      Abstract: Retrievability measures the influence a retrieval system has on the access to information in a given collection of items. This measure can help in making an evaluation of the search system based on which insights can be drawn. In this paper, we investigate the retrievability in an integrated search system consisting of items from various categories, particularly focussing on datasets, publications and variables in a real-life digital library. The traditional metrics, that is, the Lorenz curve and Gini coefficient, are employed to visualise the diversity in retrievability scores of the three retrievable document types (specifically datasets, publications, and variables). Our results show a significant popularity bias with certain items being retrieved more often than others. Particularly, it has been shown that certain datasets are more likely to be retrieved than other datasets in the same category. In contrast, the retrievability scores of items from the variable or publication category are more evenly distributed. We have observed that the distribution of document retrievability is more diverse for datasets as compared to publications and variables.
      PubDate: 2023-04-28
       
  • A discovery system for narrative query graphs: entity-interaction-aware
           document retrieval

    • Free pre-print version: Loading...

      Abstract: Finding relevant publications in the scientific domain can be quite tedious: Accessing large-scale document collections often means to formulate an initial keyword-based query followed by many refinements to retrieve a sufficiently complete, yet manageable set of documents to satisfy one’s information need. Since keyword-based search limits researchers to formulating their information needs as a set of unconnected keywords, retrieval systems try to guess each user’s intent. In contrast, distilling short narratives of the searchers’ information needs into simple, yet precise entity-interaction graph patterns provides all information needed for a precise search. As an additional benefit, such graph patterns may also feature variable nodes to flexibly allow for different substitutions of entities taking a specified role. An evaluation over the PubMed document collection quantifies the gains in precision for our novel entity-interaction-aware search. Moreover, we perform expert interviews and a questionnaire to verify the usefulness of our system in practice. This paper extends our previous work by giving a comprehensive overview about the discovery system to realize narrative query graph retrieval.
      PubDate: 2023-04-24
       
  • Towards automated meta-review generation via an NLP/ML pipeline in
           different stages of the scholarly peer review process

    • Free pre-print version: Loading...

      Abstract: With the ever-increasing number of submissions in top-tier conferences and journals, finding good reviewers and meta-reviewers is becoming increasingly difficult. Writing a meta-review is not straightforward as it involves a series of sub-tasks, including making a decision on the paper based on the reviewer’s recommendation and their confidence in the recommendation, mitigating disagreements among the reviewers, and other such similar tasks. In this work, we develop a novel approach to automatically generate meta-reviews that are decision-aware and which also take into account a set of relevant sub-tasks in the peer-review process. More specifically, we first predict the recommendation scores and confidence scores for the reviews, using which we then predict the decision on a particular manuscript. Finally, we utilize the decision signals for generating the meta-reviews using a transformer-based seq2seq architecture. Our proposed pipelined approach for automatic decision-aware meta-review generation achieves significant performance improvement over the standard summarization baselines as well as relevant prior works on this problem. We make our codes available at https://github.com/saprativa/seq-to-seq-decision-aware-mrg.
      PubDate: 2023-04-24
       
  • Approximate nearest neighbor for long document relationship labeling in
           digital libraries

    • Free pre-print version: Loading...

      Abstract: Relationship tagging of long text documents is a growing need in information science, spurred by the emergence of multi-million book bibliographic digital libraries. Large digital libraries offer an unprecedented glimpse into cultural history through their collections, but the combination of collection scale and document length complicates their study, given that prior work on large corpora has dealt primarily with much shorter texts. This study presents and evaluates an approach for fast retrieval on long texts, which leverages a chunk-and-aggregate approach with document sub-units to capture nuanced similarity relationships at scales which are not otherwise tractable. This approach is evaluated on book relationships from the HathiTrust Digital Library and shows strong results for relationships beyond exact duplicates. Finally, we argue for the value of approximate nearest neighbor search for narrowing the search space for downstream classification and retrieval contexts.
      PubDate: 2023-04-15
       
  • Creating and validating a scholarly knowledge graph using natural language
           processing and microtask crowdsourcing

    • Free pre-print version: Loading...

      Abstract: Due to the growing number of scholarly publications, finding relevant articles becomes increasingly difficult. Scholarly knowledge graphs can be used to organize the scholarly knowledge presented within those publications and represent them in machine-readable formats. Natural language processing (NLP) provides scalable methods to automatically extract knowledge from articles and populate scholarly knowledge graphs. However, NLP extraction is generally not sufficiently accurate and, thus, fails to generate high granularity quality data. In this work, we present TinyGenius, a methodology to validate NLP-extracted scholarly knowledge statements using microtasks performed with crowdsourcing. TinyGenius is employed to populate a paper-centric knowledge graph, using five distinct NLP methods. We extend our previous work of the TinyGenius methodology in various ways. Specifically, we discuss the NLP tasks in more detail and include an explanation of the data model. Moreover, we present a user evaluation where participants validate the generated NLP statements. The results indicate that employing microtasks for statement validation is a promising approach despite the varying participant agreement for different microtasks.
      PubDate: 2023-04-05
       
  • Referencing behaviours across disciplines: publication types and common
           metadata for defining bibliographic references

    • Free pre-print version: Loading...

      Abstract: In this work, we investigate existing citation practices by analysing a huge set of articles published in journals to measure which metadata are used across the various scholarly disciplines, independently from the particular citation style adopted, for defining bibliographic reference. We selected the most cited journals in each of the 27 subject areas listed in the SCImago Journal Rank in the 2015–2017 triennium according to the SCImago total cites ranking. Each journal in the sample was represented by five articles (in PDF format) published in the most recent issue published in October 2019, for a total of 729 articles. We extracted all 34,140 bibliographic references in the bibliographic references lists of these articles. Finally, we detected the types of cited works in each discipline and the structure of bibliographic references and in-text reference pointers for each type of cited work. By analysing the data gathered, we observed that the bibliographic references in our sample referenced 36 different types of cited works. Such a considerable variety of publications revealed the existence of particular citing behaviours in scientific articles that varied from subject area to subject area.
      PubDate: 2023-03-27
       
  • Scientific document processing: challenges for modern learning methods

    • Free pre-print version: Loading...

      Abstract: Neural network models enjoy success on language tasks related to Web documents, including news and Wikipedia articles. However, the characteristics of scientific publications pose specific challenges that have yet to be satisfactorily addressed: the discourse structure of scientific documents crucial in scholarly document processing (SDP) tasks, the interconnected nature of scientific documents, and their multimodal nature. We survey modern neural network learning methods that tackle these challenges: those that can model discourse structure and their interconnectivity and use their multimodal nature. We also highlight efforts to collect large-scale datasets and tools developed to enable effective deep learning deployment for SDP. We conclude with a discussion on upcoming trends and recommend future directions for pursuing neural natural language processing approaches for SDP.
      PubDate: 2023-03-24
       
  • The digitization of historical astrophysical literature with highly
           localized figures and figure captions

    • Free pre-print version: Loading...

      Abstract: Scientific articles published prior to the “age of digitization” in the late 1990s contain figures which are “trapped” within their scanned pages. While progress to extract figures and their captions has been made, there is currently no robust method for this process. We present a YOLO-based method for use on scanned pages, after they have been processed with Optical character recognition (OCR), which uses both grayscale and OCR features. We focus our efforts on translating the intersection-over-union (IOU) metric from the field of object detection to document layout analysis and quantify “high localization” levels as an IOU of 0.9. When applied to the astrophysics literature holdings of the NASA astrophysics data system, we find F1 scores of 90.9% (92.2%) for figures (figure captions) with the IOU cut-off of 0.9 which is a significant improvement over other state-of-the-art methods.
      PubDate: 2023-03-22
       
  • Beyond translation: engaging with foreign languages in a digital library

    • Free pre-print version: Loading...

      Abstract: Digital libraries can enable their patrons to go beyond modern language translations and to engage directly with sources in more languages than any individual could study, much less master. Translations should be viewed not so much as an end but as an entry point into the sources that they represent. In the case of highly studied sources, one or more experts can curate the network of annotations that support such reading. A digital library should, however, automatically create a serviceable first version of such a multi-lingual edition. Such a service is possible but benefits (if it does not require) a new generation of increasingly well-designed machine-readable translations, lexica, grammars, and encyclopedias. This paper reports on exploratory work that uses the Homeric epics to explore this wider topic and on the more general application of the results.
      PubDate: 2023-03-19
       
  • CH-Bench: a user-oriented benchmark for systems for efficient distant
           reading (design, performance, and insights)

    • Free pre-print version: Loading...

      Abstract: Data science deals with the discovery of information from large volumes of data. The data studied by scientists in the humanities include large textual corpora. An important objective is to study the ideas and expectations of a society regarding specific concepts, like “freedom” or “democracy,” both for today’s society and even more for societies of the past. Studying the meaning of words using large corpora requires efficient systems for text analysis, so-called distant reading systems. Making such systems efficient calls for a specification of the necessary functionality and clear expectations regarding typical work loads. But this currently is unclear, and there is no benchmark to evaluate distant reading systems. In this article, we propose such a benchmark, with the following innovations: As a first step, we collect and structure various information needs of the target users. We then formalize the notion of word context to facilitate the analysis of specific concepts. Using this notion, we formulate queries in line with the information needs of users. Finally, based on this, we propose concrete benchmark queries. To demonstrate the benefit of our benchmark, we conduct an evaluation, with two objectives. First, we aim at insights regarding the content of different corpora, i.e., whether and how their size and nature (e.g., popular and broad literature or specific expert literature) affect results. Second, we benchmark different data management technologies. This has allowed us to identify performance bottlenecks.
      PubDate: 2023-03-15
      DOI: 10.1007/s00799-023-00347-4
       
  • DeepMetaGen: an unsupervised deep neural approach to generate
           template-based meta-reviews leveraging on aspect category and sentiment
           analysis from peer reviews

    • Free pre-print version: Loading...

      Abstract: Peer reviews form an essential part of scientific communication. Scholarly peer review is probably the most accepted way to evaluate research papers by involving multiple experts to review the concerned research independently. Usually, the area chair, the program chair, or the editor takes a call weighing the reviewer’s judgments. It communicates the decision to the author via writing a meta-review by summarizing the review comments. With the exponential rise in research paper submissions and the corresponding rise in the reviewer pool, it becomes stressful for the chairs/editors to manage conflicts, arrive at a consensus, and also write an informative meta-review. Here in this work, we propose a novel deep neural network-based approach for generating meta-reviews in an unsupervised fashion. To generate consistent meta-reviews, we use a generic template where the task is like to slot-fill the template with the generated meta-review text. We consider the setting where only peer reviews with no summaries or meta-reviews are provided and propose an end-to-end neural network model to perform unsupervised opinion-based abstractive summarization. We first use an aspect-based sentiment analysis model, which classifies the review sentences with the corresponding aspects (e.g., novelty, substance, soundness, etc.) and sentiment. We then extract opinion phrases from reviews for the corresponding aspect and sentiment labels. Next, we train a transformer model to reconstruct the original reviews from these extraction. Finally, we filter the selected opinions according to their aspect and/or sentiment at the time of summarization. The selected opinions of each aspect are used as input to the trained Transformer model, which uses them to construct an opinion summary. The idea is to give a concise meta-review that maximizes information coverage by focusing on aspects and sentiment present in the review, coherence, readability, and redundancy. We evaluate our model on the human written template-based meta-reviews to show that our framework outperforms competitive baselines. We believe that the template-based meta-review generation focusing on aspect and sentiment will help the editor/chair in decision-making and assist the meta-reviewer in writing better and more informative meta-reviews. We make our codes available at https://github.com/sandeep82945/Unsupervised-meta-review-generation.
      PubDate: 2023-03-10
      DOI: 10.1007/s00799-023-00348-3
       
  • Implications of an ecospatial indigenous perspective on digital
           information organization and access

    • Free pre-print version: Loading...

      Abstract: The digitalisation of indigenous knowledge has been challenging considering epistemological differences and the lack of involvement of indigenous people. Drawing from our most recent community projects in Namibia, we share insights on indigenous ecospatial worldviews guiding the design of digital information organization and access of indigenous knowledge. With emerging technologies, such as augmented and virtual reality, offering new opportunities for richer and more meaningful spatial and embodied accounts of indigenous knowledge, we re-imagine digital libraries inclusive of indigenous people and their worldviews.
      PubDate: 2023-03-07
      DOI: 10.1007/s00799-023-00353-6
       
  • Transliterating Latin to Amharic scripts using user-defined rules and
           character mappings

    • Free pre-print version: Loading...

      Abstract: As social media platforms become increasingly accessible, individuals’ usage of new forms of textual communication (posts, comments, chats, etc.) on social media using local language scripts such as Amharic has increased tremendously. However, many users prefer to post comments in Latin scripts instead of local ones due to the availability of more convenient forms of character input using Latin keyboards. In existing Latin to Amharic transliteration systems, missing consideration of double consonants and double vowels has caused transliteration errors. Further, as there are multiple ways of character mapping conventions in existing systems, social media texts are susceptible to a wide variety of user adoptions during script production. The current systems have failed to address these gaps and adoptions. In this work, we present the RBLatAm (Rule-Based Latin to Amharic) transliteration system, a generic rule-based system that converts Amharic words which have been written using Latin script back into their native Amharic script. The system is based on mapping rules engineered from three existing transliteration systems (Microsoft, Google, SERA) and additional rules for double consonants, and conventions adopted on social media by speakers of Amharic. When tested on transliterated Amharic words of non-named entities, and named entities of persons, the system achieves an accuracy of 75.8% and 84.6%, respectively. The system also correctly transliterates words reported as errors in previous studies. This system drastically improves the basis for performing research on text mining for Amharic language texts by being able to process such texts even if they have originally been produced in Latin scripts.
      PubDate: 2023-03-02
      DOI: 10.1007/s00799-023-00346-5
       
  • Design, realization, and user evaluation of the ARCA system for exploring
           a digital library

    • Free pre-print version: Loading...

      Abstract: This paper presents ARCA, a software system that enables semantic search and exploration over a book catalog. The main purpose of this work is twofold: to propose a general paradigm for a semantic enrichment workflow and to evaluate a visual approach to information retrieval based on extracted information and existing knowledge graphs. ARCA has been designed and implemented following a user-centered design approach. Two different releases of the system have incrementally and iteratively developed and evaluated. The first release has evaluated the quality and usefulness of the extracted data. The second release, whose design was a refinement based on the previous evaluation results, was assessed by several users. Moreover, a comparative test with other information retrieval systems was conducted in order to study the potential added-value of the system. ARCA is employed in a real editorial scenario to visually search and explore the books of a publishing house.
      PubDate: 2022-12-16
      DOI: 10.1007/s00799-022-00343-0
       
 
JournalTOCs
School of Mathematical and Computer Sciences
Heriot-Watt University
Edinburgh, EH14 4AS, UK
Email: journaltocs@hw.ac.uk
Tel: +00 44 (0)131 4513762
 


Your IP address: 44.200.112.172
 
Home (Search)
API
About JournalTOCs
News (blog, publications)
JournalTOCs on Twitter   JournalTOCs on Facebook

JournalTOCs © 2009-