for Journals by Title or ISSN
for Articles by Keywords

Publisher: Oxford University Press   (Total: 393 journals)

 A  B  C  D  E  F  G  H  I  J  K  L  M  N  O  P  Q  R  S  T  U  V  W  X  Y  Z  

        1 2 | Last   [Sort by number of followers]   [Restore default list]

Showing 1 - 200 of 393 Journals sorted alphabetically
Acta Biochimica et Biophysica Sinica     Hybrid Journal   (Followers: 5, SJR: 0.881, h-index: 38)
Adaptation     Hybrid Journal   (Followers: 8, SJR: 0.111, h-index: 4)
Advances in Nutrition     Hybrid Journal   (Followers: 42, SJR: 2.075, h-index: 36)
Aesthetic Surgery J.     Hybrid Journal   (Followers: 6, SJR: 1.538, h-index: 35)
African Affairs     Hybrid Journal   (Followers: 65, SJR: 1.512, h-index: 46)
Age and Ageing     Hybrid Journal   (Followers: 86, SJR: 1.611, h-index: 107)
Alcohol and Alcoholism     Hybrid Journal   (Followers: 18, SJR: 0.935, h-index: 80)
American Entomologist     Full-text available via subscription   (Followers: 6)
American Historical Review     Hybrid Journal   (Followers: 150, SJR: 0.652, h-index: 43)
American J. of Agricultural Economics     Hybrid Journal   (Followers: 40, SJR: 1.441, h-index: 77)
American J. of Clinical Nutrition     Hybrid Journal   (Followers: 146, SJR: 3.771, h-index: 262)
American J. of Epidemiology     Hybrid Journal   (Followers: 171, SJR: 3.047, h-index: 201)
American J. of Hypertension     Hybrid Journal   (Followers: 25, SJR: 1.397, h-index: 111)
American J. of Jurisprudence     Hybrid Journal   (Followers: 18)
American J. of Legal History     Full-text available via subscription   (Followers: 9, SJR: 0.151, h-index: 7)
American Law and Economics Review     Hybrid Journal   (Followers: 27, SJR: 0.824, h-index: 23)
American Literary History     Hybrid Journal   (Followers: 15, SJR: 0.185, h-index: 22)
Analysis     Hybrid Journal   (Followers: 21)
Animal Frontiers     Hybrid Journal  
Annals of Behavioral Medicine     Hybrid Journal   (Followers: 14, SJR: 2.112, h-index: 98)
Annals of Botany     Hybrid Journal   (Followers: 35, SJR: 1.912, h-index: 124)
Annals of Oncology     Hybrid Journal   (Followers: 47, SJR: 4.362, h-index: 173)
Annals of the Entomological Society of America     Full-text available via subscription   (Followers: 8, SJR: 0.642, h-index: 53)
Annals of Work Exposures and Health     Hybrid Journal   (Followers: 29, SJR: 0.837, h-index: 57)
AoB Plants     Open Access   (Followers: 4, SJR: 0.78, h-index: 10)
Applied Economic Perspectives and Policy     Hybrid Journal   (Followers: 16, SJR: 0.884, h-index: 31)
Applied Linguistics     Hybrid Journal   (Followers: 55, SJR: 1.749, h-index: 63)
Applied Mathematics Research eXpress     Hybrid Journal   (Followers: 1, SJR: 0.779, h-index: 11)
Arbitration Intl.     Full-text available via subscription   (Followers: 20)
Arbitration Law Reports and Review     Hybrid Journal   (Followers: 14)
Archives of Clinical Neuropsychology     Hybrid Journal   (Followers: 30, SJR: 0.96, h-index: 71)
Aristotelian Society Supplementary Volume     Hybrid Journal   (Followers: 3, SJR: 0.102, h-index: 20)
Arthropod Management Tests     Hybrid Journal   (Followers: 2)
Astronomy & Geophysics     Hybrid Journal   (Followers: 42, SJR: 0.144, h-index: 15)
Behavioral Ecology     Hybrid Journal   (Followers: 51, SJR: 1.698, h-index: 92)
Bioinformatics     Hybrid Journal   (Followers: 291, SJR: 4.643, h-index: 271)
Biology Methods and Protocols     Hybrid Journal  
Biology of Reproduction     Full-text available via subscription   (Followers: 10, SJR: 1.646, h-index: 149)
Biometrika     Hybrid Journal   (Followers: 20, SJR: 2.801, h-index: 90)
BioScience     Hybrid Journal   (Followers: 30, SJR: 2.374, h-index: 154)
Bioscience Horizons : The National Undergraduate Research J.     Open Access   (Followers: 1, SJR: 0.213, h-index: 9)
Biostatistics     Hybrid Journal   (Followers: 17, SJR: 1.955, h-index: 55)
BJA : British J. of Anaesthesia     Hybrid Journal   (Followers: 165, SJR: 2.314, h-index: 133)
BJA Education     Hybrid Journal   (Followers: 64, SJR: 0.272, h-index: 20)
Brain     Hybrid Journal   (Followers: 68, SJR: 6.097, h-index: 264)
Briefings in Bioinformatics     Hybrid Journal   (Followers: 45, SJR: 4.086, h-index: 73)
Briefings in Functional Genomics     Hybrid Journal   (Followers: 3, SJR: 1.771, h-index: 50)
British J. for the Philosophy of Science     Hybrid Journal   (Followers: 33, SJR: 1.267, h-index: 38)
British J. of Aesthetics     Hybrid Journal   (Followers: 26, SJR: 0.217, h-index: 18)
British J. of Criminology     Hybrid Journal   (Followers: 585, SJR: 1.373, h-index: 62)
British J. of Social Work     Hybrid Journal   (Followers: 86, SJR: 0.771, h-index: 53)
British Medical Bulletin     Hybrid Journal   (Followers: 7, SJR: 1.391, h-index: 84)
British Yearbook of Intl. Law     Hybrid Journal   (Followers: 31)
Bulletin of the London Mathematical Society     Hybrid Journal   (Followers: 4, SJR: 1.474, h-index: 31)
Cambridge J. of Economics     Hybrid Journal   (Followers: 61, SJR: 0.957, h-index: 59)
Cambridge J. of Regions, Economy and Society     Hybrid Journal   (Followers: 10, SJR: 1.067, h-index: 22)
Cambridge Quarterly     Hybrid Journal   (Followers: 9, SJR: 0.1, h-index: 7)
Capital Markets Law J.     Hybrid Journal   (Followers: 2)
Carcinogenesis     Hybrid Journal   (Followers: 2, SJR: 2.439, h-index: 167)
Cardiovascular Research     Hybrid Journal   (Followers: 13, SJR: 2.897, h-index: 175)
Cerebral Cortex     Hybrid Journal   (Followers: 45, SJR: 4.827, h-index: 192)
CESifo Economic Studies     Hybrid Journal   (Followers: 17, SJR: 0.501, h-index: 19)
Chemical Senses     Hybrid Journal   (Followers: 1, SJR: 1.436, h-index: 76)
Children and Schools     Hybrid Journal   (Followers: 5, SJR: 0.211, h-index: 18)
Chinese J. of Comparative Law     Hybrid Journal   (Followers: 4)
Chinese J. of Intl. Law     Hybrid Journal   (Followers: 22, SJR: 0.737, h-index: 11)
Chinese J. of Intl. Politics     Hybrid Journal   (Followers: 8, SJR: 1.238, h-index: 15)
Christian Bioethics: Non-Ecumenical Studies in Medical Morality     Hybrid Journal   (Followers: 10, SJR: 0.191, h-index: 8)
Classical Receptions J.     Hybrid Journal   (Followers: 25, SJR: 0.1, h-index: 3)
Clean Energy     Open Access  
Clinical Infectious Diseases     Hybrid Journal   (Followers: 62, SJR: 4.742, h-index: 261)
Clinical Kidney J.     Open Access   (Followers: 3, SJR: 0.338, h-index: 19)
Communication Theory     Hybrid Journal   (Followers: 21, SJR: 2.62, h-index: 53)
Communication, Culture & Critique     Hybrid Journal   (Followers: 25)
Community Development J.     Hybrid Journal   (Followers: 27, SJR: 0.47, h-index: 28)
Computer J.     Hybrid Journal   (Followers: 9, SJR: 0.371, h-index: 47)
Conservation Physiology     Open Access   (Followers: 2)
Contemporary Women's Writing     Hybrid Journal   (Followers: 9, SJR: 0.111, h-index: 3)
Contributions to Political Economy     Hybrid Journal   (Followers: 5, SJR: 0.313, h-index: 10)
Critical Values     Full-text available via subscription  
Current Developments in Nutrition     Open Access  
Current Legal Problems     Hybrid Journal   (Followers: 27)
Current Zoology     Full-text available via subscription   (Followers: 1, SJR: 0.999, h-index: 20)
Database : The J. of Biological Databases and Curation     Open Access   (Followers: 8, SJR: 1.068, h-index: 24)
Digital Scholarship in the Humanities     Hybrid Journal   (Followers: 14)
Diplomatic History     Hybrid Journal   (Followers: 20, SJR: 0.296, h-index: 22)
DNA Research     Open Access   (Followers: 5, SJR: 2.42, h-index: 77)
Dynamics and Statistics of the Climate System     Open Access   (Followers: 3)
Early Music     Hybrid Journal   (Followers: 15, SJR: 0.124, h-index: 11)
Economic Policy     Hybrid Journal   (Followers: 39, SJR: 2.052, h-index: 52)
ELT J.     Hybrid Journal   (Followers: 24, SJR: 1.26, h-index: 23)
English Historical Review     Hybrid Journal   (Followers: 51, SJR: 0.311, h-index: 10)
English: J. of the English Association     Hybrid Journal   (Followers: 14, SJR: 0.144, h-index: 3)
Environmental Entomology     Full-text available via subscription   (Followers: 11, SJR: 0.791, h-index: 66)
Environmental Epigenetics     Open Access   (Followers: 3)
Environmental History     Hybrid Journal   (Followers: 26, SJR: 0.197, h-index: 25)
EP-Europace     Hybrid Journal   (Followers: 2, SJR: 2.201, h-index: 71)
Epidemiologic Reviews     Hybrid Journal   (Followers: 9, SJR: 3.917, h-index: 81)
ESHRE Monographs     Hybrid Journal  
Essays in Criticism     Hybrid Journal   (Followers: 16, SJR: 0.1, h-index: 6)
European Heart J.     Hybrid Journal   (Followers: 54, SJR: 6.997, h-index: 227)
European Heart J. - Cardiovascular Imaging     Hybrid Journal   (Followers: 8, SJR: 2.044, h-index: 58)
European Heart J. - Cardiovascular Pharmacotherapy     Full-text available via subscription   (Followers: 1)
European Heart J. - Quality of Care and Clinical Outcomes     Hybrid Journal  
European Heart J. Supplements     Hybrid Journal   (Followers: 7, SJR: 0.152, h-index: 31)
European J. of Cardio-Thoracic Surgery     Hybrid Journal   (Followers: 9, SJR: 1.568, h-index: 104)
European J. of Intl. Law     Hybrid Journal   (Followers: 173, SJR: 0.722, h-index: 38)
European J. of Orthodontics     Hybrid Journal   (Followers: 4, SJR: 1.09, h-index: 60)
European J. of Public Health     Hybrid Journal   (Followers: 20, SJR: 1.284, h-index: 64)
European Review of Agricultural Economics     Hybrid Journal   (Followers: 10, SJR: 1.549, h-index: 42)
European Review of Economic History     Hybrid Journal   (Followers: 28, SJR: 0.628, h-index: 24)
European Sociological Review     Hybrid Journal   (Followers: 40, SJR: 2.061, h-index: 53)
Evolution, Medicine, and Public Health     Open Access   (Followers: 10)
Family Practice     Hybrid Journal   (Followers: 14, SJR: 1.048, h-index: 77)
Fems Microbiology Ecology     Hybrid Journal   (Followers: 10, SJR: 1.687, h-index: 115)
Fems Microbiology Letters     Hybrid Journal   (Followers: 22, SJR: 1.126, h-index: 118)
Fems Microbiology Reviews     Hybrid Journal   (Followers: 27, SJR: 7.587, h-index: 150)
Fems Yeast Research     Hybrid Journal   (Followers: 14, SJR: 1.213, h-index: 66)
Food Quality and Safety     Open Access  
Foreign Policy Analysis     Hybrid Journal   (Followers: 23, SJR: 0.859, h-index: 10)
Forest Science     Hybrid Journal   (Followers: 4, SJR: 0.872, h-index: 59)
Forestry: An Intl. J. of Forest Research     Hybrid Journal   (Followers: 16, SJR: 0.903, h-index: 44)
Forum for Modern Language Studies     Hybrid Journal   (Followers: 6, SJR: 0.108, h-index: 6)
French History     Hybrid Journal   (Followers: 32, SJR: 0.123, h-index: 10)
French Studies     Hybrid Journal   (Followers: 20, SJR: 0.119, h-index: 7)
French Studies Bulletin     Hybrid Journal   (Followers: 10, SJR: 0.102, h-index: 3)
Gastroenterology Report     Open Access   (Followers: 2)
Genome Biology and Evolution     Open Access   (Followers: 12, SJR: 3.22, h-index: 39)
Geophysical J. Intl.     Hybrid Journal   (Followers: 35, SJR: 1.839, h-index: 119)
German History     Hybrid Journal   (Followers: 22, SJR: 0.437, h-index: 13)
GigaScience     Open Access   (Followers: 3)
Global Summitry     Hybrid Journal   (Followers: 1)
Glycobiology     Hybrid Journal   (Followers: 14, SJR: 1.692, h-index: 101)
Health and Social Work     Hybrid Journal   (Followers: 55, SJR: 0.505, h-index: 40)
Health Education Research     Hybrid Journal   (Followers: 13, SJR: 0.814, h-index: 80)
Health Policy and Planning     Hybrid Journal   (Followers: 24, SJR: 1.628, h-index: 66)
Health Promotion Intl.     Hybrid Journal   (Followers: 21, SJR: 0.664, h-index: 60)
History Workshop J.     Hybrid Journal   (Followers: 29, SJR: 0.313, h-index: 20)
Holocaust and Genocide Studies     Hybrid Journal   (Followers: 26, SJR: 0.115, h-index: 13)
Human Communication Research     Hybrid Journal   (Followers: 13, SJR: 2.199, h-index: 61)
Human Molecular Genetics     Hybrid Journal   (Followers: 8, SJR: 4.288, h-index: 233)
Human Reproduction     Hybrid Journal   (Followers: 72, SJR: 2.271, h-index: 179)
Human Reproduction Update     Hybrid Journal   (Followers: 17, SJR: 4.678, h-index: 128)
Human Rights Law Review     Hybrid Journal   (Followers: 60, SJR: 0.7, h-index: 21)
ICES J. of Marine Science: J. du Conseil     Hybrid Journal   (Followers: 51, SJR: 1.233, h-index: 88)
ICSID Review     Hybrid Journal   (Followers: 12)
ILAR J.     Hybrid Journal   (Followers: 2, SJR: 1.099, h-index: 51)
IMA J. of Applied Mathematics     Hybrid Journal   (SJR: 0.329, h-index: 26)
IMA J. of Management Mathematics     Hybrid Journal   (SJR: 0.351, h-index: 20)
IMA J. of Mathematical Control and Information     Hybrid Journal   (Followers: 2, SJR: 0.661, h-index: 28)
IMA J. of Numerical Analysis - advance access     Hybrid Journal   (SJR: 2.032, h-index: 44)
Industrial and Corporate Change     Hybrid Journal   (Followers: 10, SJR: 1.37, h-index: 81)
Industrial Law J.     Hybrid Journal   (Followers: 34, SJR: 0.184, h-index: 15)
Inflammatory Bowel Diseases     Hybrid Journal   (Followers: 42, SJR: 1.994, h-index: 107)
Information and Inference     Free  
Integrative and Comparative Biology     Hybrid Journal   (Followers: 7, SJR: 1.911, h-index: 90)
Interacting with Computers     Hybrid Journal   (Followers: 10, SJR: 0.529, h-index: 59)
Interactive CardioVascular and Thoracic Surgery     Hybrid Journal   (Followers: 6, SJR: 0.743, h-index: 35)
Intl. Affairs     Hybrid Journal   (Followers: 56, SJR: 1.264, h-index: 53)
Intl. Data Privacy Law     Hybrid Journal   (Followers: 31)
Intl. Health     Hybrid Journal   (Followers: 5, SJR: 0.835, h-index: 15)
Intl. Immunology     Hybrid Journal   (Followers: 3, SJR: 1.613, h-index: 111)
Intl. J. for Quality in Health Care     Hybrid Journal   (Followers: 34, SJR: 1.593, h-index: 69)
Intl. J. of Constitutional Law     Hybrid Journal   (Followers: 64, SJR: 0.613, h-index: 19)
Intl. J. of Epidemiology     Hybrid Journal   (Followers: 198, SJR: 4.381, h-index: 145)
Intl. J. of Law and Information Technology     Hybrid Journal   (Followers: 5, SJR: 0.247, h-index: 8)
Intl. J. of Law, Policy and the Family     Hybrid Journal   (Followers: 30, SJR: 0.307, h-index: 15)
Intl. J. of Lexicography     Hybrid Journal   (Followers: 10, SJR: 0.404, h-index: 18)
Intl. J. of Low-Carbon Technologies     Open Access   (Followers: 1, SJR: 0.457, h-index: 12)
Intl. J. of Neuropsychopharmacology     Open Access   (Followers: 3, SJR: 1.69, h-index: 79)
Intl. J. of Public Opinion Research     Hybrid Journal   (Followers: 9, SJR: 0.906, h-index: 33)
Intl. J. of Refugee Law     Hybrid Journal   (Followers: 35, SJR: 0.231, h-index: 21)
Intl. J. of Transitional Justice     Hybrid Journal   (Followers: 12, SJR: 0.833, h-index: 12)
Intl. Mathematics Research Notices     Hybrid Journal   (Followers: 1, SJR: 2.052, h-index: 42)
Intl. Political Sociology     Hybrid Journal   (Followers: 36, SJR: 1.339, h-index: 19)
Intl. Relations of the Asia-Pacific     Hybrid Journal   (Followers: 22, SJR: 0.539, h-index: 17)
Intl. Studies Perspectives     Hybrid Journal   (Followers: 9, SJR: 0.998, h-index: 28)
Intl. Studies Quarterly     Hybrid Journal   (Followers: 44, SJR: 2.184, h-index: 68)
Intl. Studies Review     Hybrid Journal   (Followers: 20, SJR: 0.783, h-index: 38)
ISLE: Interdisciplinary Studies in Literature and Environment     Hybrid Journal   (Followers: 1, SJR: 0.155, h-index: 4)
ITNOW     Hybrid Journal   (Followers: 1, SJR: 0.102, h-index: 4)
J. of African Economies     Hybrid Journal   (Followers: 14, SJR: 0.647, h-index: 30)
J. of American History     Hybrid Journal   (Followers: 45, SJR: 0.286, h-index: 34)
J. of Analytical Toxicology     Hybrid Journal   (Followers: 14, SJR: 1.038, h-index: 60)
J. of Antimicrobial Chemotherapy     Hybrid Journal   (Followers: 14, SJR: 2.157, h-index: 149)
J. of Antitrust Enforcement     Hybrid Journal   (Followers: 1)
J. of Applied Poultry Research     Hybrid Journal   (Followers: 4, SJR: 0.563, h-index: 43)
J. of Biochemistry     Hybrid Journal   (Followers: 41, SJR: 1.341, h-index: 96)
J. of Burn Care & Research     Hybrid Journal   (Followers: 9, SJR: 0.713, h-index: 57)
J. of Chromatographic Science     Hybrid Journal   (Followers: 18, SJR: 0.448, h-index: 42)
J. of Church and State     Hybrid Journal   (Followers: 11, SJR: 0.167, h-index: 11)
J. of Communication     Hybrid Journal   (Followers: 50, SJR: 3.327, h-index: 82)
J. of Competition Law and Economics     Hybrid Journal   (Followers: 35, SJR: 0.442, h-index: 16)
J. of Complex Networks     Hybrid Journal   (Followers: 2, SJR: 1.165, h-index: 5)
J. of Computer-Mediated Communication     Open Access   (Followers: 26, SJR: 2.878, h-index: 80)
J. of Conflict and Security Law     Hybrid Journal   (Followers: 13, SJR: 0.196, h-index: 15)
J. of Consumer Research     Full-text available via subscription   (Followers: 41, SJR: 4.896, h-index: 121)
J. of Crohn's and Colitis     Hybrid Journal   (Followers: 9, SJR: 1.543, h-index: 37)
J. of Cybersecurity     Hybrid Journal   (Followers: 3)
J. of Deaf Studies and Deaf Education     Hybrid Journal   (Followers: 8, SJR: 0.69, h-index: 36)

        1 2 | Last   [Sort by number of followers]   [Restore default list]

Journal Cover Database : The Journal of Biological Databases and Curation
  [SJR: 1.068]   [H-I: 24]   [8 followers]  Follow
  This is an Open Access Journal Open Access journal
   ISSN (Online) 1758-0463
   Published by Oxford University Press Homepage  [393 journals]
  • Signalling maps in cancer research: construction and data analysis

    • Authors: Kondratova M; Sompairac N, Barillot E, et al.
      Abstract: Generation and usage of high-quality molecular signalling network maps can be augmented by standardizing notations, establishing curation workflows and application of computational biology methods to exploit the knowledge contained in the maps. In this manuscript, we summarize the major aims and challenges of assembling information in the form of comprehensive maps of molecular interactions. Mainly, we share our experience gained while creating the Atlas of Cancer Signalling Network. In the step-by-step procedure, we describe the map construction process and suggest solutions for map complexity management by introducing a hierarchical modular map structure. In addition, we describe the NaviCell platform, a computational technology using Google Maps API to explore comprehensive molecular maps similar to geographical maps and explain the advantages of semantic zooming principles for map navigation. We also provide the outline to prepare signalling network maps for navigation using the NaviCell platform. Finally, several examples of cancer high-throughput data analysis and visualization in the context of comprehensive signalling maps are presented.
      PubDate: Mon, 09 Apr 2018 00:00:00 GMT
  • An entropy-reducing data representation approach for bioinformatic data

    • Authors: McCulloch A; Jauregui R, Maclean P, et al.
      Abstract: Non-semantic approaches to bioinformatic data analysis have potential relevance where semantic resources such as annotated finished reference genomes are lacking, such as in the analysis and utilisation of growing amounts of sequence data from non-model organisms, often associated with sequence-based agricultural, aqua-cultural and environmental sampling studies and commercial services. Even where rich semantic resources are available, semantic approaches to problems such as contrasting and comparing reference assemblies, and utilising multiple references in parallel to avoid reference bias, are costly and difficult to fully automate. We introduce and discuss a non-semantic data representation approach intended mainly for bioinformatic data called non-semantic labelling. Non-semantic labelling involves tensorially combining multiple kinds of model-based entropy-reducing data representation, with multiple representation models, so as to map both data and models into dual metric representation spaces, with goals of both reducing the statistical complexity of the data, and highlighting latent structure via machine learning and statistical analyses conducted within the dual representation spaces. As part of the framework, we introduce a novel algebraic abstraction of data representation mappings, and present four proof-of-concept examples of its application, to problems such as comparing and contrasting sequence assemblies, utilisation of multiple references for annotation and development of quality control diagnostics in a variety of high-throughput sequencing contexts.Database URL:
      PubDate: Thu, 05 Apr 2018 00:00:00 GMT
  • Expert curation for building network-based dynamical models: a case study
           on atherosclerotic plaque formation

    • Authors: Bekkar A; Estreicher A, Niknejad A, et al.
      Abstract: Knowledgebases play an increasingly important role in scientific research, where the expert curation of biological knowledge in forms that are amenable to computational analysis (using ontologies for example)–provides a significant added value and enables new types of computational analyses for high throughput datasets. In this work, we demonstrate how expert curation can also play a more direct role in research, by supporting the use of network-based dynamical models to study a specific biological process. This curation effort is focused on the regulatory interactions between biological entities, such as genes or proteins and compounds, which may interact with each other in a complex manner, including regulatory complexes and conditional dependencies between co-regulators. This critical information has to be captured and encoded in a computable manner, which is currently far beyond the current capabilities of automatically constructed network. As a case study, we report here the prior knowledge network constructed by the sysVASC consortium to model the biological events leading to the formation of atherosclerotic plaques, during the onset of cardiovascular disease and discuss some specific examples to illustrate the main pitfalls and added value provided by the expert curation during this endeavor.Database URL:
      PubDate: Wed, 04 Apr 2018 00:00:00 GMT
  • A tutorial of diverse genome analysis tools found in the CoGe web-platform
           using Plasmodium spp. as a model

    • Authors: Castillo A; Nelson A, Haug-Baltzell A, et al.
      Abstract: Integrated platforms for storage, management, analysis and sharing of large quantities of omics data have become fundamental to comparative genomics. CoGe ( is an online platform designed to manage and study genomic data, enabling both data- and hypothesis-driven comparative genomics. CoGe’s tools and resources can be used to organize and analyse both publicly available and private genomic data from any species. Here, we demonstrate the capabilities of CoGe through three example workflows using 17 Plasmodium genomes as a model. Plasmodium genomes present unique challenges for comparative genomics due to their rapidly evolving and highly variable genomic AT/GC content. These example workflows are intended to serve as templates to help guide researchers who would like to use CoGe to examine diverse aspects of genome evolution. In the first workflow, trends in genome composition and amino acid usage are explored. In the second, changes in genome structure and the distribution of synonymous (Ks) and non-synonymous (Kn) substitution values are evaluated across species with different levels of evolutionary relatedness. In the third workflow, microsyntenic analyses of multigene families’ genomic organization are conducted using two Plasmodium-specific gene families—serine repeat antigen, and cytoadherence-linked asexual gene—as models. In general, these example workflows show how to achieve quick, reproducible and shareable results using the CoGe platform. We were able to replicate previously published results, as well as leverage CoGe’s tools and resources to gain additional insight into various aspects of Plasmodium genome evolution. Our results highlight the usefulness of the CoGe platform, particularly in understanding complex features of genome evolution.Database URL:
      PubDate: Tue, 03 Apr 2018 00:00:00 GMT
  • dbAMEPNI: a database of alanine mutagenic effects for
           protein–nucleic acid interactions

    • Authors: Liu L; Xiong Y, Gao H, et al.
      Abstract: Protein–nucleic acid interactions play essential roles in various biological activities such as gene regulation, transcription, DNA repair and DNA packaging. Understanding the effects of amino acid substitutions on protein–nucleic acid binding affinities can help elucidate the molecular mechanism of protein–nucleic acid recognition. Until now, no comprehensive and updated database of quantitative binding data on alanine mutagenic effects for protein–nucleic acid interactions is publicly accessible. Thus, we developed a new database of Alanine Mutagenic Effects for Protein-Nucleic Acid Interactions (dbAMEPNI). dbAMEPNI is a manually curated, literature-derived database, comprising over 577 alanine mutagenic data with experimentally determined binding affinities for protein–nucleic acid complexes. It contains several important parameters, such as dissociation constant (Kd), Gibbs free energy change (ΔΔG), experimental conditions and structural parameters of mutant residues. In addition, the database provides an extended dataset of 282 single alanine mutations with only qualitative data (or descriptive effects) of thermodynamic information.Database URL:
      PubDate: Mon, 02 Apr 2018 00:00:00 GMT
  • Probabilistic and machine learning-based retrieval approaches for
           biomedical dataset retrieval

    • Authors: Karisani P; Qin Z, Agichtein E.
      Abstract: The bioCADDIE dataset retrieval challenge brought together different approaches to retrieval of biomedical datasets relevant to a user’s query, expressed as a text description of a needed dataset. We describe experiments in applying a data-driven, machine learning-based approach to biomedical dataset retrieval as part of this challenge. We report on a series of experiments carried out to evaluate the performance of both probabilistic and machine learning-driven techniques from information retrieval, as applied to this challenge. Our experiments with probabilistic information retrieval methods, such as query term weight optimization, automatic query expansion and simulated user relevance feedback, demonstrate that automatically boosting the weights of important keywords in a verbose query is more effective than other methods. We also show that although there is a rich space of potential representations and features available in this domain, machine learning-based re-ranking models are not able to improve on probabilistic information retrieval techniques with the currently available training data. The models and algorithms presented in this paper can serve as a viable implementation of a search engine to provide access to biomedical datasets. The retrieval performance is expected to be further improved by using additional training data that is created by expert annotation, or gathered through usage logs, clicks and other processes during natural operation of the system.Database URL:
      PubDate: Wed, 28 Mar 2018 00:00:00 GMT
  • Biopanning data bank 2018: hugging next generation phage display

    • Authors: He B; Jiang L, Duan Y, et al.
      Abstract: The 2018 update of the biopanning data bank (BDB) stores phage display data sequenced by Sanger sequencing and next generation sequencing technologies. In this work, we upgraded the database with more biopanning data sets and several new features, including (i) incorporation of next generation biopanning data and the unselected population where the target is not determined and the round of screening is zero; (ii) addition of sequencing information; (iii) improvement of browsing and searching systems and 3 D chemical structure viewer; (iv) integration of standalone tools for target-unrelated peptides analysis within conventional phage display and next generation phage display (NGPD) data. In the current version of BDB (released on 19 January 2018), the database houses 3291 sets of biopanning data collected from 1540 published articles, including 95 NGPD data sets and 3196 traditional biopanning data sets. The BDB database serves as an important and comprehensive resource for developing peptide ligands.Database URL: The BDB database is available at
      PubDate: Tue, 27 Mar 2018 00:00:00 GMT
  • CITGeneDB: a comprehensive database of human and mouse genes enhancing or
           suppressing cold-induced thermogenesis validated by perturbation
           experiments in mice

    • Authors: Li J; Deng S, Wei G, et al.
      Abstract: Cold-induced thermogenesis increases energy expenditure and can reduce body weight in mammals, so the genes involved in it are thought to be potential therapeutic targets for treating obesity and diabetes. In the quest for more effective therapies, a great deal of research has been conducted to elucidate the regulatory mechanism of cold-induced thermogenesis. Over the last decade, a large number of genes that can enhance or suppress cold-induced thermogenesis have been discovered, but a comprehensive list of these genes is lacking. To fill this gap, we examined all of the annotated human and mouse genes and curated those demonstrated to enhance or suppress cold-induced thermogenesis by in vivo or ex vivo experiments in mice. The results of this highly accurate and comprehensive annotation are hosted on a database called CITGeneDB, which includes a searchable web interface to facilitate broad public use. The database will be updated as new genes are found to enhance or suppress cold-induced thermogenesis. It is expected that CITGeneDB will be a valuable resource in future explorations of the molecular mechanism of cold-induced thermogenesis, helping pave the way for new obesity and diabetes treatments.Database URL:
      PubDate: Fri, 23 Mar 2018 00:00:00 GMT
  • GEOMetaCuration: a web-based application for accurate manual curation of
           Gene Expression Omnibus metadata

    • Authors: Li Z; Li J, Yu P.
      Abstract: Metadata curation has become increasingly important for biological discovery and biomedical research because a large amount of heterogeneous biological data is currently freely available. To facilitate efficient metadata curation, we developed an easy-to-use web-based curation application, GEOMetaCuration, for curating the metadata of Gene Expression Omnibus datasets. It can eliminate mechanical operations that consume precious curation time and can help coordinate curation efforts among multiple curators. It improves the curation process by introducing various features that are critical to metadata curation, such as a back-end curation management system and a curator-friendly front-end. The application is based on a commonly used web development framework of Python/Django and is open-sourced under the GNU General Public License V3. GEOMetaCuration is expected to benefit the biocuration community and to contribute to computational generation of biological insights using large-scale biological data. An example use case can be found at the demo website: URL:
      PubDate: Fri, 23 Mar 2018 00:00:00 GMT
  • Improved ontology-based similarity calculations using a study-wise
           annotation model

    • Authors: Köhler S.
      Abstract: A typical use case of ontologies is the calculation of similarity scores between items that are annotated with classes of the ontology. For example, in differential diagnostics and disease gene prioritization, the human phenotype ontology (HPO) is often used to compare a query phenotype profile against gold-standard phenotype profiles of diseases or genes. The latter have long been constructed as flat lists of ontology classes, which, as we show in this work, can be improved by exploiting existing structure and information in annotation datasets or full text disease descriptions. We derive a study-wise annotation model of diseases and genes and show that this can improve the performance of semantic similarity measures. Inferred weights of individual annotations are one reason for this improvement, but more importantly using the study-wise structure further boosts the results of the algorithms according to precision-recall analyses. We test the study-wise annotation model for diseases annotated with classes from the HPO and for genes annotated with gene ontology (GO) classes. We incorporate this annotation model into similarity algorithms and show how this leads to improved performance. This work adds weight to the need for enhancing simple list-based representations of disease or gene annotations. We show how study-wise annotations can be automatically derived from full text summaries of disease descriptions and from the annotation data provided by the GO Consortium and how semantic similarity measure can utilize this extended annotation model.Database URL:
      PubDate: Fri, 23 Mar 2018 00:00:00 GMT
  • TISSUES 2.0: an integrative web resource on mammalian tissue expression

    • Authors: Palasca O; Santos A, Stolte C, et al.
      Abstract: Database (2018), doi: 10.1093/database/bay003
      PubDate: Fri, 16 Mar 2018 00:00:00 GMT
  • Finding relevant biomedical datasets: the UC San Diego solution for the
           bioCADDIE Retrieval Challenge

    • Authors: Wei W; Ji Z, He Y, et al.
      Abstract: The number and diversity of biomedical datasets grew rapidly in the last decade. A large number of datasets are stored in various repositories, with different formats. Existing dataset retrieval systems lack the capability of cross-repository search. As a result, users spend time searching datasets in known repositories, and they typically do not find new repositories. The biomedical and healthcare data discovery index ecosystem (bioCADDIE) team organized a challenge to solicit new indexing and searching strategies for retrieving biomedical datasets across repositories. We describe the work of one team that built a retrieval pipeline and examined its performance. The pipeline used online resources to supplement dataset metadata, automatically generated queries from users’ free-text questions, produced high-quality retrieval results and achieved the highest inferred Normalized Discounted Cumulative Gain among competitors. The results showed that it is a promising solution for cross-database, cross-domain and cross-repository biomedical dataset retrieval.Database URL:
      PubDate: Fri, 16 Mar 2018 00:00:00 GMT
  • PvaxDB: a comprehensive structural repository of Plasmodium vivax proteome

    • Authors: Singh A; Kaushik R, Kuntal H, et al.
      Abstract: The severity of malaria caused by Plasmodium vivax worldwide and its resistance against the available general antimalarial drugs has created an urgent need for a comprehensive insight into its biology and biochemistry for developing some novel potential vaccines and therapeutics. P.vivax comprises 5392 proteins mostly predicted, out of which 4211 are soluble proteins and 2205 of these belong to blood and liver stages of malarial cycle. Presently available public resources report functional annotation (gene ontology) of only 28% (627 proteins) of the enzymatic soluble proteins and experimental structures are determined for only 42 proteins P. vivax proteome. In this milieu of severe paucity of structural and functional data, we have generated structures of 2205 soluble proteins, validated them thoroughly, identified their binding pockets (including active sites) and annotated their function increasing the coverage from the existing 28% to 100%. We have pooled all this information together and created a database christened as PvaxDB, which furnishes extensive sequence, structure, ligand binding site and functional information. We believe PvaxDB could be helpful in identifying novel protein drug targets, expediting development of new drugs to combat malaria. This is also the first attempt to create a reliable comprehensive computational structural repository of all the soluble proteins of P. vivax.Database URL:
      PubDate: Wed, 14 Mar 2018 00:00:00 GMT
  • Baseline and extensions approach to information retrieval of complex
           medical data: Poznan's approach to the bioCADDIE 2016

    • Authors: Cieslewicz A; Dutkiewicz J, Jedrzejek C.
      Abstract: Information retrieval from biomedical repositories has become a challenging task because of their increasing size and complexity. To facilitate the research aimed at improving the search for relevant documents, various information retrieval challenges have been launched. In this article, we present the improved medical information retrieval systems designed by Poznan University of Technology and Poznan University of Medical Sciences as a contribution to the bioCADDIE 2016 challenge—a task focusing on information retrieval from a collection of 794 992 datasets generated from 20 biomedical repositories. The system developed by our team utilizes the Terrier 4.2 search platform enhanced by a query expansion method using word embeddings. This approach, after post-challenge modifications and improvements (with particular regard to assigning proper weights for original and expanded terms), allowed us achieving the second best infNDCG measure (0.4539) compared with the challenge results and infAP 0.3978. This demonstrates that proper utilization of word embeddings can be a valuable addition to the information retrieval process. Some analysis is provided on related work involving other bioCADDIE contributions. We discuss the possibility of improving our results by using better word embedding schemes to find candidates for query expansion.Database URL:
      PubDate: Mon, 12 Mar 2018 00:00:00 GMT
  • SPTEdb: a database for transposable elements in salicaceous plants

    • Authors: Yi F; Jia Z, Xiao Y, et al.
      Abstract: Although transposable elements (TEs) play significant roles in structural, functional and evolutionary dynamics of the salicaceous plants genome and the accurate identification, definition and classification of TEs are still inadequate. In this study, we identified 18 393 TEs from Populus trichocarpa, Populus euphratica and Salix suchowensis using a combination of signature-based, similarity-based and De novo method, and annotated them into 1621 families. A comprehensive and user-friendly web-based database, SPTEdb, was constructed and served for researchers. SPTEdb enables users to browse, retrieve and download the TEs sequences from the database. Meanwhile, several analysis tools, including BLAST, HMMER, GetORF and Cut sequence, were also integrated into SPTEdb to help users to mine the TEs data easily and effectively. In summary, SPTEdb will facilitate the study of TEs biology and functional genomics in salicaceous plants.Database URL:
      PubDate: Fri, 09 Mar 2018 00:00:00 GMT
  • YummyData: providing high-quality open life science data

    • Authors: Yamamoto Y; Yamaguchi A, Splendiani A.
      Abstract: Many life science datasets are now available via Linked Data technologies, meaning that they are represented in a common format (the Resource Description Framework), and are accessible via standard APIs (SPARQL endpoints). While this is an important step toward developing an interoperable bioinformatics data landscape, it also creates a new set of obstacles, as it is often difficult for researchers to find the datasets they need. Different providers frequently offer the same datasets, with different levels of support: as well as having more or less up-to-date data, some providers add metadata to describe the content, structures, and ontologies of the stored datasets while others do not. We currently lack a place where researchers can go to easily assess datasets from different providers in terms of metrics such as service stability or metadata richness. We also lack a space for collecting feedback and improving data providers’ awareness of user needs. To address this issue, we have developed YummyData, which consists of two components. One periodically polls a curated list of SPARQL endpoints, monitoring the states of their Linked Data implementations and content. The other presents the information measured for the endpoints and provides a forum for discussion and feedback. YummyData is designed to improve the findability and reusability of life science datasets provided as Linked Data and to foster its adoption. It is freely accessible at URL:
      PubDate: Fri, 09 Mar 2018 00:00:00 GMT
  • The SNPcurator: literature mining of enriched SNP-disease associations

    • Authors: Tawfik N; Spruit M.
      Abstract: The uniqueness of each human genetic structure motivated the shift from the current practice of medicine to a more tailored one. This personalized medicine revolution would not be possible today without the genetics data collected from genome-wide association studies (GWASs) that investigate the relation between different phenotypic traits and single-nucleotide polymorphisms (SNPs). The huge increase in the literature publication space imposes a challenge on the conventional manual curation process which is becoming more and more expensive. This research aims at automatically extracting SNP associations of any given disease and its reported statistical significance (P-value) and odd ratio as well as cohort information such as size and ethnicity. Our evaluation illustrates that SNPcurator was able to replicate a large number of SNP-disease associations that were also reported in the NHGRI-EBI Catalog of published GWASs. SNPcurator was also tested by eight external genetics experts, who queried the system to examine diseases of their choice, and was found to be efficient and satisfactory. We conclude that the text-mining-based system has a great potential for helping researchers and scientists, especially in their preliminary genetics research. SNPcurator is publicly available at URL:
      PubDate: Thu, 08 Mar 2018 00:00:00 GMT
  • NDDVD: an integrated and manually curated Neurodegenerative Diseases
           Variation Database

    • Authors: Yang Y; Xu C, Liu X, et al.
      Abstract: Neurodegenerative diseases (NDDs) are associated with genetic variations including point substitutions, copy number alterations, insertions and deletions. At present, a few genetic variation repositories for some individual NDDs have been created, however, these databases are needed to be integrated and expanded to all the NDDs for systems biological investigation. We here build a relational database termed as NDDVD to integrate all the variations of NDDs using Leiden Open Variation Database (LOVD) platform. The items in the NDDVD are collected manually from PubMed or extracted from the existed variation databases. The cross-disease database includes over 6374 genetic variations of 289 genes associated with 37 different NDDs. The patterns, conservations and biological functions for variations in different NDDs are statistically compared and a user-friendly interface is provided for NDDVD at:
      PubDate: Mon, 05 Mar 2018 00:00:00 GMT
  • Micropublication: incentivizing community curation and placing unpublished
           data into the public domain

    • Authors: Raciti D; Yook K, Harris T, et al.
      Abstract: Large volumes of data generated by research laboratories coupled with the required effort and cost of curation present a significant barrier to inclusion of these data in authoritative community databases. Further, many publicly funded experimental observations remain invisible to curation simply because they are never published: results often do not fit within the scope of a standard publication; trainee-generated data are forgotten when the experimenter (e.g. student, post-doc) leaves the lab; results are omitted from science narratives due to publication bias where certain results are considered irrelevant for the publication. While authors are in the best position to curate their own data, they face a steep learning curve to ensure that appropriate referential tags, metadata, and ontologies are applied correctly to their observations, a task sometimes considered beyond the scope of their research and other numerous responsibilities. Getting researchers to adopt a new system of data reporting and curation requires a fundamental change in behavior among all members of the research community. To solve these challenges, we have created a novel scholarly communication platform that captures data from researchers and directly delivers them to information resources via Micropublication. This platform incentivizes authors to publish their unpublished observations along with associated metadata by providing a deliberately fast and lightweight but still peer-reviewed process that results in a citable publication. Our long-term goal is to develop a data ecosystem that improves reproducibility and accountability of publicly funded research and in turn accelerates both basic and translational discovery.Database URL:
      PubDate: Fri, 02 Mar 2018 00:00:00 GMT
  • BioDataome: a collection of uniformly preprocessed and automatically
           annotated datasets for data-driven biology

    • Authors: Lakiotaki K; Vorniotakis N, Tsagris M, et al.
      Abstract: Biotechnology revolution generates a plethora of omics data with an exponential growth pace. Therefore, biological data mining demands automatic, ‘high quality’ curation efforts to organize biomedical knowledge into online databases. BioDataome is a database of uniformly preprocessed and disease-annotated omics data with the aim to promote and accelerate the reuse of public data. We followed the same preprocessing pipeline for each biological mart (microarray gene expression, RNA-Seq gene expression and DNA methylation) to produce ready for downstream analysis datasets and automatically annotated them with disease-ontology terms. We also designate datasets that share common samples and automatically discover control samples in case-control studies. Currently, BioDataome includes ∼5600 datasets, ∼260 000 samples spanning ∼500 diseases and can be easily used in large-scale massive experiments and meta-analysis. All datasets are publicly available for querying and downloading via BioDataome web application. We demonstrate BioDataome’s utility by presenting exploratory data analysis examples. We have also developed BioDataome R package found in: URL:
      PubDate: Fri, 02 Mar 2018 00:00:00 GMT
  • AntiTbPdb: a knowledgebase of anti-tubercular peptides

    • Authors: Usmani S; Kumar R, Kumar V, et al.
      Abstract: Tuberculosis is a global menace, caused by Mycobacterium tuberculosis, responsible for millions of premature deaths every year. In the era of drug-resistant tuberculosis, peptide-based therapeutics may provide alternate to small molecule based drugs. In order to create knowledgebase, AntiTbPdb (, experimentally validated anti-tubercular and anti-mycobacterial peptides were compiled from literature. We curate 10 652 research articles and 35 patents to extract anti-tubercular peptides and annotate these peptides manually. This knowledgebase has 1010 entries, each entry provides extensive information about an anti-tubercular peptide such as sequence, chemical modification, chirality, nature and source of origin. The tertiary structure of these anti-tubercular peptides containing natural as well as chemically modified residues was predicted using PEPstrMOD and I-TASSER. In addition to structural information, database maintains other properties of peptides like physiochemical properties. Numerous web-based tools have been integrated for data retrieval, browsing, sequence similarity search and peptide mapping. In order to assist wide range of user, we developed a responsive website suitable for smartphone, tablet and desktop.Database URL:
      PubDate: Wed, 28 Feb 2018 00:00:00 GMT
  • miRwayDB: a database for experimentally validated microRNA-pathway
           associations in pathophysiological conditions

    • Authors: Das S; Saha P, Chakravorty N.
      Abstract: MicroRNAs (miRNAs) are well-known as key regulators of diverse biological pathways. A series of experimental evidences have shown that abnormal miRNA expression profiles are responsible for various pathophysiological conditions by modulating genes in disease associated pathways. In spite of the rapid increase in research data confirming such associations, scientists still do not have access to a consolidated database offering these miRNA-pathway association details for critical diseases. We have developed miRwayDB, a database providing comprehensive information of experimentally validated miRNA-pathway associations in various pathophysiological conditions utilizing data collected from published literature. To the best of our knowledge, it is the first database that provides information about experimentally validated miRNA mediated pathway dysregulation as seen specifically in critical human diseases and hence indicative of a cause-and-effect relationship in most cases. The current version of miRwayDB collects an exhaustive list of miRNA-pathway association entries for 76 critical disease conditions by reviewing 663 published articles. Each database entry contains complete information on the name of the pathophysiological condition, associated miRNA(s), experimental sample type(s), regulation pattern (up/down) of miRNA, pathway association(s), targeted member of dysregulated pathway(s) and a brief description. In addition, miRwayDB provides miRNA, gene and pathway score to evaluate the role of a miRNA regulated pathways in various pathophysiological conditions. The database can also be used for other biomedical approaches such as validation of computational analysis, integrated analysis and prediction of computational model. It also offers a submission page to submit novel data from recently published studies. We believe that miRwayDB will be a useful tool for miRNA research community.Database URL:
      PubDate: Wed, 28 Feb 2018 00:00:00 GMT
  • Prevention of data duplication for high throughput sequencing repositories

    • Authors: Gabdank I; Chan E, Davidson J, et al.
      Abstract: Prevention of unintended duplication is one of the ongoing challenges many databases have to address. Working with high-throughput sequencing data, the complexity of that challenge increases with the complexity of the definition of a duplicate. In a computational data model, a data object represents a real entity like a reagent or a biosample. This representation is similar to how a card represents a book in a paper library catalog. Duplicated data objects not only waste storage, they can mislead users into assuming the model represents more than the single entity. Even if it is clear that two objects represent a single entity, data duplication opens the door to potential inconsistencies between the objects since the content of the duplicated objects can be updated independently, allowing divergence of the metadata associated with the objects. Analogously to a situation in which a catalog in a paper library would contain by mistake two cards for a single copy of a book. If these cards are listing simultaneously two different individuals as current book borrowers, it would be difficult to determine which borrower (out of the two listed) actually has the book. Unfortunately, in a large database with multiple submitters, unintended duplication is to be expected. In this article, we present three principal guidelines the Encyclopedia of DNA Elements (ENCODE) Portal follows in order to prevent unintended duplication of both actual files and data objects: definition of identifiable data objects (I), object uniqueness validation (II) and de-duplication mechanism (III). In addition to explaining our modus operandi, we elaborate on the methods used for identification of sequencing data files. Comparison of the approach taken by the ENCODE Portal vs other widely used biological data repositories is provided.Database URL:
      PubDate: Tue, 27 Feb 2018 00:00:00 GMT
  • Updated regulation curation model at the Saccharomyces Genome Database

    • Authors: Engel S; Skrzypek M, Hellerstedt S, et al.
      Abstract: The Saccharomyces Genome Database (SGD) provides comprehensive, integrated biological information for the budding yeast Saccharomyces cerevisiae, along with search and analysis tools to explore these data, enabling the discovery of functional relationships between sequence and gene products in fungi and higher organisms. We have recently expanded our data model for regulation curation to address regulation at the protein level in addition to transcription, and are presenting the expanded data on the ‘Regulation’ pages at SGD. These pages include a summary describing the context under which the regulator acts, manually curated and high-throughput annotations showing the regulatory relationships for that gene and a graphical visualization of its regulatory network and connected networks. For genes whose products regulate other genes or proteins, the Regulation page includes Gene Ontology enrichment analysis of the biological processes in which those targets participate. For DNA-binding transcription factors, we also provide other information relevant to their regulatory function, such as DNA binding site motifs and protein domains. As with other data types at SGD, all regulatory relationships and accompanying data are available through YeastMine, SGD’s data warehouse based on InterMine.Database URL:
      PubDate: Tue, 27 Feb 2018 00:00:00 GMT
  • The NCBI BioCollections Database

    • Authors: Sharma S; Ciufo S, Starchenko E, et al.
      Abstract: The rapidly growing set of GenBank submissions includes sequences that are derived from vouchered specimens. These are associated with culture collections, museums, herbaria and other natural history collections, both living and preserved. Correct identification of the specimens studied, along with a method to associate the sample with its institution, is critical to the outcome of related studies and analyses. The National Center for Biotechnology Information BioCollections Database was established to allow the association of specimen vouchers and related sequence records to their home institutions. This process also allows cross-linking from the home institution for quick identification of all records originating from each collection.Database URL:
      PubDate: Fri, 23 Feb 2018 00:00:00 GMT
  • TransAtlasDB: an integrated database connecting expression data, metadata
           and variants

    • Authors: Adetunji M; Lamont S, Schmidt C.
      Abstract: High-throughput transcriptome sequencing (RNAseq) is the universally applied method for target-free transcript identification and gene expression quantification, generating huge amounts of data. The constraint of accessing such data and interpreting results can be a major impediment in postulating suitable hypothesis, thus an innovative storage solution that addresses these limitations, such as hard disk storage requirements, efficiency and reproducibility are paramount. By offering a uniform data storage and retrieval mechanism, various data can be compared and easily investigated. We present a sophisticated system, TransAtlasDB, which incorporates a hybrid architecture of both relational and NoSQL databases for fast and efficient data storage, processing and querying of large datasets from transcript expression analysis with corresponding metadata, as well as gene-associated variants (such as SNPs) and their predicted gene effects. TransAtlasDB provides the data model of accurate storage of the large amount of data derived from RNAseq analysis and also methods of interacting with the database, either via the command-line data management workflows, written in Perl, with useful functionalities that simplifies the complexity of data storage and possibly manipulation of the massive amounts of data generated from RNAseq analysis or through the web interface. The database application is currently modeled to handle analyses data from agricultural species, and will be expanded to include more species groups. Overall TransAtlasDB aims to serve as an accessible repository for the large complex results data files derived from RNAseq gene expression profiling and variant analysis.Database URL:
      PubDate: Fri, 23 Feb 2018 00:00:00 GMT
  • AllerGAtlas 1.0: a human allergy-related genes database

    • Authors: Liu J; Liu Y, Wang D, et al.
      Abstract: Allergy is a detrimental hypersensitive response to innocuous environmental antigen, which is caused by the effect of interaction between environmental factors and multiple genetic pre-disposition. In the past decades, hundreds of allergy-related genes have been identified to illustrate the epidemiology and pathogenesis of allergic diseases, which are associated with better endophenotype, novel biomarkers, early-life risk factors and individual differences in treatment responses. However, the information of all these allergy-related genes is dispersed in thousands of publications. Here, we present a manually curated human allergy-related gene database of AllerGAtlas, which contained 1195 well-annotated human allergy-related genes, determined by text-mining and manual curation. AllerGAtlas will be a valuable bioinformatics resource to search human allergy-related genes and explore their functions in allergy for experimental research.Database URL:
      PubDate: Thu, 22 Feb 2018 00:00:00 GMT
  • dbDEPC 3.0: the database of differentially expressed proteins in human
           cancer with multi-level annotation and drug indication

    • Authors: Yang Q; Zhang Y, Cui H, et al.
      Abstract: Proteins are major effectors of biological functions, and differentially expressed proteins (DEPs) are widely reported as biomarkers in pathological mechanism, prognosis prediction as well as treatment targeting in cancer research. High-throughput technology of mass spectrometry (MS) has identified large amounts of DEPs in human cancers. Through mining published researches with detailed experiment information, dbDEPC was the first database aimed to provide a systematic resource for the storage and query of the DEPs generated by MS in cancer research. It was updated to dbDEPC 2.0 in 2012. Here, we provide another updated version of dbDEPC, with improvement of database contents and enhanced web interface. The current version of dbDEPC 3.0 contains 11 669 unique DEPs in 26 different cancer types. Multi-level annotations of DEPs have been firstly introduced this time, including cancer-related peptide amino acid variations, post-translational modifications and drug information. Moreover, these multi-level annotations can be displayed in the biological networks, which can benefit integrative analysis. Finally, an online enrichment analysis tool has been developed, to support a KEGG enrichment analysis and to browse the relationship among interested protein list and known DEPs in KEGG pathways. In summary, dbDEPC 3.0 provides a comprehensive resource for accessing integrated and highly annotated DEPs in human cancer.Database URL:
      PubDate: Thu, 22 Feb 2018 00:00:00 GMT
  • Identification of errors in the IEDB using ontologies

    • Authors: Vita R; Overton J, Peters B.
      Abstract: The Immune Epitope Database (IEDB) is a free online resource that has manually curated over 18 500 references from the scientific literature. Our database presents experimental data relating to the recognition of immune epitopes by the adaptive immune system in a structured, searchable manner. In order to be consistent and accurate in our data representation across many different journals, authors and curators, we have implemented several quality control measures, such as curation rules, controlled vocabularies and links to external ontologies and other resources. Ontologies and other resources have greatly benefited the IEDB through improved search interfaces, easier curation practices, interoperability between the IEDB and other databases and the identification of errors within our dataset. Here, we will elaborate on how ontology mapping and usage can be used to find and correct errors in a manually curated database.Database URL:
      PubDate: Thu, 22 Feb 2018 00:00:00 GMT
  • GAN: a platform of genomics and genetics analysis and application in

    • Authors: Yang S; Zhang X, Li H, et al.
      Abstract: Nicotiana is an important Solanaceae genus, and plays a significant role in modern biological research. Massive Nicotiana biological data have emerged from in-depth genomics and genetics studies. From big data to big discovery, large-scale analysis and application with new platforms is critical. Based on data accumulation, a comprehensive platform of Genomics and Genetics Analysis and Application in Nicotiana (GAN) has been developed, and is publicly available at GAN consists of four main sections: (i) Sources, a total of 5267 germplasm lines, along with detailed descriptions of associated characteristics, are all available on the Germplasm page, which can be queried using eight different inquiry modes. Seven fully sequenced species with accompanying sequences and detailed genomic annotation are available on the Genomics page. (ii) Genetics, detailed descriptions of 10 genetic linkage maps, constructed by different parents, 2239 KEGG metabolic pathway maps and 209 945 gene families across all catalogued genes, along with two co-linearity maps combining N. tabacum with available tomato and potato linkage maps are available here. Furthermore, 3 963 119 genome-SSRs, 10 621 016 SNPs, 12 388 PIPs and 102 895 reverse transcription-polymerase chain reaction primers, are all available to be used and searched on the Markers page. (iii) Tools, the genome browser JBrowse and five useful online bioinformatics softwares, Blast, Primer3, SSR-detect, Nucl-Protein and E-PCR, are provided on the JBrowse and Tools pages. (iv) Auxiliary, all the datasets are shown on a Statistics page, and are available for download on a Download page. In addition, the user’s manual is provided on a Manual page in English and Chinese languages. GAN provides a user-friendly Web interface for searching, browsing and downloading the genomics and genetics datasets in Nicotiana. As far as we can ascertain, GAN is the most comprehensive source of bio-data available, and the most applicable resource for breeding, gene mapping, gene cloning, the study of the origin and evolution of polyploidy, and related studies in Nicotiana.Database URL:
      PubDate: Wed, 21 Feb 2018 00:00:00 GMT
  • FAIR principles and the IEDB: short-term improvements and a long-term
           vision of OBO-foundry mediated machine-actionable interoperability

    • Authors: Vita R; Overton J, Mungall C, et al.
      Abstract: The Immune Epitope Database (IEDB), at, has the mission to make published experimental data relating to the recognition of immune epitopes easily available to the scientific public. By presenting curated data in a searchable database, we have liberated it from the tables and figures of journal articles, making it more accessible and usable by immunologists. Recently, the principles of Findability, Accessibility, Interoperability and Reusability have been formulated as goals that data repositories should meet to enhance the usefulness of their data holdings. We here examine how the IEDB complies with these principles and identify broad areas of success, but also areas for improvement. We describe short-term improvements to the IEDB that are being implemented now, as well as a long-term vision of true ‘machine-actionable interoperability’, which we believe will require community agreement on standardization of knowledge representation that can be built on top of the shared use of ontologies.
      PubDate: Mon, 19 Feb 2018 00:00:00 GMT
  • miRToolsGallery: a tag-based and rankable microRNA bioinformatics
           resources database portal

    • Authors: Chen L; Heikkinen L, Wang C, et al.
      Abstract: Hundreds of bioinformatics tools have been developed for MicroRNA (miRNA) investigations including those used for identification, target prediction, structure and expression profile analysis. However, finding the correct tool for a specific application requires the tedious and laborious process of locating, downloading, testing and validating the appropriate tool from a group of nearly a thousand. In order to facilitate this process, we developed a novel database portal named miRToolsGallery. We constructed the portal by manually curating > 950 miRNA analysis tools and resources. In the portal, a query to locate the appropriate tool is expedited by being searchable, filterable and rankable. The ranking feature is vital to quickly identify and prioritize the more useful from the obscure tools. Tools are ranked via different criteria including the PageRank algorithm, date of publication, number of citations, average of votes and number of publications. miRToolsGallery provides links and data for the comprehensive collection of currently available miRNA tools with a ranking function which can be adjusted using different criteria according to specific requirements.Database URL:
      PubDate: Mon, 19 Feb 2018 00:00:00 GMT
  • Fungal Stress Database (FSD)––a repository of fungal stress
           physiological data

    • Authors: Orosz E; van de Wiele N, Emri T, et al.
      Abstract: The construction of the Fungal Stress Database (FSD) was initiated and fueled by two major goals. At first, some outstandingly important groups of filamentous fungi including the aspergilli possess remarkable capabilities to adapt to a wide spectrum of environmental stress conditions but the underlying mechanisms of this stress tolerance have remained yet to be elucidated. Furthermore, the lack of any satisfactory interlaboratory standardization of stress assays, e.g. the widely used stress agar plate experiments, often hinders the direct comparison and discussion of stress physiological data gained for various fungal species by different research groups. In order to overcome these difficulties and to promote multilevel, e.g. combined comparative physiology-based and comparative genomics-based, stress research in filamentous fungi, we constructed FSD, which currently stores 1412 photos taken on Aspergillus colonies grown under precisely defined stress conditions. This study involved altogether 18 Aspergillus strains representing 17 species with two different strains for Aspergillus niger and covered six different stress conditions. Stress treatments were selected considering the frequency of various stress tolerance studies published in the last decade in the aspergilli and included oxidative (H2O2, menadione sodium bisulphite), high-osmolarity (NaCl, sorbitol), cell wall integrity (Congo Red) and heavy metal (CdCl2) stress exposures. In the future, we would like to expand this database to accommodate further fungal species and stress treatments.URL:
      PubDate: Mon, 12 Feb 2018 00:00:00 GMT
  • OliveNet™: a comprehensive library of compounds from Olea europaea

    • Authors: Bonvino N; Liang J, McCord E, et al.
      Abstract: Accumulated epidemiological, clinical and experimental evidence has indicated the beneficial health effects of the Mediterranean diet, which is typified by the consumption of virgin olive oil (VOO) as a main source of dietary fat. At the cellular level, compounds derived from various olive (Olea europaea), matrices, have demonstrated potent antioxidant and anti-inflammatory effects, which are thought to account, at least in part, for their biological effects. Research efforts are expanding into the characterization of compounds derived from Olea europaea, however, the considerable diversity and complexity of the vast array of chemical compounds have made their precise identification and quantification challenging. As such, only a relatively small subset of olive-derived compounds has been explored for their biological activity and potential health effects to date. Although there is adequate information describing the identification or isolation of olive-derived compounds, these are not easily searchable, especially when attempting to acquire chemical or biological properties. Therefore, we have created the OliveNet™ database containing a comprehensive catalogue of compounds identified from matrices of the olive, including the fruit, leaf and VOO, as well as in the wastewater and pomace accrued during oil production. From a total of 752 compounds, chemical analysis was sufficient for 676 individual compounds, which have been included in the database. The database is curated and comprehensively referenced containing information for the 676 compounds, which are divided into 13 main classes and 47 subclasses. Importantly, with respect to current research trends, the database includes 222 olive phenolics, which are divided into 13 subclasses. To our knowledge, OliveNet™ is currently the only curated open access database with a comprehensive collection of compounds associated with Olea europaea.Database URL:
      PubDate: Mon, 12 Feb 2018 00:00:00 GMT
  • TISSUES 2.0: an integrative web resource on mammalian tissue expression

    • Authors: Palasca O; Santos A, Stolte C, et al.
      Abstract: Physiological and molecular similarities between organisms make it possible to translate findings from simpler experimental systems—model organisms—into more complex ones, such as human. This translation facilitates the understanding of biological processes under normal or disease conditions. Researchers aiming to identify the similarities and differences between organisms at the molecular level need resources collecting multi-organism tissue expression data. We have developed a database of gene–tissue associations in human, mouse, rat and pig by integrating multiple sources of evidence: transcriptomics covering all four species and proteomics (human only), manually curated and mined from the scientific literature. Through a scoring scheme, these associations are made comparable across all sources of evidence and across organisms. Furthermore, the scoring produces a confidence score assigned to each of the associations. The TISSUES database (version 2.0) is publicly accessible through a user-friendly web interface and as part of the STRING app for Cytoscape. In addition, we analyzed the agreement between datasets, across and within organisms, and identified that the agreement is mainly affected by the quality of the datasets rather than by the technologies used or organisms compared.Database URL:
      PubDate: Mon, 12 Feb 2018 00:00:00 GMT
  • Worldwide Protein Data Bank biocuration supporting open access to
           high-quality 3D structural biology data

    • Authors: Young J; Westbrook J, Feng Z, et al.
      Abstract: The Protein Data Bank (PDB) is the single global repository for experimentally determined 3D structures of biological macromolecules and their complexes with ligands. The worldwide PDB (wwPDB) is the international collaboration that manages the PDB archive according to the FAIR principles: Findability, Accessibility, Interoperability and Reusability. The wwPDB recently developed OneDep, a unified tool for deposition, validation and biocuration of structures of biological macromolecules. All data deposited to the PDB undergo critical review by wwPDB Biocurators. This article outlines the importance of biocuration for structural biology data deposited to the PDB and describes wwPDB biocuration processes and the role of expert Biocurators in sustaining a high-quality archive. Structural data submitted to the PDB are examined for self-consistency, standardized using controlled vocabularies, cross-referenced with other biological data resources and validated for scientific/technical accuracy. We illustrate how biocuration is integral to PDB data archiving, as it facilitates accurate, consistent and comprehensive representation of biological structure data, allowing efficient and effective usage by research scientists, educators, students and the curious public worldwide.Database URL:
      PubDate: Wed, 07 Feb 2018 00:00:00 GMT
  • FishTEDB: a collective database of transposable elements identified in the
           complete genomes of fish

    • Authors: Shao F; Wang J, Xu H, et al.
      Abstract: Transposable elements (TEs) are important for host gene regulation and genome evolution. Consensus sequences of TEs can assist investigators in accelerating studies on TE origins, amplification, functions and evolution, as well as comparative analyses and prediction of TEs in different species. In evolution, physiology, ecology and heredity research, fish are important models. However, to date, no comprehensive resource for TE consensus sequences exists for fish. Here, we collected genome-wide data and developed a novel database, FishTEDB, including 27 bony fishes, 1 cartilaginous fish, 1 lamprey and 1 lancelet. De novo, structure-based and homology-based approaches were combined to detect TEs. The database is open-source and user-friendly, and users can browse, search and download all data. FishTEDB also provides GetORF, BLAST and HMMER tools to analyze sequences.Database URL:
      PubDate: Tue, 16 Jan 2018 00:00:00 GMT
  • CellExpress: a comprehensive microarray-based cancer cell line and
           clinical sample gene expression analysis online system

    • Authors: Lee Y; Lee C, Lai L, et al.
      Abstract: With the advancement of high-throughput technologies, gene expression profiles in cell lines and clinical samples are widely available in the public domain for research. However, a challenge arises when trying to perform a systematic and comprehensive analysis across independent datasets. To address this issue, we developed a web-based system, CellExpress, for analyzing the gene expression levels in more than 4000 cancer cell lines and clinical samples obtained from public datasets and user-submitted data. First, a normalization algorithm can be utilized to reduce the systematic biases across independent datasets. Next, a similarity assessment of gene expression profiles can be achieved through a dynamic dot plot, along with a distance matrix obtained from principal component analysis. Subsequently, differentially expressed genes can be visualized using hierarchical clustering. Several statistical tests and analytical algorithms are implemented in the system for dissecting gene expression changes based on the groupings defined by users. Lastly, users are able to upload their own microarray and/or next-generation sequencing data to perform a comparison of their gene expression patterns, which can help classify user data, such as stem cells, into different tissue types. In conclusion, CellExpress is a user-friendly tool that provides a comprehensive analysis of gene expression levels in both cell lines and clinical samples. The website is freely available at Source code is available at under the MIT License.Database URL:
      PubDate: Fri, 12 Jan 2018 00:00:00 GMT
  • A generic workflow for effective sampling of environmental vouchers with
           UUID assignment and image processing

    • Authors: Triebel D; Reichert W, Bosert S, et al.
      Abstract: Sampling of biological and environmental vouchers in the field is rather challenging, particularly under adverse habitat conditions and when various activities need to be handled simultaneously. The workflow described here includes five procedural steps, which result in professional sampling and the generation of universally identifiable data. In preparation for the field campaign, sample containers need to be labelled with universally unique identifier (UUID)-QR-codes. At the collection site, labelled containers, sampled material and attached supplementary information are imaged using a GNSS- respectively GPS-enabled smartphone or camera. Image processing, tagging and data storage as CSV text file is subsequently achieved in a field station or laboratory. For this purposes, the newly implemented tool DiversityImageInspector (URL: is used. It addresses combined image and data processing in such a context including the extraction of the QR-coded UUID from the image content and the extraction of geodata and time information from the Exif image header. The import of the resulting data files into a relational database or other kind of data management systems is optional but recommended. If applied, the import might be guided by a data transformation tool with compliant schema as described here. The new approach is discussed also with regard to implications for virtual research environments and data publication networks.Database URL:
      PubDate: Tue, 09 Jan 2018 00:00:00 GMT
  • YAAM: Yeast Amino Acid Modifications Database

    • Authors: Ledesma L; Sandoval E, Cruz-Martínez U, et al.
      Abstract: Proteins are dynamic molecules that regulate a myriad of cellular functions; these functions may be regulated by protein post-translational modifications (PTMs) that mediate the activity, localization and interaction partners of proteins. Thus, understanding the meaning of a single PTM or the combination of several of them is essential to unravel the mechanisms of protein regulation. Yeast Amino Acid Modification (YAAM) ( is a comprehensive database that contains information from 121 921 residues of proteins, which are post-translationally modified in the yeast model Saccharomyces cerevisiae. All the PTMs contained in YAAM have been confirmed experimentally. YAAM database maps PTM residues in a 3D canvas for 680 proteins with a known 3D structure. The structure can be visualized and manipulated using the most common web browsers without the need for any additional plugin. The aim of our database is to retrieve and organize data about the location of modified amino acids providing information in a concise but comprehensive and user-friendly way, enabling users to find relevant information on PTMs. Given that PTMs influence almost all aspects of the biology of both healthy and diseased cells, identifying and understanding PTMs is critical in the study of molecular and cell biology. YAAM allows users to perform multiple searches, up to three modifications at the same residue, giving the possibility to explore possible regulatory mechanism for some proteins. Using YAAM search engine, we found three different PTMs of lysine residues involved in protein translation. This suggests an important regulatory mechanism for protein translation that needs to be further studied.Database URL:
      PubDate: Tue, 09 Jan 2018 00:00:00 GMT
  • To increase trust, change the social design behind aggregated biodiversity

    • Authors: Franz N; Sterner B.
      Abstract: Growing concerns about the quality of aggregated biodiversity data are lowering trust in large-scale data networks. Aggregators frequently respond to quality concerns by recommending that biologists work with original data providers to correct errors ‘at the source.’ We show that this strategy falls systematically short of a full diagnosis of the underlying causes of distrust. In particular, trust in an aggregator is not just a feature of the data signal quality provided by the sources to the aggregator, but also a consequence of the social design of the aggregation process and the resulting power balance between individual data contributors and aggregators. The latter have created an accountability gap by downplaying the authorship and significance of the taxonomic hierarchies—frequently called ‘backbones’—they generate, and which are in effect novel classification theories that operate at the core of data-structuring process. The Darwin Core standard for sharing occurrence records plays an under-appreciated role in maintaining the accountability gap, because this standard lacks the syntactic structure needed to preserve the taxonomic coherence of data packages submitted for aggregation, potentially leading to inferences that no individual source would support. Since high-quality data packages can mirror competing and conflicting classifications, i.e. unsettled systematic research, this plurality must be accommodated in the design of biodiversity data integration. Looking forward, a key directive is to develop new technical pathways and social incentives for experts to contribute directly to the validation of taxonomically coherent data packages as part of a greater, trustworthy aggregation process.
      PubDate: Thu, 04 Jan 2018 00:00:00 GMT
  • HTT-DB: new features and updates

    • Authors: Dotto B; Carvalho E, da Silva A, et al.
      Abstract: Horizontal Transfer (HT) of genetic material between species is a common phenomenon among Bacteria and Archaea species and several databases are available for information retrieval and data mining. However, little attention has been given to this phenomenon among eukaryotic species mainly due to the lower proportion of these events. In the last years, a vertiginous amount of new HT events involving eukaryotic species was reported in the literature, highlighting the need of a common repository to keep the scientific community up to date and describe overall trends. Recently, we published the first HT database focused on HT of transposable elements among eukaryotes: the Horizontal Transposon Transfer DataBase: Database URL: ( 8080/httdatabase/). Here, we present new features and updates of this unique database: (i) its expansion to include virus-host exchange of genetic material, which we called Horizontal Virus Transfer (HVT) and (ii) the availability of a web server for HT detection, where we implemented the online version of vertical and horizontal inheritance consistence analysis (VHICA), an R package developed for HT detection. These improvements will help researchers to navigate through known HVT cases, take data-informed decision and export figures based on keywords searches. Moreover, the availability of the VHICA as an online tool will make this software easily reachable even for researchers with no or little computation knowledge as well as foster our capability to detect new HT events in a wide variety of taxa.Database URL:
      PubDate: Thu, 04 Jan 2018 00:00:00 GMT
School of Mathematical and Computer Sciences
Heriot-Watt University
Edinburgh, EH14 4AS, UK
Tel: +00 44 (0)131 4513762
Fax: +00 44 (0)131 4513327
Home (Search)
Subjects A-Z
Publishers A-Z
Your IP address:
About JournalTOCs
News (blog, publications)
JournalTOCs on Twitter   JournalTOCs on Facebook

JournalTOCs © 2009-