Archive for the ‘Discussion’ Category
One year ago (21st March 2014 to be exact) we contacted Helen Duce, the Head of E-Publishing at Maney Publishing, because after Maney migrated to its new Atypon’s e-publishing platform (Literatum), JournalTOCs was unable to crawl the TOC RSS feeds of Maney’s journals.
JournalTOCS not only uses the effective and simple RSS feeds to get the latest articles from over 25,000 journals. It also uses a very basic version of the simple, but still effective,
wget unix command:
wget -O newtocs.tmp "journal-RSS-feed-URL" 2>&1
That is it. A
wget that has nothing to hide or try to use its rich options to force crawling.
As we can only communicate with the publishers, we couldn’t discuss the problem directly with Atypon. So, we contacted Maney many times. While Helen was very helpful, Atypon was telling Maney that everything was OK at their end, but we knew that we were being refused access to the RSS feeds.
Today, Helen gave us the good news that Maney have finally heard back from Atypon on this issue. It turns out that our IP range was blocked by Maney Online (Atypon) because of “abuse monitoring“, given that JournalsTOCs was crawling content (RSS feeds) which Atypon flagged up as abuse.
Fortunately the misunderstanding has been resolved. Atypon has noticed that crawling RSS feeds is not abuse. The very reason for having RSS feeds is to enable other services to crawl and reuse your feeds to facilitate the widest dissemination of your content, which at the end of the day will benefit your business because it would increase the number of visitors to your site.
We are glad to be able to access the RSS feeds of Maney again. We will restore the Maney journals that were selected by the JournalTOCs Index and start to update their TOCs. In the last year, usage (number of followers) for Maney’s journals have decreased at JournalTOCs, but we hope that once users see that Maney’s journals are being updated, they will start to follow Maney journals again.
Publishers that are changing platforms should make sure to check that their RSS feeds continue being accessible for aggregators and discovery services. By working together, publishers, discovery services, aggregators and e-publishing platforms, can create positive impact in facilitating the dissemination of research.
“the success of these systems [link resolvers and knowledgebases] and services is ultimately dependent upon the cooperation of the various players across the supply chain of electronic resource metadata”
(van Ballegooie, Marlene (2015) Knowledgebases: The Cornerstone of E-Resource Management and Access. Serials Review 40(4) pp. 259-266. DOI: 10.1080/00987913.2014.977127)
LM created LibTOC thanks to a JournalTOCs Premium license, which gave LM full access to up-to-date information to the entire database of JournalTOCs as well as premium access to journal’s metadata daily updates. LM didn’t renew the license in July 2013 and as a consequence LibTOC lost access to up-to-date journal information.
The agreement between LM and JournalTOCs was intended to provide LM with privileged access to JournalTOCs database to power the LibNet system, which was launched by LM last year.
Almost every day, many journal titles are transferred between publishers, cease publication, have their URLs changed, new titles are published, etc. Using the JournalTOCs Premium API, services can keep track of those changes in a systematic and automated way. In particular JournalTOCs can identify when the URL for a journal TOC RSS feeds have been changed, removed or when new TOC RSS feeds are made available. Thus, through its customised APIs, JournalTOCs constantly is providing up-to-date information on journal metadata to other current awareness services. Per each journal, the information includes:
- subject classification
- RSS feeds URL
- homepage URL
- access rights
- e-ISSN and print-ISSN numbers
- number of followers at JournalTOCs
- last issue publication date
Many would argue that there is no excuse for software developers not to support old browsers, aka browsers that have been released more than five years ago or do not support the advanced web apps commonly used in modern websites.
Some will point out that developers should apply standards that all browsers should support, and that the whole point of well formed HTML is that it should render in any browser.
But what about security vulnerabilities commonly found in older browsers and what about the support for the rich and interactive web apps that have transformed the way we interact with websites nowadays? Shouldn’t those two reasons be enough to convince anyone to upgrade their browser? Our experience with the NHS, the major UK Heath service, has shown us that sometimes the answer is no.
JournalTOCs is used by hundreds of professionals from the NHS. Sometimes we receive enquiries from NHS librarians, who are using JournalTOCs to support the current awareness demands of their patrons. A recurrent question, made by those librarians in a rather apologetic manner, is whether JournalTOCs web pages will work and render without problems by the browser being used by many in the NHS, which is the old version 7 of the Microsoft Internet Explorer (IE7). Those librarians are pleased to learn that JournalTOCs has been developed to work with IE7 and also newer browser versions.
IE7 was released by Microsoft in October 2006. It was shipped as the default browser in Windows Vista systems and was offered as a replacement for IE 6 for Windows XP systems. IE7 was superseded by IE8 in March 2009, which in turn was replaced by IE9, released in March 2011. IE9 no longer supports Windows XP systems. IE7 is now a seven years old browser. However, it is estimated that IE7’s global market share is still 4%.
The issue becomes relevant in particular when you need to provide an external web service to NHS users. Probably a sizable chunk of the IE7 market share comes from the NHS and other departments from the UK government such as the Department for Work and Pensions (DWP). The NHS alone has more than 800,000 workstations and laptops nationwide, where IE7 is installed by default.
Why is an organisation with the importance of the NHS letting its staff use a seven years browser that has already been superseded by two versions? And why IE only? The clue to the answer can be found by the fact that the NHS is one of those organisations that are more concerned with maintaining the stability of their major critical intranets than being compliant with external services and websites that are occasionally used by their staff. Google can be omnipresent and very important for millions of users and can afford to stop supporting old browsers (Modern browsers for modern applications) and develop its own browser, but it will not deter those organisations from continuing using a browser that is strongly interrelated with their enterprise intranets.
As long as critical NHS enterprise applications are still depending on IE7, JournalTOCs will continue supporting IE7. We understand that enterprise applications are not easy to upgrade. They deal with booking services, expense claims, corporate accounts, staffing changes, CRM systems, payroll, etc. Upgrading these expensive systems is not a trivial task. It’s one process that is full of risks. So, it makes sense that these systems are upgraded at large intervals of time, with the process being rigorously controlled and methodically run. It also makes sense that JournalTOCs should be able to be useful to staff working in the NHS and other national organisations from other countries that are in a similar situation to the NHS.
As we know there a range of variations in the quality of the RSS feeds that publishers produce to announce the latest issues or articles published in their journals. But we wonder if there is any correlation between quality of a journal and quality of its RSS feeds. In particular what about the best journals, I mean the journals with the highest impacts, most-cited articles and the most prolific content? Are their TOC RSS feeds a reflection of their outstanding position and quality?
Surely the publishers of the top journals are aware of the advantages of providing excellent RSS feeds (with rich content, tagged with standards elements and focused in enabling re-usability and early awareness.) We can get a good idea of the quality of the RSS feeds of those top journals by checking that their RSS feeds are valid and well formed, follow the RSS specifications for scholarly publishers, and in particular are making use of the main RSS 1.0 modules recommended by the “Recommendations on RSS Feeds for Scholarly Publishers“, namely the Dublin Core and PRISM modules. We are carrying out such analysis, which will take some time. In the meantime we could check the RSS feeds of the winners of the ALPSP Award for Best New Journal 2012, recently announced.
It is interesting to notice that Postmedieval, from Palgrave Macmillan, which is the winner of the ALPSP Award for Best New Journal 2012 is among the journals with the best TOC RSS feeds too.
The TOC RSS feeds of Postmedieval include all the metadata required to support efficient reuse (e.g. OpenURL resolution) and dissemination (e.g. current awareness) of latest articles, making Postmedieval a good example of how to use RSS feeds.
Similarly the winner of the Highly Commended Certificate (Methods in Ecology and Evolution, from the British
Ecological Society and Wiley-Blackwell) as well as the shortlisted journals (Cancer Discovery, from the
American Association for Cancer Research, and Physical Review X, from the American Physical Society) have excellent TOC RSS feeds.
Clearly there is a direct relationship between the quality of those new journals and the quality of their RSS feeds. In a next post we will report on the results of our analysis of the RSS feeds collected by JournalTOCs to determine whether the top journals tend to have the best TOC RSS feeds or not.
Postmedieval TOC RSS feeds:
Methods in Ecology and Evolution TOC RSS feeds:
Physical Review X
Predatory publishers are already damaging the Open Access reputation. Unfortunately, the uncontrolled proliferation of new Open Access journals is also negatively impacting on the standing of the Open Access movement.
From the list of 3,850 Open Access journals currently indexed by JournalTOCs, we detect that in average two of those Open Access journals cease publishing or disappear altogether every month. In addition, we noticed that various Open Access journals indexed by JournalTOCs are struggling to continue publishing new issues. The temptation for some of those journals to publish “anything” is real.
The questions we would like to ask to our friends at DOAJ are:
1. How many of the Open Access journals, registered with DOAJ, have ceased to publish?
2. Can DOAJ provide us with an API to help us to detect the OA journals that no longer exist?
In average, JournalTOCs receives 10 requests per day to add new Open Access journals to its database. In most of the cases, those journals do not meet our selection criteria and consequently they are not added to JournalTOCs.
Open Access journals are helping researchers to boost their number of publications and citations. For example Prof. Syed Tauseef Mohyud-Din has achieved an impressive number of 350 new papers published in less than four years. However, aren’t we abusing the current explosion of spurious scholarly Open Access journals? Is the peer-review model working in the same way for both Open Access and commercial “traditional” publishers? Many questions are still to be answered regarding Open Access.