|
Home | Issues | eMJA shop | My account | Classifieds | Contact | More... | Topics | Search |
→ For editorial comment, see Lundberg
→ Previous article in this issue
→ Contents list for this issue
→ More articles on Journalology and publishing
→ Download a pdf version of this article
Gloom or glee? Each September, journal editors and publishers anxiously await news of a particular figure from the Institute of Scientific Information (ISI) in Philadelphia, USA. The figure's value is promptly met with despair or delight. We are referring, of course, to the Impact Factor (IF) and the ritual surrounding its release that has come to dominate the editors' and publishers' calendar.
Two of us, as editors ourselves, participate in this practice, albeit reluctantly and with rising apprehension that scientific publishing is being undermined by "numerology". What was introduced as an aid to librarians more than four decades ago to guide their selection of scientific journals has become, in the view of some people, an inappropriate means of scrutinising an applicant's "track record" when allocating research funds or considering academic promotions.1-4 Why inappropriate? Because the IF (defined as "the number of citations to a journal's articles published in the previous two years divided by the number of articles published by that journal during those two years"4-6) is conceptually and technically flawed, on a number of grounds:
the quality of published material cannot be constrained by time — the two-year period set by the ISI for citations is arbitrary;7
the number of journals in the ISI's database is a minute proportion of those published;4
reviews are cited more frequently than original research, thus favouring journals that opt for these articles as part of a publishing strategy;1
the IF does not take into account self-citations, which amount to a third of all citations;8
errors are common in reference lists (occurring in up to a quarter of references), inevitably affecting IF accuracy;9 and
the assumption of a positive link between citations and quality is ill-founded, in that we cite articles for diverse reasons, including to refer to research judged suspect or poor.10
If these flaws were not enough to instil scepticism about the IF's validity and precision, then the arbitrary assumption that the quality of a specific article correlates with the IF of the journal in which it appears is entirely ill-founded and, in itself, sufficient to warrant concern about its continuing use. As editors and researchers, we are duty-bound to analyse this assumption rigorously. What do we find?
Consider two psychiatric journals published for a general readership: the Australian and New Zealand Journal of Psychiatry (ANZJP) and the Canadian Journal of Psychiatry (CJP). Applying the ISI's own "Web of Science" database,11 we can examine the purported link between IF, citations and the worth of a particular article. For instance, if we calculate the proportion of citations to all articles in each journal that is accounted for by the most-cited 50% of papers, a striking pattern emerges. In the case of the ANZJP, the most-cited 50% of papers published between 1990 and 1995 account for 94% (range, 91% [1992] to 98% [1990]) of all citations to articles in that journal. Figures for the CJP are virtually identical: 94% (range, 91% [1991] to 96% [1994]). A blunt summary of these findings is that half the articles published in both journals receive virtually no citations. We can conclude from the data (and from comparable findings from other journals, such as those in cardiology10) that to determine the academic worth of a paper from the IF of the journal in which it appears is ill-conceived and misleading.
In an era of evidence-based medicine, its proponents avow that scientific progress can only be achieved by dint of diligent scrutiny of available data. Is it not incongruous, then, that the scientific community continues to cling to such an inadequate tool as the IF? What's more, the "parent" IF has spawned a range of flawed offspring, including "Scope-adjusted IF", "Discipline-specific IF", "Journal-specific influence factor", "Immediacy index" and "Cited half-life". As if that were not disconcerting enough, lo and behold, ISI recently faced a new rival, albeit short-lived (the venture collapsed in the wake of a threat from ISI to sue for violation of intellectual property rights). "PrestigeFactor.com" was launched in 2001, enticing us to ditch the IF and supplant it with another measure of journal quality, the "Prestige Factor" (PF).12 Its proponents boldly asserted that the PF provided "truer value" than the IF.12 There was, however, a fly in the ointment. Despite minor refinement (eg, the PF separated review articles from research reports and included citations to journal articles over the previous three years versus two), the underlying premise of both measures — that quality and number of citations are inextricably linked — was identical. It is also worth reporting the contemporary practice of open-access "e-journals" tracking their most popular articles through "hit rates".13,14 Again, we doubt that a popularity poll can indicate academic merit and fear that it may be misconstrued in this way.
We have reached a watershed — either we persevere with the notion that citations lie at the heart of scientific quality or we make a clean break. The latter option is attracting growing support. For instance, Richard Frackowiak, Dean of the Institute of Neurology in London, asserts that current measures are crude and reliance on them in making hiring-and-firing decisions is counterproductive.15 Zach Hall, a leading figure in US research, sees numerical methods as "excuses for not thinking".15 David Adam, a writer for Nature, highlights the absurdity of the situation in Finland, where government funding of university hospitals utilises a sliding scale corresponding to the IF of journals in which researchers publish their work.16 A notable development is a similar questioning among scientific bodies. The Deutsche Forschungsgemeinschaft, Germany's central research organisation, has promulgated innovative guidelines emphasising qualitative criteria in evaluating published material.17 As they posit, "Publications must be read and critically compared with the relevant state of the art and to the contributions of other individuals and working groups".17 Admirable but vexing. How are we to judge quality objectively?
We have grappled with this challenge and devised an option for the ANZJP. Five international members of the journal's advisory board were invited in 2001 to identify that year's "top" articles. The quintet, selected on the basis of scholarship, professional integrity and knowledge of scientific psychiatry, were asked to select three publications per issue that satisfied one or more of the following criteria:
adds consequentially to the field through original, innovative research findings;
expands or challenges current knowledge;
opens additional areas for new research activity;
opens a pathway to advance knowledge;
integrates discoveries obtained by different approaches and/or disciplines through creative synthesis, thus bringing new insights to bear on original research; and
reflects critically on research findings to guide the direction of further research.
The data were collated, and the titles of the nine articles gaining the most votes were announced in the April 2002 issue of the journal and posted on the websites of the Royal Australian and New Zealand College of Psychiatrists and Blackwell Publishing. This procedure is familiar in that it is, in essence, an extension of peer review. While by no means foolproof, we proffer this approach as a fresh way to establish the quality of the individual article. We sought feedback from the judges and learned that the task is feasible and the clarity and utility of the assessment criteria are satisfactory. The judges also found the assignment personally rewarding, even enjoyable! We are examining the method's reliability. Testing validity, of course, is more taxing given the lack of an objective yardstick.
Interestingly, our experiment has been echoed by another initiative to highlight meritorious papers, namely the "Faculty of 1000" (F1000). Launched in November 2001 by the publishers of BioMed Central (a collection of wholly electronic biomedical "journals"),18,19 F1000 aims to identify the best papers in the basic biological sciences through the eyes of a "faculty" of over 1000 selected scientists who are experts in their fields. Thus, for instance, cell biology is divided into 18 categories, and the faculty members for each category select two to four papers each month from any journal, ranking them as "recommended", "must read" or "exceptional". The experts also briefly explain their choices. We applaud this initiative and see our own effort as complementary. Indeed, it is commonsensical that more than one method of identifying outstanding papers should be instituted, as there cannot be an absolute consensus.
We invite editors, publishers and authors to consider trying our experiment. We selected a new set of judges, again all distinguished figures in international psychiatry, to undertake a similar task for articles published in 2002. After this replication, we hope to be well placed to determine whether the method needs change. It would be disingenuous of us to conceal our fantasy that the annual "gloom or glee" ritual will be supplanted by published lists of "best quality articles", with their authors duly acknowledged. We may even witness the ISI collating the results, including the names of the judges and the criteria applicable for participating journals (consensually agreed criteria across all journals would be ideal). The implications are clear: successful authors will be able to cite articles that have made it to the "top" when documenting their academic track record, whatever the purpose (eg, applying for research grants). Depending on a measure devoid of any rational link to the appraisal of academic worth will be but a hazy memory.
Rivendell Unit, Child Adolescent and Family Psychiatric Services, Thomas Walker Hospital, Concord West, NSW.
Garry Walter, PhD, FRANZCP, Director; and Editor, Australasian Psychiatry; Karen Fisher, MB BS, Research Officer, Central Sydney Area Health Service.Department of Psychiatry, University of Melbourne, Fitzroy, VIC.
Sidney Bloch, PhD, FRANZCP, Professor; and Editor, Australian and New Zealand Journal of Psychiatry.Department of Psychological Medicine, Research Unit, Rozelle Hospital, Rozelle, NSW.
Glenn Hunt, MSc, PhD, Senior Research Fellow, University of Sydney.Correspondence: Dr Garry Walter, Thomas Walker Hospital ("Rivendell"), Hospital Road, Concord West, NSW 2138. gwalterATmail.usyd.edu.au
AntiSpam note: To avoid spam, authors' email addresses are written with AT in place of the usual symbol, and we have removed "mail to" links. Replace AT with the correct symbol to get a valid address.
©The Medical Journal of Australia 2003 www.mja.com.au Print ISSN: 0025-729X Online ISSN: 1326-5377
George Lundberg. The "omnipotent" Science Citation Index Impact Factor Med J Aust 2003; 178 (6): 253-254. [Editorials] <http://www.mja.com.au/public/issues/178_06_170303/editorials_170303-2.html>
|
Home | Issues | eMJA shop | My account | Classifieds | More... | Contact | Topics | Search |