Timing and location count when announcing particle physics results

August 7, 2009 | 7:10 pm

Two interesting papers have recently come out on arXiv about citation and readership habits in the high-energy physics community. The first to appear, arXiv:0906.5418 by me and two of my colleagues at CERN, discussed the large number of citations that papers posted to arXiv get, compared to those in HEP that are not posted to arXiv. While this is, at least in part, attributable to selection effects, we found two striking pieces of data:

1) Papers on arXiv are cited before they are published in journals. In fact 20 percent of the citations that articles receive in their first two years occur during the time before publication.

2) Physicists using SPIRES to access the literature are presented with a choice of clicking on the arXiv version or the journal version. For one month we looked at how often users clicked on each of these links for articles that were both submitted to arXiv and published, and found that 80% of the clicks went to arXiv. Another large set of users use arXiv directly, and thus more than 80% of readers in HEP prefer arXiv versions to published versions, when given a choice.

Together, these points make it clear that researchers in HEP don’t use journals to communicate scientific ideas. They may notice a paper is published, and they certainly value the peer-review and other functions provided by the journals, but they don’t communicate using the journals; instead they use arXiv, which is much faster.

The second paper , by Ginsparg and Haque, examines how the position of an article in the daily arXiv listings affects the number of citations it gets. Papers are listed on arXiv in the order they are submitted, with each new round of submissions starting at 4 p.m. EST daily. Since most physicists communicate via arXiv, and the arXiv lists new submissions daily, the position of the paper on these lists affects the citation count. Papers at the top of the list get more citations.

Ginsparg and Haque show that the boost is due to two effects. One is due purely to the higher visibility, and is about 50 percent or so, depending on the subfield. The other effect is a self-selection effect, i.e. researchers who know their paper is good want it to be first. They will work hard to submit it at 4:00:01 p.m. to get it on the top. This effect is of a similar size to that of the visibility effect.

So, by studying the systems that HEP uses to communicate (arXiv, SPIRES, and journals) we see that physicists in HEP are quite savvy about the communication tools they use and the ways those tools work. Researchers understand that work on arXiv is unpublished initially, but will eventually be peer-reviewed, and are willing to cite it during the interim period. They understand that the versions on arXiv are generally updated to match the final journal article, and prefer to use the consistent interface and access provided by arXiv. They understand that the arXiv daily listings are important and heavily read, thus they are willing to go to some lengths to submit papers right at 4:00:01 to get them on the top of the lists.

What new features will these communication systems provide that HEP scientists will adopt? Stay tuned. I think there are more exciting things to come…

Travis Brooks

1 Comment »

Top-cited high-energy physics articles during 2008

January 14, 2009 | 2:12 pm

Each year, the SPIRES team compiles a list of the most-cited research articles in high-energy physics. Here we present the most recent addition to this collection. As usual, the Particle Data Group’s Review of Particle Physics tops the 2008 list of the most highly cited papers. The top ten is composed of papers in observational astrophysics/cosmology, string theory, and extra dimensions.

Keep in mind that citation counts can never be exact: There is something like a 5% error in most of these numbers. Please do not fret about number 32 versus 33, as this is often not a statistically significant difference. Remember the detailed warning about the accuracy of these counts. A collection of lists for topcited papers in 2008 for individual eprint archives is available.

Also note that the counts shown, and used in the rankings, are the counts as of Thursday, January 8, 2009. Further, the counts shown by the ranking are only the cites satisfying the criteria for that list. Actual citation numbers in the database may change as corrections are made and papers are added, the links will take you to the updated numbers. The lists, however, will not update.

  1. 1994
    Review of Particle Physics
    By Particle Data Group
    Citations are counted for all versions of the RPP, most recent version is:
    Published in:Phys.Lett.B667:1,2008
    [391 Total citations in HEP]
    [32872 Total Citations to all copies of RPP in HEP]
  2. 924
    Wilkinson Microwave Anisotropy Probe (WMAP) three year results: implications for cosmology
    By WMAP Collaboration
    Published in:Astrophys.J.Suppl.170:377,2007
    (arXiv:astro-ph/0603449)
    [3245 Total citations in HEP]
  3. 731
    The Large N limit of superconformal field theories and supergravity
    By Juan Martin Maldacena (Harvard U.).
    Published in:Adv.Theor.Math.Phys.2:231-252,1998, Int.J.Theor.Phys.38:1113-1133,1999
    (arXiv:hep-th/9711200)
    [5812 Total citations in HEP]
  4. 658
    Five-Year Wilkinson Microwave Anisotropy Probe (WMAP) Observations: Cosmological Interpretation
    By WMAP Collaboration
    HEP Record
    (arXiv:0803.0547)
    [683 Total citations in HEP]
  5. 500
    Maps of dust IR emission for use in estimation of reddening and CMBR foregrounds
    By David J. Schlegel (Durham U.), Douglas P. Finkbeiner, Marc Davis (UC, Berkeley).
    Published in:Astrophys.J.500:525,1998
    (arXiv:astro-ph/9710327)
    [3910 Total citations in HEP]
  6. 492
    First year Wilkinson Microwave Anisotropy Probe (WMAP) observations: Determination of cosmological parameters
    By WMAP Collaboration
    Published in:Astrophys.J.Suppl.148:175,2003
    (arXiv:astro-ph/0302209)
    [5133 Total citations in HEP]
  7. 491
    Measurements of Omega and Lambda from 42 high redshift supernovae
    By Supernova Cosmology Project
    Published in:Astrophys.J.517:565-586,1999
    (arXiv:astro-ph/9812133)
    [4168 Total citations in HEP]
  8. 480
    Anti-de Sitter space and holography
    By Edward Witten (Princeton, Inst. Advanced Study).
    Published in:Adv.Theor.Math.Phys.2:253-291,1998
    (arXiv:hep-th/9802150)
    [3900 Total citations in HEP]
  9. 462
    Observational evidence from supernovae for an accelerating universe and a cosmological constant
    By Supernova Search Team
    Published in:Astron.J.116:1009-1038,1998
    (arXiv:astro-ph/9805201)
    [3981 Total citations in HEP]
  10. 425
    Gauge theory correlators from noncritical string theory
    By S.S. Gubser, Igor R. Klebanov, Alexander M. Polyakov (Princeton U.).
    Published in:Phys.Lett.B428:105-114,1998
    (arXiv:hep-th/9802109)
    [3433 Total citations in HEP]
  11. Read the rest of this entry »

Travis Brooks

1 Comment »

Springer Publishing to acquire BioMedCentral

October 9, 2008 | 6:04 am

“Springer to acquire BioMedCentral.” Those of us in physics may well wonder what this headline has to do with us. “BioMed” is not usually associated with high-energy physics (HEP), and Springer is, to most physicists, a big publisher that we occasionally deal with when submitting to Eur.Phys.J. or similar journals.

However, it is worth remembering that BioMedCentral (BMC) has a physics journal as well. The recently launched PhysMathCentral (an arm of BMC) journal PMC Physics A is an exciting new Open Access journal in HEP, so the sale does affect physics literature. I certainly hope that PhysMathCentral will continue to be a leading Open Access journal in the field, paving the way for broader Open Access in the future of HEP publishing. BMC has, since its founding in 2000, been a leader in Open Access, becoming the world’s first for-profit Open Access publisher and the world’s largest Open Access publisher of any kind. In light of this, Springer’s acquisition carries with it some interesting questions.

Will Springer begin to adopt BMC’s commitment to Open Access?

Read the rest of this entry »

Travis Brooks

1 Comment »

LHC Goes Open Access

September 17, 2008 | 9:18 am

As noted elsewhere, Elsevier has recently announced that:

Initial experimental results submitted to Physics Letters B and Nuclear Physics B by the LHC experiments will be made available through Elsevier’s sponsored access option at no cost to the experiments.

About one year ago Springer made a similar statement, extended to all HEP experimental papers, announcing

…all experimental papers submitted to and accepted by The European Physical Journal C – Particles and Fields will be published with full, online open access without any fees being incurred by the authors.

These moves by Elsevier and Springer mean that the peer-reviewed papers will not only be available for any and all parties to read, but that neither the authors nor their institutions will need to pay for this Open Access.

Of course, these sorts of donations to the HEP community (Elsevier and Springer incur very real costs to review, edit, and publish these papers, which they are here donating to the community) can be seen as a marketing ploy, designed to attract the goodwill of the HEP community and, with it, several of what one surely expects to be notable, highly-cited articles in the field.

However, the interesting bit is not that Elsevier and Springer might attempt to obtain high-quality articles for their journal, after all, that is their responsibility to their shareholders. Indeed the interesting thing here is that they feel that providing Open Access for LHC papers will actually attract LHC papers. Remember that authors in HEP pay essentially no fees in any of the primary journals in which they publish, so they do not really have a direct incentive to publish Open Access. Why then would a publisher (and indeed clearly smart ones like Elsevier and Springer) feel that providing Open Access for free would be a selling point?

Indeed this is the real story behind this action: that Elsevier are responding to new forces in the market for scholarly publishing. The rise of Open Access movements in general, and SCOAP3 within HEP, are a sign that the scientific community is starting to ask for more control over the dissemination of their work. Researchers are increasingly asking for journals that fit their mode of discourse, rather than journals that are bound to traditional ways that might not fit their optimal working style.

In addition the LHC experiments have made clear statements that they will publish their articles under Open Access conditions, and, as reported in symmetry, chose to publish key papers about the construction of their experiments as Open Access in JINST.  These publishers appear to think that Open Access is something researchers want, and that providing it will get them a leg up in the marketplace.

The real story is that researchers, librarians, and other Open Access advocates have changed the rules of the game, so that not only does the world’s largest scientific publisher feel that giving away articles free is good business, but they are probably right.

Travis Brooks

3 Comments »

Projects in SPIRES land

September 16, 2008 | 8:05 am

As I’ve written here before, I’m pretty busy these days working on INSPIRE, the system that will revamp and replace SPIRES. My colleagues at CERN, DESY, and Fermilab are also working hard to build something new and useful for the HEP community, while continuing to provide the SPIRES service, and in fact I’m writing this post from CERN, where many of us involved in INSPIRE are meeting to work on the project and get some good code written.

I’ve also written about SLAC’s involvement in SCOAP3, which is a separate project, but also aimed at maintaining the high-quality of literature in the community. Many of us involved in INSPIRE are also supporters of SCOAP3′s goal of Open Access to the peer-reviewed literature of HEP, and several of my colleagues are quite busy in this regard as well.

In the midst of all this work, my friend and colleague at the DESY library, Annette Holtkamp, somehow found time in to give an interview to a journalist about these projects that she’s working on. The interview is very detailed and presents a really nice overview of both INSPIRE and SCOAP3. Since I am also involved in these projects, I was quite happy to see such a nice summary of what I’ve been working on this last year, reminding me what I have done, where I want to go, and why.

Travis Brooks

3 Comments »

Can you predict Nobel Prize winners by counting citations?

August 27, 2008 | 8:11 am

The Nobel Prize Medal for Physics and Chemistry

The Nobel Prize Medal for Physics and Chemistry

In this recently posted paper, Yves Gingras and Matthew L. Wallace analyze the bibliometrics surrounding the physics and chemistry Nobel Prize winners and nominees of the past 106 years. They were curious: Could we predict the winners of the upcoming Nobel Prizes on the basis of how often their work is cited by other scientists?

The answer turns out to be no.

In the first half of the 1900s, citations did have some predictive power. A scientist would make a crucial discovery and write a paper (or papers) about it in a scientific journal. Other researchers would then cite these results in their own publications. Just about the time the scientist’s papers became the most heavily cited in the field, he or she would be awarded a Nobel.

In the last 30 years, however, it appears that physics–and chemistry as well–have become larger and more diverse, so that citations are spread among many seminal papers and researchers. Nobel laureates are no longer necessarily near the top of the citation rankings at the time of their awards.

Read the rest of this entry »

Travis Brooks

5 Comments »

SLAC, Lawrence Berkeley Lab, and J-Lab join SCOAP3

June 27, 2008 | 6:26 am

As announced on the SCOAP3 website, three more Department of Energy laboratories have joined the open-access consortium: Stanford Linear Accelerator Center, Lawrence Berkeley National Laboratory, and Jefferson Lab. They join four other DOE labs–Argonne, Pacific Northwest National Lab, Los Alamos, and Fermilab–that have pledged to redirect their high-energy physics journal subscription funds to SCOAP3, and thus help make peer-reviewed literature in the field freely available to all.

As reported in an Oct/Nov 07 symmetry article on the initiative:

If it works, no one will have to pay to read most particle physics results. The journals that publish most of the research in the field will be available free online to anyone, anywhere and any time. Money to run the journals—including the cost of having experts review each article before it sees print—would instead come from funding agencies, laboratories and libraries through a consortium called SCOAP3, the Sponsoring Consortium for Open Access Publishing in Particle Physics. This would give journals a stable source of funding while reducing the total cost to libraries and readers.

The proposal is the latest twist in open-access publishing, a worldwide movement whose goal is to pull down barriers to the free flow of information while preserving a system that has kept watch over the integrity of science for nearly 350 years.

Physics, with its long history of openly sharing research results, seems an ideal testing ground for such an idea. Fifty years ago physicists began circulating mimeographed preprints, or unpublished papers, as a way of getting results out more quickly. The mimeograph gave way to the copying machine and the computer; today’s physicists post theories and experimental results on arXiv.org, an Internet clearinghouse set up in 1991 to make it easier for them to swap information. .. As early as 1961, librarians at Stanford Linear Accelerator Center in California started keeping track of circulating preprints, an effort that evolved into SPIRES.

A growing number of other institutions in the United States, as well as organizations in 16 other countries, support this new model of paying publishers directly for the services that the scientific community needs, rather than paying indirectly for these services via subscriptions.

The SLAC Research Library has been a supporter of SCOAP3 in the United States from the beginning, helping to organize and educate libraries in various institutions around the country about the SCOAP3 model and why it fits the HEP community so well. However, it took some work to ensure that the library’s subscription costs were accurately tallied, and furthermore, during this time the entire SLAC library moved to a temporary location while our building is being renovated.

After the dust had settled in our new digs, we were able to take some time to understand the details of our pledge, and ensure that SLAC management understood SCOAP3 and why it would benefit SLAC and the HEP community as a whole. Making it clear that many scientists support SCOAP3, including the major Large Hadron Collider collaborations and HEPAP, the DOE’s High Energy Physics Advisory Panel, was incredibly useful since it helped everyone see how valuable those in the community feel open access to be. I am proud that SLAC was able to make this pledge of support now, and I am excited about the future of SCOAP3.

Travis Brooks

1 Comment »

SPIRES to become INSPIRE

May 29, 2008 | 4:31 am

Here at SPIRES we’ve been worried for quite some time that our software infrastructure was getting old. SPIRES continues to be a useful service for the community, but the time it takes the administrators to change things in the system is increasing, as we continually run up against the cracks of a 30-year-old system. Our recent survey revealed not only a strong user base, but also let us know that our users want more from us that we can currently provide.

Fortunately, CERN, DESY, Fermilab, and SLAC have joined forces to rectify the situation by creating INSPIRE. By combining the successful SPIRES database, curated at DESY, Fermilab, and SLAC, with the Invenio digital library technology developed at CERN, INSPIRE will offer the functionalities and quality of service which the high-energy physics user community has grown to expect from SPIRES. It will develop long-sought improvements providing access to the entire corpus of the HEP literature with full-text Google-like search capabilities and enabling innovative text- and data-mining applications, among a host of other desired features tailored specifically to the HEP community.

I’ve just come back from DESY, where we announced this project and got lots of positive feedback from our colleagues at other information services like arXiv, ADS, and journal publishers. I’m excited too, as our team at the four labs is very strong and I’m happy that we will finally be able to deliver many of the services that users have been requesting for years, but haven’t been able to provide.

Travis Brooks

2 Comments »

HEP Information Resources Survey

May 9, 2008 | 5:27 am

In Spring 2007 more than 2000 high-energy physicists took the time to answer a survey about HEP information systems that was put together by the libraries at CERN, DESY, Fermilab, and SLAC. My colleagues and I have now compiled these results and some analysis of them in arXiv:0804.2701 [cs.DL]

We found the HEP users tend to prefer the systems that have grown within their community, such as SPIRES and arXiv. This is probably not surprising to people within HEP; however, other fields often rely on more general systems that are not as tailored to the behavior of the specific researchers. The overwhelming success of the community systems in HEP may prove to be of interest to other areas of research. In addition, those of us who run these systems in HEP got some much-needed feedback about what we are doing right, and what we need to be doing better!

From the abstract:

Access to previous results is of paramount importance in the scientific process. Recent progress in information management focuses on building e-infrastructures for the optimization of the research workflow, through both policy-driven and user-pulled dynamics. For decades, High-Energy Physics (HEP) has pioneered innovative solutions in the field of information management and dissemination. In light of a transforming information environment, it is important to assess the current usage of information resources by researchers, and HEP provides a unique test-bed for this assessment. A survey of about 10% of practitioners in the field reveals usage trends and information needs. Community-based services, such as the pioneering arXiv and SPIRES systems, largely answer the need of the scientists, with a limited but increasing fraction of younger users relying on Google. Commercial services offered by publishers or database vendors are essentially unused in the field. The survey offers an insight into the most important features that users require to optimize their research workflow. These results inform the future evolution of information management in HEP and, as these researchers are traditionally “early adopters” of innovation in scholarly communication, can inspire developments of disciplinary repositories serving other communities.

We’d like to again thank those researchers who participated in this survey, which has already begun to guide us in delivering better services to the HEP community. You will certainly be hearing more about these new developments in the coming months, and we will continue to involve the community in the design and construction of these resources.

Travis Brooks

No Comments »

2007 Topcites Clouds

April 22, 2008 | 5:33 am

Last year a colleague of mine showed me tagcrowd.com, which is a neat way to take a bunch of text and visualize, very roughly, the important concepts in the text. The site performs a frequency analysis of words in the text, and displays it in the intuitive fashion we have all come to know and love when reading blogs or flickr or other tagging sites. However, tag clouds don’t have to be about tags, it can work for any mass of text.

Below, I took the titles of all 51 2007 Topcites from SPIRES (as well as abstracts from 37 of them and keywords from DESY for 27 of them) and threw them at the TagCrowd generator to see what it thought. So without further ado, here is an unintelligent, algorithmic, and probably very biased picture of the important concepts in HEP in 2007.

2007 Topcites Cloud

Travis Brooks

No Comments »