<$BlogRSDURL$>
Christina's LIS Rant
Wednesday, April 28, 2004
  Using Google search results as a measure of “fame”
Found via Physics Web 27 April 2004.
"Fame in science is different to fame in other areas of life according to physicists at Clarkson University in the US. Daniel ben-Avraham and colleagues have shown that the fame of a scientist - as measured by the number of hits on Google - is directly proportional to their merit as measured by the number of research papers they have published. Such a relationship is not found for other groups such as sportsmen or actors (J P Bagrow et al. 2004 arXiv/cond-mat/0404515)."

There are several large problems with this article. First, the author’s definition of fame is “how well linked we are in… the World Wide Web.” Yet their measurement of how well linked they are only uses the proprietary Google PageRank feature. If the Google algorithms were published, or relied solely on linking, then this might be justifiable. As it stands, however, one is never certain what tweaks have been made and why top sites have dropped out of the results (see discussion here. Also, there are many documented instances of errors in the PageRank as well as efforts to artificially inflate the rank of a given page (see, for example, this Search Engine Watch article.

Second, this “fame” is compared to achievement which is then equated to merit. The model for this study used an obvious, less disputable metric: the number of enemy planes shot down by a fighter pilot. What metric is appropriate in measuring the merit or achievement of a physicist? The authors use number of publications appearing on www.arxiv.org/cond-mat! They do nothing to counteract occasions when the head of a research group appends his name to every paper written by his group. Also, wouldn’t it make more sense to weigh articles eventually published in Nature or Science or other high impact journals more heavily than never published (and never reviewed) eprints? Using the number of citations in cond-mat is circular reasoning because the first few hits in Google generally come from there.

The mathematics of the article are most likely correct, but the entire basis of the study is not. Bibliometrics of science is a fascinating subject, but there are many pitfalls to be avoided; it appears that these authors have hit them all.
 
Comments: Post a Comment


Powered by Blogger

This is my blog on library and information science. I'm into Sci/Tech libraries, special libraries, personal information management, sci/tech scholarly comms.... My name is Christina Pikas and I'm a librarian in a physics, astronomy, math, computer science, and engineering library. I'm also a doctoral student at Maryland. Any opinions expressed here are strictly my own and do not necessarily reflect those of my employer or CLIS. You may reach me via e-mail at cpikas {at} gmail {dot} com.

Site Feed (ATOM)

Add to My Yahoo!

Creative Commons License
Christina's LIS Rant by Christina K. Pikas is licensed under a Creative Commons Attribution 3.0 United States License.

Christina Kirk Pikas

Laurel , Maryland , 20707 USA
Most Recent Posts
-- University of Maryland's CIP
-- BloggerCon II, the Wired Coverage
-- BloggerCon II, the NT Times Coverage
-- BloggerConII: The Emotional Life of Blogs with Jo...
-- Revisiting the "don't believe citation analysis" i...
-- BloggerConII: International Session with Rebecca ...
-- I interupt with an announcement: my new Blog Sear...
-- BloggerConII: Librarianesque Session with Jessica...
-- Notes and thoughts from BloggerConII: Journalism ...
-- Notes and thoughts from BloggerConII: Opening Cer...
ARCHIVES
02/01/2004 - 03/01/2004 / 03/01/2004 - 04/01/2004 / 04/01/2004 - 05/01/2004 / 05/01/2004 - 06/01/2004 / 06/01/2004 - 07/01/2004 / 07/01/2004 - 08/01/2004 / 08/01/2004 - 09/01/2004 / 09/01/2004 - 10/01/2004 / 10/01/2004 - 11/01/2004 / 11/01/2004 - 12/01/2004 / 12/01/2004 - 01/01/2005 / 01/01/2005 - 02/01/2005 / 02/01/2005 - 03/01/2005 / 03/01/2005 - 04/01/2005 / 04/01/2005 - 05/01/2005 / 05/01/2005 - 06/01/2005 / 06/01/2005 - 07/01/2005 / 07/01/2005 - 08/01/2005 / 08/01/2005 - 09/01/2005 / 09/01/2005 - 10/01/2005 / 10/01/2005 - 11/01/2005 / 11/01/2005 - 12/01/2005 / 12/01/2005 - 01/01/2006 / 01/01/2006 - 02/01/2006 / 02/01/2006 - 03/01/2006 / 03/01/2006 - 04/01/2006 / 04/01/2006 - 05/01/2006 / 05/01/2006 - 06/01/2006 / 06/01/2006 - 07/01/2006 / 07/01/2006 - 08/01/2006 / 08/01/2006 - 09/01/2006 / 09/01/2006 - 10/01/2006 / 10/01/2006 - 11/01/2006 / 11/01/2006 - 12/01/2006 / 12/01/2006 - 01/01/2007 / 01/01/2007 - 02/01/2007 / 02/01/2007 - 03/01/2007 / 03/01/2007 - 04/01/2007 / 04/01/2007 - 05/01/2007 / 05/01/2007 - 06/01/2007 / 06/01/2007 - 07/01/2007 / 07/01/2007 - 08/01/2007 / 08/01/2007 - 09/01/2007 / 09/01/2007 - 10/01/2007 / 10/01/2007 - 11/01/2007 / 11/01/2007 - 12/01/2007 / 12/01/2007 - 01/01/2008 / 01/01/2008 - 02/01/2008 / 02/01/2008 - 03/01/2008 / 03/01/2008 - 04/01/2008 / 04/01/2008 - 05/01/2008 / 05/01/2008 - 06/01/2008 / 06/01/2008 - 07/01/2008 / 07/01/2008 - 08/01/2008 / 08/01/2008 - 09/01/2008 / 09/01/2008 - 10/01/2008 / 10/01/2008 - 11/01/2008 / 11/01/2008 - 12/01/2008 / 12/01/2008 - 01/01/2009 / 01/01/2009 - 02/01/2009 / 02/01/2009 - 03/01/2009 / 03/01/2009 - 04/01/2009 / 04/01/2009 - 05/01/2009 / 05/01/2009 - 06/01/2009 / 08/01/2010 - 09/01/2010 /

Some of what I'm scanning

Locations of visitors to this page

Search this site
(gigablast)

(google api)
How this works

Where am I?

N 39 W 76