Archive for January, 2006

Tale of Two Tiananmens

Tuesday, January 31st, 2006

Y’know, what gets me isn’t just the Google bowing to censorship in this whole what’s fit for China escapade, but the amazing thoroughness of the block.

it’s downright chilling, is what it is

and I don’t mean chilling because some nation could mandate that instead of this page of image-search results their own people should see this ‘cleaned‘ page of results. I mean it’s chilling that one single private corporation should have the ability to robotically discern that this image from rollins.edu should be forbidden while this image from spspku.bjmu.edu.cn should not. Nor is it just a slanting of the weights: Way down at Page 10, Google.cn is still squeeky clean. Happy smiling people holding hands.

Maybe I’m just not keeping the faith, but it is my experience, in my half-century or so of being a human, that whether or not Larry Page et al would really stoop to using this means for any other ends is moot because way way down the line, as Sun Microsystems recently admitted with it’s Java Creator CD disclaimer on it’s privacy clause …

“Sun, as a global company, may transfer your personal information to countries which may not provide an adequate level of protection. Sun, however, is committed to providing a suitable & consistent level of protection for your personal information regardless of the country in which it resides.”

A tip to Bub for the tip on that, and while you may see differently, I read them as basically being honest about the clause most others simply slyly omit: “you can trust us, but we can’t vouch for our partners” — things age, connections extend, and you never really know who’s going to be the NEXT Emperor of Rome, or what sort of sense of ethics and fair-play they might have.

source: Tale of Two Tiananmens

OA is the future

Tuesday, January 31st, 2006
Kuan-Teh Jeang, Open Access And Public Archiving: The Future Of Scientific Publishing?
NIH Catalyst
, Jan-Feb, 2006 (accessible only to NIH employees). (Thanks to Jennifer Heffelfinger.) Excerpt:
Traditional journals, like print journalism, remain the dominant force at the moment. However, slowly but surely, the open-access web and electronically based upstarts are gaining traction. Indeed, a senior science writer at the New York Times recently told me –when asked how the Times sees its free web-based competitors– “We’re running scared!”…Now, the pervasiveness of the Internet offers the potential for numerous additional communities –within or outside academia, in rich and in poor nations– to access previously guarded knowledge. Such access is in keeping not only with the concept that publicly funded science should be shared without charge, but also with the tradition long embraced by scientists that access to large databases such as the genomes of animals and plants and archives like PubMed should be free and public.

Nonetheless, broad acceptance of open-access publishing is at a tipping point. Several factors may yet influence its success or failure. The first is the economics of publishing for a wide audience. The web promises to be a low-cost venue that can reach, with unparalleled rapidity, large numbers of geographically dispersed and economically disparate parties.

Contrast this availability with the rising cost of the traditional print model, which threatens affordability by even the best-funded libraries in wealthy nations. For example, United Kingdom statistics show that between 1998 and 2003, the average subscription price of academic journals rose by 58 percent while retail prices increased by only 11 percent.

A second factor is public demand in developed and developing worlds. The view that at-large access to scientific data is not needed because of lack of public interest is incongruent with empirical experience. Existing numbers indicate that only one-third of the users of PubMed are academicians and researchers, whereas two-thirds are the “public” –clearly not indifferent. As science moves increasingly toward globalization, access models that transcend professional classifications, national boundaries, and accidents of birth are timely and necessary….

Currently, NIH, the Howard Hughes Medical Institute, the United Kingdom’s Wellcome Trust, Germany’s Max-Planck Society and Deutsche Forschungsgemeinschaft, and France’s CNRS and INSERM have all encouraged their funded researchers to deposit peer-reviewed articles into publicly accessible repositories. The two major publishers of open-access journals –Public Library of Science (PLoS) and Biomed Central– have also adopted policies of directly and immediately depositing their published works into PubMed Central….I have an interest in the evolution of scientific publishing. Twelve years ago I helped start a traditional journal, the Journal of Biomedical Science, which I edited for more than 10 years. Two years ago, I left that project to found Retrovirology, an exclusively web-based open-access journal.

Although I have an abiding loyalty to my scientific societies and feel that they deserve continuing revenue streams, my personal read of the winds of change is that open-access publishing and publicly accessible digital repositories like PubMed Central may well be the dominant future players….Based on the acceptance that Retrovirology has gained within my scientific community, it seems to me that scientists do look beyond the cover of a journal to recognize the value of open accessibility to their work. Our journal caters to a relatively small cohort of retrovirologists, but it is accessed steadily 1,000 times each day, 30,000 times each month. These numbers are disproportionate to our known academic audience and suggest that a significant percentage of our readers are members of the public who value and trust our content. Public access, public trust, and public archives –are these not the wave of the future of scientific publishing?

source: OA is the future

More on the British Library digitization project

Tuesday, January 31st, 2006
Helen Beckett, Preserving our digital heritage, ComputerWeekly, January 31, 2006. A detailed look at the British Library project to digitize its holdings, some of which is funded and assisted by Microsoft as part of the Open Content Alliance. The article focuses more on digital preservation than OA.

source: More on the British Library digitization project

More on the NIH policy and CURES Act

Tuesday, January 31st, 2006

The Winter 2006 issue of ARL’s Federal Relations and Information Policy is now online. Section V.B is on the NIH public-access policy and V.C is on the CURES Act. Excerpt:

Based on a review of statistics detailing grantee deposit rates, the NIH Public Access Working Group, comprised of key stakeholders including members of the library community, recommended that researchers be required to deposit articles in PMC in lieu of the current policy which is voluntary. Ann Wolpert, Director of Libraries, MIT, is a member of the Working Group. The library community strongly supports this recommendation. ARL will continue to monitor the NIH policy and work with others in the community, SPARC and the Alliance for Taxpayer Access (ATA) in particular, on this evolving policy….

Introduced on December 14, 2005, by Senators Joe Lieberman (D-CT) and Thad Cochran (R-MS), the bipartisan “American Center for Cures Act of 2005” would expedite the development of new therapies and cures for life-threatening diseases. One provision in the bill calls for free public access to articles stemming from research funded by agencies of the Department of Health and Human Services (DHHS), including NIH, the Centers for Disease Control and Prevention, and the Agency for Healthcare Research and Quality. Under the proposed legislation, articles published in a peer-reviewed journal would be required to be made publicly available within 6 months via NIH’s PubMed Central online digital archive. The library associations note that although some final electronic manuscripts are made available on PubMed Central, many are not—and delays in posting research on PubMed sometimes thwart public access to important articles for up to a year. The library announcement is available at www.librarycopyrightalliance.org. ARL will promote the public access provision in the CURES legislation.

source: More on the NIH policy and CURES Act

Weblog Spam and Adversarial Classification

Monday, January 30th, 2006

Dr. Dave, author of the Spam Karma WordPress antispam plugin, has posted an
interesting article about new weblog-spammer
tactics
:

These spams do not present most of the idiotic traits of their lower
colleagues: they do not try cramming hundreds of URLs or inserting hundreds
of easily spotted junk keywords in the comment content. Instead, they use
only the dedicated name and homepage fields to sneak in spam URL and
keywords. The comment content is often perfectly innocuous, sometimes even
topical (by copying parts of another comment or a trackbacking post). All in
all, these spams could easily be missed by a human moderator who wouldn’t
look carefully at the contact name and URL.

(Thanks to Kelson Vibber for the pointer
to this.)

In other words, he is noting what we noticed in email anti-spam; that what
works well one year, is likely to degrade over time as the spammers attempt to
evade it, and one has to keep working to keep up.

The best term for this appears to be adversarial
classification
. Anti-spam
activities fall into this category, and it often means that classic text
classification algorithms aren’t suitable — after all, the Reuters-21578
dataset

never tried to evade your classifier ;)

In a similar vein, this MS research
paper
is interesting:

Previous work on adversarial classification has made the unrealistic
assumption that the attacker has perfect knowledge of the classifier. …. We
present efficient algorithms for reverse engineering linear classifiers with
either continuous or Boolean features and demonstrate their effectiveness
using real data from the domain of spam filtering.

It’s akin to John Graham-Cumming’s work looking into how a spammer could get
past a bayesian filter “from the outside”, but with more techniques, and
examining MS’ MaxEnt algorithm, too. PDF
here
, well worth
a read.

(By the way, I’m in the process of moving house, so if you send me an email, it
may take a while for me to reply. This situation is likely to prevail for the
next few weeks, for what it’s worth — fun.)

This post was written by Justin, source: Weblog Spam and Adversarial Classification

More on OA to theses and dissertations

Monday, January 30th, 2006
Diane Le Hénaff and Catherine Thiolon, Gérer et diffuser les thèses électroniques : un choix politique pour un enjeu scientifique, Documentaliste, October 2005. Only an abstract (in French) is free online for non-subscribers, at least so far. Here is Erik Arfeuille’s translation of the title and abstract:
Managing and disseminating electronic theses: policy decisions for scientific stakes. Now that the concept of open archives has been accepted by the scientific community, open access to theses has become a major preoccupation for institutes of higher education and research. Disseminating electronic theses is a key concern in providing visibility for and access to scientific documents that although not published has been validated. Following a review of the techniques used to deposit, process and disseminate theses, this article describes STAR, the French plan for depositing, publicizing and archiving this type of record, and insists on the scientific issues of a national policy on electronic dissemination of theses.

source: More on OA to theses and dissertations

Profile of the Sudan Archive Project

Monday, January 30th, 2006
Sudan archiving project turns dry-as-dust documents into bits for easy access, Balancing Act News Update, Issue no. 290. An unsigned news story. (Thanks to Eve Gray.) Excerpt:
Archives conjure up images of rows of shelves with documents gathering dust. And this is precisely the problem a Sudan archiving project had when it wanted to digitalise the archives. In Lokichoggio in northern Kenya, just over the border from southern Sudan, Daniel Large remembers that dust was the main problem “Scanning old documents was difficult among the dust to the point where it clogged the scanner”, writes Isabelle Gross. Large is the project manager of the Sudan Open Archive Pilot Project – a scheme that aims to digitally preserve the documents left by various humanitarian organisations in Sudan and to make them accessible to the public via a website. As Large explains, back in 1989 UNICEF -Operation Lifeline Sudan was only meant to last for three months, but in reality humanitarian work went on for more than 15 years, involving more than a dozen of other NGOs working under UNICEF’s umbrella….In Daniel’s opinion, once preserved, these [documents] will help reveal such things as the history of constraints to aid operations, the evolution of the conflict and changing conditions in locations throughout Southern Sudan over these years. He adds that there is lots of documentation currently scattered across many locations in Kenya and Sudan. Some of these are in a vulnerable condition, including some documents produced after the peace agreement in 1972. The project began after a meeting in Amsterdam between UNICEF and the members of the Rift Valley Institute. The idea was to turn the documents left by UNICEF and other NGOs into usable resources for NGO field practitioners, and more generally, for the Sudanese people, giving them the opportunity to access contemporary and historical knowledge about their country. Large throws his spotlight on the general problem of archives in the humanitarian sector. In his view, just as information is fundamental to the effectiveness and impact of emergency response, knowledge of the history of aid and development operations is important to programme design, implementation and evaluation. Documents from the past can have a practical purpose in the present, but only if they can be readily accessed. The widespread “amnesia” resulting from emergency response and development is mostly a result of a lack of institutional memory and of high staff turnover….According to Daniel Large, the fact that the [Greenstone] software is open source and free will ensure cheap and easy archive accessibility among NGO practitioners and more generally between people in Sudan….Furthermore the concept behind the project is highly transferable. Unfortunately many countries around the world such as East Timor or Afghanistan have been in turmoil over the past few years. These events have resulted in a loss of documented history for the people and the NGO working there. Short- or long-term history suggests a choice of how anybody can write about the history of its own country. How, for instance, can one dispute the position of the frontier when one can only rely on human memory? Although human memories fade over 50 years, properly archived and accessible documentation could give a more concrete guide. In the meantime the experience of the many NGO that have been involved in aid work has made Sudan’s history more accessible to the world. Balancing Act will publishing the URL to the Sudan Open Archive Pilot Project as soon as it becomes available.

source: Profile of the Sudan Archive Project

Proposed new OA journal of anthropology

Monday, January 30th, 2006
Matthew Wolf-Meyer has writiten a proposal for a new OA journal of anthropology, After Culture: Emergent Anthropologies. (Thanks to Kambiz Kamrani on Anthropology.net.) From the proposal:
The purpose of After Culture: Emergent Anthropologies (hereafter AC:EA) is to allow an international group of graduate students to work alongside a similarly international group of faculty in the production of a journal that publishes primarily graduate student scholarship. Graduate students will produce peer-reviews as well as deliberate on and implement editorial policy, all with the support of a diverse group of faculty (who will participate in the peer-review process and provide advice on editorial policy)….[I]f given AAA approval, the journal will be published by the University of California Press and made available through AnthroSource….The total cost of one year’s worth of publications (2 issues, 200 pages) is approximately $3200 (based on University of California Press figures and including the costs of formatting, online storage and publicity). I have approached the president of the National Association of Student Anthropologists (NASA) about the possibility of aligning AC:EA with NASA, thereby becoming the official publication of NASA. I am currently discussing the possibility of funds being provided by the Society of Cultural Anthropology and the American Ethnological Society. The AAA journals will soon be receiving payments from the institutional subscription fees based upon how often the journal and its contents are cited and accessed through AnthroSource. It is hoped that with its provocative title and contemporary content that AC:EA will receive enough funds to cover some of its publishing costs. AC:EA may also attempt to solicit funds through the AAA meeting registration process (similar to the donations that people make for childcare).

source: Proposed new OA journal of anthropology

Counting the OA journals

Monday, January 30th, 2006
Heather Morrison, Trends in refereed journals / open and toll access, Imaginary Journal of Poetic Economics, January 29, 2006. Excerpt:
Data on scholarly, peer-reviewed journals from three sources is presented and analyzed. Ulrich’s reports 1,253 scholarly, peer-reviewed open access journals, about 5% of the journals in this category. The number of new journal start-ups recorded in Ulrich’s since 2001 appears to be fairly steady since 2001, both for all scholarly, peer-reviewed journals, and for open access scholarly peer-reviewed journals. The largest number of open access peer-reviewed journal start-ups recorded was in 2004, the last year for which data is likely complete, with a total of 99. DOAJ includes a total of 2,009 open access journals as of today. One possible source of the discrepancy in numbers could be an english-language bias in Ulrich’s; of the academic journals listed in Ulrich’s, almost 90% are in the english language, while DOAJ appears to represent a much broader linguistic spectrum. Both DOAJ and Ulrich’s list considerably fewer open access journals than are found in Jan Sczcepanski’s list, over 4,700 journals as of December 2005. There are several possible reasons for this. One is that many academic journals are not necessarily peer-reviewed; for example, only about 40% of the journals listed as academic / scholarly in Ulrich’s are peer-reviewed. If we assume that 40% of the journals in Jan Szczepanski’s list are peer-reviewed, the total would be 1,880 - very close to the DOAJ figure of 2,009. There are reasons to think that all available figures for open access journals are underestimates. Jan Sczcepanski’s, the longest list available, for example, focuses on social sciences, humanities, and math; it is also primarily the work of one individual working on a volunteer basis.

source: Counting the OA journals

Profile of David Goodman

Monday, January 30th, 2006
Heather Morrison, David Goodman: Ardent Open Access Advocate, OA Librarian, January 29, 2006. Another installment in Heather’s celebration of librarians who fight for OA. Excerpt:
Dr. David Goodman of the Palmer School of Library and Information Studies, Long Island University, has been an ardent advocate of open access for many years….Many of us know David through his long-standing, and much appreciated, contributions to the Liblicense discussion list…as well as his contributions to the SPARC Open Access Forum….Some of David’s formal writings can be found in David’s E-LIS or D-LIST. David edited a November 2004 special issue of Serials Review on Open Access - his own The Criteria for Open Access is highly recommended as a well-balanced overview, particularly for the novice advocate. Through his long-standing involvement with the renowned and highly innovative Charleston conference, David succeeded in developing an OA-centric 2004 conference theme. Like others in the open access movement, however, many of David’s best works are not the formal publications. When we are working in an arena like open access, timing is critical - policymakers are discussing the issues now, not next year; publishers are reviewing their procedures now, too. Rather than waiting for the formal publication process, with its academic rewards but inevitable delay, David often shares his knowledge, opinions, and even preliminary data with us right away.

source: Profile of David Goodman

Dutch biometric passports cracked, ID info stealable

Monday, January 30th, 2006

[blogs.sun.com]


Dutch biometric passports cracked, ID info stealable It’s as well that there is lively debate on this subject, from a technical standpoint as well as the more expected civil liberties folk… The Dutch passport, one of the first to include “biometric” information which can be read digitally in accordance with new internatiomal legislation, has been cracked from a range of 10 metres such that cleartext of the data stored therein is extracted. Kudos to the Riscure labs in Delft for publishing.


Links, etc, at Dave’s blog.



Tags:

[Comment Link for RSS]

source: Dutch biometric passports cracked, ID info stealable

Google Goof - Perils of Case-Sensitivity in Censorship…

Monday, January 30th, 2006

Edit/Synopsis:
A bug exists in the code that Google uses to enforce censorship of its
recently announced Chinese service; the trivial bug - that
search-strings with Capitalised Words generate uncensored output -
could provide a (presumably temporary) mechanism for Chinese nationals
to bypass their government censorship.


Simon Phipps just messaged me:
(window grabs added infix for illustration)



Try this:



http://images.google.cn/images?q=Tiananmen


Result Of A Capitalised Word Google Search In China


__tiananmen-capitalised-jump.html


and then


http://images.google.cn/images?q=tiananmen


__tiananmen-lowercase-jump.html


result of a lowercase word google search in china



…and so it looks like Google’s pro-China censorship keyword list is
in all-lowercase, and the text matching is case-sensitive.


Oopsie.


Enjoy the liberation whilst you can, citizens!



Tags:

[Comment Link for RSS]

source: Google Goof - Perils of Case-Sensitivity in Censorship…

Google Goof - Perils of Case-Sensitivity in Censorship…

Sunday, January 29th, 2006

Simon Phipps just messaged me:
(window grabs added infix for illustration)



Try this:



http://images.google.cn/images?q=Tiananmen


Result Of A Capitalised Word Google Search In China


__tiananmen-capitalised-jump.html


and then


http://images.google.cn/images?q=tiananmen


__tiananmen-lowercase-jump.html


result of a lowercase word google search in china



…and so it looks like Google’s pro-China censorship keyword list is
in all-lowercase, and the text matching is case-sensitive.


Oopsie.


Enjoy the liberation whilst you can, citizens!



Tags:

[Comment Link for RSS]

source: Google Goof - Perils of Case-Sensitivity in Censorship…

Lorcan Dempsey on institutional repositories

Sunday, January 29th, 2006

Lorcan Dempsey, Networkflows, Lorcan Dempsey’s blog, January 28, 2006. Excerpt:

As more of our working, learning and playing lives moves onto the network we need better workflow support. One can state one of the major challenges facing libraries in these terms. Historically, users have built their workflow around the services the library provides. As we move forward, the reverse will increasingly be the case. On the network, the library needs to build its services around its users’ work- and learn-flows (networkflows). This may provide one way of thinking about institutional repositories. I tend to think about institutional repositories as ways of automating particular processes in support of particular institutional goals. Now, one of the discussion points around insitutional repositories is about which goals they support: open access, curation of institutional intellectual assets, reputation management. And which processes? Over time, it is clear that what we now call institutional repositories will be part of wider research process support. What is currently the institutional repository will be a component of the workflow/curation/disclosure apparatus that develops to support research activities….

A couple of interesting recent indications of direction….I have mentioned before the impact of research assessment on local infrastructure, particularly in the UK and Australia where there is a need to record and report research outputs. Some support issues are discussed in an interesting White Paper by Les Carr and John MacColl. This is one output of the IRRA project, which has also released some software which extends eprints.org and Dspace to provide better workflow support for the Research Assessment Exercise.

source: Lorcan Dempsey on institutional repositories

OA to medical books

Sunday, January 29th, 2006

Dean Giustini, Open access to the digital medical atheneum - work in progress, UBC Google Scholar Blog, January 28, 2006. Excerpt:

Open access to high-quality, digitized versions of the most influential medical books in history is improving, all the time. The National Library of Medicine’s History of Medicine Division and the British Library have notable digitization projects worth exploring. NLM’s amazing historical collections examine various facets of medical history, and include Islamic manuscripts, searchable images and even the Vesalius
De humani corporis fabrica
. The NLM version of the anatomy classic includes audio commentary, and online magnifying and “page turning” technology. Google"s Book Search is typical of current digitization efforts - it’s very much a work in progress. The great medical texts of history - such as Harvey’s Circulation of the Blood - are not yet digitized but others mention Harvey’s landmark book or are translations. Text versions are available on Bartleby’s as are writings by Lister and even Pasteur. Try an advanced search on the Web for specific digital versions. Googling for medical texts in the digital atheneum is getting easier. But first, if you can, browse specific portals such as MLA’s and the AAHM. Two of Canada’s best collections in the history of medicine are located at the UBC Woodward Library and McGill"s Osler Library of the History of Medicine. Sir William Osler was a bibliofile and gave a collection of 8,000 medical books to McGill. It will take time to view Osler’s complete collection online. Digitization is hard on books, and some texts will likely never be digitized. At present, however, search for static images using Google"s Image search, view progress at the Gutenberg project and the Internet Archive. For a good starting point, browse sites selected by McGill’s librarians and search for history papers on PubMed, the IndexCat or Google scholar.

Update. Klaus Graf points out by email that the Bibliothèque interuniversitaire de médecine (BIUM) lists over 400 OA medical books, mostly in French, and over 3,000 OA medical texts of all kinds, in many languages.

source: OA to medical books

Comparing ROAR and OpenDOAR

Sunday, January 29th, 2006

Stevan Harnad, ROAR to DOAR, Open Access Archivangelism, January 29, 2006. Comparing the strengths and weaknesses of the Registry of Open Access Repositories (ROAR) and the Directory of Open Access Repositories (OpenDOAR). Excerpt:

DOAR is mostly just re-doing, funded, what Tim [Brody] had already done, unfunded (with ROAR). DOAR so far covers about 3/5 of the archives in ROAR and 1/2 the number in OAIster, and does not yet measure or provide a way to display the time-course of their growth in contents or number, as ROAR does. (DOAR will need Tim’s Celestial to do that.)

However, DOAR does provide an OAI Base URL in what looks (to my eyes: DOAR does not yet give tallies) to be a much larger proportion of archives than ROAR does (c. 80%), and this is presumably because DOAR has directly contacted each archive individually for which the OAI Base URL was missing.

(This is…perhaps too much to expect from an unfunded doctoral student, primarily working on his thesis! The solution of course is for archives to expose their own OAI Base URLs for harvesters to pick up automatically, and this will of course be the ultimate outcome. For now, there is no Registry that all archives use or aspire to be covered by. If DOAR incorporates all of the useful features of ROAR (especially celestial), and adds value, it may succeed in becoming that Registry. So far, ROAR’s periodic calls to Archives to register have insufficient success. Most of ROAR’s new archives for the past year or more have been hand-imported by me and Tim! At least DOAR will be funded to do that thankless task, from now on!)

The second potentially useful feature of DOAR is that it seems to classify separately the different content types and (I think — I’m not sure) that DOAR has checked that those are all full-texts (rather than just bibiographic metadata: DOAR will have to make this more explicit in their documentation)….

Right now, the DOAR entry for an archive looks a lot like a library card catalogue entry for a journal or a book (perhaps by analogy with DOAJ) or even a collection. This does not quite make sense to me, since users do not consult or use individual online institutional archives as they do for individual books or journals or collections. For one thing, most of the archives will be university IRs. Most universities produce contents of all of the types listed, and in all of the subjects listed; and rarely will any user want all/only, say, articles on subject X from individual institution Y: They will instead use an OAI harvester and service-provider like OAIster or citebase or citeseer or even google scholar, that searches across all institutions on that subject, or even all subjects.

Update. Also see the discussion thread on Harnad’s comments in the AmSci OA Forum.

source: Comparing ROAR and OpenDOAR

DRM and handicap accessibility

Saturday, January 28th, 2006
The January issue of INDICARE is devoted to DRM and handicap accessibility. From Knud Böhle’s editorial introducing the issue:
Disabled persons, especially blind, partially sighted and other print disabled people have to rely on exemptions within copyright law allowing them to effectively use assistive technologies even in cases where the content is protected by TPMs. The three articles dealing with this subject make us aware of the troubles still existing, but also of the solutions at hand. When talking about this subject it is important to have in mind that blind and visually impaired people are consumers like you and me, and that improving accessibility is not only to the benefit of this group, but for all of us.

source: DRM and handicap accessibility

On the value added by journals

Saturday, January 28th, 2006

Carol Tenopir, The Value of the Container, TCMNet, January 27, 2006. Apparently forthcoming from Library Joural. (Thanks to William Walsh.) In order to focus on OA-related issues, I’m cutting a good discussion of the effect of the long tail on journals, as opposed to books. Worth reading. Excerpt:

Why all the fuss about electronic journals? That was the question raised by Michael Gorman, the outspoken president of the American Library Association (ALA), at a session on Future of Libraries at the recent Online Information Meeting in London. What we want is articles, said Gorman, calling the idea of putting them together in things called journals irrelevant. We dont need e-journals, said the controversial Gorman. Articles should be put together by our interests, not the editors. The real problem, according to Gorman, is that there is no viable economic model. Buying all articles (including those no one reads) is not sustainable. The comments got me thinking about the containers in which we package information. When the entire text is digitized and searchable through various search engines, traditional containers might not matter anymore. The concept of a journal may not matter now that we have article e-print servers and institutional repositories….I conduct surveys to find out how faculty, students, and others use scholarly information. All find relevant scholarly articles by searching databases, the web, and e-print servers, but a journal often delivers value greater than the sum of its article parts. For current awareness, readers in many disciplines browse through entire print or electronic issues of journals. They select journals they trust, based on past experience and such factors as the journals affiliation, prestige, and reputation with scholars in their discipline. An issue devoted to a special topic or a bundled collection of related articles in a journal can guide readers to related material that they might not read otherwise.

Even when searching for articles on a specific topic, readers often find the journal title indicates its scope and quality. In health-related fields the stamp of peer review plus journal prestige is vital.

Comment. There are two kinds of added value to keep distinct. One is the kind added by peer review, especially peer review by a known editorial board with its particular disciplinary orientation and standards. The second is the kind added by clustering articles together. OA is compatible with both, but some OA proposals would preserve the first without preserving the second.

source: On the value added by journals

Google explains its decision to work with Chinese censors

Saturday, January 28th, 2006
Andrew McLaughlin, Google’s senior policy lawyer, has blogged an explanation of Google’s decision to censor the Chinese version of its index. Excerpt:
We know that many people are upset about this decision, and frankly, we understand their point of view. This wasn’t an easy choice, but in the end, we believe the course of action we’ve chosen will prove to be the right one. Launching a Google domain that restricts information in any way isn’t a step we took lightly. For several years, we’ve debated whether entering the Chinese market at this point in history could be consistent with our mission and values. Our executives have spent a lot of time in recent months talking with many people, ranging from those who applaud the Chinese government for its embrace of a market economy and its lifting of 400 million people out of poverty to those who disagree with many of the Chinese government’s policies, but who wish the best for China and its people. We ultimately reached our decision by asking ourselves which course would most effectively further Google’s mission to organize the world’s information and make it universally useful and accessible. Or, put simply: how can we provide the greatest access to information to the greatest number of people? Filtering our search results clearly compromises our mission. Failing to offer Google search at all to a fifth of the world’s population, however, does so far more severely. Whether our critics agree with our decision or not, due to the severe quality problems faced by users trying to access Google.com from within China, this is precisely the choice we believe we faced. By launching Google.cn and making a major ongoing investment in people and infrastructure within China, we intend to change that. No, we’re not going to offer some Google products, such as Gmail or Blogger, on Google.cn until we’re comfortable that we can do so in a manner that respects our users’ interests in the privacy of their personal communications. And yes, Chinese regulations will require us to remove some sensitive information from our search results. When we do so, we’ll disclose this to users, just as we already do in those rare instances where we alter results in order to comply with local laws in France, Germany and the U.S. Obviously, the situation in China is far different than it is in those other countries; while China has made great strides in the past decades, it remains in many ways closed. We aren’t happy about what we had to do this week, and we hope that over time everyone in the world will come to enjoy full access to information. But how is that full access most likely to be achieved? We are convinced that the Internet, and its continued development through the efforts of companies like Google, will effectively contribute to openness and prosperity in the world. Our continued engagement with China is the best (perhaps only) way for Google to help bring the tremendous benefits of universal information access to all our users there. We’re in this for the long haul. In the years to come, we’ll be making significant and growing investments in China. Our launch of google.cn, though filtered, is a necessary first step toward achieving a productive presence in a rapidly changing country that will be one of the world’s most important and dynamic for decades to come. To some people, a hard compromise may not feel as satisfying as a withdrawal on principle, but we believe it’s the best way to work toward the results we all desire.

source: Google explains its decision to work with Chinese censors

OA and LOCKSS for Alaska state publications

Saturday, January 28th, 2006
Dan Cornwall has blogged some notes on how the State of Alaska provides OA to state publications and preserves them with LOCKSS.

source: OA and LOCKSS for Alaska state publications