Re: Coverage loads - quality of data

Re: Coverage loads - quality of data Sitko, Michelle 18 Oct 2008 19:47 UTC
Peter astutely points out, "inaccurate metadata leads to incomplete
searches, which leads to undiscovered content, which leads to unrenewed
subscriptions, which leads to lost revenue."  This argument alone should
significantly increase publisher and content provider interest in improving
data loads.  In addition, researchers will undoubtedly appreciate being able
to reveal and identify needed articles in older serial titles (i.e. from
additional missed title changes/mergers/splits).  This is especially
important as an increasing number of older issues/titles are being added to
e-content and need to remain readily accessible in this new format.

Lessons learned after decades worth of institutions contributing high
quality records according rule interpretations set to specific standards
into major bibliographic database utilities obviously needs to be further
tapped into. I have a great deal of appreciation for the folks who work with
and understand the nuances that are inherent and necessary in the building
of effective metadata access. A good deal of gratitude needs to go out to
those who are willing to give so generously of their time and expertise
through participation in KBART's efforts to improve the data exchange among
and between the members of the digital resource supply chain.  And, a
special thanks goes to the team at Serials Solutions for not only
recognizing this shortfall well before many others took serious note, but
for also staying the course.  Hopefully, the 2009 report will bring some
welcome news.

****** *** ***** *** ***** *** ***** ***
Michelle Sitko
Associate Professor
Coordinator of Collection Management Services/Head, Serials Department
Learning Resources Center
Marywood University Library
Email: sitko@marywood.edu

-----Original Message-----
From: SERIALST: Serials in Libraries Discussion Forum
[mailto:SERIALST@list.uvm.edu]On Behalf Of McCracken, Peter
Sent: Wednesday, October 15, 2008 1:32 PM
To: SERIALST@LIST.UVM.EDU
Subject: Re: [SERIALST] Coverage loads - quality of data

I'd like to chime in on this topic on two levels: first, as co-chair of the
KBART group mentioned below, and second, as a co-founder of Serials
Solutions.

First, I feel strongly that KBART will be able to create a positive impact
on the transfer of data among and between members of the e-resource supply
chain. We have a great group of individuals from all parts of the supply
chain who have been putting a lot of work into the project so far. The
working group first met in March, and given that we only meet monthly, much
has been done so far. We anticipate releasing our report in early 2009.

Anyone can follow our progress at NISO's KBART page
(http://www.niso.org/workrooms/kbart) or by joining the monitoring group at
http://www.niso.org/lists/kbart_interest/.

We *are* focusing on the issue of inaccurate data being delivered by content
providers, and I think it is obviously a very important one. Charlie Rapple
and I, the two KBART co-chairs, have given presentations about KBART
everywhere we can, including through a number of conferences and meetings
aimed directly at publishers. Several members of the KBART team, chaired by
Adam Chandler (Cornell Univ), will be presenting about KBART at the
Charleston Conference next week.

We see an important role in educating publishers about the metadata they
deliver. In a nutshell, inaccurate metadata leads to incomplete searches,
which leads to undiscovered content, which leads to unrenewed subscriptions,
which leads to lost revenue. Hopefully, that argument will carry some
weight.

In addition, if content providers are able to significantly enhance the
quality of the data they deliver, then it may be that libraries will be able
to feel confident in using files directly from the content providers, rather
than passing them through ERAMS vendors like Serials Solutions.

So to go to my second point, I'd like to thank Judith Stokes for her
comments about Serials Solutions -- they were welcomed heartily in the
office, by the way -- but make one point that I've made in other places
before.

I feel that you should, in fact, blame Serials Solutions when you find
inaccurate data, even if we didn't introduce it. We're never going to have
perfect data, but you can bet we're going to try! Libraries pay us to manage
this data, and we see it as a critical part of what we do. We realized long
ago that if we didn't have a way of correcting the data we get from content
providers, we'd never be able to deliver correct data to our customers. So
we built a "rules management module," in which our catalogers and
knowledgebase editors research and then write rules to correct inaccurate
data that comes into our system. (All of these editors are in our Seattle
office, and, thanks to the nearby UW iSchool, all have MLSs, are earning
them, or have significant experience working in libraries before joining
Serials Solutions.) We currently have many thousands of rules written to
correct errors coming in from hundreds of databases.

We very much welcome corrections from clients, as Judith says. We can't
check everything (in fact, there's a lot we can't check at all, since we
don't have access to the database), and we rely on clients to help us
discover these errors. And what I love most is that when a librarian reports
an error to us, we correct it not just for that person's institution and
users, but for *everyone* who has access to that database.

We are willing to take the blame, even if we didn't introduce the error,
because we see our role as improving the way libraries connect their patrons
with their content. We can best do that through improving the data we
manage. So when aggregators report incorrect data, please do tell us,
because we *can* fix it, we *can* tell you what you can actually access in
those databases, and we can help you get patrons to resources that it seems
aggregators don't even know they're offering.

Thanks,

Peter McCracken

-----Original Message-----
From: SERIALST: Serials in Libraries Discussion Forum
[mailto:SERIALST@list.uvm.edu] On Behalf Of Ercelawn, Ann
Sent: Tuesday, October 14, 2008 4:22 PM
To: SERIALST@LIST.UVM.EDU
Subject: Re: [SERIALST] Coverage loads - quality of data

The KBART Group is working on this problem
(http://www.niso.org/workrooms/kbart).
But publishers need to hear from librarians that the quality of data at
publisher web sites matters.

Ann

-----Original Message-----
From: SERIALST: Serials in Libraries Discussion Forum
[mailto:SERIALST@LIST.UVM.EDU] On Behalf Of Chad Hutchens
Sent: Tuesday, October 14, 2008 2:20 PM
To: SERIALST@LIST.UVM.EDU
Subject: Re: [SERIALST] Coverage loads - quality of data

I agree...this is a big problem in a lot of ways.  There's a report out
that
was done in the UK in 2007 I believe.  It's a very interesting read that
describes what is going on with this entire problem.

A very good read for those interested:

http://www.uksg.org/resolvers

--
Chad Hutchens
Electronic Resources Librarian
University of Wyoming Libraries
Dept 3334, 1000 E University Ave.
Laramie, WY 82071-20000
Ph: (307) 766-5560

> From: Lucy Wrightington <lxw08@HEALTH.STATE.NY.US>
> Reply-To: "SERIALST: Serials in Libraries Discussion Forum"
> <SERIALST@LIST.UVM.EDU>
> Date: Tue, 14 Oct 2008 09:51:09 -0400
> To: "SERIALST: Serials in Libraries Discussion Forum"
<SERIALST@LIST.UVM.EDU>
> Subject: Re: [SERIALST] Coverage loads - quality of data
>
> A huge and growing problem that no one so far seems able/willing to
tackle.
> I report these to Ebsco A-to-Z all the time, but they are dependent on
the
> publisher loads.
> Former titles are getting lost as they don't show up in the databases
at
> all.
> Many publishers are guilty of this.
> Who's doing it right? Science Direct and PubMed Central to name a
couple.
> They should be the accepted model.
> Any ideas would be welcome on how the library community can bring
pressure
> to fix this.
>
> Lucy Wrightington, Senior Librarian
> Dickerman Library
> Wadsworth Center, N.Y. State Dept. of Health
> Empire State Plaza, Albany, NY 12201
>
>
>
>
>
>              "Stokes, Judith"
>              <JStokes@RIC.EDU>
>              Sent by:
To
>              "SERIALST:                SERIALST@LIST.UVM.EDU
>              Serials in
cc
>              Libraries
>              Discussion Forum"
Subject
>              <SERIALST@LIST.UV         Re: [SERIALST] Coverage loads -
>              M.EDU>                    quality of data
>
>
>              10/14/2008 09:39
>              AM
>
>
>              Please respond to
>                 "SERIALST:
>                 Serials in
>                  Libraries
>              Discussion Forum"
>              <SERIALST@LIST.UV
>                   M.EDU>
>
>
>
>
>
>
> When we find errors and report them to Serials Solutions they are
> cooperative -- enthusiastic, even, about getting it right. On the
other
> hand, if the data comes from an aggregator like Proquest which does
just
> what you reported -- lump all holdings under the current title and not
even
> cross ref from the old title -- it will just keep coming in wrong over
and
> over again. Getting the aggregators to change is a different story
> altogether. I've had no luck with that.
>
> Good luck,
> Judith Stokes
>
> Judith E. Stokes
> Serials/E-resources Librarian
> Rhode Island College
> 600 Mount Pleasant Avenue
> Providence, RI 02908-1991
> 401.456.8165
>
>
> -----Original Message-----
> From: SERIALST: Serials in Libraries Discussion Forum
> [mailto:SERIALST@list.uvm.edu] On Behalf Of Cahill, Helen
> Sent: Monday, October 13, 2008 8:08 PM
> To: SERIALST@LIST.UVM.EDU
> Subject: [SERIALST] Coverage loads - quality of data
>
> Hello all,
>
> I wonder if there is anybody out there who has assessed the quality of
data
> being offered by the coverage load vendors? I'm principally interested
in
> Serials Solutions, Ebsco A-Z, and III's CASE product, but would also
> welcome comments on any others.
>
> Here is an example from the coverage loads for ACM: "SIGART bulletin"
was
> published 1990-1998 with previous and later titles. There is (to my
> cataloguing mind) a problem over the coverage that is available from
SS,
> EAZ and CASE: they list the coverage for SIGART bulletin to be
1970-1998,
> and don't have any listing for the previous title. I've looked in a
few
> catalogues (randomly) and it seems to me that libraries are simply
> accepting that (wrong) coverage data. How do your patrons find the
online
> version of "SIGART newsletter"?
>
> Has that bothered anybody out there enough to have attempted to get
these
> vendors to properly match the coverage to the title runs? Or, are we
so
> seriously understaffed world-wide that we can't either do the checking
&
> correcting or pressure the vendors to produce accurate information?
Has
> anybody ever offered to clean up the data offered by these vendors to
> benefit all others?
>
> I'm feeling like this is going to develop into one of those Publisher
vs
> Vendor, IT vs Cataloguer debates, but I'm always mindful of what our
> library patrons want to see when they look on our OPACs.
>
> Thanks!
>
> Helen Cahill
> Cataloguer, Collection Services
> Massey University Library
> Private Bag 11054
> Palmerston North 4442
> NEW ZEALAND
>
> Ph: + 64 6 350 5799 ext 7876
> Fax: + 64 6 350 5692
> emai: H.Cahill@massey.ac.nz<mailto:H.Cahill@massey.ac.nz>
> http://library.massey.ac.nz
>
>
>
> IMPORTANT NOTICE:  This e-mail and any attachments may contain
confidential or
> sensitive information which is, or may be, legally privileged or
otherwise
> protected by law from further disclosure.  It is intended only for the
> addressee.  If you received this in error or from someone who was not
> authorized to send it to you, please do not distribute, copy or use it
or any
> attachments.  Please notify the sender immediately by reply e-mail and
delete
> this from your system. Thank you for your cooperation.