[Papyrus-L] ISI Import

Mon Mar 26 13:48:44 EST 2001

I've attached the modified Web of Science import filter that Rene mentions
below. The filter is somewhat different from Dave's but it gives good
results. See WOS_NEW.TXT (attached) for more details.

Also, when saving search results in Web of Science, you should choose "Save
to file" rather than "Export....". The export option imports records
directly into a ProCite or Reference Manager database. "Save to file" saves
the results in a text file.

----- Original Message -----
From: "Hessling, Rene" <Hessling at biologie.uni-osnabrueck.de>
To: <papyrus-l at rsd.com>
Sent: Monday, 26 March, 2001 1:54 AM
Subject: AW: [Papyrus-L] ISI Import

Dave,

Sorry but I can't get the import to work with your filter! I've run through
the steps you mentioned: I've got version 7.0.16c and the latest import.exe,
I extracted the import.flb into my papyrus folder, I copied the WEB SCI 2000
format.... and still I can't import a single entry! In ISI I create a text
file ('export to reference software') of marked articles. I then rename the
*.cgi document to a *.txt (prior to saving). Do you have any idea what I
might be doing wrong?

In the meantime, I received a format from Neil Harris, which works fine.
There's no hassel with the BP and EP entry either. Since he was so kind as
to send his file to me directly, I'll leave it to him to distribute it to
the group or to RSD. Dave, perhaps you could take a look at it for
comparision. It looks quite different!

Rene Hessling

> -----Ursprüngliche Nachricht-----
> Von: Dave Goldman, Research Software Design [mailto:dave at rsd.com]
> Gesendet am: Freitag, 23. März 2001 18:56
> An: papyrus-l at rsd.com
> Betreff: Re: [Papyrus-L] ISI Import
>
> Hanne N. Waltenburg wrote:
>
> >I have found a way to import from Web of Science. I use the format in
> >the attached format library (zipped) - it is probably based on the
> >original from RSD, but I have made some adjustment.
> >
> >I have chosen to add keywords and abstract to the file, and
> then I save
> >the records to a txt file.
> >
> >However, this text file needs to be edited slightly, so I open it in
> >Word:
> >Change page numbers:
> >BP 307
> >EP 310
> >is changed to
> >BP 307-310
> >And then the file is saved as "MS-DOS Text" (NOT: Text only)
> >
> >After this the import runs smoothly.
>
>
> As of Papyrus Version 7.0.16c there is no longer a need for
> the page number
> editing. Here are our official instructions for importing
> from the Web of
> Science:
>
> --------------------------------------------------------------
> ---------------
> Use the format provided in our IMPORT.FLB format library
> named "WEB SCI
> 2000". This format was last updated in November 2000.
>
> Caveat #1: Make sure that you have a recent copy of
> IMPORT.FLB. You can
> download the latest edition from <http://www.rsd.com/Formats7.html>.
>
> Caveat #2: Make sure that you are using Version 7.0.16c of Papyrus. If
> necessary, you can download an update patch from
> <http://www.rsd.com/Patches7.html>.
>
> Caveat #3: When you run the import, stick to the suggested
> Fussiness Level
> of "Tolerant".
> --------------------------------------------------------------
> ---------------
>
> The WEB SCI 2000 format does import both keywords and abstracts.
>
>
> -- Dave Goldman (dave at rsd.com)              Research Software Design
>    503/796-1368, fax 503-452-8920           617 SW Hume Street
>    The PAPYRUS Bibliography System          Portland OR
> 97219-4458 (U.S.A.)
>
>    Technical Support: support at rsd.com       Other Questions:
> info at rsd.com
>                        WWW Site: http://www.rsd.com/
>
>
>
> _______________________________________________
> Papyrus-L mailing list
> Papyrus-L at rsd.com
> http://www.pairlist.net/mailman/listinfo/papyrus-l
>

_______________________________________________
Papyrus-L mailing list
Papyrus-L at rsd.com
http://www.pairlist.net/mailman/listinfo/papyrus-l
-------------- next part --------------
Format-codes for Articles:

          1  Authors                   15  Title
          2  Year                      16  Journal Name
          3  Also Print                17  Journal Abbrev
          4  Accession Number          18  Journal Abbrev, without periods
          5  Location                  19  Journal Series
          6  Affiliation/Address       20  Volume
          7  Email                     21  Issue
          8  Field B                   22  Supplement
          9  Field C                   23  Day & Month
         10  Abstract                  24  Issue Title
         11  Comments                  25  Editors of Issue
         12  Keywords (All)            26  Ed./Eds. (for Issue Editors)
         13  Major Keywords            27  From page#
         14  Minor Keywords            28  From page#-Thru page#
                                       29  pp. From page#-Thru page#

Dave's Web of Science format for articles:
[<<0|>>PT J|AU1|TI15[|SO16][|C0][|DE12][|DT0][|ID12][|AB10][|RP0][|C{1[6]][|EM6]|BP28|EP28[<<|0>>|JI17]<<|0>>|PY2[|PD23][|VL20][[|IS][|PN]21[-22]][|SU0][|SI24][<<|0>>|WP11[/]]][PT S|AU1|TI15|SO24[|C0][|DE12][|ID12][|AB10][|RP0][|C{1[6]][|EM6]|BP28|EP28<<|0>>|SE16|PY2[|PD23][|VL20][[|IS][|PN]21[-22]]]<<|0>>

Modified Web of Science format for articles:
[FN0|][VR0|]PT0[|AU1]|TI15[|SO16][|LA0][|DT0][|NR0][|SN0][|PU0][|C{10][|DE12][|ID12][|AB10][|CR0][|TC0]|BP28|EP28[|PG0][|JI17][|PY2][|PD23][|VL20][|IS21][|PN0][|GA0][|PI0][|WP7][|RP6][|J{90][|PA0][|UT0]|ER0

Below are the export tags given in the ISI online help. The modified import format (for articles) includes all of these tags (most in [] since they may  not be in every record). I've found this approach gives good results (high success rate), and is easy to modify (since it is simple to follow). I routinely include (if available): Author keywords (DE), KeyWords Plus (ID), Abstract (AB), and Reprint address (RP). I also import Publisher web address (WP), the corresponding authors e-mail address, into Custom Field A (renamed 'Email'), although this rarely included in the Web of Science database (CC Search via WebSPIRS is much better in this regard). Dave's import format is more 'intelligent' (flexible), but in my experience fails more often and is slower (it needs to make more decisions).

Since the vast majority of what I import is covered by the article format, I haven't modified Dave's Chapter import format (below). The chapter format as it stands also regularly fails with publication types other than 'journal' (e.g. book in series). ISI is somewhat to blame here; their parsing of data into the fields for these other publication types is often inconsistent.

Dave's Web of Science format for Chapters:
<<0|>>PT S|AU1|TI15[|SO24][|C0][|DE12][|ID12][|AB10][|RP0][|C{1[6]][|EM6]|BP31|EP31[<<|0>>|SO17]<<|J0>>[|SE22][|BS22]|PY2[|PD26][|VL19][|PN26][|IS26][|PN26][|SU0]|GA0|PU27|PI28|PA0[|WP11[/]]<<|0>>

Export Tags
===========
FN 		File type
VR 		File format version number
PT 		Publication type (e.g., book, journal, book in series)
AU 		Author(s)
TI 		Article title
SO 		Full source title
LA 		Language
DT 		Document type
NR 		Cited reference count
SN 		ISSN
PU 		Publisher
C1 		Research addresses
DE 		Author keywords
ID 		KeyWords Plus
AB 		Abstract
CR 		Cited references
TC 		Times cited
BP 		Beginning page
EP 		Ending page
PG 		Page count
JI 		ISO source title abbreviation
SE 		Book series title
BS 		Book series subtitle
PY 		Publication year
PD 		Publication date
VL 		Volume
IS 		Issue
PN 		Part number
SU 		Supplement
SI 		Special issue
GA 		ISI document delivery number
PI 		Publisher city
WP 		Publisher web address
RP 		Reprint address
CP 		Cited patent
J9 		29-character source title abbreviation
PA 		Publisher address
UT 		ISI unique article identifier
ER 		End of record

Each export tag identifies a data element. Tags are not included unless the data elements they identify are present in the record.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: WOS_NEW.FLB
Type: application/octet-stream
Size: 9728 bytes
Desc: not available
Url : http://five.pairlist.net/pipermail/papyrus-l/attachments/20010326/417b7e5d/WOS_NEW.obj