[Papyrus-L] ISI Import
Neil S. Harris
neil.harris at biology.ualberta.ca
Mon Mar 26 13:48:44 EST 2001
I've attached the modified Web of Science import filter that Rene mentions
below. The filter is somewhat different from Dave's but it gives good
results. See WOS_NEW.TXT (attached) for more details.
Also, when saving search results in Web of Science, you should choose "Save
to file" rather than "Export....". The export option imports records
directly into a ProCite or Reference Manager database. "Save to file" saves
the results in a text file.
----- Original Message -----
From: "Hessling, Rene" <Hessling at biologie.uni-osnabrueck.de>
To: <papyrus-l at rsd.com>
Sent: Monday, 26 March, 2001 1:54 AM
Subject: AW: [Papyrus-L] ISI Import
Dave,
Sorry but I can't get the import to work with your filter! I've run through
the steps you mentioned: I've got version 7.0.16c and the latest import.exe,
I extracted the import.flb into my papyrus folder, I copied the WEB SCI 2000
format.... and still I can't import a single entry! In ISI I create a text
file ('export to reference software') of marked articles. I then rename the
*.cgi document to a *.txt (prior to saving). Do you have any idea what I
might be doing wrong?
In the meantime, I received a format from Neil Harris, which works fine.
There's no hassel with the BP and EP entry either. Since he was so kind as
to send his file to me directly, I'll leave it to him to distribute it to
the group or to RSD. Dave, perhaps you could take a look at it for
comparision. It looks quite different!
Rene Hessling
> -----Ursprüngliche Nachricht-----
> Von: Dave Goldman, Research Software Design [mailto:dave at rsd.com]
> Gesendet am: Freitag, 23. März 2001 18:56
> An: papyrus-l at rsd.com
> Betreff: Re: [Papyrus-L] ISI Import
>
> Hanne N. Waltenburg wrote:
>
> >I have found a way to import from Web of Science. I use the format in
> >the attached format library (zipped) - it is probably based on the
> >original from RSD, but I have made some adjustment.
> >
> >I have chosen to add keywords and abstract to the file, and
> then I save
> >the records to a txt file.
> >
> >However, this text file needs to be edited slightly, so I open it in
> >Word:
> >Change page numbers:
> >BP 307
> >EP 310
> >is changed to
> >BP 307-310
> >And then the file is saved as "MS-DOS Text" (NOT: Text only)
> >
> >After this the import runs smoothly.
>
>
> As of Papyrus Version 7.0.16c there is no longer a need for
> the page number
> editing. Here are our official instructions for importing
> from the Web of
> Science:
>
> --------------------------------------------------------------
> ---------------
> Use the format provided in our IMPORT.FLB format library
> named "WEB SCI
> 2000". This format was last updated in November 2000.
>
> Caveat #1: Make sure that you have a recent copy of
> IMPORT.FLB. You can
> download the latest edition from <http://www.rsd.com/Formats7.html>.
>
> Caveat #2: Make sure that you are using Version 7.0.16c of Papyrus. If
> necessary, you can download an update patch from
> <http://www.rsd.com/Patches7.html>.
>
> Caveat #3: When you run the import, stick to the suggested
> Fussiness Level
> of "Tolerant".
> --------------------------------------------------------------
> ---------------
>
> The WEB SCI 2000 format does import both keywords and abstracts.
>
>
> -- Dave Goldman (dave at rsd.com) Research Software Design
> 503/796-1368, fax 503-452-8920 617 SW Hume Street
> The PAPYRUS Bibliography System Portland OR
> 97219-4458 (U.S.A.)
>
> Technical Support: support at rsd.com Other Questions:
> info at rsd.com
> WWW Site: http://www.rsd.com/
>
>
>
> _______________________________________________
> Papyrus-L mailing list
> Papyrus-L at rsd.com
> http://www.pairlist.net/mailman/listinfo/papyrus-l
>
_______________________________________________
Papyrus-L mailing list
Papyrus-L at rsd.com
http://www.pairlist.net/mailman/listinfo/papyrus-l
-------------- next part --------------
Format-codes for Articles:
1 Authors 15 Title
2 Year 16 Journal Name
3 Also Print 17 Journal Abbrev
4 Accession Number 18 Journal Abbrev, without periods
5 Location 19 Journal Series
6 Affiliation/Address 20 Volume
7 Email 21 Issue
8 Field B 22 Supplement
9 Field C 23 Day & Month
10 Abstract 24 Issue Title
11 Comments 25 Editors of Issue
12 Keywords (All) 26 Ed./Eds. (for Issue Editors)
13 Major Keywords 27 From page#
14 Minor Keywords 28 From page#-Thru page#
29 pp. From page#-Thru page#
Dave's Web of Science format for articles:
[<<0|>>PT J|AU1|TI15[|SO16][|C0][|DE12][|DT0][|ID12][|AB10][|RP0][|C{1[6]][|EM6]|BP28|EP28[<<|0>>|JI17]<<|0>>|PY2[|PD23][|VL20][[|IS][|PN]21[-22]][|SU0][|SI24][<<|0>>|WP11[/]]][PT S|AU1|TI15|SO24[|C0][|DE12][|ID12][|AB10][|RP0][|C{1[6]][|EM6]|BP28|EP28<<|0>>|SE16|PY2[|PD23][|VL20][[|IS][|PN]21[-22]]]<<|0>>
Modified Web of Science format for articles:
[FN0|][VR0|]PT0[|AU1]|TI15[|SO16][|LA0][|DT0][|NR0][|SN0][|PU0][|C{10][|DE12][|ID12][|AB10][|CR0][|TC0]|BP28|EP28[|PG0][|JI17][|PY2][|PD23][|VL20][|IS21][|PN0][|GA0][|PI0][|WP7][|RP6][|J{90][|PA0][|UT0]|ER0
Below are the export tags given in the ISI online help. The modified import format (for articles) includes all of these tags (most in [] since they may not be in every record). I've found this approach gives good results (high success rate), and is easy to modify (since it is simple to follow). I routinely include (if available): Author keywords (DE), KeyWords Plus (ID), Abstract (AB), and Reprint address (RP). I also import Publisher web address (WP), the corresponding authors e-mail address, into Custom Field A (renamed 'Email'), although this rarely included in the Web of Science database (CC Search via WebSPIRS is much better in this regard). Dave's import format is more 'intelligent' (flexible), but in my experience fails more often and is slower (it needs to make more decisions).
Since the vast majority of what I import is covered by the article format, I haven't modified Dave's Chapter import format (below). The chapter format as it stands also regularly fails with publication types other than 'journal' (e.g. book in series). ISI is somewhat to blame here; their parsing of data into the fields for these other publication types is often inconsistent.
Dave's Web of Science format for Chapters:
<<0|>>PT S|AU1|TI15[|SO24][|C0][|DE12][|ID12][|AB10][|RP0][|C{1[6]][|EM6]|BP31|EP31[<<|0>>|SO17]<<|J0>>[|SE22][|BS22]|PY2[|PD26][|VL19][|PN26][|IS26][|PN26][|SU0]|GA0|PU27|PI28|PA0[|WP11[/]]<<|0>>
Export Tags
===========
FN File type
VR File format version number
PT Publication type (e.g., book, journal, book in series)
AU Author(s)
TI Article title
SO Full source title
LA Language
DT Document type
NR Cited reference count
SN ISSN
PU Publisher
C1 Research addresses
DE Author keywords
ID KeyWords Plus
AB Abstract
CR Cited references
TC Times cited
BP Beginning page
EP Ending page
PG Page count
JI ISO source title abbreviation
SE Book series title
BS Book series subtitle
PY Publication year
PD Publication date
VL Volume
IS Issue
PN Part number
SU Supplement
SI Special issue
GA ISI document delivery number
PI Publisher city
WP Publisher web address
RP Reprint address
CP Cited patent
J9 29-character source title abbreviation
PA Publisher address
UT ISI unique article identifier
ER End of record
Each export tag identifies a data element. Tags are not included unless the data elements they identify are present in the record.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: WOS_NEW.FLB
Type: application/octet-stream
Size: 9728 bytes
Desc: not available
Url : http://five.pairlist.net/pipermail/papyrus-l/attachments/20010326/417b7e5d/WOS_NEW.obj
More information about the Papyrus-L
mailing list