- 図書書誌レコードの重複同定方法：LC MARCと筑波大学附属図書館洋書所蔵ファイルを例として
- Automatic Identification of Duplicate Monographic Records in LC/MARC and University of Tsukuda Library Catalog File
- No.21, p.169-180
The purpose of this study is to develop an improved procedure for the automatic identification of duplicate monographic records in two catalog record files.
Following procedures are used.
(1) Selecting bibliographic elements for identification.
(2) Converting the form of these elements to the unified key form for matching.
(3) Matching these keys.
Test files used are LC/MARC (109, 430 records) and University of Tsukuba Library catalog file (127, 608 records). Author, title, publisher, and edition statement are chosen as identifiers and eighteen conversion methods are examined.
When author, title and publisher keys (not converted) are used, the match rate is very low. But, it is possible to bring the match rate up to 86.9%～96.5% by the conversion (delite delimitors, convert to capital letters etc.) Author key is low matching rate than other keys. By using the combination of these four converted keys, it is possible to identify nearly 80% of the all duplicated records in two files.