Return to search screen
Contents of the archive
How to search
Record display pages

Contents of the archive [top]

The Taos Archive contains every piece of information from the following types of record: authority, bibliographic, holdings, and item. Patron records, circulation transactions (those linking items with patrons) and fines/fees were not retained in the Archive, either for privacy reasons or because the data has been aggregated into the data warehouse.

Records for all 3 Taos databases - Ethnomusicology, Film/Television, and UCLA - are stored in the Archive. At present, all records are indexed together, so a search may retrieve records from any or all of the original databases.

Archive Contents
Record Type Count
Authority records 1,089,722
Bibliographic records 5,221,631
Holdings records 7,171,735
Item records 9,307,788
Bib keywords (selected fields) 112,470,637

How to search [top]

Your results will vary depending on the index you use:

Index Results Examples Notes
Author word(s) Bibliographic records mark twain
association medical
Fields indexed: 100, 110, 111, 130, 700, 710, 711, 730
Call number Holdings records KF135
NA2 .C34 1994
Fields indexed: 852 ($h + $i, $j)
Spaces are ignored for searching, but all other characters are indexed. This is a left-anchored, right-truncated search.
DBCN Authority, bibliographic & holdings records 02-AAA-1111 07-ABC-1234 Fields indexed: 001
OK to search with or without the hyphen
ISBN Bibliographic records 0872929094 Fields indexed: 020
OK to search with or without the hyphen
ISSN Bibliographic records 1080-210X Fields indexed: 022
OK to search with or without the hyphen
Item barcode Holdings records L0073620833 Fields indexed: 876 $p, 877 $p, 878 $p
Keyword(s) Bibliographic records twain tom sawyer
journal medical
Fields indexed: 001, 020, 022, 100, 110, 111, 130, 240, 242, 243, 245, 246, 247, 700, 710, 711, 730, 900
OCLC number Bibliographic records 40483669 00999996 Fields indexed: 035
OCLC numbers are normalized to 8 digits; include leading zeros as needed
Orion1 number Bibliographic records TC0035174 Fields indexed: 935
Title word(s) Bibliographic records federal reporter
mittens gloves
Fields indexed: 240, 242, 243, 245, 246, 247

All bibliographic searches are done against one big keyword index, which includes all subfields except $6 from all fields specified above. Normalization was done by keeping English letters (A-Z) and numerals (0-9); stripping apostrophes (e.g., don't became dont; and replacing all other punctuation with spaces (e.g., 1,000 became 1 000). All diacritics were removed (e.g., für became fur); this works well for most Western European languages, but with varying success for others.

Use care when doing keyword searches: while the database server's performance is excellent, if your search retrieves thousands of records, it can take a long time to return all of the results to your browser. Formulate your searches to be as specific as possible.

Record display pages [top]

Most of the record display pages are similar to their Taos counterparts, and need only minimal explanation:

Taos MARC records were not stored in Unicode (although the Cataloging client used Unicode for editing). The Archive contains the records as they were stored within Taos, encoded in MARC-8, but displays the records in Unicode for convenience. Known problems with the Unicode display:

Since the Archive preserves data just as it was extracted from Taos, display oddities are almost certainly caused by data errors. However, if you need to be certain of the raw content of an archived record, or if an error occurs while using the archive, please email LIT or call x5-7557.

Last updated: 22 Feb 2005