QA catalogue for analysing library data

British Library     last data update: 2020-11-26     number of records: 18,787,911

Serials analysis

These scores are calculated for each continuing resources (type of record (LDR/6) is language material ('a') and bibliographic level (LDR/7) is serial component part ('b'), integrating resource ('i') or serial ('s')).

The calculation is based on a slightly modified version of the method published by Jamie Carlstone in the following paper:

Jamie Carlstone (2017) Scoring the Quality of E-Serials MARC Records Using Java, Serials Review, 43:3-4, pp. 271-277, DOI: 10.1080/00987913.2017.1350525 URL: https://www.tandfonline.com/doi/full/10.1080/00987913.2017.1350525

histogram

  • y: number of records
  • x: number of authority names in one record
Each records having ... get a score based on a number of criteria. Each criteria results in a positive or negative score. The final score is these criteria scores.
criteria score
date1 (008/07-11) is unknown ('uuuu') -3
place of publication (008/15) is unknown (~ 'xx.+') -1
publication language (008/35) is unknown (xxx) -1
has authentication code (042$a) 7
encoding level (LDR/17) is Full level (‘ ‘) or Full level, material not examined (1) or Full level input by OCLC participants (I) 7
encoding level (LDR/17) is Added from a batch process (M), L, or Minimal level input by OCLC participants (K), or Minimal level (7) 1
has 006 field 1
has publisher (260) 1
has production, publication, distribution (264) 1
has publication frequency (310) 1
has content type (336) 1
has dates of publication (362) 1
has source of description (588) 1
has no subject headings -5
for each subject headings 1
authentication code (042$a) = “ppc” 100
date1 begins with '0' -100

components

The histograms of the individual components:

2. Date 1 is unknown

3. Country of publication is unknown

4. Publication language is unknown

5. Authentication code is empty

6. Encoding level is full

7. Encoding level is M, L, K or 7

8. 006 is present

9. Publisher 260 (AACR2) is present

10. Publisher 264 (RDA) is present

11. Publication frequency is present

12. Content Type (336) is present

13. Dates of Publication (362) is present

14. Source of Description Note (588) is present

15. No subject is present

16. Subject is present

17. Authentication Code is pcc

18. First date (008/07) startes with 0

19. Encoding level is abbreviated