A vision for making pre-1978 Copyright records more available. — Our goals for the project

A principal goal of the Copyright Office is to digitize the content of the card catalog records. This work is already underway.  The card catalog is considered the most up-to-date index to copyright records prior to 1978. It has been updated over time to reflect corrections and changes sometimes with handwritten annotations and sometimes with new cards. The digitization and indexing of no other Copyright records can provide the completeness and accuracy that can be realized from the digitization and indexing of the card catalog. To the extent it remains in paper form, this nucleus of the non-digital Copyright Office records will remain off-line. Many of the works recorded in the card catalog are still under copyright protection. As presently envisioned a searchable data record would be created from the cards and each data record would have links to images of the respective cards, applications, record book pages and documents.

Another index to copyright records is the Catalog of Copyright Entries (CCE) which was published periodically between 1891 and 1977.  Scanning of the CCE volumes is also underway.  If a high success rate for optically recognizing, verifying, and parsing the character strings in the scanned images could be achieved followed by automatic indexing, it could provide an alternative search capability as well as achieving to a degree the preservation goal. It would also provide a means for better searching of the CCE data for situations where that is recommended such as a search for works pre-dating 1938. But the CCEs do not contain all of the updates that have been made to the catalog cards and they do not include assignment and transfers of copyrights, so they cannot provide the full record of ownership of a particular copyright. Nevertheless, a second goal is to digitize the contents of the CCEs and to explore ways to capitalize on the results of applying OCR.  Including links to the CCE page images in the respective data records is also being considered.

A third goal is to digitize the content of the bound Record Books of applications and requests for copyright registration and to link each image to the respective data record. Microfilm copies exist for all Record Books and scanning of the microfilm might facilitate digitization. However, the quality of the output may not be as good as scanning the Record Book pages and may not be acceptable.

A fourth goal is to link the existent PDF images of assignments and transfers of copyrights, made from the microfilm copies of the documents, to their respective data record. An assessment will be made of the quality of the PDF images and if found acceptable the PDF files will be used as is. If they are not acceptable this goal will be expanded to determine what would be required and what it would cost to produce improved images from the microfilm.

Achieving these goals could result in a single searchable database of copyright records with links to images of currently non-digital records. The database could eventually cover the period from 1790 to the present.

4 Comments

  1. Michael Carroll
    December 16, 2011 at 10:55 am

    This is an extremely important initiative. Digital access to the historical record of copyright registration will be a significant benefit to those researching the copyright status of pre-1978 works and for the scholarly community.

    I hope that in addition to simply OCRing the records, these will also be marked up in a standard XML schema?

    Kudos for making this a priority.

    -Mike

  2. Chris
    January 4, 2012 at 10:07 am

    I’d really like to see this move forward quickly. It’s an important goal, and, honestly, in the government IP record realm, the CO is playing catch up to the USPTO, who have done a pretty amazing job with their records.

    Best of luck!

  3. Barbara – So many years ago
    February 23, 2012 at 11:04 am

    I was disappointed today when I decided to look up some music I copyrighted in the early 1970s, just to discover that it is not online. I really hope this information becomes accessible. My copyright has since expired, but I can’t even remember what year I submitted it. Sure would like to be able look this up and point to my grandchildren some day… Your grandma wrote a few songs back in the day and here’s proof.

  4. David Hayes
    March 9, 2012 at 7:47 am

    A message for Barbara and persons interested in copyrights of the 1970s:

    Your copyright registration should be in the Catalog of Copyright Entries edition published at the time of your registration. Images of these book-format catalogs are available online, enabling you to show your grandchildren the listing. The listing will include the registration number, which will enable anyone visiting the Copyright Office to request and inspect (and also photograph) your copyright application.

    What’s more, your copyright has not expired. A 1992 law granted automatic renewal of copyright to works copyrighted after 1963. Thus, although you may have been told in the early 1970s that your copyright would expire in 28 years unless you renewed at the time of expiration, you were spared that task by later legislation.

Add a Comment

This blog is governed by the general rules of respectful civil discourse. You are fully responsible for everything that you post. The content of all comments is released into the public domain unless clearly stated otherwise. Your submission may be subject to disclosure under the Freedom of Information Act (FOIA). The Library of Congress does not control the content posted. Nevertheless, the Library of Congress may monitor any user-generated content as it chooses and reserves the right to remove content for any reason whatever, without consent. Gratuitous links to sites are viewed as spam and may result in removed comments. We further reserve the right, in our sole discretion, to remove a user's privilege to post content on the Library site. Read our Comment and Posting Policy.

Required fields are indicated with an * asterisk.