This action might not be possible to undo. Are you sure you want to continue?
BookID is an automated copyright protection system that helps authors and publishers identify unauthorized use of their works on Scribd. The system uses algorithms to compare reference samples of works in the BookID database to materials uploaded to Scribd’s platforms.
BookID uses a sophisticated, proprietary algorithm to analyze the computer-readable text in reference samples of copyright works for semantic data such as word count, letter frequency, phrase comparison, and other elements. The aggregated data is then encoded into a "fingerprint" of the work that is stored on a secure server inaccessible to the Internet. No copyrighted content is stored or made available to the public by BookID.
Every document uploaded to Scribd is scanned by BookID. Content that matches, with a certain degree of confidence, any BookID fingerprint is blocked and does not appear on the site.
BookID’s unique, semantic approach drastically reduces misidentifications and false positives, and increases the chances of detecting unauthorized copies of documents in the BookID database even where those copies have been altered.
Fingerprints from reference copies of copyrighted works will be generated for inclusion in the BookID database in the following situations:
Scribd continues to evaluate and enhance its BookID system. In its current form, the system has some notable limitations.
For example, BookID relies upon computer-readable text in digital documents. Content scanned from paper sources may not contain computer-readable text data, making those sources unsuitable for use as references. Similarly, digital documents encoded with optical character recognition (OCR) technology may contain garbled or partial computer- readable text data. This may be true regardless of whether the document is readable to humans. These conditions make it very difficult, if not impossible, to detect matches.
In addition, BookID’s fingerprint methodology cannot detect specific keywords, titles, author names, copyright notices, or other disclaimers that are part of a document's text. For example, BookID cannot be programmed to block all documents that contain your name. Likewise, BookID cannot translate to and from different languages. If a BookID thumbprint is encoded from an English-language document, BookID will only detect subsequent uploads in English. BookID cannot detect cover images, illustrations, and sheet music at this time.
While these limitations affect only a fraction of the content uploaded to Scribd, they explain why the system cannot be used to detect all copies or uses of a given copyrighted work on the service.
The BookID database may contain reference samples from educational textbooks and other works that contain long excerpts of classic literature, religious texts, legal documents, and government publications that are typically in the public domain. This can occasionally result in the removal of uncopyrighted, authorized, or public domain material from Scribd.
While occasional false positives are inevitable, we are diligently tuning BookID to reduce their incidence. Unfortunately, the volume of reference samples and uploads to Scribd prevent any sort of manual oversight or notification prior to effecting removals. If you suspect your content was removed improperly, click here and Scribd will evaluate whether specific content may be restored.
BookID for Authors and Publishers is a free program for approved content providers that wish to voluntarily upload copyrighted content to BookID. Click here to learn more.