Patent-to-Patent Similarity: A Vector Space Model
39 Pages Posted: 30 Dec 2015 Last revised: 19 Aug 2016
Date Written: July 30, 2016
Current measures of patent similarity rely on the manual classification of patents into taxonomies. In this project, we leverage information retrieval theory and Big Data methods to develop a machine-automated measure of patent-to-patent similarity. We validate the measure and demonstrate that it significantly improves upon existing patent classification systems. Moreover, we illustrate how a pairwise similarity comparison of any and every two patents in the USPTO patent space can open new avenues of research in economics, management, and public policy. We make the data available for future scholarship through the Patent Research Foundation.
Keywords: patent data, technology space, similarity, relatedness
Suggested Citation: Suggested Citation