Patent Similarity Data and Innovation Metrics

Journal of Empirical Legal Studies, 2020

26 Pages Posted: 28 Aug 2020

See all articles by Ryan Whalen

Ryan Whalen

The University of Hong Kong - Faculty of Law

Alina Lungeanu

Northwestern University - School of Communication

Leslie DeChurch

Northwestern University - School of Communication

Noshir Contractor

Northwestern University - McCormick School of Engineering and Applied Science; Northwestern University; Northwestern University; Northwestern University

Date Written: July 21, 2020

Abstract

We introduce and describe the Patent Similarity Data set, comprising vector space model-based similarity scores for United States utility patents. The data set provides approximately 640 million pre-calculated similarity scores, as well as the code and computed vectors required to calculate further pairwise similarities. In addition to the raw data, we introduce measures that leverage patent similarity to provide insight into innovation and intellectual property law issues of interest to both scholars and policymakers. Code is provided in accompanying scripts to assist researchers in obtaining the data set, joining it with other available patent data, and using it in their research.

Keywords: Patent, Doc2Vec, Patent Similarity, Patent Distance

JEL Classification: K30, Y10, O34

Suggested Citation

Whalen, Ryan and Lungeanu, Alina and DeChurch, Leslie and Contractor, Noshir, Patent Similarity Data and Innovation Metrics (July 21, 2020). Journal of Empirical Legal Studies, 2020, Available at SSRN: https://ssrn.com/abstract=3657462 or http://dx.doi.org/10.2139/ssrn.3657462

Ryan Whalen (Contact Author)

The University of Hong Kong - Faculty of Law ( email )

Pokfulam Road
Hong Kong, Hong Kong
China

Alina Lungeanu

Northwestern University - School of Communication ( email )

70 Arts Circle Drive
Office of the Dean
Evanston, IL 60208
United States

Leslie DeChurch

Northwestern University - School of Communication ( email )

70 Arts Circle Drive
Office of the Dean
Evanston, IL 60208
United States

Noshir Contractor

Northwestern University - McCormick School of Engineering and Applied Science ( email )

2145 Sheridan Road
Evanston, IL 60208-1230
United States

Northwestern University ( email )

2145 Sheridan Road
Room C210
Evanston, IL 60208
United States
2173906270 (Phone)

Northwestern University ( email )

Evanston, IL
United States
2173906270 (Phone)

Northwestern University ( email )

70 Arts Circle Drive
Office of the Dean
Evanston, IL 60208
United States
2173906270 (Phone)

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
205
Abstract Views
840
Rank
283,650
PlumX Metrics