US Patent Similarity Data

Dataset

Description

Pairwise semantic similarity measures for US utility patents. Includes measures for citing/cited patent pairs, 100 most-similar patents for each patent, and doc2vec vectors for each patent. Second edition includes .npy file needed to generate new text embeddings using the pre-trained model.
Date made availableNov 25 2019
PublisherZENODO

Cite this