Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Searching in cooperative patent classification: comparison between keyword and concept-based search

Montecchi, Tiziano, Russo, Davide and Liu, Ying ORCID: 2013. Searching in cooperative patent classification: comparison between keyword and concept-based search. Advanced Engineering Informatics 27 (3) , pp. 335-345. 10.1016/j.aei.2013.02.002

Full text not available from this repository.


International patent corpus is a gigantic source containing today about 80 million of documents. Every patent is manually analyzed by patent officers and then classified by a specific code called Patent Class (PC). Cooperative Patent Classification CPC is the new classification system introduced since January 2013 in order to standardize the classification systems of all major patent offices. Like keywords for papers, PCs point to the core of the invention, describing concisely what they contain inside. Most of patents strategies are based on PC as filter for results therefore the selection of relevant PCs is often a primary and crucial activity. This task is considered particularly challenging and only few tools have been specially developed for this purpose. The most efficient tools are provided by patent offices of EPO and WIPO. This paper analyzes their PCs search strategy (mainly based on keyword-based engines) in order to identify main limitations in terms of missing relevant PCs (recall) and non-relevant results (precision). Patents have been processed by KOM, a semantic patent search tool developed by the authors. Unlike all other PC search tools, KOM uses semantic parser and many knowledge bases for carrying out a conceptual patent search. Its functioning is described step by step through a detailed analysis pointing out the benefits of a concept-based search vis-à-vis a keyword-based search. An exemplary case is proposed dealing with CPCs describing the sterilization of contact lenses. Comparison could be likewise conducted on other PCs such as International (IPC), European (ECLA) or United States (USPC) patent classification codes.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Centre for Advanced Manufacturing Systems At Cardiff (CAMSAC)
Subjects: T Technology > TA Engineering (General). Civil engineering (General)
Uncontrolled Keywords: Concept-based search; Patent classification; Patent mining
Publisher: Elsevier
ISSN: 1474-0346
Last Modified: 25 Oct 2022 07:59

Citation Data

Cited 54 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item