1 PATIRIS Permanent Observatory on Patenting by Italian Universities and Public Research Institutes. Enrico Forti University College London & University of Bologna Maurizio Sobrero University of Bologna Loredana Guglielmetti Ministry of Economic Development
2 Patiris patiris.uibm.gov.it Permanent Observatory on Patenting by Italian Universities and Public Research Institutes. Open-data and Dynamic Reports. ITALIAN ENGLISH
3 The Project Sponsor Loredana Guglielmetti Ministry of Economic Development Dipartimento per l'impresa e l'internalizzazione - Direzione Generale Lotta alla Contraffazione Ufficio Italiano Brevetti e Marchi Design and Development Enrico Forti, Maurizio Sobrero University College London, Department of Management Science & Innovation University of Bologna, Department of Management Technical Support & Web Development Fulvio Di Marco Epoca Ricerca Patent Data Questel Orbit UIBM Assignee Name Disambiguation All the names of the Italian universities and research institutes have been harmonized and de-duplicated with technical support from Questel (Martine Massiera, Rossella Osella).
4 Patiris: Objectives & Approach Objectives Promoting the value of Italian research Diffusion of data, indicators, and analyses about the innovative productivity of Italian public research institutes Potential Users Policy Makers Patent Information Professionals Private Companies Media General Public Approach A reliable database with all the patents of Italian research institutes. Disambiguation of assignee names Invention-based patent families (FAMPAT) Updated twice a year A web service based on open data and dynamic reports Intuitive data exploration via dynamic reports and open-data Built-in search engine allowing easy data retrieval and export. Visually rich timelines to explore in detail individual patent families and documents.
5 Methodology Invention-based Patent Families: FAMPAT (Questel Orbit, widely used by NETVAL members) 96 Institutes: 94 Italian Universities (identified by the Italian Ministry of Education - CNR (National Research Council) ENEA (National Agency for New Technologies, Energy and Sustainable Development) INPADOC Questel - Orbit FAMPAT Patiris
6 Methodology Patent Identification Strategy INPADOC Documents that have at least one of the 96 observed institutes listed as assignee. Name disambiguation Questel - Orbit FAMPAT Patiris
7 Assignee Name Disambiguation The Problem Lack of unique IDs assigned to assignees by the various international patent authorities. For decades the institutes listed as assignees have been identified (e.g. at the time of the application) only via the name and few other parameters (e.g. address) encoded as text strings. Common Issues Data entry mistakes Incorrect translations Abbreviations Artefacts generated during the process of scanning paper documents using OCR technology Name changes or mergers between different institutions
8 Assignee Name Disambiguation Lack of unique IDs allows a significant number of name variants for the assignees puts a serious burden on anyone interested in carrying out research based on assignee names A single institution needs to be matched to multiple variants of a common name. Building search strings that include all name variations is the only way to correctly identify all potentially relevant documents.
9 Assignee Name Disambiguation Objective Capture the variety of names present in the stock of patents of the 96 monitored institutes. Robustness Checks All the identified name variants and all the documents have been manually inspected by the researchers. Approach Identify all the observable name variations appearing in the documents. Iterative process: 1. search string A 2. examination of the patents retrieved 3. Search string B We proceeded incrementally considering each institution in our sample as assignee and each name variation.
10 Assignee Name Disambiguation How Bad is the Issue? Each monitored institution has on average 11 variants Name variants make analysis and research difficult and costly (e.g. 100 variations for CNR; 24 for the University of Bologna) Examples UNIBERUSHITA DETSURI SUTOUDEI DAY CAGLIARI University of Cagliari EHNEHA EHNTE FOR NEW Teknolodzhi L EHNERGIJA EL ENVIRONMENT ENEA, National Agency for New Technologies, Energy and Sustainable Economic Development
11 Assignee Name Disambiguation Outcome The final database contains all name variants observed on the patent documents we were able to retrieve matched with the normal name listed on Fully integrated in Patiris and in the Questel Orbit database. Benefits A database that allows to match all the observed name variants with the normalized names of those institutes that are listed as assignee of at least one granted patent or patent application. Freely available.
12 Assignee Name Disambiguation The data provider (Questel Orbit) kindly agreed to implement the normalized names directly in their master database. The results of the disambiguation we carried out within the project is now available to the widest number of stakeholders. Future-proof Missing data? New name variants? Any issue with the data can be reported directly to Questel. Questel will check the issue and will edit the Orbit master database. The database powering Patiris will be updated twice per year.
13 Patiris: Home 9 Dynamic Modules
14 Patiris: Module 1 Ministero dello Sviluppo Economico
15 Patiris: Module 2 Ministero dello Sviluppo Economico
16 Patiris: Module 3 Ministero dello Sviluppo Economico
17 Patiris: Module 4 Ministero dello Sviluppo Economico
18 Patiris: Module 5 Ministero dello Sviluppo Economico
19 Patiris: Module 6 Ministero dello Sviluppo Economico
20 Patiris: Module 7 Ministero dello Sviluppo Economico
21 Patiris: Module 8 Ministero dello Sviluppo Economico
22 Patiris: Module 9 Ministero dello Sviluppo Economico
23 Patiris: Search Ministero dello Sviluppo Economico
24 Patiris: Search Results Ministero dello Sviluppo Economico
25 Patiris: Search Results Dynamic Charts
26 Patiris: Search Results Dynamic Charts
27 Patiris: Search Results Family Members
28 Patiris: Search Results Timeline Module
29 Patiris: Search Results Timeline Module
30 Patiris: Search Results Timeline Module
31 Patiris: Search Results Timeline Module
32 Patiris: Search Results Document Details
33 Patiris: Export Anything Ministero dello Sviluppo Economico