View More View Less
  • 1 IPMI – Institute of Project Management and Innovation, University of Bremen, Wilhelm-Herbst-Str. 12, 28359 Bremen, Germany jgerken@uni-bremen.de
Restricted access

Abstract

For certain tasks in patent management it makes sense to apply a quantitative measure of textual similarity between patents and/or parts thereof: be it the analysis of freedom to operate, the analysis of technology convergence, or the mapping of patents for strategic purposes. In this paper we intend to outline the process of measuring textual patent similarity on the basis of elements referred to as ‘combined concepts’. We are going to use this process in various operations leading to design decisions, and shall also provide guidance regarding these decisions. By way of two applications from patent management, namely the prioritization of patents and the analysis of convergence between two technological fields, we mean to demonstrate the crucial importance of design decisions in terms of patent analysis results.

  • Abou-Assaleh, T., Cercone, N., Keselj, V., & Sweidan, R. (2004). N-gram-based detection of new malicious code. In Proceedings of the 28th annual international computer software and applications conference. Hong-Kong.

    • Search Google Scholar
    • Export Citation
  • Batagelj, V, Bren, M 1995 Comparing resemblance measures. Journal of Classification 12 1 7390 .

  • Bonino, D, Ciaramella, A, Corno, F 2010 Review of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics. World Patent Information 32 1 3038 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Braam, RR, Moed, HF AFJ van Raan 1988 Mapping of science: Critical elaboration and new approaches, a case study in agricultural biochemistry L Egghe R Rousseau eds. Infometrics 87/88 Elsevier Science Amsterdam 1528.

    • Search Google Scholar
    • Export Citation
  • Buehl, A 2010 PASW 18: Einführung in die moderne Datenanalyse 12 Pearson Studium München u.a..

  • Carley, KM 1997 Extracting team mental models through textual analysis. Journal of Organizational Behavior 18 S1 533558 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Cascini, G, Russo, D 2007 Computer-aided analysis of patents and search for TRIZ contradictions. International Journal of Product Development 4 1/2 5267 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Cavanar, W. B., & Trenkle, J. M., (1994). N-gram-based text categorization. In Proceedings of SDAIR-94, 3rd annual symposium on document analysis and information retrieval. Las Vegas, NV.

    • Search Google Scholar
    • Export Citation
  • Cepela, N., & Danowski, J. A. (2009). Automatic mapping of social networks of political actors from large collections of news stories. International conference on advances in social network analysis and mining. Athens.

    • Search Google Scholar
    • Export Citation
  • Çıltık, A, Güngör, T 2008 Time-efficient spam e-mail filtering using n-gram models. Pattern Recognition Letters 29 1 1933 .

  • Collins, P, Hollo, C 2010 English grammar: An introduction Palgrave Macmillan Basingstoke u.a..

  • Corman, SR, Kuhn, T, McPhee, RD, Dooley, KJ 2002 Studying complex discursive systems. Human Communication Research 28 2 157206.

  • Curran, C, Bröring, S, Leker, J 2010 Anticipating converging industries using publicly available data. Technological Forecasting and Social Change 77 3 385395 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Curran, C., & Leker, J. (2009). Seeing the next iphone coming your way: How to anticipate converging industries. Portland International Conference on Management of Engineering & Technology, 2009. PICMET 2009..

    • Search Google Scholar
    • Export Citation
  • Curran, C, Leker, J 2011 Patent indicators for monitoring convergence—Examples from NFF and ICT. Technological Forecasting and Social Change 78 2 256273 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Daga, R., & Pandey, G. (2008). US-Patent application 2008/0162455 A1. Determination of document similarity.

  • Doerfel, ML, Barnett, GA 1996 The use of Catpac for text analysis. Field Methods 8 2 47 .

  • Dressler, A 2006 Patente in technologieorientierten Mergers & Acquisitions: Nutzen, Prozessmodell, Entwicklung und Interpretation semantischer Patentlandkarten Deutscher Universitäts-Verlag Wiesbaden.

    • Search Google Scholar
    • Export Citation
  • Egghe, L 2000 The distribution of N-grams. Scientometrics 47 2 237252 .

  • Foltz, PW, Kintsch, W, Landauer, TK 1998 The measurement of textual coherence with latent semantic analysis. Discourse Processes 25 2–3 285308 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Gerken, J. M., Walter, L., & Moehrle, M. G. (2010). Semantische Patentlandkarten. Einsatz semantischer Patentlandkarten im Anwendungsfeld der Antriebstechnik—Eine explorative Analyse am Beispiel der Planentengetriebe. Heft Nr. 924 der Forschungsvereinigung Antriebstechnik. Frankfurt/Main: VDMA.

    • Search Google Scholar
    • Export Citation
  • Gower, JC, Legendre, P 1986 Journal of Classification 3 1 548 .

  • Jeong, B, Lee, D, Cho, H, Lee, J 2008 A novel method for measuring semantic similarity for XML schema matching. Expert Systems with Applications 34 3 16511658 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Kangasabai, R., & Pan, H. (2008). US-Patent 7,346,491 B2. Method of text similarity measurement.

  • Kim, YG, Suh, JH, Park, SC 2008 Visualization of patent analysis for emerging technology. Expert Systems with Applications 34 3 18041812 .

  • Kondrak, G 2005 N-gram similarity and distance. Lecture Notes in Computer Science 3772:115126 .

  • Landauer, TK, Foltz, PW, Laham, D 1998 An introduction to latent semantic analysis. Discourse Processes 25 2 259284 .

  • Lee, S, Yoon, B, Park, Y 2009 An approach to discovering new technology opportunities: Keyword-based patent map approach. Technovation 29 6–7 481497 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Manning, C. D., & Schütze, H., (2005). Foundations of statistical natural language processing. Cambridge, MA: MIT Press.

  • Moehrle, MG 2010 Measures for textual patent similarities: A guided way to select appropriate approaches. Scientometrics 85 1 95109 .

  • Moehrle, MG, Geritz, A 2007 Developing acquisition strategies based on patent maps T Khalil Y Hosni eds. Management of technology: New directions in technology management Elsevier Oxford 1929.

    • Search Google Scholar
    • Export Citation
  • Moehrle, MG, Walter, L, Bergmann, I, Bobe, S, Skrzipale, S 2010 Patinformatics as a business process: A guideline through patent research tasks and tools. World Patent Information 32 4 291299 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Moens, M 2006 Information extraction: Algorithms and prospects in a retrieval context Springer Dordrecht.

  • Peters, HPF AFJ van Raan 1993 Co-word-based science maps of chemical engineering. Part I: Representations by direct multidimensional scaling. Research Policy 22 1 2345 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Qin, J 2000 Semantic similarities between a keyword database and a controlled vocabulary database: An investigation in the antibiotic resistance literature. Journal of the American Society for Information Science 51 2 166180 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Ranganathan, A., & Ronen, R. (2008). US-Patent application 2008/0243809 A1. Information-theory based measure of similarity between instances in ontology.

    • Search Google Scholar
    • Export Citation
  • Rip, A, Courtial, P 1984 Co-word maps of biotechnology: An example of cognitive scientometrics. Scientometrics 6 6 381400 .

  • Ryley, JF, Saffer, J, Gibbs, A 2008 Advanced document retrieval techniques for patent research. World Patent Information 30 3 238243 .

  • Sepkoski, JJ 1974 Quantified coefficients of association and measurement of similarity. Mathematical Geology 6 2 135152 .

  • Sternitzke, C 2008 Betriebswirtschaftliche Patentportfoliobewertung: Eine informationswissenschaftliche Perspektive [dissertation] Universität Bremen Bremen.

    • Search Google Scholar
    • Export Citation
  • Sternitzke, C, Bergmann, I 2009 Similarity measures for document mapping: A comparative study on the level of an individual scientist. Scientometrics 78 1 113130 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Trajtenberg, M 1990 A penny for your quotes: Patent citations and the value of innovations. The Rand Journal of Economics 21 1 172187 .

  • Trippe, AJ 2003 Patinformatics: Tasks to tools. World Patent Information 25 3 211221 .

  • Tseng, Y, Lin, C, Lin, Y 2007 Text mining techniques for patent analysis. Information Processing and Management 43 5 12161247 .

  • Tsourikov, V. M., Batchilo, L. S., & Sovpel, I. V. (2000). US-Patent 6,167,370. Document semantic analysis/selection with knowledge creativity capability utilizing subject-action-object (SAO) structures.

    • Search Google Scholar
    • Export Citation
  • Turney, PD 2001 Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. Lecture Notes in Computer Science 2167:491502 .

  • I von Wartburg Teichert, T, Rost, K 2005 Inventive progress measured by multi-stage patent citation analysis. Research Policy 34 10 15911607 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Wanner, L, Baeza-Yates, R, Brügmann, S, Codina, J, Diallo, B, Escorsa, E, Giereth, M, Kompatsiaris, Y, Papadopoulos, S, Pianta, E, Piella, G, Puhlmann, I, Rao, G, Rotard, M, Schoester, P, Serafini, L, Zervaki, V 2008 Towards content-oriented patent document processing. World Patent Information 30 1 2133 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Wen, G, Jiang, L, Shadbolt, NR 2006 Ontology-based similarity between text documents on manifold. Lecture Notes in Computer Science 4185:113125 .

  • Yang, Y, Akers, L, Klose, T, Barcelon Yang, C 2008 Text mining and visualization tools—Impressions of emerging capabilities. World Patent Information 30 4 280293 .

    • Crossref
    • Search Google Scholar
    • Export Citation

Manuscript submission: http://www.editorialmanager.com/scim/

  • Impact Factor (2019): 2.867
  • Scimago Journal Rank (2019): 1.210
  • SJR Hirsch-Index (2019): 106
  • SJR Quartile Score (2019): Q1 Computer Science Apllications
  • SJR Quartile Score (2019): Q1 Library and Information Sciences
  • SJR Quartile Score (2019): Q1 Social Sciences (miscellaneous)
  • Impact Factor (2018): 2.770
  • Scimago Journal Rank (2018): 1.113
  • SJR Hirsch-Index (2018): 95
  • SJR Quartile Score (2018): Q1 Library and Information Sciences
  • SJR Quartile Score (2018): Q1 Social Sciences (miscellaneous)

For subscription options, please visit the website of Springer

Scientometrics
Language English
Size B5
Year of
Foundation
1978
Volumes
per Year
4
Issues
per Year
12
Founder Akadémiai Kiadó
Founder's
Address
H-1117 Budapest, Hungary 1516 Budapest, PO Box 245.
Publisher Akadémiai Kiadó
Springer Nature Switzerland AG
Publisher's
Address
H-1117 Budapest, Hungary 1516 Budapest, PO Box 245.
CH-6330 Cham, Switzerland Gewerbestrasse 11.
Responsible
Publisher
Chief Executive Officer, Akadémiai Kiadó
ISSN 0138-9130 (Print)
ISSN 1588-2861 (Online)