View More View Less
  • 1 The MITRE Corporation (Ret'd), 7515 Colshire Drive, McLean, VA, 22102, USA
  • 2 Bioterrorism Preparedness and Response Program, National Center for Infectious Diseases, Center for Disease Control, Atlanta, GA, USA
Restricted access

Abstract

Text mining was used to extract technical intelligence from the open source global SARS research literature. A SARS-focused query was applied to the Science Citation Index (SCI) (SCI 2008) database for the period 1998–early 2008. The SARS research literature infrastructure (prolific authors, key journals/institutions/countries, most cited authors/journals/documents) was obtained using bibliometrics, and the SARS research literature technical structure (hierarchical taxonomy) was obtained using computational linguistics/document clustering.

  • Davidse, RJ AFJ Van Raan 1997 Out of particles: impact of CERN, DESY, and SLAC research to fields other than physics. Scientometrics 40 2 171193 .

  • Feng, YJ, Gao, GF 2007 Towards our understanding of SARS-CoV, an emerging and devastating but quickly conquered virus. Comparative Immunology, Microbiology and Infectious Diseases 30 5–6 309327 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Garfield, E 1985 History of citation indexes for chemistry—a brief review. JCICS 25 3 170174.

  • Goldman, JA, Chu, WW, Parker, DS, Goldman, RM 1999 Term domain distribution analysis: A data mining tool for text databases. Methods of Information in Medicine 38:96101.

    • Search Google Scholar
    • Export Citation
  • Greengrass, E. (1997). Information retrieval: An overview. National Security Agency. TR-R52-02-96.

  • Hao, P, Chen, M, Zhang, GQ, He, WZ, Li, YX 2006 Bioinformatics research on the SARS coronavirus (SARS_CoV) in China. Current Pharmaceutical Design 12 35 45654572 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Hearst, M. A. (1999). Untangling text data mining. In Proceedings of ACL 99, the 37th annual meeting of the association for computational linguistics, June 20-26, 1999. University of Maryland.

    • Search Google Scholar
    • Export Citation
  • Janies, D, Habib, F, Alexandrov, B, Hill, A, Pol, D 2008 Evolution of genomes, host shifts and the geographic spread of SARS-CoV and related coronaviruses. Cladistics 24 2 111130 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Karypis, G. (2004). CLUTO—A clustering toolkit. http://www.cs.umn.edu/cluto.

  • Kostoff, RN 2003 Bilateral asymmetry prediction. Medical Hypotheses 61 2 265266 .

  • Kostoff, R. N. (2008). Literature-related discovery: Introduction and background. In R. N. Kostoff (ed.), Special issue on literature-related discovery. Technological forecasting and social change, 75 (2), 165185.

    • Search Google Scholar
    • Export Citation
  • Kostoff, RN, Braun, T, Schubert, A, Toothman, DR, Humenik, J 2000 Fullerene roadmaps using bibliometrics and Database Tomography. Journal of Chemical Information and Computer Science 40 1 1939.

    • Search Google Scholar
    • Export Citation
  • Kostoff, R. N., Briggs, M., Rushenberg, R., Bowles, C. A., & Pecht, M. (2006). The structure and infrastructure of Chinese science and technology. DTIC Technical report number ADA443315. Fort Belvoir, VA: Defense Technical Information Center. http://www.dtic.mil/.

    • Search Google Scholar
    • Export Citation
  • Kostoff, R. N., Briggs, M. B., Solka, J. A., Rushenberg, R. L. (2008). Literature-related discovery: Methodology. In R. N. Kostoff (ed.), Special issue on literature-related discovery. Technological Forecasting and Social Change, 75 (2), 186202.

    • Search Google Scholar
    • Export Citation
  • Kostoff, RN, Del Rio, JA, García, EO, Ramírez, AM, Humenik, JA 2001 Citation mining: Integrating text mining and bibliometrics for research user profiling. Journal of the American Society for Information Science and Technology 52 13 11481156 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Kostoff, RN, Eberhart, HJ, Toothman, DR 1997 Database Tomography for information retrieval. Journal of Information Science 23 4 301311 .

  • Kostoff, RN, Morse, SA, Oncu, S 2007 The seminal literature of anthrax research. Critical Reviews in Microbiology 33 3 171181 .

  • Kostoff, RN, Shlesinger, MF, Malpohl, G 2004 Fractals roadmaps using bibliometrics and Database Tomography. Fractals 12 1 116 .

  • Kostoff, RN, Shlesinger, MF, Tshiteya, R 2004 Nonlinear dynamics roadmaps using bibliometrics and Database Tomography. International Journal of Bifurcation and Chaos 14 1 6192 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Losiewicz, P, Oard, D, Kostoff, RN 2000 Textual data mining to support science and technology management. Journal of Intelligent Information Systems 15:99119 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Narin, F. (1976). Evaluative bibliometrics: The use of publication and citation analysis in the evaluation of scientific activity (monograph). NSF C-637. National Science Foundation. 1976. Contract NSF C-627. NTIS accession no. PB252339/AS.

    • Search Google Scholar
    • Export Citation
  • Narin, F, Olivastro, D, Stevens, KA 1994 Bibliometrics theory, practice and problems. Evaluation Review 18 1 6576 .

  • Schubert, A, Glanzel, W, Braun, T 1987 Subject field characteristic citation scores and scales for assessing research performance. Scientometrics 12 5–6 267291 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Swanson, DR 1986 Fish Oil, Raynauds Syndrome, and undiscovered public knowledge. Perspectives in Biology and Medicine 30 1 718.

  • Swanson, DR, Smalheiser, NR, Bookstein, A 2001 Information discovery from complementary literatures: Categorizing viruses as potential weapons. Journal of the American Society for Information Science and Technology 52 10 797812 .

    • Crossref
    • Search Google Scholar
    • Export Citation
  • Zhang, ZB 2007 The outbreak pattern of SARS cases in China as revealed by a mathematical model. Ecological Modelling 204 3–4 420426 .

  • Zhao, Y, Karypis, G 2004 Empirical and theoretical comparisons of selected criterion functions for document clustering. Machine Learning 55 3 311331 .

    • Crossref
    • Search Google Scholar
    • Export Citation