However, corrections are not always possible, partly because of the database structure. This holds for entries older than 1995 (email March 3, 2011, from Thomson Reuters).
Various data repositories, namely WoS, are working to improve on the issue of multiple assigned author addresses, and newer publications in the database have direct indications of which author(s) links to which address(es).
In the case of alphabetical listings of authors, each author is assigned a value of 1/n (where n is the total number of authors).
The meta-data fields are compared across records that share the same last name only.
To make the algorithm useful for completely unchecked sets and thus avoid excessive manual checking of records we are currently working on a sampling method which will be presented in a follow-up publication.
Bates, T, Anic, A, Marusic, M, Marusic, A 2004 Authorship criteria and disclosure of contributions: comparison of 3 general medical journals with different author contribution forms. JAMA 292 1 86 .
Blondel, VD, Guillaume, JL, Lambiotte, R, Lefebvre, E 2008 Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment 2008:P10008 .
Cassiman, B, Glenisson, P B Van Looy 2007 Measuring industry-science links through inventor-author relations: a profiling methodology. Scientometrics 70 2 379–391 .
Damerau, FJ 1964 A technique for computer detection and correction of spelling errors. Communications of the ACM 7 3 171–176 .
Gurney, T., Horlings, E., & Van Den Besselaar, P. (2011). Author disambiguation using multi-aspect similarity indicators. In: E. Noyons, P. Ngulube, J. Leta (eds), Proceedings of ISSI 2011—The 13th International Conference on Scientometrics and Informetrics, Durban, 4-7 July 2011, pp 261–266.
- Search Google Scholar
- Export Citation
Gurney, T. , & Horlings, E. ( Van Den Besselaar, P. 2011). Author disambiguation using multi-aspect similarity indicators. In: , E. Noyons , P. Ngulube (eds), Proceedings of ISSI 2011—The 13th International Conference on Scientometrics and Informetrics, Durban, 4-7 July 2011, pp J. Leta 261– 266.
Healey, P, Rothman, H, Hoch, P 1986 An experiment in science mapping for research planning. Research Policy 15 5 233–251 .
Huang, J, Ertekin, S, Giles, CL 2006 Efficient name disambiguation for large-scale databases. Lecture Notes in Computer Science 4213:536 .
Kang, IS, Na, SH, Lee, S, Jung, H, Kim, P, Sung, WK et al. 2009 On co-authorship for author disambiguation. Information Processing & Management 45 1 84–97 .
Levenshtein, VI 1966 Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10 8 707–710.
Leydesdorff, L 1989 Words and co-words as indicators of intellectual organization 1. Research Policy 18 4 209–223 .
Leydesdorff, L, Cozzens, S P Van den Besselaar 1994 Tracking areas of strategic importance using scientometric journal mappings. Research Policy 23 2 217–229 .
Matsuo, Y, Ishizuka, M 2004 Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13 1 157–170 .
Meyer, M.S. (2001). Patent citation analysis in a novel field of technology: an exploration of nano-science and nano-technology (Vol. 51, pp. 163–183). Berlin: Springer.
Moed, HF 2000 Bibliometric indicators reflect publication and management strategies. Scientometrics 47 2 323–346 .
HF Moed W Glänzel U Schmoch eds. 2004 Handbook of quantitative science and technology research, The use of publication and patent statistics in studies of S&T systems Kluwer Academic Publishers Dordrecht.
Onodera, N, Iwasawa, M, Midorikawa, N, Yoshikane, F, Amano, K, Ootani, Y et al. 2011 A method for eliminating articles by homonymous authors from the large number of articles retrieved by author search. Journal of the American Society for Information Science and Technology 62 4 23 .
- Search Google Scholar
- Export Citation
Onodera, N , Iwasawa, M , Midorikawa, N , Yoshikane, F , Amano, K et al. Ootani, Y 2011 A method for eliminating articles by homonymous authors from the large number of articles retrieved by author search. Journal of the American Society for Information Science and Technology 62 4 23 10.1002/asi.21491.
Pasterkamp, G, Rotmans, JI DVP de Kleijn Borst, C 2007 Citation frequency: a biased measure of research impact significantly influenced by the geographical origin of research articles. Scientometrics 70 1 153–165 .
Raffo, J, Lhuillery, S 2009 How to play the “names game”: patent retrieval comparing different heuristics. Research Policy 38 10 1617–1627 .
Somers, A, Gurney, T, Horlings, E P Van Den Besselaar 2009 Science assessment integrated network toolkit (SAINT): a scientometric toolbox for analyzing knowledge dynamics Rathenau Institute The Hague.
Song, Y., Huang, J., Councill, I.G., Li, J., & Giles, C.L. 2007. Efficient topic-based unsupervised name disambiguation. In Proceedings of ACM/IEEE Joint Conference on Digital Libraries (JCDL 2007) (pp. 351). New York: ACM.
Tan, Y.F., Kan, M.Y., & Lee, D. (2006). Search engine driven author disambiguation. In 6th ACM/IEEE-CS joint conference on Digital libraries: Chapel Hill: ACM.
Tang, L, Walsh, JP 2010 Bibliometric fingerprints: name disambiguation based on approximate structure equivalence of cognitive maps. Scientometrics 84 3 763–784 .
Trajtenberg, M., Shiff, G., & Melamed, R. (2006). The Names Game: Harnessing Inventors’ Patent Data for Economic Research. Cambridge: NBER working paper.
van den Besselaar, P., & Leydesdorff, L. (1996). Mapping change in scientific specialities; a scientometric case study of the development of artificial intelligence. Journal of the American Society of Information Science, 47 (5).
Wagner-Döbler, R 2001 Continuity and discontinuity of collaboration behaviour since 1800—from a bibliometric point of view. Scientometrics 52 3 503–517 .
Whittaker, J, Courtial, JP, Law, J 1989 Creativity and conformity in science: titles, keywords and co-word analysis. Social Studies of Science 19 3 473–496 .
Yang, K.H., Peng, H.T., Jiang, J.Y., Lee, H.M., & Ho, J.M. (2008). Author Name Disambiguation for Citations Using Topic and Web Correlation. Research and Advanced Technology for Digital Libraries, 185–196.
Yank, V, Rennie, D 1999 Disclosure of researcher contributions: a study of original research articles in The Lancet. Annals of Internal Medicine 130 8 661.
Zhu, J., Zhou, X., & Fung, G. (2009). A term-based driven clustering approach for name disambiguation. Advances in Data and Web Management, 320–331.