View More View Less
  • 1 Katholieke Industriële Hogeschool West-Vlaanderen Zeedijk 101 B-8400 Oostende Belgium West-Vlaanderen Zeedijk 101 B-8400 Oostende Belgium
  • 2 Speciale Licentie Documentatie- en Bibliotheekwetenschap University of Antwerp Universiteitsplein 1 B-2610 Wilrijk Belgium Universiteitsplein 1 B-2610 Wilrijk Belgium
  • 3 The City University Dep. of Information Science Northampton Square EC1V OHB London UK Northampton Square EC1V OHB London UK
  • 4 China National Rice Research Institute Department of Sci-Tech Inf. Hangzhou The People's Republic of China Hangzhou The People's Republic of China
Restricted access

Abstract  

At the occasion of the 40th anniversary of George Zipf's premature dead, we reanalyse his data on the frequency of Chinese words. We find the best fitting Lotka, Zipf, Bradford and Leimkuhler distribution and show that only Lotka's function is not rejected by a Kolmogorov-Smirnov test. Using an additional term to Leimkuhler's function leads to a statistically acceptable fit. In this way we can determine a core (nucleus) of most frequently used Chinese words.