differentiation by scientific field is too crude.
Against this background, it is manifest to use broad publication databases such as the WoS to analyse the growth of science. Price already used the number of journals as evidence for the rapid growth of
Authors:Yu-Chun Chen, Hsiao-Yun Yeh, Jau-Ching Wu, Ingo Haschler, Tzeng-Ji Chen and Thomas Wetter
Administrative health care databases have long been very important data sources for health studies. In recent decades, large-scale administrative health care databases featuring extremely large case numbers, long
The database management technology has played a vital role in the advancements of the information technology field. It has been matured after extensive research and development efforts and very successfully created
The application of online analytical processing (OLAP) technology to bibliographic databases is addressed. We show that OLAP
tools can be used by librarians for periodic and ad hoc reporting, quality assurance, and data integrity checking, as well
as by research policy makers for monitoring the development of science and evaluating or comparing disciplines, fields or
research groups. It is argued that traditional relational database management systems, used mainly for day-to-day data storage
and transactional processing, are not appropriate for performing such tasks on a regular basis. For the purpose, a fully functional
OLAP solution has been implemented on Biomedicina Slovenica, a Slovenian national bibliographic database. We demonstrate the system's usefulness by extracting data for studying a selection
of scientometric issues: changes in the number of published papers, citations and pure citations over time, their dependence
on the number of co-operating authors and on the number of organisations the authors are affiliated to, and time-patterns
of citations. Hardware, software and feasibility considerations are discussed and the phases of the process of developing
bibliographic OLAP applications are outlined.
Database management technology has played a vital role in the advancements of the information technology field. It has matured after extensive research and development efforts and has very successfully created a
Quantitative data or indicators derived from databases are usually treated like any other empirical data. In this article, the social character of the different processes leading to them is outlined. As a social process taking place in systemic organizational structures the genesis of such data cannot be reconstructed as sort of mechanical application of formal rules but only as human (and as such: arbitrary) action. As a consequence the discussion of such data or indicators can no longer be subject to methodology alone but requires support by sociology.
We analyse the statistical properties a database of musical notes for the purpose of designing an information retrieval system
as part of the Musifind project. In order to reduce the amount of musical information we convert the database to the intervals
between notes, which will make the database easier to search. We also investigate a further simplification by creating equivalence
classes of musical intervals which also increases the resilience of searches to errors in the query. The Zipf, Zipf-Mandelbrot,
Generalized Waring (GW) and Generalized Inverse Gaussian-Poisson (GIGP) distributions are tested against these various representations
with the GIGP distribution providing the best overall fit for the data. There are many similarities with text databases, especially
those with short bibliographic records. There are also some differences, particularly in the highest frequency intervals which
occur with a much lower frequency than the highest frequency “stopwords” in a text database. This provides evidence to support
the hypothesis that traditional text retrieval methods will work for a music database.
Authors:Gavin LaRowe, Sumeet Ambre, John Burgoon, Weimao Ke and Katy Börner
The Scholarly Database aims to serve researchers and practitioners interested in the analysis, modelling, and visualization
of large-scale data sets. A specific focus of this database is to support macro-evolutionary studies of science and to communicate
findings via knowledge-domain visualizations. Currently, the database provides access to about 18 million publications, patents,
and grants. About 90% of the publications are available in full text. Except for some datasets with restricted access conditions,
the data can be retrieved in raw or pre-processed formats using either a web-based or a relational database client. This paper
motivates the need for the database from the perspective of bibliometric/scientometric research. It explains the database
design, setup, etc., and reports the temporal, geographical, and topic coverage of data sets currently served via the database.
Planned work and the potential for this database to become a global testbed for information science research are discussed
at the end of the paper.
Authors:Kun-Yang Chuang, Ming-Huang Wang and Yuh-Shan Ho
The Science Citation Index (SCI)-Expanded was primarily designed to help researchers retrieve relevant publications on specific topics (Garfield and Pudovkin 2003 ). As the database became available, different criteria were developed to evaluate